Vocal learning, the substrate of human language acquisition, has rarely been described in other mammals. Often, group-specific vocal dialects in wild populations provide the main evidence for vocal learning. While social learning is often the most plausible explanation for these intergroup differences, it is usually impossible to exclude other driving factors, such as genetic or ecological backgrounds. Here, we show the formation of dialects through social vocal learning in fruit bats under controlled conditions. We raised 3 groups of pups in conditions mimicking their natural roosts. Namely, pups could hear their mothers' vocalizations but were also exposed to a manipulation playback. The vocalizations in the 3 playbacks mainly differed in their fundamental frequency. From the age of approximately 6 months and onwards, the pups demonstrated distinct dialects, where each group was biased towards its playback. We demonstrate the emergence of dialects through social learning in a mammalian model in a tightly controlled environment. Unlike in the extensively studied case of songbirds where specific tutors are imitated, we demonstrate that bats do not only learn their vocalizations directly from their mothers, but that they are actually influenced by the sounds of the entire crowd. This process, which we term “crowd vocal learning,” might be relevant to many other social animals such as cetaceans and pinnipeds.