Haplogroup F, also known as F-M89 and previously as Haplogroup FT, is a very common Y-chromosome haplogroup. The clade and its subclades constitute over 90% of paternal lineages outside of Africa.
The vast majority of individual males with F-M89 fall into its direct descendant Haplogroup GHIJK (F1329/M3658/PF2622/YSC0001299).[9] in addition to GHIJK, haplogroup F has three other immediate descendant subclades: F1 (P91/P104), F2 (M427/M428), and F3 (M481). These three, with F* (M89*), constitute the paragroupF(xGHIJK). They are primarily found throughout South Asia, Southeast Asia and parts of East Asia.
Haplogroup GHIJK branches subsequently split into two direct descendants: G (M201/PF2957) and HIJK (F929/M578/PF3494/S6397). HIJK in turn splits into H (L901/M2939) and IJK (F-L15). The descendants of the haplogroup IJK include the clades I, J, K, and, ultimately, several major haplogroups descended from Haplogroup K, namely: haplogroups M, N, O, P, Q, R, S, L, and T.
It is estimated that the SNP M89 appeared 38,700–55,700 years ago, and most likely originated in West Asia[2][10]orSoutheast Asia[11] It has also been suggested by previous research that F-M89 most likely first appeared in the Arabian Peninsula, LevantorNorth Africa, about 43,800–56,800 years ago.[3] It has also been speculated that the possible location of this lineage's first expansion and rise to prevalence appears to have been in the Indian Subcontinent, or somewhere close to it, and most of the descendant subclades and haplogroups appear to have radiated outward from The Persian Gulf and/or neighbouring parts of the Middle East and Southeast Asia.
Some lineages derived from Haplogroup F-M89 appear to have back-migrated into Africa from West Asia, during prehistory. For example, subclades of F-M89 were discovered in ancient DNA samples from Sudan, which were associated with both Meroitic and Post-Meroitic [1]Archived 2022-08-25 at the Wayback Machine burials.
The vast majority of living individuals carrying F-M89 belong to subclades of GHIJK. By comparison, cases of the paragroup F(xG,H,I,J,K) – that is, either basal F* (M89) or the primary subclades F1 (P91; P104), F2 (M427; M428) and F3 (M481) – are relatively rare worldwide.
A lack of precise, high resolution testing in the past makes it difficult to discuss F*, F1, F2* and F3* separately. ISOGG states that F(xG,H,I,J,K) has not been well studied, occurs "infrequently" in modern populations and peaks in South Asia, especially Sri Lanka.[9] It also appears to have long been present in South East Asia. However, the possibility of misidentification is considered to be relatively high and some cases may in fact belong to misidentified subclades of Haplogroup GHIJK. This was, for instance, the case with the subclade Haplogroup H2 (P96), which was originally named "F3", i.e. – a name that has since been reassigned to F-M481.
F(xF1,F2,F3) has been reported among 10% of males in Sri Lanka, 5.2% of males across India (including up to 10% of males in South India), 5% in Pakistan, as well as lower levels among the Tamang people (Nepal), and in Iran.
Men originating in Indonesia have also been reported to carry F(xG,H,I,J,K) – especially F-M89* – at relatively significant levels. It has been reported at rates of 4-5% in Sulawesi and Lembata. One study, which did not comprehensively screen for other subclades of F-M89 (including some subclades of GHIJK), found that Indonesian men with the SNP P14/PF2704 (which is equivalent to M89), comprise 1.8% of men in West Timor, 1.5% of Flores 5.4% of Lembata 2.3% of Sulawesi and 0.2% in Sumatra.[12][13] F1 (P91), F2 (M427) and F3 (M481; previously F5) are all highly rare and virtually exclusive to regions/ethnic minorities in Sri Lanka, India, Nepal, South China, Thailand, Burma, and Vietnam.
Kutanan et al. (2020) have found F*-M89 in 50.0% (8/16) of a sample of Red Lahu, 47.1% (8/17) of a sample of Black Lahu, and 6.7% (1/15) of a sample of LisuinMae Hong Son Province of Thailand. All these Loloish-speaking members of F*-M89 in northwestern Thailand have been found to be quite closely related in the paternal line, with the TMRCA of their Y-DNA estimated to be 584 years before present. However, the aforementioned Y-chromosomes are only distantly related to instances of F*-M89 observed in samples of other populations of Thailand, including 5.6% (1/18) of a sample of Phuan from Central Thailand, 11.8% (2/17) of a sample of Soa from Northeast Thailand, and 29% (2/7) of a sample of Saek from Northeast Thailand. The TMRCA of the Loloish cluster from North Thailand and the Y-DNA of the Phuan individual from Central Thailand has been estimated to be 12,675 years before present. The TMRCA of the F*-M89 cluster from Northeast Thailand has been estimated to be 6,492 years before present. The TMRCA of all these F*-M89 individuals from Thailand has been estimated to be 16,006 years before present.[15]
Neolithic migration into Europe from Southwest Asia, by first wave of farmers in Europe has been put forward as the source of F and G2a found in European Neolithic remains, dating from circa 4000 BCE. These remains, according to Herrerra et al. (2012) showed a "greater genetic similarity" to "individuals from the modern Near East" than to modern Europeans.[19] F(xG,H,I,J,K) may have been found in Bronze Age remains from Europe, namely the individuals known as DEB 20 and DEB 38, who lived about 7,000–7,210 BP, and were found at the Derenburg Meerenstieg II site in Germany.[20]
Three less certain cases, which have not been tested for all subclades of GHIJK, have been found among Neolithic remains in Europe. I0411 (Troc 4), who lived 7,195–7,080 years BP, was found in the Els Trocs cave, near Bisaurri (modern Spain) – while haplogroups G, I1 (I-M450; I-S247) J, L1b2, Q1b1, Q1a2a, R1a1a1 (R-L449), R1b1a2b1a (R-M35) and T were ruled out,[20] I2a1b1 and R1b1a2 were found in other remains from the same site (Troc 5 and Troc 2). Similarly, three sets of remains from Hungary were not tested for all subclades of GHIJK: BAM 17, BAM 26 (both from Alsónyék Bátaszék, circa 7,850–7,675 years BP) and TOLM 3 (7,030–7,230 BP, found in Tolna-Mözs).[20] (An individual known to scholars as "Oase 1", who lived circa 37,800 years BP in Eastern Europe, was initially classified as belonging either to paragroup F(xGHIJK) or within K.[18] However, subsequent research has revealed that Oase 1 belonged to K2a*.[21])
Some cases reported amongst modern populations of Europeans, Native Americans and Pacific Islanders may be due to migration and admixture of F(xG,H,I,J,K), as a result of contact with South and/or South East Asia, during the early modern era (16th–19th Century).[9]
Such examples include:
The newly defined and rare subclade F3 (M481; previously F5) has been found in India and Nepal, among the Tharu people and in Andhra Pradesh.[29] F-M481 should not be confused with Haplogroup H2 (L279, L281, L284, L285, L286, M282, P96), which was previously misclassified under F-M89, as "F3".
In Y-chromosome phylogenetics, subclades are the branches of haplogroups. These subclades are also defined by single nucleotide polymorphisms (SNPs) or unique event polymorphisms (UEPs).
There are several confirmed and proposed phylogenetic trees available for haplogroup F-M89. The scientifically accepted one is the Y-Chromosome Consortium (YCC) one published in Karafet 2008[2] and subsequently updated. A draft tree that shows emerging science is provided by Thomas Krahn at the Genomic Research Center in Houston, Texas. The International Society of Genetic Genealogy (ISOGG) also provides an amateur tree.
This is the official scientific tree produced by the Y-Chromosome Consortium (YCC). The last major update was in 2008.[2] Subsequent updates have been quarterly and biannual. The current version is a revision of the 2010 update.[citation needed]
Prior to 2002, there were in academic literature at least seven naming systems for the Y-Chromosome Phylogenetic tree. This led to considerable confusion. In 2002, the major research groups came together and formed the Y-Chromosome Consortium (YCC). They published a joint paper that created a single new tree that all agreed to use. Later, a group of citizen scientists with an interest in population genetics and genetic genealogy formed a working group to create an amateur tree aiming at being above all timely. The table below brings together all of these works at the point of the landmark 2002 YCC Tree. This allows a researcher reviewing older published literature to quickly move between nomenclatures.
^Van Oven M, Van Geystelen A, Kayser M, Decorte R, Larmuseau HD (2014). "Seeing the wood for the trees: a minimal reference phylogeny for the human Y chromosome". Human Mutation. 35 (2): 187–91. doi:10.1002/humu.22468. PMID24166809. S2CID23291764.
^K-M2313*, which as yet has no phylogenetic name, has been documented in two living individuals, who have ethnic ties to India and South East Asia. In addition, K-Y28299, which appears to be a primary branch of K-M2313, has been found in three living individuals from India. See: Poznik op. cit.; YFull YTree v5.08, 2017, "K-M2335", and; PhyloTree, 2017, "Details of the Y-SNP markers included in the minimal Y tree" (Access date of these pages: 9 December 2017)
^ Haplogroup S, as of 2017, is also known as K2b1a. (Previously the name Haplogroup S was assigned to K2b1a4.)
^ Haplogroup M, as of 2017, is also known as K2b1b. (Previously the name Haplogroup M was assigned to K2b1d.)
^Wibhu Kutanan, Rasmi Shoocongdej, Metawee Srikummool, et al. (2020), "Cultural variation impacts paternal and maternal genetic lineages of the Hmong-Mien and Sino-Tibetan groups from Thailand." European Journal of Human Genetics. https://doi.org/10.1038/s41431-020-0693-x
^Phillip Edward Melton, 2008, Genetic History and Pre-Columbian Diaspora of Chibchan Speaking Populations: Molecular Genetic Evidence; Ann Arbor, Michigan; ProQuest, p. 29.