Origin of the various defined STR clusters based on user provided ancestor locations

 

John McEwan

 

4th February 2006

 

Background

Many people ask:

 

 I have haplotype XYZ, and according to your how to guide I am R1bSTR19 can you tell me where my ancestors most likely came from”.

 

This question has several hidden features the first is that relatively sophisticated analyses may try to identify the emergence of a founder haplotype (or SNP based haplogroup) in a certain location followed by various migrations. Often these migrations may be in several potential “waves” from the focal region. However, often people simply know their ancestors may have came from a certain region perhaps 100-300 years ago and simply want to know based on haplotype if that is a likely scenario. 

 

I have used the simplest tabulation method and presented the raw numbers from the phase 3 analysis of the 37 STR Ysearch haplotypes plus a summary based on percentages, either overall, or within a putative haplogroup. Potentially much more could be done but the data suffers from several major flaws. Many of the individuals list their origin as unknown or USA, Canada, Australia. In the latter case almost all immigrated from elsewhere and these have been classed as other. The second issue is that those submitting samples are a biased subset that is heavily weighted to Scots, Irish and English origins and infrequently from other regions. This means the latter estimates are sparse or absent. These values do have one major advantage though in that they use estimated haplogroups, so many more individuals can be included other than those that have been SNP tested or estimated based on minor deviations from SNP tested individuals. This process also has identified errors both in data entry and SNP genotyping.

 

Regions of origin

The groups were defined as follows. United Kingdom was where this was the only location provided; Africa was all countries in this continent; Asia was defined as east of the Urals; Eastern European was defined by the longtitude of the western border of Poland; England as it is geographically defined; Iberia consists of Spain and Portugal; Ireland included Northern and Southern Ireland; Middle East included Turkey and Iran and was bounded by the Red sea to the South; Other was unknown and countries largely founded by recent immigrants including USA, Canda, Australia; Scandinavia as geographically defined; Scotland as geographically defined; Southern Europe included Italy, Greece, Albania, Mediterranean Islands and the former Yugoslavia; Wales as it is geographically defined; Western Europe was the region bounded by the above regions.

 

Tables

            Table 1. By percentage for estimated major haplogroup and region of origin

            Table 2. By percentage for estimated haplogroup E cluster and region of origin

            Table 3. By percentage for estimated haplogroup G cluster and region of origin

            Table 4. By percentage for estimated haplogroup I cluster and region of origin

            Table 5. By percentage for estimated haplogroup J cluster and region of origin

            Table 6. By percentage for estimated haplogroup R1b cluster and region of origin

            Table 7. Raw data for all subgroups by region of origin

 

 

Table 1. Percentage of major estimated haplogroups by region of origin, with the number of samples by region listed at the bottom: abbreviations UK, United Kingdom; AF, Africa; AS, Asia; EE, Eastern European; EN, England; IB, Iberia; IR, Ireland; ME, Middle East; OT, Other; SD, Scandinavia; SC, Scotland; SE, Southern Europe; WA, Wales; WE, Western Europe. 

 

 

 

 

 

 

 

Origin

 

 

 

 

 

 

 

Haplogroup (Est)

AF

AS

EE

EN

IB

IR

ME

OT

SC

SD

SE

UK

WA

WE

AB

0

0

0

0

0

0

0

0

0

0

0

0

0

0

E

33

0

16

2

15

1

17

3

1

0

4

0

2

12

F

0

43

1

0

0

0

0

0

0

0

4

0

0

0

G

0

0

7

3

4

2

17

1

0

5

8

0

7

6

HO3

0

29

0

0

0

0

0

0

0

0

0

0

0

0

I

0

0

13

24

8

14

0

22

17

50

21

28

10

19

J

33

0

13

2

27

1

33

3

1

0

21

0

0

6

K2

0

0

3

0

8

0

0

0

1

0

0

0

0

0

N

0

0

0

0

0

0

0

0

0

8

0

0

0

1

Q

0

0

1

0

4

0

0

1

1

0

4

0

0

0

R1a

0

14

24

4

0

5

33

4

5

13

8

0

0

8

R1b

33

14

21

64

35

77

0

65

74

25

29

72

80

48

N samples

3

7

107

625

26

422

6

2087

324

40

24

36

41

247

 

Table 2. Percentage of clusters in haplogroup E by region of origin, with the number of samples by region listed at the bottom: abbreviations UK, United Kingdom; AF, Africa; AS, Asia; EE, Eastern European; EN, England; IB, Iberia; IR, Ireland; ME, Middle East; OT, Other; SD, Scandinavia; SC, Scotland; SE, Southern Europe; WA, Wales; WE, Western Europe. 

 

 

 

 

 

 

 

Origin

 

 

 

 

 

 

 

Haplogroup (est)

AF

AS

EE

EN

IB

IR

ME

OT

SC

SD

SE

UK

WA

WE

E3a

0

0

6

8

0

20

0

46

0

0

0

0

0

10

E3bSTR1

100

0

59

17

75

20

100

20

0

0

0

0

0

66

E3bSTR2

0

0

12

58

0

60

0

34

100

0

100

0

100

21

E3bSTR3

0

0

24

17

25

0

0

0

0

0

0

0

0

3

N samples

1

0

17

12

4

5

1

70

2

0

1

0

1

29

Note: haplogroup E SNP subclades are under active revision no attempt has been made to reconcile these STR clusters with these subclades. However, in many cases there is a strong homology between them.

 

 

 Table 3. Percentage of clusters in haplogroup G by region of origin, with the number of samples by region listed at the bottom: abbreviations UK, United Kingdom; AF, Africa; AS, Asia; EE, Eastern European; EN, England; IB, Iberia; IR, Ireland; ME, Middle East; OT, Other; SD, Scandinavia; SC, Scotland; SE, Southern Europe; WA, Wales; WE, Western Europe.

 

 

 

 

 

 

 

Origin

 

 

 

 

 

 

 

Haplogroup (est)

AF

AS

EE

EN

IB

IR

ME

OT

SC

SD

SE

UK

WA

WE

Fx(GG2)

0

0

0

0

0

70

0

0

0

0

0

0

0

7

GG2

0

0

25

57

100

20

0

70

0

100

50

0

67

87

GG2STR2

0

0

25

43

0

0

100

23

0

0

50

0

33

7

Gx

0

0

50

0

0

10

0

7

0

0

0

0

0

0

N Samples

0

0

8

21

1

10

1

30

0

2

2

0

3

15

Note: When this analysis was undertaken a number of individuals were mislabeled as F based on a faulty SNP test these have been now confirmed as part of G

 

Table 4. Percentage of clusters in haplogroup I by region of origin, with the number of samples by region listed at the bottom: abbreviations UK, United Kingdom; AF, Africa; AS, Asia; EE, Eastern European; EN, England; IB, Iberia; IR, Ireland; ME, Middle East; OT, Other; SD, Scandinavia; SC, Scotland; SE, Southern Europe; WA, Wales; WE, Western Europe.

 

 

 

 

 

 

 

Origin

 

 

 

 

 

 

 

Haplogroup (est)

AF

AS

EE

EN

IB

IR

ME

OT

SC

SD

SE

UK

WA

WE

I*

0

0

0

2

0

0

0

3

0

0

0

0

0

4

I1aSTR1

0

0

0

5

0

2

0

5

2

10

0

10

0

11

I1aSTR10

0

0

0

3

0

12

0

8

11

5

0

0

0

0

I1aSTR2

0

0

21

5

50

4

0

7

13

10

0

0

0

0

I1aSTR3

0

0

0

4

0

18

0

4

2

0

0

0

0

9

I1aSTR4

0

0

0

9

0

2

0

10

4

5

0

0

25

11

I1aSTR5

0

0

7

13

0

2

0

8

11

5

0

0

25

11

I1aSTR6

0

0

14

9

0

0

0

8

2

5

0

20

0

6

I1aSTR7

0

0

0

14

0

12

0

12

4

30

0

0

50

6

I1aSTR8

0

0

0

3

50

2

0

6

15

20

20

0

0

2

I1aSTR9

0

0

0

2

0

0

0

3

2

5

0

0

0

2

I1b2

0

0

0

1

0

2

0

1

2

0

0

0

0

4

I1bSTR1

0

0

57

1

0

5

0

3

6

5

40

10

0

9

WesternI1b

0

0

0

5

0

0

0

2

2

0

20

0

0

0

I1cSTR1

0

0

0

17

0

11

0

10

17

0

20

50

0

21

I1cSTR2

0

0

0

1

0

2

0

1

0

0

0

0

0

0

I1cSTR3

0

0

0

0

0

5

0

1

0

0

0

0

0

0

IslesI1c

0

0

0

2

0

18

0

3

9

0

0

0

0

0

RootsI1c

0

0

0

4

0

2

0

2

0

0

0

10

0

0

Ix

0

0

0

2

0

4

0

2

0

0

0

0

0

4

N Samples

0

0

14

151

2

57

0

455

54

20

5

10

4

47

Note: haplogroup I SNP subclades are under active revision no attempt has been made to reconcile these STR clusters with these subclades. However, in many cases there is a strong homology between them.

 

Table 5. Percentage of clusters in haplogroup J by region of origin, with the number of samples by region listed at the bottom: abbreviations UK, United Kingdom; AF, Africa; AS, Asia; EE, Eastern European; EN, England; IB, Iberia; IR, Ireland; ME, Middle East; OT, Other; SD, Scandinavia; SC, Scotland; SE, Southern Europe; WA, Wales; WE, Western Europe.

 

 

 

 

 

 

 

Origin

 

 

 

 

 

 

 

Haplogroup (est)

AF

AS

EE

EN

IB

IR

ME

OT

SC

SD

SE

UK

WA

WE

J1

100

0

7

40

57

0

0

13

50

0

0

0

0

13

J2

0

0

29

50

29

83

0

41

0

0

40

0

0

19

J2e

0

0

0

0

14

17

0

20

0

0

20

0

0

31

J2STR1

0

0

7

0

0

0

0

0

0

0

0

0

0

6

J2STR2

0

0

21

0

0

0

0

2

0

0

0

0

0

0

J2STR3

0

0

7

0

0

0

50

0

0

0

20

0

0

6

J2STR4

0

0

21

0

0

0

0

6

25

0

0

0

0

6

J2x

0

0

7

10

0

0

50

19

25

0

20

0

0

19

N Samples

1

0

14

10

7

6

2

54

4

0

5

0

0

16

Note: haplogroup J SNP subclades are under active revision no attempt has been made to reconcile these STR clusters with these subclades. However, in many cases there is a strong homology between them.

 

 

Table 6. Percentage of clusters in haplogroup R1b by region of origin, with the number of samples by region listed at the bottom: abbreviations UK, United Kingdom; AF, Africa; AS, Asia; EE, Eastern European; EN, England; IB, Iberia; IR, Ireland; ME, Middle East; OT, Other; SD, Scandinavia; SC, Scotland; SE, Southern Europe; WA, Wales; WE, Western Europe.  

 

 

 

 

 

 

 

Origin

 

 

 

 

 

 

 

Haplogroup (est)

AF

AS

EE

EN

IB

IR

ME

OT

SC

SD

SE

UK

WA

WE

R1bSTR1

0

0

4

2

0

2

0

2

1

0

0

0

3

7

R1bSTR10

0

0

0

2

0

2

0

2

6

10

0

0

3

1

R1bSTR11

0

0

0

2

0

1

0

2

3

0

0

0

9

0

R1bSTR12

0

0

0

2

11

2

0

1

2

0

0

0

0

1

R1bSTR13

0

0

0

0

0

0

0

1

1

0

0

0

0

1

R1bSTR14

0

0

0

3

0

2

0

1

0

0

0

0

3

2

R1bSTR15

0

0

4

1

11

4

0

2

2

0

0

0

12

1

R1bSTR16

0

0

0

1

0

3

0

1

2

0

14

4

0

1

R1bSTR17

0

0

0

2

11

1

0

3

0

0

0

4

0

6

R1bSTR18

0

0

0

2

0

1

0

1

2

0

0

4

0

2

R1bSTR19Irish

0

0

0

1

0

20

0

6

9

0

0

12

0

3

R1bSTR2

0

0

0

3

11

1

0

2

1

0

0

8

12

2

R1bSTR20

0

0

4

2

0

1

0

2

1

0

0

0

0

3

R1bSTR21

0

0

0

0

0

1

0

1

1

0

0

0

6

2

R1bSTR22Frisian

0

0

0

8

0

2

0

5

3

0

0

4

3

3

R1bSTR23

0

0

0

0

0

2

0

1

0

0

0

0

0

3

R1bSTR24

0

0

35

1

0

2

0

2

3

0

29

0

3

3

R1bSTR25

0

0

4

3

0

4

0

3

1

0

0

0

9

3

R1bSTR25a

0

0

0

2

0

1

0

1

0

0

0

0

0

4

R1bSTR26

0

0

0

0

0

3

0

1

1

0

0

0

0

2

R1bSTR27

0

0

4

3

11

1

0

3

1

10

0

0

0

0

R1bSTR28

0

0

0

2

0

3

0

2

1

10

0

8

0

3

R1bSTR29

0

0

0

0

0

0

0

1

1

40

0

0

0

1

R1bSTR3

0

0

0

3

0

0

0

3

0

0

0

8

0

2

R1bSTR30

0

0

0

1

0

1

0

2

1

0

0

0

0

0

R1bSTR31

0

0

0

0

0

0

0

1

0

0

0

0

0

0

R1bSTR32

0

0

0

1

11

0

0

2

2

0

14

0

0

1

R1bSTR33

0

0

0

1

0

1

0

1

1

0

0

0

0

0

R1bSTR34

0

0

9

3

11

1

0

2

1

0

0

4

3

0

R1bSTR35

0

0

0

1

0

1

0

1

1

0

0

0

3

0

R1bSTR36

0

0

0

1

0

1

0

1

0

0

0

0

0

9

R1bSTR37

0

0

0

2

0

2

0

3

1

0

14

4

6

3

R1bSTR38

0

0

0

0

0

2

0

1

2

0

0

0

3

2

R1bSTR39

0

0

0

4

0

2

0

3

1

10

0

12

0

3

R1bSTR4

0

0

0

0

0

1

0

1

0

0

0

0

3

1

R1bSTR40

0

100

0

4

0

1

0

3

2

0

0

0

3

3

R1bSTR41

0

0

4

0

0

2

0

1

1

0

0

0

0

2

R1bSTR42

100

0

0

4

22

1

0

3

5

0

0

4

3

0

R1bSTR43

0

0

13

3

0

3

0

4

5

0

0

0

3

8

R1bSTR44

0

0

0

3

0

6

0

3

2

10

0

4

0

7

R1bSTR45

0

0

4

1

0

2

0

1

1

0

14

8

0

2

R1bSTR46

0

0

4

0

0

2

0

1

1

0

0

0

0

1

R1bSTR47Scots

0

0

0

3

0

2

0

4

21

0

0

8

3

1

R1bSTR48

0

0

4

2

0

1

0

2

0

0

14

0

0

1

R1bSTR49

0

0

0

1

0

4

0

2

1

10

0

0

3

0

R1bSTR5

0

0

0

1

0

0

0

1

0

0

0

0

0

0

R1bSTR6

0

0

4

1

0

0

0

2

2

0

0

0

3

1

R1bSTR7

0

0

0

5

0

4

0

2

3

0

0

4

0

0

R1bSTR8

0

0

0

1

0

0

0

1

1

0

0

0

0

3

R1bSTR9

0

0

0

2

0

2

0

3

2

0

0

4

0

3

N samples

1

1

23

401

9

323

0

1359

241

10

7

26

33

118

  

 

Table 7. Raw data for estimated groups by region of origin, with the number of samples by region listed at the bottom: abbreviations UK, United Kingdom; AF, Africa; AS, Asia; EE, Eastern European; EN, England; IB, Iberia; IR, Ireland; ME, Middle East; OT, Other; SD, Scandinavia; SC, Scotland; SE, Southern Europe; WA, Wales; WE, Western Europe. 

 

 

 

 

 

 

 

Origin

 

 

 

 

 

 

 

Haplogroup (Est)

AF

AS

EE

EN

IB

IR

ME

OT

SC

SD

SE

UK

WA

WE

Grand Total

AB

 

 

 

1

 

1

 

6

1

 

 

 

 

 

9

E3a

 

 

1

1

 

1

 

32

 

 

 

 

 

3

38

E3bSTR1

1

 

10

2

3

1

1

14

 

 

 

 

 

19

51

E3bSTR2

 

 

2

7

 

3

 

24

2

 

1

 

1

6

46

E3bSTR3

 

 

4

2

1

 

 

 

 

 

 

 

 

1

8

F

 

3

1

1

 

1

 

5

 

 

1

 

 

 

12

Fx

 

 

 

 

 

7

 

 

 

 

 

 

 

1

8

GG2

 

 

2

12

1

2

 

21

 

2

1

 

2

13

56

GG2STR2

 

 

2

9

 

 

1

7

 

 

1

 

1

1

22

Gx

 

 

4

 

 

1

 

2

 

 

 

 

 

 

7

HO3

 

2

 

 

 

 

 

1

 

 

 

 

 

 

3

I

 

 

 

3

 

 

 

14

 

 

 

 

 

2

19

I1aSTR1

 

 

 

7

 

1

 

25

1

2

 

1

 

5

42

I1aSTR10

 

 

 

4

 

7

 

37

6

1

 

 

 

 

55

I1aSTR2

 

 

3

8

1

2

 

33

7

2

 

 

 

 

56

I1aSTR3

 

 

 

6

 

10

 

16

1

 

 

 

 

4

37

I1aSTR4

 

 

 

13

 

1

 

46

2

1

 

 

1

5

69

I1aSTR5

 

 

1

20

 

1

 

36

6

1

 

 

1

5

71

I1aSTR6

 

 

2

13

 

 

 

37

1

1

 

2

 

3

59

I1aSTR7

 

 

 

21

 

7

 

53

2

6

 

 

2

3

94

I1aSTR8

 

 

 

4

1

1

 

27

8

4

1

 

 

1

47

I1aSTR9

 

 

 

3

 

 

 

14

1

1

 

 

 

1

20

I1b2

 

 

 

1

 

1

 

5

1

 

 

 

 

2

10

I1bSTR1

 

 

8

2

 

3

 

15

3

1

2

1

 

4

39

WesternI1b

 

 

 

7

 

 

 

7

1

 

1

 

 

 

16

I1cSTR1

 

 

 

26

 

6

 

46

9

 

1

5

 

10

103

I1cSTR2

 

 

 

1

 

1

 

5

 

 

 

 

 

 

7

I1cSTR3

 

 

 

 

 

3

 

4

 

 

 

 

 

 

7

IslesI1c

 

 

 

3

 

10

 

15

5

 

 

 

 

 

33

RootsI1c

 

 

 

6

 

1

 

10

 

 

 

1

 

 

18

Ix

 

 

 

3

 

2

 

10

 

 

 

 

 

2

17

J1

1

 

1

4

4

 

 

7

2

 

 

 

 

2

21

J2

 

 

4

5

2

5

 

22

 

 

2

 

 

3

43

J2e

 

 

 

 

1

1

 

11

 

 

1

 

 

5

19

J2STR1

 

 

1

 

 

 

 

 

 

 

 

 

 

1

2

J2STR2

 

 

3

 

 

 

 

1

 

 

 

 

 

 

4

J2STR3

 

 

1

 

 

 

1

 

 

 

1

 

 

1

4

J2STR4

 

 

3

 

 

 

 

3

1

 

 

 

 

1

8

J2x

 

 

1

1

 

 

1

10

1

 

1

 

 

3

18

K2

 

 

3

2

2

 

 

8

2

 

 

 

 

 

17

N

 

 

 

 

 

 

 

3

 

3

 

 

 

2

8

Q

 

 

1

2

1

 

 

15

3

 

1

 

 

1

24

R1a

 

1

26

24

 

19

2

81

17

5

2

 

 

19

196

R1bSTR1

 

 

1

8

 

6

 

29

3

 

 

 

1

8

56

R1bSTR10

 

 

 

9

 

7

 

27

14

1

 

 

1

1

60

R1bSTR11

 

 

 

10

 

3

 

30

7

 

 

 

3

 

53

R1bSTR12

 

 

 

8

1

7

 

19

5

 

 

 

 

1

41

R1bSTR13

 

 

 

 

 

 

 

11

2

 

 

 

 

1

14

R1bSTR14

 

 

 

14

 

5

 

20

1

 

 

 

1

2

43

R1bSTR15

 

 

1

5

1

12

 

31

5

 

 

 

4

1

60

R1bSTR16

 

 

 

6

 

9

 

15

6

 

1

1

 

1

39

R1bSTR17

 

 

 

7

1

4

 

36

1

 

 

1

 

7

57

R1bSTR18

 

 

 

7

 

4

 

17

6

 

 

1

 

2

37

R1bSTR19Irish

 

 

 

6

 

64

 

86

22

 

 

3

 

3

184

R1bSTR2

 

 

 

12

1

3

 

30

2

 

 

2

4

2

56

R1bSTR20

 

 

1

7

 

2

 

21

3

 

 

 

 

3

37

R1bSTR21

 

 

 

 

 

4

 

16

2

 

 

 

2

2

26

R1bSTR22Frisian

 

 

 

31

 

5

 

68

8

 

 

1

1

3

117

R1bSTR23

 

 

 

1

 

5

 

17

1

 

 

 

 

3

27

R1bSTR24

 

 

8

6

 

6

 

29

8

 

2

 

1

4

64

R1bSTR25

 

 

1

13

 

13

 

45

3

 

 

 

3

4

82

R1bSTR25a

 

 

 

7

 

4

 

19

 

 

 

 

 

5

35

R1bSTR26

 

 

 

2

 

9

 

11

2

 

 

 

 

2

26

R1bSTR27

 

 

1

13

1

3

 

34

2

1

 

 

 

 

55

R1bSTR28

 

 

 

9

 

11

 

28

2

1

 

2

 

3

56

R1bSTR29

 

 

 

2

 

 

 

10

2

4

 

 

 

1

19

R1bSTR3

 

 

 

13

 

1

 

34

1

 

 

2

 

2

53

R1bSTR30

 

 

 

4

 

3

 

24

2

 

 

 

 

 

33

R1bSTR31

 

 

 

1

 

 

 

8

 

 

 

 

 

 

9

R1bSTR32

 

 

 

4

1

 

 

32

4

 

1

 

 

1

43

R1bSTR33

 

 

 

6

 

2

 

10

3

 

 

 

 

 

21

R1bSTR34

 

 

2

14

1

2

 

21

3

 

 

1

1

 

45

R1bSTR35

 

 

 

4

 

4

 

16

2

 

 

 

1

 

27

R1bSTR36

 

 

 

6

 

4

 

17

1

 

 

 

 

11

39

R1bSTR37

 

 

 

9

 

6

 

39

2

 

1

1

2

4

64

R1bSTR38

 

 

 

2

 

7

 

17

4

 

 

 

1

2

33

R1bSTR39

 

 

 

15

 

6

 

38

2

1

 

3

 

4

69

R1bSTR4

 

 

 

2

 

4

 

7

 

 

 

 

1

1

15

R1bSTR40

 

1

 

18

 

2

 

35

5

 

 

 

1

3

65

R1bSTR41

 

 

1

1

 

6

 

18

3

 

 

 

 

2

31

R1bSTR42

1

 

 

17

2

3

 

43

11

 

 

1

1

 

79

R1bSTR43

 

 

3

14

 

11

 

48

11

 

 

 

1

9

97

R1bSTR44

 

 

 

13

 

20

 

44

4

1

 

1

 

8

91

R1bSTR45

 

 

1

5

 

5

 

20

2

 

1

2

 

2

38

R1bSTR46

 

 

1

2

 

8

 

10

3

 

 

 

 

1

25

R1bSTR47Scots

 

 

 

11

 

8

 

60

50

 

 

2

1

1

133

R1bSTR48

 

 

1

9

 

3

 

23

 

 

1

 

 

1

38

R1bSTR49

 

 

 

6

 

13

 

29

2

1

 

 

1

 

52

R1bSTR5

 

 

 

4

 

 

 

13

 

 

 

 

 

 

17

R1bSTR6

 

 

1

5

 

 

 

24

4

 

 

 

1

1

36

R1bSTR7

 

 

 

21

 

13

 

26

8

 

 

1

 

 

69

R1bSTR8

 

 

 

5

 

1

 

18

2

 

 

 

 

3

29

R1bSTR9

 

 

 

7

 

5

 

36

5

 

 

1

 

3

57

Grand Total

3

7

107

625

26

422

6

2087

324

40

24

36

41

247

3995