; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014002 (gene) of Snake gourd v1 genome

Gene IDTan0014002
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG06:12492808..12494948
RNA-Seq ExpressionTan0014002
SyntenyTan0014002
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576012.1 hypothetical protein SDJN03_26651, partial [Cucurbita argyrosperma subsp. sororia]1.4e-6858.95Show/hide
Query:  SSSIPHIASTLLLLLLLIPSLKILNINGLNSLSNLVLCLFFKAPLCILVSFLKGLKLPADAFLSAFQSLAEALRSILVSIIDMVFRILGSFFMVVLGFVK
        SS IP +ASTLLLLLLLIPSLK+L INGLNSL  LV CLFFKAP C+LVS L  L+ PA+A L A QSLA+AL+ + VS ++M   I+GSF M VLG +K
Subjt:  SSSIPHIASTLLLLLLLIPSLKILNINGLNSLSNLVLCLFFKAPLCILVSFLKGLKLPADAFLSAFQSLAEALRSILVSIIDMVFRILGSFFMVVLGFVK

Query:  NVVVFGSFADSGWGLGDLFQKTKASWQESYFLEQLRDIVGSVSQNIIHQVLGIASSSAGGVFGFVKTGISTLLNEPGSGIGQLVESLKG-----SWSMEG
        N VVFGSF + G   G L  K KAS + S  LEQ+R+I+G VS  I+  VL  ASSSAGGVFGF  T IS LLN+PGS +G+LV  LKG       SM G
Subjt:  NVVVFGSFADSGWGLGDLFQKTKASWQESYFLEQLRDIVGSVSQNIIHQVLGIASSSAGGVFGFVKTGISTLLNEPGSGIGQLVESLKG-----SWSMEG

Query:  VRGIIGSFMEKVGNGVLEVGSSSTGGSFEIVKAIFGVVMDSGYTLGGFVEKTRAALELLRIEELRGIMQSIAKISVNMIIRYLLG
        VRGI+ S +EKV    L V SSS  G FE VK    +V++SG+T+GG VEKT+AALE+L +E+LR ++QS+A++SVNMII Y LG
Subjt:  VRGIIGSFMEKVGNGVLEVGSSSTGGSFEIVKAIFGVVMDSGYTLGGFVEKTRAALELLRIEELRGIMQSIAKISVNMIIRYLLG

XP_011659765.2 uncharacterized protein LOC105436268 [Cucumis sativus]2.5e-7058.08Show/hide
Query:  SSFSSSIPHIASTLLLLLLLIPSLKILNINGLNSLSNLVLCLFFKAPLCILVSFLKGLKLPADAFLSAFQSLAEALRSILVSIIDMVFRILGSFFMVVLG
        SS SS+I H+ ST LLLLLLIPSLKI+ ING NSLS L +CLF KAP C+++SFLK +KLPA+AFLSAFQSL EAL+SI VS I+M F I+ SF M VL 
Subjt:  SSFSSSIPHIASTLLLLLLLIPSLKILNINGLNSLSNLVLCLFFKAPLCILVSFLKGLKLPADAFLSAFQSLAEALRSILVSIIDMVFRILGSFFMVVLG

Query:  FVKNVVVFGSFAD-SGWGLGDLFQKTKASWQESYFLEQLRDIVGSVSQNIIHQVLGIASSSAGGVFGFVKTGISTLLNEPGSGIGQLVESLK-------G
         V N VVFGSF + S    G L + TK SW+     EQ+R I+ S  + ++ QV  IA+S AGG+F F  T +ST+ NEPGS IG LVE+LK       G
Subjt:  FVKNVVVFGSFAD-SGWGLGDLFQKTKASWQESYFLEQLRDIVGSVSQNIIHQVLGIASSSAGGVFGFVKTGISTLLNEPGSGIGQLVESLK-------G

Query:  SWSMEGVRGIIGSFMEKVGNGVLEVGSSSTGGSFEIVKAIFGVVMDSGYTLGGFVEKTRAALELLRIEELRGIMQSIAKISVNMIIRYLLG
        SW MEGV+GI+   +EK+ N   EV +SST G FEIVK +F +V+DSGY++GG VEKTR  LE+L++E+LRGI+ +IAKISVNM+I YL G
Subjt:  SWSMEGVRGIIGSFMEKVGNGVLEVGSSSTGGSFEIVKAIFGVVMDSGYTLGGFVEKTRAALELLRIEELRGIMQSIAKISVNMIIRYLLG

XP_022954027.1 uncharacterized protein LOC111456411 [Cucurbita moschata]1.8e-6557.19Show/hide
Query:  SSSIPHIASTLLLLLLLIPSLKILNINGLNSLSNLVLCLFFKAPLCILVSFLKGLKLPADAFLSAFQSLAEALRSILVSIIDMVFRILGSFFMVVLGFVK
        SS IP +ASTLLLLLLLIPSLK+L INGLNSL  LV CLFFKAP C+LVS L  L+ PA+A L A QSLA+AL+ + VS ++M   I+GSF M VLG +K
Subjt:  SSSIPHIASTLLLLLLLIPSLKILNINGLNSLSNLVLCLFFKAPLCILVSFLKGLKLPADAFLSAFQSLAEALRSILVSIIDMVFRILGSFFMVVLGFVK

Query:  NVVVFGSFADSGWGLGDLFQKTKASWQESYFLEQLRDIVGSVSQNIIHQVLGIASSSAGGVFGFVKTGISTLLNEPGSGIGQLVESLKG-----SWSMEG
        N VVFGSF + G   G L  K +AS + S  LE +R+I+G VS  I+ +VL  A+SSAGGVFGF  T IS LLN+PGS +G+LV  LKG       SM G
Subjt:  NVVVFGSFADSGWGLGDLFQKTKASWQESYFLEQLRDIVGSVSQNIIHQVLGIASSSAGGVFGFVKTGISTLLNEPGSGIGQLVESLKG-----SWSMEG

Query:  VRGIIGSFMEKVGNGVLEVGSSSTGGSFEIVKAIFGVVMDSGYTLGGFVEKTRAALELLRIEELRGIMQSIAKISVNMIIRYLLG
        VRGI+ S +EKV    L V SSS  G FE VK    +V++SG+T+GG VEKT+AALE+L +E+LR ++QS+A++ VNM I Y LG
Subjt:  VRGIIGSFMEKVGNGVLEVGSSSTGGSFEIVKAIFGVVMDSGYTLGGFVEKTRAALELLRIEELRGIMQSIAKISVNMIIRYLLG

XP_022991981.1 uncharacterized protein LOC111488468 [Cucurbita maxima]6.7e-6858.6Show/hide
Query:  SSSIPHIASTLLLLLLLIPSLKILNINGLNSLSNLVLCLFFKAPLCILVSFLKGLKLPADAFLSAFQSLAEALRSILVSIIDMVFRILGSFFMVVLGFVK
        SS IP +ASTLLLLLLLIPSLK+L INGLNSL  LV CLFFKAP C+LVS L  L+ PA+A L A QSLA+AL+ + VS ++M   I+GSF M VLG +K
Subjt:  SSSIPHIASTLLLLLLLIPSLKILNINGLNSLSNLVLCLFFKAPLCILVSFLKGLKLPADAFLSAFQSLAEALRSILVSIIDMVFRILGSFFMVVLGFVK

Query:  NVVVFGSFADSGWGLGDLFQKTKASWQESYFLEQLRDIVGSVSQNIIHQVLGIASSSAGGVFGFVKTGISTLLNEPGSGIGQLVESLKGSW-----SMEG
        N VVFGSF + G   G L  K KAS + S  LEQ+R+I+G VS  I+  VL  A+SSAGGVFGF  T IS  LN+PGS +G+LV  LKGS      SM G
Subjt:  NVVVFGSFADSGWGLGDLFQKTKASWQESYFLEQLRDIVGSVSQNIIHQVLGIASSSAGGVFGFVKTGISTLLNEPGSGIGQLVESLKGSW-----SMEG

Query:  VRGIIGSFMEKVGNGVLEVGSSSTGGSFEIVKAIFGVVMDSGYTLGGFVEKTRAALELLRIEELRGIMQSIAKISVNMIIRYLLG
        VRGI+ S +EKV    L V SSS  G FE VK    +V++SG+T+GG VEKT+AALE+L +E+LR ++QS+A++SVNMII Y LG
Subjt:  VRGIIGSFMEKVGNGVLEVGSSSTGGSFEIVKAIFGVVMDSGYTLGGFVEKTRAALELLRIEELRGIMQSIAKISVNMIIRYLLG

XP_038899505.1 uncharacterized protein LOC120086784 [Benincasa hispida]3.0e-6859.07Show/hide
Query:  HIASTLLLLLLLIPSLKILNINGLNSLSNLVLCLFFKAPLCILVSFLKGLKLPADAFLSAFQSLAEALRSILVSIIDMVFRILGSFFMVVLGFVKNVVVF
        H+AST+L LL+LIPSLKIL ING N LS LV+CLF KAP  I++SFLK ++L  +A LSAFQSLAEAL+ I VS I+M   IL S  M VL      VVF
Subjt:  HIASTLLLLLLLIPSLKILNINGLNSLSNLVLCLFFKAPLCILVSFLKGLKLPADAFLSAFQSLAEALRSILVSIIDMVFRILGSFFMVVLGFVKNVVVF

Query:  GSFADSGWGLGDLFQKTKASWQESYFLEQLRDIVGSVSQNIIHQVLGIASSSAGGVFGFVKTGISTLLNEPGSGIGQLVESLKGSW------SMEGVRGI
        GS  +SG   G L +  KASW E   LEQ+R+I+GS+ + I+ +   IA+SSAGG+F FV   ISTLLNEPGS IG+LV  LK S       SMEGVRGI
Subjt:  GSFADSGWGLGDLFQKTKASWQESYFLEQLRDIVGSVSQNIIHQVLGIASSSAGGVFGFVKTGISTLLNEPGSGIGQLVESLKGSW------SMEGVRGI

Query:  IGSFMEKVGNGVLEVGSSSTGGSFEIVKAIFGVVMDSGYTLGGFVEKTRAALELLRIEELRGIMQSIAKISVNMIIRYLLG
        +GSF+EK+ N   EV SSST G FEIVK +F +V++SGYT+GG +E TRAALE+LR+EE+RGI  S+AK+ VN II YLLG
Subjt:  IGSFMEKVGNGVLEVGSSSTGGSFEIVKAIFGVVMDSGYTLGGFVEKTRAALELLRIEELRGIMQSIAKISVNMIIRYLLG

TrEMBL top hitse value%identityAlignment
A0A0A0K6V2 Uncharacterized protein9.5e-6857.8Show/hide
Query:  IASTLLLLLLLIPSLKILNINGLNSLSNLVLCLFFKAPLCILVSFLKGLKLPADAFLSAFQSLAEALRSILVSIIDMVFRILGSFFMVVLGFVKNVVVFG
        + ST LLLLLLIPSLKI+ ING NSLS L +CLF KAP C+++SFLK +KLPA+AFLSAFQSL EAL+SI VS I+M F I+ SF M VL  V N VVFG
Subjt:  IASTLLLLLLLIPSLKILNINGLNSLSNLVLCLFFKAPLCILVSFLKGLKLPADAFLSAFQSLAEALRSILVSIIDMVFRILGSFFMVVLGFVKNVVVFG

Query:  SFAD-SGWGLGDLFQKTKASWQESYFLEQLRDIVGSVSQNIIHQVLGIASSSAGGVFGFVKTGISTLLNEPGSGIGQLVESLK-------GSWSMEGVRG
        SF + S    G L + TK SW+     EQ+R I+ S  + ++ QV  IA+S AGG+F F  T +ST+ NEPGS IG LVE+LK       GSW MEGV+G
Subjt:  SFAD-SGWGLGDLFQKTKASWQESYFLEQLRDIVGSVSQNIIHQVLGIASSSAGGVFGFVKTGISTLLNEPGSGIGQLVESLK-------GSWSMEGVRG

Query:  IIGSFMEKVGNGVLEVGSSSTGGSFEIVKAIFGVVMDSGYTLGGFVEKTRAALELLRIEELRGIMQSIAKISVNMIIRYLLG
        I+   +EK+ N   EV +SST G FEIVK +F +V+DSGY++GG VEKTR  LE+L++E+LRGI+ +IAKISVNM+I YL G
Subjt:  IIGSFMEKVGNGVLEVGSSSTGGSFEIVKAIFGVVMDSGYTLGGFVEKTRAALELLRIEELRGIMQSIAKISVNMIIRYLLG

A0A5D3BCI5 Uncharacterized protein1.2e-3552.31Show/hide
Query:  MVVLGFVKNVVVFGSFADSGW-GLGDLFQKTKASWQESYFLEQLRDIVGSVSQNIIHQVLGIASSSAGGVFGFVKTGISTLLNEPGSGIGQLVESLKGSW
        M VL  V N +VFGSF +S     G L + T   W+     EQ+R I+ S  + ++ QV  IA+S AGG+F F  TGIST+ NEP S +G LV  LK S 
Subjt:  MVVLGFVKNVVVFGSFADSGW-GLGDLFQKTKASWQESYFLEQLRDIVGSVSQNIIHQVLGIASSSAGGVFGFVKTGISTLLNEPGSGIGQLVESLKGSW

Query:  ------SMEGVRGIIGSFMEKVGNGVLEVGSSSTGGSFEIVKAIFGVVMDSGYTLGGFVEKTRAALELLRIEELRGIMQSIAKISVNMIIRYLLG
              SMEGVRGI+ SF+EK+ N   EV SSS  G FEIVK +  +V+DSGY++GG VEKTR ALE+LR+EELR I+ +IA I VNMI+ YLLG
Subjt:  ------SMEGVRGIIGSFMEKVGNGVLEVGSSSTGGSFEIVKAIFGVVMDSGYTLGGFVEKTRAALELLRIEELRGIMQSIAKISVNMIIRYLLG

A0A6J1DST0 uncharacterized protein LOC1110235982.1e-5148.11Show/hide
Query:  SSSIPHIASTLLL-LLLLIPSLKILNI------NGLNSLSNLVLCLFFKAPLCILVSFLKGLKLPADAFLSAFQSLAEALRSILVSIIDMVFRILGSFFM
        SS +   AST LL LLL+I SLKILN+      NG NSL   +  LFFK+  C+L+SF   +KLPA A LSAFQ LA+A+R++L+  I+M   I+ SF +
Subjt:  SSSIPHIASTLLL-LLLLIPSLKILNI------NGLNSLSNLVLCLFFKAPLCILVSFLKGLKLPADAFLSAFQSLAEALRSILVSIIDMVFRILGSFFM

Query:  VVLGFVKNVVVFGSFADSGWGLGDLFQKTKASWQESYFLEQLRDIVGSVSQNIIHQVLGIASSSAGGVFGFVKTGISTLLNEPGSGIGQLVESLK----G
        +VL F+KN  VFGS  +SG   G L +KTK+S+ ES   +Q+R+I+ S+S+ II   L  ASS AG +F FVK  I  LLNEP S IG+LVE +K    G
Subjt:  VVLGFVKNVVVFGSFADSGWGLGDLFQKTKASWQESYFLEQLRDIVGSVSQNIIHQVLGIASSSAGGVFGFVKTGISTLLNEPGSGIGQLVESLK----G

Query:  SWSMEGVRGIIGSFMEKVGNGVLEVGSSSTGGSFEIVKAIFGVVMDSGYTLGGFVEKTRAALELLRIEELRGIMQSIAKISVNMIIRYLLG
        S +M+GVR I+ +F+ K+      V SSS  G FE VK    + ++SG T+GG +EK + +LE+L +E LRGI++SI+KI +++I  YL G
Subjt:  SWSMEGVRGIIGSFMEKVGNGVLEVGSSSTGGSFEIVKAIFGVVMDSGYTLGGFVEKTRAALELLRIEELRGIMQSIAKISVNMIIRYLLG

A0A6J1GPW1 uncharacterized protein LOC1114564118.9e-6657.19Show/hide
Query:  SSSIPHIASTLLLLLLLIPSLKILNINGLNSLSNLVLCLFFKAPLCILVSFLKGLKLPADAFLSAFQSLAEALRSILVSIIDMVFRILGSFFMVVLGFVK
        SS IP +ASTLLLLLLLIPSLK+L INGLNSL  LV CLFFKAP C+LVS L  L+ PA+A L A QSLA+AL+ + VS ++M   I+GSF M VLG +K
Subjt:  SSSIPHIASTLLLLLLLIPSLKILNINGLNSLSNLVLCLFFKAPLCILVSFLKGLKLPADAFLSAFQSLAEALRSILVSIIDMVFRILGSFFMVVLGFVK

Query:  NVVVFGSFADSGWGLGDLFQKTKASWQESYFLEQLRDIVGSVSQNIIHQVLGIASSSAGGVFGFVKTGISTLLNEPGSGIGQLVESLKG-----SWSMEG
        N VVFGSF + G   G L  K +AS + S  LE +R+I+G VS  I+ +VL  A+SSAGGVFGF  T IS LLN+PGS +G+LV  LKG       SM G
Subjt:  NVVVFGSFADSGWGLGDLFQKTKASWQESYFLEQLRDIVGSVSQNIIHQVLGIASSSAGGVFGFVKTGISTLLNEPGSGIGQLVESLKG-----SWSMEG

Query:  VRGIIGSFMEKVGNGVLEVGSSSTGGSFEIVKAIFGVVMDSGYTLGGFVEKTRAALELLRIEELRGIMQSIAKISVNMIIRYLLG
        VRGI+ S +EKV    L V SSS  G FE VK    +V++SG+T+GG VEKT+AALE+L +E+LR ++QS+A++ VNM I Y LG
Subjt:  VRGIIGSFMEKVGNGVLEVGSSSTGGSFEIVKAIFGVVMDSGYTLGGFVEKTRAALELLRIEELRGIMQSIAKISVNMIIRYLLG

A0A6J1JSB1 uncharacterized protein LOC1114884683.3e-6858.6Show/hide
Query:  SSSIPHIASTLLLLLLLIPSLKILNINGLNSLSNLVLCLFFKAPLCILVSFLKGLKLPADAFLSAFQSLAEALRSILVSIIDMVFRILGSFFMVVLGFVK
        SS IP +ASTLLLLLLLIPSLK+L INGLNSL  LV CLFFKAP C+LVS L  L+ PA+A L A QSLA+AL+ + VS ++M   I+GSF M VLG +K
Subjt:  SSSIPHIASTLLLLLLLIPSLKILNINGLNSLSNLVLCLFFKAPLCILVSFLKGLKLPADAFLSAFQSLAEALRSILVSIIDMVFRILGSFFMVVLGFVK

Query:  NVVVFGSFADSGWGLGDLFQKTKASWQESYFLEQLRDIVGSVSQNIIHQVLGIASSSAGGVFGFVKTGISTLLNEPGSGIGQLVESLKGSW-----SMEG
        N VVFGSF + G   G L  K KAS + S  LEQ+R+I+G VS  I+  VL  A+SSAGGVFGF  T IS  LN+PGS +G+LV  LKGS      SM G
Subjt:  NVVVFGSFADSGWGLGDLFQKTKASWQESYFLEQLRDIVGSVSQNIIHQVLGIASSSAGGVFGFVKTGISTLLNEPGSGIGQLVESLKGSW-----SMEG

Query:  VRGIIGSFMEKVGNGVLEVGSSSTGGSFEIVKAIFGVVMDSGYTLGGFVEKTRAALELLRIEELRGIMQSIAKISVNMIIRYLLG
        VRGI+ S +EKV    L V SSS  G FE VK    +V++SG+T+GG VEKT+AALE+L +E+LR ++QS+A++SVNMII Y LG
Subjt:  VRGIIGSFMEKVGNGVLEVGSSSTGGSFEIVKAIFGVVMDSGYTLGGFVEKTRAALELLRIEELRGIMQSIAKISVNMIIRYLLG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCTTTTTCTTCTTCAATTCCTCACATAGCTTCAACTTTGCTCCTCCTTCTCCTCCTAATCCCATCTCTAAAAATCCTCAATATCAATGGCCTCAACTCCTTATC
AAATTTGGTTCTTTGCCTATTCTTCAAAGCCCCACTTTGTATTCTTGTTTCCTTTCTCAAGGGCCTCAAGCTCCCAGCAGATGCCTTTCTCAGTGCATTTCAAAGCCTGG
CTGAAGCATTGAGGTCCATTTTGGTGAGCATAATTGACATGGTTTTTCGAATTTTAGGCTCTTTTTTTATGGTTGTGTTGGGTTTTGTGAAGAATGTGGTCGTTTTTGGT
TCTTTTGCTGACTCTGGTTGGGGACTTGGTGACCTCTTCCAGAAGACTAAGGCTTCTTGGCAAGAGTCTTATTTTTTGGAACAACTGCGGGACATTGTTGGGAGCGTTTC
TCAGAACATCATTCACCAGGTTTTGGGCATTGCCAGTTCTTCTGCAGGCGGAGTGTTCGGGTTTGTAAAGACGGGCATTTCAACGCTGTTGAACGAGCCCGGTTCGGGCA
TCGGACAGTTGGTGGAGTCGTTGAAGGGCAGTTGGTCCATGGAAGGAGTGCGAGGAATTATTGGGAGCTTCATGGAGAAGGTGGGCAATGGGGTTTTGGAAGTGGGGAGT
TCTTCTACAGGTGGTTCGTTTGAAATTGTGAAGGCTATTTTTGGTGTTGTGATGGACTCTGGTTACACTCTTGGAGGGTTCGTGGAGAAGACGAGGGCTGCATTAGAGCT
TCTGCGAATCGAAGAATTACGAGGGATTATGCAGAGTATTGCTAAGATTAGTGTAAATATGATTATTAGATATTTACTAGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCTTTTTCTTCTTCAATTCCTCACATAGCTTCAACTTTGCTCCTCCTTCTCCTCCTAATCCCATCTCTAAAAATCCTCAATATCAATGGCCTCAACTCCTTATC
AAATTTGGTTCTTTGCCTATTCTTCAAAGCCCCACTTTGTATTCTTGTTTCCTTTCTCAAGGGCCTCAAGCTCCCAGCAGATGCCTTTCTCAGTGCATTTCAAAGCCTGG
CTGAAGCATTGAGGTCCATTTTGGTGAGCATAATTGACATGGTTTTTCGAATTTTAGGCTCTTTTTTTATGGTTGTGTTGGGTTTTGTGAAGAATGTGGTCGTTTTTGGT
TCTTTTGCTGACTCTGGTTGGGGACTTGGTGACCTCTTCCAGAAGACTAAGGCTTCTTGGCAAGAGTCTTATTTTTTGGAACAACTGCGGGACATTGTTGGGAGCGTTTC
TCAGAACATCATTCACCAGGTTTTGGGCATTGCCAGTTCTTCTGCAGGCGGAGTGTTCGGGTTTGTAAAGACGGGCATTTCAACGCTGTTGAACGAGCCCGGTTCGGGCA
TCGGACAGTTGGTGGAGTCGTTGAAGGGCAGTTGGTCCATGGAAGGAGTGCGAGGAATTATTGGGAGCTTCATGGAGAAGGTGGGCAATGGGGTTTTGGAAGTGGGGAGT
TCTTCTACAGGTGGTTCGTTTGAAATTGTGAAGGCTATTTTTGGTGTTGTGATGGACTCTGGTTACACTCTTGGAGGGTTCGTGGAGAAGACGAGGGCTGCATTAGAGCT
TCTGCGAATCGAAGAATTACGAGGGATTATGCAGAGTATTGCTAAGATTAGTGTAAATATGATTATTAGATATTTACTAGGTTAA
Protein sequenceShow/hide protein sequence
MSSFSSSIPHIASTLLLLLLLIPSLKILNINGLNSLSNLVLCLFFKAPLCILVSFLKGLKLPADAFLSAFQSLAEALRSILVSIIDMVFRILGSFFMVVLGFVKNVVVFG
SFADSGWGLGDLFQKTKASWQESYFLEQLRDIVGSVSQNIIHQVLGIASSSAGGVFGFVKTGISTLLNEPGSGIGQLVESLKGSWSMEGVRGIIGSFMEKVGNGVLEVGS
SSTGGSFEIVKAIFGVVMDSGYTLGGFVEKTRAALELLRIEELRGIMQSIAKISVNMIIRYLLG