; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017200 (gene) of Snake gourd v1 genome

Gene IDTan0017200
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTail fiber
Genome locationLG02:95389879..95394042
RNA-Seq ExpressionTan0017200
SyntenyTan0017200
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585460.1 hypothetical protein SDJN03_18193, partial [Cucurbita argyrosperma subsp. sororia]9.3e-10291.74Show/hide
Query:  MANSTPSSASTSTASKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDI
        MA STPS  S+STASK WIRNLS +ASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFP PVVKALLYPGAIVNGLVMNLTVPSWS+LLD+
Subjt:  MANSTPSSASTSTASKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDI

Query:  YNLTNVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKV-IRSA
        YNLTN+KEASA+TDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMI+++ICAFSSVKYD+KKV  RSA
Subjt:  YNLTNVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKV-IRSA

Query:  PARPIAKPLQSSSKSKLK
        PARPIAKPLQSSSKSKLK
Subjt:  PARPIAKPLQSSSKSKLK

XP_016900009.1 PREDICTED: uncharacterized protein LOC103488297 isoform X3 [Cucumis melo]3.5e-10189.86Show/hide
Query:  MANSTPSSASTSTASKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDI
        MA STPS  STSTASK W+RNLS +ASR+YF LIILQIPLFR+ CRSGMCTTPLHVTSSQLIASEVFP PVVKALLYPGA+VNGLVMNLTVPSWS+L DI
Subjt:  MANSTPSSASTSTASKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDI

Query:  YNLTNVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKVIRSAP
        YNLTN+KEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFG+LLVIWGLVKEGILGKPVNTDP KAVYVYPTMIL++ICAFSSVKYDVKKV+R AP
Subjt:  YNLTNVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKVIRSAP

Query:  ARPIAKPLQSSSKSKLK
        ARPIAKPLQSSSKSKLK
Subjt:  ARPIAKPLQSSSKSKLK

XP_022131291.1 uncharacterized protein LOC111004557 [Momordica charantia]6.4e-10391.71Show/hide
Query:  MANSTPSSASTSTASKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDI
        MA  TPS  STST S+GW+RNLS +ASRI+FFLIILQIPLFR+PCRSGMCTTPLHVTSSQLIASEVFP PVVKALLYPGAIVNGLVMNLTVPSWSNLLDI
Subjt:  MANSTPSSASTSTASKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDI

Query:  YNLTNVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKVIRSAP
        YNLTNVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFG+LLVIWGLVKEGILGKPVNTDP K+VYVYPTMIL++ICAFSSVKYDVKKV+RSAP
Subjt:  YNLTNVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKVIRSAP

Query:  ARPIAKPLQSSSKSKLK
        ARPIAKPLQSSSKSKLK
Subjt:  ARPIAKPLQSSSKSKLK

XP_023002496.1 uncharacterized protein LOC111496320 [Cucurbita maxima]5.4e-10291.55Show/hide
Query:  TPSSASTSTASKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDIYNLT
        T + +S+STASK WIRNLS +ASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFP PVVKALLYPGAIVNGLVMNLTVPSWS+LLD+YNLT
Subjt:  TPSSASTSTASKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDIYNLT

Query:  NVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKVIRSAPARPI
        N+KEASA+TDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMI+++ICAFSSVKYD+KKV RSAPARPI
Subjt:  NVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKVIRSAPARPI

Query:  AKPLQSSSKSKLK
        AKPLQSSSKSKLK
Subjt:  AKPLQSSSKSKLK

XP_038883986.1 uncharacterized protein LOC120074948 [Benincasa hispida]4.2e-10291.71Show/hide
Query:  MANSTPSSASTSTASKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDI
        MA ST S  STS ASK WIRNLS +ASRIYFFLIILQIPLFR+PCRSGMCTTPLHVTSSQLIASEVFP PVVKALLYPGAIVNGLVMNLTVPSW+NL DI
Subjt:  MANSTPSSASTSTASKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDI

Query:  YNLTNVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKVIRSAP
        YNLTN+KEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFG+LLVIWGLVKEGILGKPVNTDPAKAVYVYPTMIL++ICAFSSVKYDVKKV+R AP
Subjt:  YNLTNVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKVIRSAP

Query:  ARPIAKPLQSSSKSKLK
        ARPIAKPLQSSSKSKLK
Subjt:  ARPIAKPLQSSSKSKLK

TrEMBL top hitse value%identityAlignment
A0A1S3BCZ0 uncharacterized protein LOC103488297 isoform X41.7e-10189.86Show/hide
Query:  MANSTPSSASTSTASKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDI
        MA STPS  STSTASK W+RNLS +ASR+YF LIILQIPLFR+ CRSGMCTTPLHVTSSQLIASEVFP PVVKALLYPGA+VNGLVMNLTVPSWS+L DI
Subjt:  MANSTPSSASTSTASKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDI

Query:  YNLTNVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKVIRSAP
        YNLTN+KEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFG+LLVIWGLVKEGILGKPVNTDP KAVYVYPTMIL++ICAFSSVKYDVKKV+R AP
Subjt:  YNLTNVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKVIRSAP

Query:  ARPIAKPLQSSSKSKLK
        ARPIAKPLQSSSKSKLK
Subjt:  ARPIAKPLQSSSKSKLK

A0A1S4DVJ9 uncharacterized protein LOC103488297 isoform X31.7e-10189.86Show/hide
Query:  MANSTPSSASTSTASKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDI
        MA STPS  STSTASK W+RNLS +ASR+YF LIILQIPLFR+ CRSGMCTTPLHVTSSQLIASEVFP PVVKALLYPGA+VNGLVMNLTVPSWS+L DI
Subjt:  MANSTPSSASTSTASKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDI

Query:  YNLTNVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKVIRSAP
        YNLTN+KEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFG+LLVIWGLVKEGILGKPVNTDP KAVYVYPTMIL++ICAFSSVKYDVKKV+R AP
Subjt:  YNLTNVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKVIRSAP

Query:  ARPIAKPLQSSSKSKLK
        ARPIAKPLQSSSKSKLK
Subjt:  ARPIAKPLQSSSKSKLK

A0A5A7VF29 Uncharacterized protein1.7e-10189.86Show/hide
Query:  MANSTPSSASTSTASKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDI
        MA STPS  STSTASK W+RNLS +ASR+YF LIILQIPLFR+ CRSGMCTTPLHVTSSQLIASEVFP PVVKALLYPGA+VNGLVMNLTVPSWS+L DI
Subjt:  MANSTPSSASTSTASKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDI

Query:  YNLTNVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKVIRSAP
        YNLTN+KEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFG+LLVIWGLVKEGILGKPVNTDP KAVYVYPTMIL++ICAFSSVKYDVKKV+R AP
Subjt:  YNLTNVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKVIRSAP

Query:  ARPIAKPLQSSSKSKLK
        ARPIAKPLQSSSKSKLK
Subjt:  ARPIAKPLQSSSKSKLK

A0A6J1BP64 uncharacterized protein LOC1110045573.1e-10391.71Show/hide
Query:  MANSTPSSASTSTASKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDI
        MA  TPS  STST S+GW+RNLS +ASRI+FFLIILQIPLFR+PCRSGMCTTPLHVTSSQLIASEVFP PVVKALLYPGAIVNGLVMNLTVPSWSNLLDI
Subjt:  MANSTPSSASTSTASKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDI

Query:  YNLTNVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKVIRSAP
        YNLTNVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFG+LLVIWGLVKEGILGKPVNTDP K+VYVYPTMIL++ICAFSSVKYDVKKV+RSAP
Subjt:  YNLTNVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKVIRSAP

Query:  ARPIAKPLQSSSKSKLK
        ARPIAKPLQSSSKSKLK
Subjt:  ARPIAKPLQSSSKSKLK

A0A6J1KLG8 uncharacterized protein LOC1114963202.6e-10291.55Show/hide
Query:  TPSSASTSTASKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDIYNLT
        T + +S+STASK WIRNLS +ASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFP PVVKALLYPGAIVNGLVMNLTVPSWS+LLD+YNLT
Subjt:  TPSSASTSTASKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDIYNLT

Query:  NVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKVIRSAPARPI
        N+KEASA+TDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMI+++ICAFSSVKYD+KKV RSAPARPI
Subjt:  NVKEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKVIRSAPARPI

Query:  AKPLQSSSKSKLK
        AKPLQSSSKSKLK
Subjt:  AKPLQSSSKSKLK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G80200.1 unknown protein5.9e-3034.51Show/hide
Query:  ASKGWIRNLSPVASRIYFFLIILQIPLFR-----------------------------VPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNG
        + K W   +S +AS ++  LI+ QIPLFR                             V CR+  C TPL V SS+LIA+++ P  +VK LLYPGAI   
Subjt:  ASKGWIRNLSPVASRIYFFLIILQIPLFR-----------------------------VPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNG

Query:  LVMNLTVPSWSNLLDIYNLTNV-KEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVN---TDPAKAVYVYPTMILS
        L     +PS+ +L   Y+   + + +S  TD+  LEV AGS   + GA + L KP R++  G+LL+ WGL+++ +L    +   +    +V VYPT+ L+
Subjt:  LVMNLTVPSWSNLLDIYNLTNV-KEASAVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVN---TDPAKAVYVYPTMILS

Query:  LICAFSSVKYDVKKVIRSAPARPIAK
         + AF S++ DV+K+IR   +  ++K
Subjt:  LICAFSSVKYDVKKVIRSAPARPIAK

AT5G11280.1 unknown protein2.9e-9380.3Show/hide
Query:  SKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDIYNLTNVKEASAVTD
        SKGWI+  + +AS +YF LI+ QIPLFRVPCRSGMC++P+HVTSSQLI+SE+FPVP++KALLYPGA+VNGL +N+T P W N+LDIYNLTNVKEASAVTD
Subjt:  SKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDIYNLTNVKEASAVTD

Query:  LQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKVIRSAPARPIAKPLQSSSKS
        LQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLL++WGLVKEGILGKPVNTDPAK VYVYPTM+L++ICAFS +KYD++K  R+APARPIAKPL SSSKS
Subjt:  LQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKVIRSAPARPIAKPLQSSSKS

Query:  KLK
        KLK
Subjt:  KLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAACTCTACGCCATCATCGGCATCAACATCAACAGCATCAAAAGGTTGGATCAGAAATCTTTCACCAGTTGCTTCTCGCATCTATTTCTTCCTTATCATACTTCA
GATCCCTCTCTTCAGGGTCCCATGCAGATCTGGCATGTGTACAACCCCTTTGCACGTGACTTCATCGCAGTTGATTGCAAGTGAAGTCTTTCCTGTACCTGTAGTTAAGG
CACTTCTCTATCCTGGAGCAATTGTGAATGGCCTTGTCATGAACTTGACAGTTCCTAGCTGGAGTAACTTGTTAGACATATATAACTTGACCAATGTGAAGGAAGCCTCT
GCTGTGACCGATCTTCAACGTTTAGAGGTTCTTGCCGGAAGCTATTTTTCAGTTGCTGGAGCCTTTGTGGGTCTTTTGAAGCCTGGGAGAATGAGCATGTTTGGAAGTCT
CTTGGTAATTTGGGGTCTTGTTAAGGAAGGAATCCTCGGAAAACCTGTGAACACAGATCCTGCCAAAGCTGTTTATGTTTATCCTACAATGATTCTTTCCCTGATCTGTG
CTTTCTCATCGGTTAAGTATGATGTGAAGAAGGTAATTAGAAGTGCCCCTGCTCGACCAATTGCAAAGCCACTTCAAAGCTCATCAAAATCTAAGCTTAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATTTCCAATTTCAACATCCTTCGGTTCAGAACCTTCACGATCGATTTCTAATTAGACCGCAATTACCAAATCCTATTTCTTTCATCTTCCTCGTCTGACCTTTTTTCTCA
AGTTGTCAACAATGGCAAACTCTACGCCATCATCGGCATCAACATCAACAGCATCAAAAGGTTGGATCAGAAATCTTTCACCAGTTGCTTCTCGCATCTATTTCTTCCTT
ATCATACTTCAGATCCCTCTCTTCAGGGTCCCATGCAGATCTGGCATGTGTACAACCCCTTTGCACGTGACTTCATCGCAGTTGATTGCAAGTGAAGTCTTTCCTGTACC
TGTAGTTAAGGCACTTCTCTATCCTGGAGCAATTGTGAATGGCCTTGTCATGAACTTGACAGTTCCTAGCTGGAGTAACTTGTTAGACATATATAACTTGACCAATGTGA
AGGAAGCCTCTGCTGTGACCGATCTTCAACGTTTAGAGGTTCTTGCCGGAAGCTATTTTTCAGTTGCTGGAGCCTTTGTGGGTCTTTTGAAGCCTGGGAGAATGAGCATG
TTTGGAAGTCTCTTGGTAATTTGGGGTCTTGTTAAGGAAGGAATCCTCGGAAAACCTGTGAACACAGATCCTGCCAAAGCTGTTTATGTTTATCCTACAATGATTCTTTC
CCTGATCTGTGCTTTCTCATCGGTTAAGTATGATGTGAAGAAGGTAATTAGAAGTGCCCCTGCTCGACCAATTGCAAAGCCACTTCAAAGCTCATCAAAATCTAAGCTTA
AGTGAGTCGAAATGTAATTTCCCTCTACCGGTTTTGTTTGATTCTCACTCCTGACGAGTTCATGCATTATTTATGCATGCTGAGATTCATTTTCTGGATCTATTAGTTCA
AGTTGTGTTGTGATATACAAATTTCTTTCCAGATCTTCTTTTTAGCACTTCCCGACTTTTTCAACTTGAATTGGTAGAACTGCATCTCTCTGGTTCAACTATGTGAATCA
GAAAGAGTGGTTCCTGGTGATCACTTCTGATTCCTTCTTCTCTATGACCTCAAAACCAAATTT
Protein sequenceShow/hide protein sequence
MANSTPSSASTSTASKGWIRNLSPVASRIYFFLIILQIPLFRVPCRSGMCTTPLHVTSSQLIASEVFPVPVVKALLYPGAIVNGLVMNLTVPSWSNLLDIYNLTNVKEAS
AVTDLQRLEVLAGSYFSVAGAFVGLLKPGRMSMFGSLLVIWGLVKEGILGKPVNTDPAKAVYVYPTMILSLICAFSSVKYDVKKVIRSAPARPIAKPLQSSSKSKLK