; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021193 (gene) of Snake gourd v1 genome

Gene IDTan0021193
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationLG10:53929047..53943507
RNA-Seq ExpressionTan0021193
SyntenyTan0021193
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053639.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.0e-3245.75Show/hide
Query:  MTWLNSLWKIDIHWPTLTIKVEMENRVIILKSDPSLTKSNVSLKKMMKAWEDHDQRFLMELKMLEARPEGKVEVSNGGDKEQEAYLIKILNENGDVFQWP
        M WL+SL   ++ W  L +    + R +++K DPSLTK+ VSLK +MK+W   DQ FL+E + +E     + E   G  +E E  +  +L +   VF+WP
Subjt:  MTWLNSLWKIDIHWPTLTIKVEMENRVIILKSDPSLTKSNVSLKKMMKAWEDHDQRFLMELKMLEARPEGKVEVSNGGDKEQEAYLIKILNENGDVFQWP

Query:  NQFPPRREVDHHIHLKEGTNPVNLRPNCYTQLQKGEIKRLVDEMMTSGVIRPS
           PP+R +DHHI+LK GTNPVN+RP  Y   QK E++RLVDEM++SG+IRPS
Subjt:  NQFPPRREVDHHIHLKEGTNPVNLRPNCYTQLQKGEIKRLVDEMMTSGVIRPS

KAA0056478.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]5.8e-3248.05Show/hide
Query:  MTWLNSLWKIDIHWPTLTIKVEMENRVIILKSDPSLTKSNVSLKKMMKAWEDHDQRFLMELKMLEARPEGKVEVSNGGDKEQEAY-LIKILNENGDVFQW
        M WL+SL    + W  LT+    E + I +K DPSLTKS +SLK M K W D D+ FL++ + ++ R E   E +N      EA  L  +L + GDVF W
Subjt:  MTWLNSLWKIDIHWPTLTIKVEMENRVIILKSDPSLTKSNVSLKKMMKAWEDHDQRFLMELKMLEARPEGKVEVSNGGDKEQEAY-LIKILNENGDVFQW

Query:  PNQFPPRREVDHHIHLKEGTNPVNLRPNCYTQLQKGEIKRLVDEMMTSGVIRPS
        P + PPRR+++H IHLKEGTNP+N+RP  Y   QK E+++LV+EM+TSG+IRPS
Subjt:  PNQFPPRREVDHHIHLKEGTNPVNLRPNCYTQLQKGEIKRLVDEMMTSGVIRPS

KAA0063463.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.4e-3247.44Show/hide
Query:  MTWLNSLWKIDIHWPTLTIKVEMENRVIILKSDPSLTKSNVSLKKMMKAWEDHDQRFLMELKMLEARPEGKVEVSNGGDKEQ-EAYLIKILNENGDVFQW
        M WL SL      W  LT+      + I +K DPSLTK+ VSLK ++K WE+HD  +L+E + +E     +++ S+  +KE+ +  LI ILN+  DVF+W
Subjt:  MTWLNSLWKIDIHWPTLTIKVEMENRVIILKSDPSLTKSNVSLKKMMKAWEDHDQRFLMELKMLEARPEGKVEVSNGGDKEQ-EAYLIKILNENGDVFQW

Query:  PNQFPPRREVDHHIHLKEGTNPVNLRPNCYTQLQKGEIKRLVDEMMTSGVIRPSAT
        P + PPRR ++H IHLKEGTNPVN+RP  Y   QK E+++LV+EM+ SG+IRPSA+
Subjt:  PNQFPPRREVDHHIHLKEGTNPVNLRPNCYTQLQKGEIKRLVDEMMTSGVIRPSAT

TYK22240.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.4e-3247.44Show/hide
Query:  MTWLNSLWKIDIHWPTLTIKVEMENRVIILKSDPSLTKSNVSLKKMMKAWEDHDQRFLMELKMLEARPEGKVEVSNGGDKEQ-EAYLIKILNENGDVFQW
        M WL SL      W  LT+      + I +K DPSLTK+ VSLK ++K WE+HD  +L+E + +E     +++ S+  +KE+ +  LI ILN+  DVF+W
Subjt:  MTWLNSLWKIDIHWPTLTIKVEMENRVIILKSDPSLTKSNVSLKKMMKAWEDHDQRFLMELKMLEARPEGKVEVSNGGDKEQ-EAYLIKILNENGDVFQW

Query:  PNQFPPRREVDHHIHLKEGTNPVNLRPNCYTQLQKGEIKRLVDEMMTSGVIRPSAT
        P + PPRR ++H IHLKEGTNPVN+RP  Y   QK E+++LV+EM+ SG+IRPSA+
Subjt:  PNQFPPRREVDHHIHLKEGTNPVNLRPNCYTQLQKGEIKRLVDEMMTSGVIRPSAT

TYK30083.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.4e-3247.44Show/hide
Query:  MTWLNSLWKIDIHWPTLTIKVEMENRVIILKSDPSLTKSNVSLKKMMKAWEDHDQRFLMELKMLEARPEGKVEVSNGGDKEQ-EAYLIKILNENGDVFQW
        M WL SL      W  LT+      + I +K DPSLTK+ VSLK ++K WE+HD  +L+E + +E     +++ S+  +KE+ +  LI ILN+  DVF+W
Subjt:  MTWLNSLWKIDIHWPTLTIKVEMENRVIILKSDPSLTKSNVSLKKMMKAWEDHDQRFLMELKMLEARPEGKVEVSNGGDKEQ-EAYLIKILNENGDVFQW

Query:  PNQFPPRREVDHHIHLKEGTNPVNLRPNCYTQLQKGEIKRLVDEMMTSGVIRPSAT
        P + PPRR ++H IHLKEGTNPVN+RP  Y   QK E+++LV+EM+ SG+IRPSA+
Subjt:  PNQFPPRREVDHHIHLKEGTNPVNLRPNCYTQLQKGEIKRLVDEMMTSGVIRPSAT

TrEMBL top hitse value%identityAlignment
A0A5A7UJK0 Ty3/gypsy retrotransposon protein9.7e-3345.75Show/hide
Query:  MTWLNSLWKIDIHWPTLTIKVEMENRVIILKSDPSLTKSNVSLKKMMKAWEDHDQRFLMELKMLEARPEGKVEVSNGGDKEQEAYLIKILNENGDVFQWP
        M WL+SL   ++ W  L +    + R +++K DPSLTK+ VSLK +MK+W   DQ FL+E + +E     + E   G  +E E  +  +L +   VF+WP
Subjt:  MTWLNSLWKIDIHWPTLTIKVEMENRVIILKSDPSLTKSNVSLKKMMKAWEDHDQRFLMELKMLEARPEGKVEVSNGGDKEQEAYLIKILNENGDVFQWP

Query:  NQFPPRREVDHHIHLKEGTNPVNLRPNCYTQLQKGEIKRLVDEMMTSGVIRPS
           PP+R +DHHI+LK GTNPVN+RP  Y   QK E++RLVDEM++SG+IRPS
Subjt:  NQFPPRREVDHHIHLKEGTNPVNLRPNCYTQLQKGEIKRLVDEMMTSGVIRPS

A0A5A7UN05 Transposon Tf2-1 polyprotein isoform X12.8e-3248.05Show/hide
Query:  MTWLNSLWKIDIHWPTLTIKVEMENRVIILKSDPSLTKSNVSLKKMMKAWEDHDQRFLMELKMLEARPEGKVEVSNGGDKEQEAY-LIKILNENGDVFQW
        M WL+SL    + W  LT+    E + I +K DPSLTKS +SLK M K W D D+ FL++ + ++ R E   E +N      EA  L  +L + GDVF W
Subjt:  MTWLNSLWKIDIHWPTLTIKVEMENRVIILKSDPSLTKSNVSLKKMMKAWEDHDQRFLMELKMLEARPEGKVEVSNGGDKEQEAY-LIKILNENGDVFQW

Query:  PNQFPPRREVDHHIHLKEGTNPVNLRPNCYTQLQKGEIKRLVDEMMTSGVIRPS
        P + PPRR+++H IHLKEGTNP+N+RP  Y   QK E+++LV+EM+TSG+IRPS
Subjt:  PNQFPPRREVDHHIHLKEGTNPVNLRPNCYTQLQKGEIKRLVDEMMTSGVIRPS

A0A5A7V5U6 Ty3/gypsy retrotransposon protein1.7e-3247.44Show/hide
Query:  MTWLNSLWKIDIHWPTLTIKVEMENRVIILKSDPSLTKSNVSLKKMMKAWEDHDQRFLMELKMLEARPEGKVEVSNGGDKEQ-EAYLIKILNENGDVFQW
        M WL SL      W  LT+      + I +K DPSLTK+ VSLK ++K WE+HD  +L+E + +E     +++ S+  +KE+ +  LI ILN+  DVF+W
Subjt:  MTWLNSLWKIDIHWPTLTIKVEMENRVIILKSDPSLTKSNVSLKKMMKAWEDHDQRFLMELKMLEARPEGKVEVSNGGDKEQ-EAYLIKILNENGDVFQW

Query:  PNQFPPRREVDHHIHLKEGTNPVNLRPNCYTQLQKGEIKRLVDEMMTSGVIRPSAT
        P + PPRR ++H IHLKEGTNPVN+RP  Y   QK E+++LV+EM+ SG+IRPSA+
Subjt:  PNQFPPRREVDHHIHLKEGTNPVNLRPNCYTQLQKGEIKRLVDEMMTSGVIRPSAT

A0A5D3DFT1 Ty3/gypsy retrotransposon protein1.7e-3247.44Show/hide
Query:  MTWLNSLWKIDIHWPTLTIKVEMENRVIILKSDPSLTKSNVSLKKMMKAWEDHDQRFLMELKMLEARPEGKVEVSNGGDKEQ-EAYLIKILNENGDVFQW
        M WL SL      W  LT+      + I +K DPSLTK+ VSLK ++K WE+HD  +L+E + +E     +++ S+  +KE+ +  LI ILN+  DVF+W
Subjt:  MTWLNSLWKIDIHWPTLTIKVEMENRVIILKSDPSLTKSNVSLKKMMKAWEDHDQRFLMELKMLEARPEGKVEVSNGGDKEQ-EAYLIKILNENGDVFQW

Query:  PNQFPPRREVDHHIHLKEGTNPVNLRPNCYTQLQKGEIKRLVDEMMTSGVIRPSAT
        P + PPRR ++H IHLKEGTNPVN+RP  Y   QK E+++LV+EM+ SG+IRPSA+
Subjt:  PNQFPPRREVDHHIHLKEGTNPVNLRPNCYTQLQKGEIKRLVDEMMTSGVIRPSAT

A0A5D3E1V9 Ty3/gypsy retrotransposon protein1.7e-3247.44Show/hide
Query:  MTWLNSLWKIDIHWPTLTIKVEMENRVIILKSDPSLTKSNVSLKKMMKAWEDHDQRFLMELKMLEARPEGKVEVSNGGDKEQ-EAYLIKILNENGDVFQW
        M WL SL      W  LT+      + I +K DPSLTK+ VSLK ++K WE+HD  +L+E + +E     +++ S+  +KE+ +  LI ILN+  DVF+W
Subjt:  MTWLNSLWKIDIHWPTLTIKVEMENRVIILKSDPSLTKSNVSLKKMMKAWEDHDQRFLMELKMLEARPEGKVEVSNGGDKEQ-EAYLIKILNENGDVFQW

Query:  PNQFPPRREVDHHIHLKEGTNPVNLRPNCYTQLQKGEIKRLVDEMMTSGVIRPSAT
        P + PPRR ++H IHLKEGTNPVN+RP  Y   QK E+++LV+EM+ SG+IRPSA+
Subjt:  PNQFPPRREVDHHIHLKEGTNPVNLRPNCYTQLQKGEIKRLVDEMMTSGVIRPSAT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTTGGTTGAATAGCTTATGGAAAATAGACATTCACTGGCCAACCCTCACGATTAAAGTGGAGATGGAGAACCGAGTGATTATTTTGAAAAGTGACCCAAGCTTAAC
TAAGTCAAATGTATCCCTTAAGAAGATGATGAAAGCATGGGAGGACCATGATCAACGCTTTTTAATGGAATTAAAAATGTTAGAGGCAAGGCCAGAGGGCAAGGTGGAAG
TAAGCAATGGGGGAGACAAAGAACAGGAAGCATATTTGATTAAAATATTAAATGAGAATGGAGACGTGTTTCAATGGCCAAATCAGTTCCCTCCACGAAGGGAAGTAGAT
CATCACATACATTTAAAAGAAGGAACCAATCCAGTGAATTTGAGGCCAAATTGTTATACACAATTACAAAAAGGAGAGATCAAAAGGTTAGTTGATGAAATGATGACTAG
TGGAGTGATAAGGCCTAGTGCAACCCATATTCGAGTCCAGTACTCTTGGTAA
mRNA sequenceShow/hide mRNA sequence
GTTATTTCTTAGAATTCACACTCAAAAGTCTTGGATCTGTTTCAAAAAAAAAAAAAAAAAGTCTTGGATTGGATCAGAATGTGATCAAATTCATGGATGGCGTTCCATAT
TCTTAATCTAAATTTCCATCTGCCCCTTTGCTCTTAAAAGCTCTCTAAATGGCTGAAGCTTCATCGTCGACTTCTCCAGCCATATCTACAACAACTATTCAAGCAAAATA
TGATGTGTTCTTGAGTTTTCGAGACCGACGGTAGTTATGCAGAAGCATTTGGTCAACACTTGGAGACTCTCAAGGGTGGGCAAGATGAAATACAAAGGTGGAAGGATGCA
CTAACAGAATTAGCAAATTTGTCTGGTTGGGATTTACTTGATCGGTAAATAAGCTAAGTATATGACTTTTTAAAAAAAAACTTTTATTTTGTTGCTCTCAAGAATTTTGC
CCATTTGAGTATTTTTAAGTTTTGAGTTCAACAAAATTGGATAGGCATAATTAATATATATTTTCCTAAAAAAAGTTAGGGGAAAAAATAATTTTTTAAAAACTTAGCAA
GATATATCAATTTCTAACTAGCTTAATTGTTGTAATCTTTACACTAAATTTGATTAAGTATTACAAATTTTACCACTAAATAGATAAAATCTCTTATAAAAATATATAAA
TGTACATTTACTGTTGTGATAAAACAGATGTCTAAAATAAAATTGATGGACTTAAGATAAAAATTGTACCACAGTTGAAATTTTTGTGGGTTTTAACAGATTCATTTATG
GTTAATTTTAGGCATCCATCAACTTAAATCTTATGTGAGGTTTGGTTTTTTTTTTTTAGGAAAATTTTTTACTTTGTTTTATTGTTAAAATTAGTAGGTAAATTTAACGG
AAGTGGAACATAACCCAATAATTGCAACGTTGGACTAAGTTTAGAGATCACAATTAATTACTATGTGTTTTATTTATTTATTTATCTTTTGAAAAGGAGAAATTAATTAA
CGACCACCAACCAAGCCAACAGTAAAAAAAAAAAAAGAAAGAAAAGAAAAGAGAAAAAATAAACCAATCCTCATTAAAGAAATTTACAATTCTGCCTCTTCCTTTTGATT
TGTTCCACCAAGCATCGGTACGAGTTCGAAGAGAGGGGTCCAAATTGTGTGACTCGAATGGCTCTCCATCGAAGCCGACAACAGGCATTTCGTTTCATGCTACTCTCCAC
CAAACTTTTCTAGACAATATGCCCATCTTACAAATTGAGGAAAATCAGCTGGATTACCACTTGAATTTAGGAAATCTCTGTAGGACCACGTCATTGGAATGGCTTTTCCA
CAAAGAGTGAGAAGCCTCCTCATCAGAAGATGAAGGTTGGTCCCTTTTCGACCTTTAGTTTAGATAGCTAGATGATGATCTCTCTTGACAGAGAAGGCTCCTTTTGCATC
TGGACCCCAGCCCAAATAATAGGGTCACTGGAACCTGGACGGCTAGATGGCTTATTCAGGATATCATTCACATTTAAATTAATATTTGAACTCTTTTAAATATTTAATTC
TCTCATATAATTTAATATGAATCATATTCACATTAAATTTATAATATAAAGTTTAAGTTCCAAAACTTTATATTATAACGTATTTATATACATTAAATTTATTTCCTAAT
GAATTTGAACATTTCAAATTCAAACGATATAAGAACCCTTTACGAGCTAGAAGGTGGACCTAATGGACTCTACTGGTCGGAAGGCTCCAACGATACTGAAATTATTCTTG
TTAATCTCATTGACCTCCTAATCACCATTCGTTAATCTGAGAACACTCCACTAAAGTCTCACAAATTTGCACTCTTCTCACTCGCAGATATTTTTACGTGGCCACAAATA
TTGACCAATAACAAAGAAGTCAATCCTTCACGAATGTTAAGTAACACCAACCAGGTCAAATTACCGCTTTACCCCAGGGTTACATCTTGTACCTTAAGTACCAGTTCTCC
TCTAATGAACAATTTGTTTGTGGTCCTACCAACAAACAGAAAGTCCTTTTCGGGCCAATAAGAGGGTTGGGCCCTTTGTTCAAGTCCCGGAGACACTACTTAAGGGAACA
TTCATCTACCCCTAGTAGGCAAGAAGAATTGAATTCCATCTTGCTAAGTAAGTCCCCAGTCGCTCACTCGATCTTGTCCCCAAGAAGGTAGGCATATTGAGTCGGCGAAT
TTGGCCACTCTCACCCATACTAGTCAAAGGACAATCTCTCGCAAATAGGAGTTTGTAACCTGCTCAGGATTGAGATCAAGTTGCCTAGGTCATCATAGTGAAATAGAAAC
CTAACTAGTCAACGGAGTTACATCTAGCGGTTACTATTTCGCGGTTCGGTCTTATGTAATCTCATTACATAGGATATCTCCACTCACATGTCATCTATACGAACAGGATA
GGATCACAGTGTTTGTATCATATACAAAGTGGGTCACATCCATAGTGTTACCATGGTAAGGTACCCAAACCTTATCTCCTTACTATAGACTCTTTAGGTTGTATCTCGAA
CTGAGATTCTTTATATGTACACTACATTCAGTTCAAGAGACATTTTACAACATTGGATGTTAGTTTATTAGATTTAGGGTTAAAATAAGGGCAATGTCGTAAATAACGAA
CAATAAATCTTTATTGAAATAATAACTATTTATTACAATAATTAGAACAAAATTACAAAACTACGAGTTTTAGGGCACAAACCCCAACAGGATTGATGTGGTGTTAGGCA
TGACTTGGTTGAATAGCTTATGGAAAATAGACATTCACTGGCCAACCCTCACGATTAAAGTGGAGATGGAGAACCGAGTGATTATTTTGAAAAGTGACCCAAGCTTAACT
AAGTCAAATGTATCCCTTAAGAAGATGATGAAAGCATGGGAGGACCATGATCAACGCTTTTTAATGGAATTAAAAATGTTAGAGGCAAGGCCAGAGGGCAAGGTGGAAGT
AAGCAATGGGGGAGACAAAGAACAGGAAGCATATTTGATTAAAATATTAAATGAGAATGGAGACGTGTTTCAATGGCCAAATCAGTTCCCTCCACGAAGGGAAGTAGATC
ATCACATACATTTAAAAGAAGGAACCAATCCAGTGAATTTGAGGCCAAATTGTTATACACAATTACAAAAAGGAGAGATCAAAAGGTTAGTTGATGAAATGATGACTAGT
GGAGTGATAAGGCCTAGTGCAACCCATATTCGAGTCCAGTACTCTTGGTAAAGAAGAAAGATGGTGATGGAGATTATGTGTCGACTATAGAGCCTTAAATAATGTGATCA
TACCAGACAAATTTCTCATTCCAGTAGTAGAAGAGTTATGATCATTCGTTACACGCAAATGCATGCGATCGAAGTAATAAAATTGGAGTACTCCAGAGTATCAAACGCAC
AAGGAACGGGTTTCACCAAGTTACTTCGGGAAAACAAGAGTTGCTACGAAGATGCAAGTGTGGTAACAAAGTTTTTAAGAATGTTTTGGTCGTAAAGAATTGTTGCTAAT
ATGTAAAGTAAACAATGAGATAAAAGAAGAGAATTCAAGTTGTTTGCGCCCCTTCGAAGAAAATGTTGGCGAAAATGCAGTAGCAATTGTGCCAACAATCCTAAATCTAA
GTAAATCGGGTTCGACTCCTATTCGTTTTTGCTCCCAAAACCATTAGGGGACTAAACAATTAAACCTACATGTTTGTGGCTAATTGAGCTTAGATGGTTCCGCACACAAC
CCTTTTGATTTTGGAGCCTCGCCCTATCTATGTCTAAGGTTGAAGTTTCAGATCTTATGTCTATGTCATTTCCTCCGTTGGTTTTATGAGATCCATTATCCCTTGACCCT
TTCAGTATCAAGTTCGAGCTTGATGGACAAATCAGTGATATGCTAGGGTTACCCAAATAATATTCAATAAAAATAAACTTAGAGTTGGTTGCCGACACAACTACTTTACT
AATTTAAGTTTGGCAACTCTCTTCTAAGGCATTCACAACTTAAACCCCTAAAAAGGATTTAGCACATGGTTTAAATAAAGGAAAATATTGTCTTGGTTGATGGAAATGTA
AAAGACATCTTGCAAATAGAAGGTAAAAGACGACAATAAGAGGAATAATAAAGAAACTATAAATTGGAAAATAAATGGAGGCCTCACCCCTTACCTTTTATTAAATACAT
AACTCAATCTAAATCAGTATGGGTTGGATTAAACTCGAATACAGAAAGTTTTGGAGTGTAGCATGCGATGAACGAGAAAAATCTCGAGAATCCTTTCGTAGATCGACTTC
AAAATCCCAAGGGAAGTTAAGAATCTTTACCTCCAATACTTTCGTAAGGAGGGATCAACCAAGGATTCGTTACAAACTAAGAAATAAGAAAAACTTGTTGGAATAAAAAT
ACTGGAAACAAAATCTTGTTGTTATGGATACAAAGTAACGAAAAACTTGAAAAACAAATGGAAAAGAAATTGATCTAATAACCTTCCTCCCGATTGCCTCTTTCTCTAGG
GTTTCTTGTGTATTTTTAGGTGTTTTCCTTTAGGGTTTCCTTGTTTGGACTCCCATGTATGGTGGAAAATCAATAAATCACGAATCTGGGTTGTCAACGGTTCCAAAAAT
GGCAAAAATCTTGTAATTCCTGCGACATCCCTTTTCTGCAGATTCTTACAATGCAGTCCACATTTTAGCTCGCATTTTGTCTTGTCTGGTCTTGTAAGCCCTAAACTCTA
TTTTCTTTTATCATTTTGCACTCATTCATGATGTTTTGAAATATGCAAAATCGTATAGGTTGCACTAAATGATGTCGTGATCGGATTTGGAAATCTATTGAATTTGGAGC
TACAATGTGTCTAAGCTTCATAAGAAGATAGGAGCTGGCCAAGGGAGGTGAAGCCTCGGAGTTAAACATGGCAAGTAGGTCTTTGTCAAATTCATGAGAAAAACAGCAAG
CAGCTGGAAAATGGGTCATCTTCAAACGATCATATCTCTCAATTTACTGAGTCAAATTCGTTGATTTGTAAGTCAAAATCTCATAAATTGAGTTTAGAATCCGAAACTTG
TCGACTTTTGGAAAAATACAGAGAGGACGAAGGAGAAAAGTGTAGAACAGTGAGAGGTGTTCAAAGTTCTCGGGTAACAGAACAGTGTGAGGAGTTCAGTCTGTTGTGAG
CTCATTAGGACGAATTAGAAGCTGATATTTTGAGGTGCAAGGGAAAAACTGCTAGCAAGAGATTGAAGGAGATTCTCTTCCATTTCTATATACCTTACAACCAATTGTGT
GGCAAACGGAGATAGGAAAAGCGTATAACCCTTGGAACTCTTTTGGTGATTGTAGCAGACGAGTTTCCAGTGCTTAGCCTTGACAAGACCTATTTGCCACCACGCCATTT
AGCTCCTTGACCATGGCTCGAGGTGGCAAACGAGGCAGGCAAGTGGAGACTGGAACTCAAGAAGCTACTGGTGATAGAGGATGAGAGGTATCAAAGGGAGAGTCTAGTCA
TCCTCAGCAAGAGGTGAACATGGAGGAACAGATCTTTACGAGGATAACTCAAAGATTAGCTGAAAGTGTTGGATCAACACAAGCAGATCCAGAAAAGAAGTATGGCATTG
AAAGACTGAAAGGCTAGGTGCAATAATATTTGAAGGCACGACAGATCCCGCCGATGCTGAGGTTTGGTTAAATCTGATTGAGAAGTGCTTTAGGGTCATGCGATGCCCTG
AAGACAGGAAGGTCGATTTAGCGAAGATTCTTACTCCGAGAAAGGAGCGGATGATTGGTGGAAGATAACGAGAGAGCGAGAAAAGGGAAGCTTGGAGTGGAAAAGAGTTT
CGAAAGGCCTTTGAAGATAAGTATTATCCGAGATCTTATCGTGATGCAAAGAGGAACGAGTTTCTAGGACTTGTTCAGGGATCGATGACAGTAAGAGAATATGAGAAGAA
GTTCACAGAGTTATCAAAGTATGCTAGCACTATTGTTGCAAACGAGACAGATCGATGTAAGAGGTTCGAGGATGGTCTACGAACAGAGATCGTAACGCTTTGTGCAAAGT
CAAGTTCGGAGTGGGTTGAGTTCTCCAAGCTTGTTGAGACGACATTACGGGTAGAAACGAAGCTTTGTAGATGACGAGAATGGGAAAAGGGG
Protein sequenceShow/hide protein sequence
MTWLNSLWKIDIHWPTLTIKVEMENRVIILKSDPSLTKSNVSLKKMMKAWEDHDQRFLMELKMLEARPEGKVEVSNGGDKEQEAYLIKILNENGDVFQWPNQFPPRREVD
HHIHLKEGTNPVNLRPNCYTQLQKGEIKRLVDEMMTSGVIRPSATHIRVQYSW