; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0027121 (gene) of Chayote v1 genome

Gene IDSed0027121
OrganismSechium edule (Chayote v1)
DescriptionReverse transcriptase
Genome locationLG07:32849063..32850762
RNA-Seq ExpressionSed0027121
SyntenySed0027121
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR032567 - LDOC1-related
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037906.1 reverse transcriptase [Cucumis melo var. makuwa]4.5e-3130.17Show/hide
Query:  MPVTR-AERGHPRGGRGVRIGSDTVGTSSQSRGPIAIRCSSPVEPDGKSPELGSSGDPFVSKRRARRHPTQQSQRPLRRKWREGADELGKWLKDFVKWNP
        MP  R A RG  RGGRG   G      ++ +  P A   ++PV+    +P       P          P Q S                K L+DF K+NP
Subjt:  MPVTR-AERGHPRGGRGVRIGSDTVGTSSQSRGPIAIRCSSPVEPDGKSPELGSSGDPFVSKRRARRHPTQQSQRPLRRKWREGADELGKWLKDFVKWNP

Query:  DKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCATHVLGGVARWWWESTLSERPAGSPPATWEFFKTEFKAKYISDEAQEKMVALFQNLQQGSDSI
          FD + D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+            TWE FK  F AK+ S   +   +  F NL+QG  ++
Subjt:  DKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCATHVLGGVARWWWESTLSERPAGSPPATWEFFKTEFKAKYISDEAQEKMVALFQNLQQGSDSI

Query:  EEYERKFSVYGYFDPQLIATPELKILKFISGLRQGVTQDVRAQAPITYARAVQLATIYNGKYPVERIVAAPAKDCPTVVG--REEEGGP-----------
        E+Y+ +F +   F P ++     +  KF+ GLR  +   VRA  P T+A A+++A   +     ER  ++ A    + +G  R+ E  P           
Subjt:  EEYERKFSVYGYFDPQLIATPELKILKFISGLRQGVTQDVRAQAPITYARAVQLATIYNGKYPVERIVAAPAKDCPTVVG--REEEGGP-----------

Query:  ---RQIPACVTCGRVHFGACQTGPRRCFRCGKEGHMIRSCDKPAMAIQITQQSSQQAG
           R++PAC TCGRVH G C  G   CFRC + GH    C +        Q S+ Q G
Subjt:  ---RQIPACVTCGRVHFGACQTGPRRCFRCGKEGHMIRSCDKPAMAIQITQQSSQQAG

KAA0047534.1 gag protease polyprotein [Cucumis melo var. makuwa]4.2e-2928.89Show/hide
Query:  LVPHSVGPLQLVSELRVINSIIASMPVTRAERGHPRGGRG-----VRIGSDTVGTSSQSRGPIAIRCSSPVEPDGKSPELGSSGDPFVSKRRARRHPTQQ
        L+  S G  +L+     I  +I  MP  R  R   RGGRG     V++    V  ++    P+     + +E   +           + + R ++ P   
Subjt:  LVPHSVGPLQLVSELRVINSIIASMPVTRAERGHPRGGRG-----VRIGSDTVGTSSQSRGPIAIRCSSPVEPDGKSPELGSSGDPFVSKRRARRHPTQQ

Query:  SQRPLRRKWREGADEL---GKWLKDFVKWNPDKFDAT-GDALAAARWIAHLEYTFLIMVCPDVQRPRCATHVLGGVARWWWESTLSERPAGSPPATWEFF
        +  P         D+L    K L+DF K+NP  FD +  D   A  W++ LE  F  M CP+ Q+ +CA  +L      WWE+T           TW+ F
Subjt:  SQRPLRRKWREGADEL---GKWLKDFVKWNPDKFDAT-GDALAAARWIAHLEYTFLIMVCPDVQRPRCATHVLGGVARWWWESTLSERPAGSPPATWEFF

Query:  KTEFKAKYISDEAQEKMVALFQNLQQGSDSIEEYERKFSVYGYFDPQLIATPELKILKFISGLRQGVTQDVRAQAPITYARAVQLATIYNGKYPVERIVA
        K  F AK+ S   ++     F NL+QG  ++E+Y+ +  +   F P++IAT   +  KF+ GLR  +   VRA    T+A A++LA   + +        
Subjt:  KTEFKAKYISDEAQEKMVALFQNLQQGSDSIEEYERKFSVYGYFDPQLIATPELKILKFISGLRQGVTQDVRAQAPITYARAVQLATIYNGKYPVERIVA

Query:  APA-----KDCPTVVGREEEGGPRQIPACVTCGRVHFGACQTGPRRCFRCGKEGHMIRSC
        A       +  P   G    G     P C TCG+ H G C  G R CF+C +EGH    C
Subjt:  APA-----KDCPTVVGREEEGGPRQIPACVTCGRVHFGACQTGPRRCFRCGKEGHMIRSC

TYK05193.1 pol protein [Cucumis melo var. makuwa]5.4e-2930.03Show/hide
Query:  PVTRAERGHPRGGRGVRIGSDTVGTSSQSRGPIAIRCSSPVEPDGKSPE------LGSSGDPFVSKRRARRHPTQQSQ--RPLRRKWREGADEL---GKW
        P   A RG  RGGRGV  G        Q   P A+  ++PV  D  + E      L ++  PF++ ++ +  P Q      P   + +    +L    K 
Subjt:  PVTRAERGHPRGGRGVRIGSDTVGTSSQSRGPIAIRCSSPVEPDGKSPE------LGSSGDPFVSKRRARRHPTQQSQ--RPLRRKWREGADEL---GKW

Query:  LKDFVKWNPDKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCATHVLGGVARWWWESTLSERPAGSPPA--TWEFFKTEFKAKYISDEAQEKMVAL
        L+DF K+NP  FD + D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER  G   +  TWE FK  F AK+ S   +   +  
Subjt:  LKDFVKWNPDKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCATHVLGGVARWWWESTLSERPAGSPPA--TWEFFKTEFKAKYISDEAQEKMVAL

Query:  FQNLQQGSDSIEEYERKFSVYGYFDPQLIATPELKILKFISGLRQGVTQDVRAQAPITYARAVQLATIYNGKYPVERIVAAPAKDCPTVVGREEE-----
        F NL+QG  ++E+Y+ +F +   F P ++     +  KF+ GLR  +   VRA  P T+A A+++A   +     ER+ ++ A    T +G++ +     
Subjt:  FQNLQQGSDSIEEYERKFSVYGYFDPQLIATPELKILKFISGLRQGVTQDVRAQAPITYARAVQLATIYNGKYPVERIVAAPAKDCPTVVGREEE-----

Query:  ----------GGP---------------RQIPACVTCGRVHFGACQTGPRRCFRCGKEGHMIRSCDKPAMAIQITQQSSQQAG
                  GG                R++PAC TCGRVH G C  G   CFRC + GH    C +        Q S+ Q G
Subjt:  ----------GGP---------------RQIPACVTCGRVHFGACQTGPRRCFRCGKEGHMIRSCDKPAMAIQITQQSSQQAG

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]4.2e-2932.01Show/hide
Query:  KWLKDFVKWNPDKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCATHVLGGVARWWWESTLSERPAGSPPATWEFFKTEFKAKYISDEAQEKMVAL
        +++KDF ++ P  FD   + A A   WI  LE  +  + C D  + + A  +L G A  WW+S  +     + P  W  FK      Y  +  ++   A 
Subjt:  KWLKDFVKWNPDKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCATHVLGGVARWWWESTLSERPAGSPPATWEFFKTEFKAKYISDEAQEKMVAL

Query:  FQNLQQGSDSIEEYERKFSVYGYFDPQLIATPELKILKFISGLRQGVTQDVRAQAPITYARAVQLATIY-----NGKYPVERIVAA-------PAKDCPT
        F +L QG+ S+ +YERKF+    F  +LI T  LKI +F+ GLR+G+   V  Q P TYA AV+ A +      N   P+  + ++       P+     
Subjt:  FQNLQQGSDSIEEYERKFSVYGYFDPQLIATPELKILKFISGLRQGVTQDVRAQAPITYARAVQLATIY-----NGKYPVERIVAA-------PAKDCPT

Query:  VVGREEEGGPRQ--IPACVTCGRVHFGACQTGPRRCFRCGKEGHMIRSCDKPAMAIQITQQSSQQAGCSLLNRQRPSV
        V+   +     Q   P C TC + H G C TG + CFRCG+EGH  R C   A   Q   Q       +  N QR  V
Subjt:  VVGREEEGGPRQ--IPACVTCGRVHFGACQTGPRRCFRCGKEGHMIRSCDKPAMAIQITQQSSQQAGCSLLNRQRPSV

XP_038891712.1 uncharacterized protein LOC120081110 [Benincasa hispida]3.8e-3029.66Show/hide
Query:  PTQQSQRPLRRKWREGADELGKWLKDFVKWNPDKFDAT-GDALAAARWIAHLEYTFLIMVCPDVQRPRCATHVLGGVARWWWESTLSERPAGSPPATWEF
        P  Q+Q  ++ +   G     K L+DF K+NP  F+ +  D   A  WI+ +E  F  M CP+ Q+ +CA  +L   A+ WW+        G  P TWE 
Subjt:  PTQQSQRPLRRKWREGADELGKWLKDFVKWNPDKFDAT-GDALAAARWIAHLEYTFLIMVCPDVQRPRCATHVLGGVARWWWESTLSERPAGSPPATWEF

Query:  FKTEFKAKYISDEAQEKMVALFQNLQQGSDSIEEYERKFSVYGYFDPQLIATPELKILKFISGLRQGVTQDVRAQAPITYARAVQLAT------------
        FK  F AKY S   +      F  L+QG  S+EEY+++F     F P+L+AT  ++  +FI GL++ +   V+A  P T+  A++LA             
Subjt:  FKTEFKAKYISDEAQEKMVALFQNLQQGSDSIEEYERKFSVYGYFDPQLIATPELKILKFISGLRQGVTQDVRAQAPITYARAVQLAT------------

Query:  IYNGKYPVERIVAAPAKDCPTVVGREEEGGPRQI-----------------PACVTCGRVHFGACQTGPRRCFRCGKEGHMIRSCDKPAM
        ++       +   A  KD      ++ +G  R +                 P C +CGR H+G C  G   CF C ++GH++  C    M
Subjt:  IYNGKYPVERIVAAPAKDCPTVVGREEEGGPRQI-----------------PACVTCGRVHFGACQTGPRRCFRCGKEGHMIRSCDKPAM

TrEMBL top hitse value%identityAlignment
A0A5A7SJH3 Reverse transcriptase1.7e-2833.71Show/hide
Query:  KWLKDFVKWNPDKFDAT-GDALAAARWIAHLEYTFLIMVCPDVQRPRCATHVLGGVARWWWESTLSERPAGSPPA--TWEFFKTEFKAKYISDEAQEKMV
        K L+DF K+NP  FD +  D   A  W++ LE  F  M CP+ Q+ +CA  +L      WWE+T  ER  G   +  TW+ FK  F AK+ S   ++   
Subjt:  KWLKDFVKWNPDKFDAT-GDALAAARWIAHLEYTFLIMVCPDVQRPRCATHVLGGVARWWWESTLSERPAGSPPA--TWEFFKTEFKAKYISDEAQEKMV

Query:  ALFQNLQQGSDSIEEYERKFSVYGYFDPQLIATPELKILKFISGLRQGVTQDVRAQAPITYARAVQLATIYNGKYPVERIVAA----------PAKDCPT
          F NL+QG  ++E+Y+ +F +   F P++IAT   +  KF+ GLR  +   VRA  P T+A A++LA   + +       AA           A+  P 
Subjt:  ALFQNLQQGSDSIEEYERKFSVYGYFDPQLIATPELKILKFISGLRQGVTQDVRAQAPITYARAVQLATIYNGKYPVERIVAA----------PAKDCPT

Query:  VVGRE--EEGG---------------PRQIPACVTCGRVHFGACQTGPRRCFRCGKEGHMIRSC
         V +     GG                R  P C TCG+ H G C  G R CF+C +EGH    C
Subjt:  VVGRE--EEGG---------------PRQIPACVTCGRVHFGACQTGPRRCFRCGKEGHMIRSC

A0A5A7TX58 Gag protease polyprotein2.0e-2928.89Show/hide
Query:  LVPHSVGPLQLVSELRVINSIIASMPVTRAERGHPRGGRG-----VRIGSDTVGTSSQSRGPIAIRCSSPVEPDGKSPELGSSGDPFVSKRRARRHPTQQ
        L+  S G  +L+     I  +I  MP  R  R   RGGRG     V++    V  ++    P+     + +E   +           + + R ++ P   
Subjt:  LVPHSVGPLQLVSELRVINSIIASMPVTRAERGHPRGGRG-----VRIGSDTVGTSSQSRGPIAIRCSSPVEPDGKSPELGSSGDPFVSKRRARRHPTQQ

Query:  SQRPLRRKWREGADEL---GKWLKDFVKWNPDKFDAT-GDALAAARWIAHLEYTFLIMVCPDVQRPRCATHVLGGVARWWWESTLSERPAGSPPATWEFF
        +  P         D+L    K L+DF K+NP  FD +  D   A  W++ LE  F  M CP+ Q+ +CA  +L      WWE+T           TW+ F
Subjt:  SQRPLRRKWREGADEL---GKWLKDFVKWNPDKFDAT-GDALAAARWIAHLEYTFLIMVCPDVQRPRCATHVLGGVARWWWESTLSERPAGSPPATWEFF

Query:  KTEFKAKYISDEAQEKMVALFQNLQQGSDSIEEYERKFSVYGYFDPQLIATPELKILKFISGLRQGVTQDVRAQAPITYARAVQLATIYNGKYPVERIVA
        K  F AK+ S   ++     F NL+QG  ++E+Y+ +  +   F P++IAT   +  KF+ GLR  +   VRA    T+A A++LA   + +        
Subjt:  KTEFKAKYISDEAQEKMVALFQNLQQGSDSIEEYERKFSVYGYFDPQLIATPELKILKFISGLRQGVTQDVRAQAPITYARAVQLATIYNGKYPVERIVA

Query:  APA-----KDCPTVVGREEEGGPRQIPACVTCGRVHFGACQTGPRRCFRCGKEGHMIRSC
        A       +  P   G    G     P C TCG+ H G C  G R CF+C +EGH    C
Subjt:  APA-----KDCPTVVGREEEGGPRQIPACVTCGRVHFGACQTGPRRCFRCGKEGHMIRSC

A0A5A7V5X5 Pol protein1.3e-2830.4Show/hide
Query:  MPVTR-AERGHPRGGRGVRIG---SDTVGTSSQSRGPIAIRCSSPVEPDGKSPELGSSGDPFVSKRRARRHPTQ-QSQRPLRRKWREGA----DELGKWL
        MP  R A RG  RGGRG   G   +  V  +     P+     + +E       L ++  PF++ ++ +  P Q Q+  P   +  +          K L
Subjt:  MPVTR-AERGHPRGGRGVRIG---SDTVGTSSQSRGPIAIRCSSPVEPDGKSPELGSSGDPFVSKRRARRHPTQ-QSQRPLRRKWREGA----DELGKWL

Query:  KDFVKWNPDKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCATHVLGGVARWWWESTLSERPAGSPPA--TWEFFKTEFKAKYISDEAQEKMVALF
        +DF K+NP  FD + D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER  G   +  TWE FK  F AK+ S   +   +  F
Subjt:  KDFVKWNPDKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCATHVLGGVARWWWESTLSERPAGSPPA--TWEFFKTEFKAKYISDEAQEKMVALF

Query:  QNLQQGSDSIEEYERKFSVYGYFDPQLIATPELKILKFISGLRQGVTQDVRAQAPITYARAVQLATIYNGKYPVERIVAAPAKDCPTVVGREEEGGPRQI
         NL+QG  ++E+Y+ +F +   F P ++     +  KF+ GLR  +   VRA  P T+A A+++A   +     +   AA       +         R++
Subjt:  QNLQQGSDSIEEYERKFSVYGYFDPQLIATPELKILKFISGLRQGVTQDVRAQAPITYARAVQLATIYNGKYPVERIVAAPAKDCPTVVGREEEGGPRQI

Query:  PACVTCGRVHFGACQTGPRRCFRCGKEGHMIRSCDKPAMAIQITQQSSQQAG
        PAC TCGRVH G C  G   CFRC + GH    C +        Q S+ Q G
Subjt:  PACVTCGRVHFGACQTGPRRCFRCGKEGHMIRSCDKPAMAIQITQQSSQQAG

A0A5A7VBY3 Reverse transcriptase2.6e-2930.03Show/hide
Query:  PVTRAERGHPRGGRGVRIGSDTVGTSSQSRGPIAIRCSSPVEPDGKSPE------LGSSGDPFVSKRRARRHPTQQSQ--RPLRRKWREGADEL---GKW
        P   A RG  RGGRGV  G        Q   P A+  ++PV  D  + E      L ++  PF++ ++ +  P Q      P   + +    +L    K 
Subjt:  PVTRAERGHPRGGRGVRIGSDTVGTSSQSRGPIAIRCSSPVEPDGKSPE------LGSSGDPFVSKRRARRHPTQQSQ--RPLRRKWREGADEL---GKW

Query:  LKDFVKWNPDKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCATHVLGGVARWWWESTLSERPAGSPPA--TWEFFKTEFKAKYISDEAQEKMVAL
        L+DF K+NP  FD + D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER  G   +  TWE FK  F AK+ S   +   +  
Subjt:  LKDFVKWNPDKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCATHVLGGVARWWWESTLSERPAGSPPA--TWEFFKTEFKAKYISDEAQEKMVAL

Query:  FQNLQQGSDSIEEYERKFSVYGYFDPQLIATPELKILKFISGLRQGVTQDVRAQAPITYARAVQLATIYNGKYPVERIVAAPAKDCPTVVGREEE-----
        F NL+QG  ++E+Y+ +F +   F P ++     +  KF+ GLR  +   VRA  P T+A A+++A   +     ER+ ++ A    T +G++ +     
Subjt:  FQNLQQGSDSIEEYERKFSVYGYFDPQLIATPELKILKFISGLRQGVTQDVRAQAPITYARAVQLATIYNGKYPVERIVAAPAKDCPTVVGREEE-----

Query:  ----------GGP---------------RQIPACVTCGRVHFGACQTGPRRCFRCGKEGHMIRSCDKPAMAIQITQQSSQQAG
                  GG                R++PAC TCGRVH G C  G   CFRC + GH    C +        Q S+ Q G
Subjt:  ----------GGP---------------RQIPACVTCGRVHFGACQTGPRRCFRCGKEGHMIRSCDKPAMAIQITQQSSQQAG

A0A5D3BZN1 Reverse transcriptase2.6e-2930.03Show/hide
Query:  PVTRAERGHPRGGRGVRIGSDTVGTSSQSRGPIAIRCSSPVEPDGKSPE------LGSSGDPFVSKRRARRHPTQQSQ--RPLRRKWREGADEL---GKW
        P   A RG  RGGRGV  G        Q   P A+  ++PV  D  + E      L ++  PF++ ++ +  P Q      P   + +    +L    K 
Subjt:  PVTRAERGHPRGGRGVRIGSDTVGTSSQSRGPIAIRCSSPVEPDGKSPE------LGSSGDPFVSKRRARRHPTQQSQ--RPLRRKWREGADEL---GKW

Query:  LKDFVKWNPDKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCATHVLGGVARWWWESTLSERPAGSPPA--TWEFFKTEFKAKYISDEAQEKMVAL
        L+DF K+NP  FD + D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER  G   +  TWE FK  F AK+ S   +   +  
Subjt:  LKDFVKWNPDKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCATHVLGGVARWWWESTLSERPAGSPPA--TWEFFKTEFKAKYISDEAQEKMVAL

Query:  FQNLQQGSDSIEEYERKFSVYGYFDPQLIATPELKILKFISGLRQGVTQDVRAQAPITYARAVQLATIYNGKYPVERIVAAPAKDCPTVVGREEE-----
        F NL+QG  ++E+Y+ +F +   F P ++     +  KF+ GLR  +   VRA  P T+A A+++A   +     ER+ ++ A    T +G++ +     
Subjt:  FQNLQQGSDSIEEYERKFSVYGYFDPQLIATPELKILKFISGLRQGVTQDVRAQAPITYARAVQLATIYNGKYPVERIVAAPAKDCPTVVGREEE-----

Query:  ----------GGP---------------RQIPACVTCGRVHFGACQTGPRRCFRCGKEGHMIRSCDKPAMAIQITQQSSQQAG
                  GG                R++PAC TCGRVH G C  G   CFRC + GH    C +        Q S+ Q G
Subjt:  ----------GGP---------------RQIPACVTCGRVHFGACQTGPRRCFRCGKEGHMIRSCDKPAMAIQITQQSSQQAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCTTTACTTTGAATGCATGCATTGGGGTAAGGGCCCATTTAGTGCCCCATTCGGTCGGTCCCTTACAGTTGGTATCAGAGCTCAGAGTCATAAATTCTATAATCGC
CAGCATGCCAGTTACCCGCGCCGAAAGAGGACACCCGAGAGGAGGCCGAGGTGTCAGGATCGGATCTGACACCGTAGGGACATCGTCCCAAAGCCGAGGCCCGATAGCGA
TCCGCTGTAGCTCCCCTGTTGAGCCCGACGGGAAGAGTCCCGAGCTCGGATCGAGCGGTGACCCGTTTGTTTCAAAGAGGCGCGCAAGAAGACACCCGACTCAGCAGAGC
CAACGACCTTTGAGGCGTAAGTGGCGAGAAGGTGCCGATGAGCTTGGTAAATGGTTGAAGGACTTCGTTAAGTGGAACCCTGATAAGTTTGACGCCACAGGTGATGCTTT
AGCAGCAGCTAGATGGATTGCCCATTTGGAGTACACCTTCCTGATTATGGTGTGCCCCGATGTGCAGAGGCCGAGGTGTGCAACCCATGTGTTAGGGGGCGTAGCTAGAT
GGTGGTGGGAGTCCACCTTGAGTGAGAGACCAGCTGGGTCTCCACCTGCTACCTGGGAGTTCTTCAAGACAGAATTTAAAGCCAAGTACATTAGTGATGAGGCTCAAGAA
AAGATGGTAGCGCTTTTCCAGAATCTGCAGCAGGGCTCTGATTCTATAGAGGAGTACGAGAGGAAGTTTTCAGTGTATGGTTATTTTGATCCGCAGTTGATAGCGACCCC
TGAGTTGAAAATTTTGAAGTTCATTTCTGGTCTGAGGCAGGGTGTGACTCAGGATGTCCGGGCACAGGCTCCTATCACTTATGCTAGAGCTGTTCAGCTAGCCACTATTT
ACAATGGAAAGTATCCTGTAGAGCGAATTGTTGCCGCTCCGGCCAAAGACTGTCCCACCGTGGTAGGGCGAGAAGAGGAGGGCGGTCCTAGGCAGATTCCCGCCTGTGTC
ACTTGTGGAAGAGTCCACTTCGGAGCTTGTCAGACCGGGCCCAGGCGGTGTTTCCGATGTGGAAAGGAGGGGCATATGATCCGTTCTTGTGATAAGCCAGCTATGGCGAT
TCAAATAACGCAGCAAAGCAGCCAGCAGGCCGGGTGTTCGCTACTGAACCGTCAGAGGCCGAGCGTCACGAGATGCCGCAGATGGCGAGTACGCTTCTTGTTTTGGATCA
TTGTGCATTTGTCTTGTTTGACTGCAGTTCCTACGCACTCGTTTGTATCTGCATTTTGTTTGGTGAGCATGCTAGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCTTTACTTTGAATGCATGCATTGGGGTAAGGGCCCATTTAGTGCCCCATTCGGTCGGTCCCTTACAGTTGGTATCAGAGCTCAGAGTCATAAATTCTATAATCGC
CAGCATGCCAGTTACCCGCGCCGAAAGAGGACACCCGAGAGGAGGCCGAGGTGTCAGGATCGGATCTGACACCGTAGGGACATCGTCCCAAAGCCGAGGCCCGATAGCGA
TCCGCTGTAGCTCCCCTGTTGAGCCCGACGGGAAGAGTCCCGAGCTCGGATCGAGCGGTGACCCGTTTGTTTCAAAGAGGCGCGCAAGAAGACACCCGACTCAGCAGAGC
CAACGACCTTTGAGGCGTAAGTGGCGAGAAGGTGCCGATGAGCTTGGTAAATGGTTGAAGGACTTCGTTAAGTGGAACCCTGATAAGTTTGACGCCACAGGTGATGCTTT
AGCAGCAGCTAGATGGATTGCCCATTTGGAGTACACCTTCCTGATTATGGTGTGCCCCGATGTGCAGAGGCCGAGGTGTGCAACCCATGTGTTAGGGGGCGTAGCTAGAT
GGTGGTGGGAGTCCACCTTGAGTGAGAGACCAGCTGGGTCTCCACCTGCTACCTGGGAGTTCTTCAAGACAGAATTTAAAGCCAAGTACATTAGTGATGAGGCTCAAGAA
AAGATGGTAGCGCTTTTCCAGAATCTGCAGCAGGGCTCTGATTCTATAGAGGAGTACGAGAGGAAGTTTTCAGTGTATGGTTATTTTGATCCGCAGTTGATAGCGACCCC
TGAGTTGAAAATTTTGAAGTTCATTTCTGGTCTGAGGCAGGGTGTGACTCAGGATGTCCGGGCACAGGCTCCTATCACTTATGCTAGAGCTGTTCAGCTAGCCACTATTT
ACAATGGAAAGTATCCTGTAGAGCGAATTGTTGCCGCTCCGGCCAAAGACTGTCCCACCGTGGTAGGGCGAGAAGAGGAGGGCGGTCCTAGGCAGATTCCCGCCTGTGTC
ACTTGTGGAAGAGTCCACTTCGGAGCTTGTCAGACCGGGCCCAGGCGGTGTTTCCGATGTGGAAAGGAGGGGCATATGATCCGTTCTTGTGATAAGCCAGCTATGGCGAT
TCAAATAACGCAGCAAAGCAGCCAGCAGGCCGGGTGTTCGCTACTGAACCGTCAGAGGCCGAGCGTCACGAGATGCCGCAGATGGCGAGTACGCTTCTTGTTTTGGATCA
TTGTGCATTTGTCTTGTTTGACTGCAGTTCCTACGCACTCGTTTGTATCTGCATTTTGTTTGGTGAGCATGCTAGGTTGA
Protein sequenceShow/hide protein sequence
MSFTLNACIGVRAHLVPHSVGPLQLVSELRVINSIIASMPVTRAERGHPRGGRGVRIGSDTVGTSSQSRGPIAIRCSSPVEPDGKSPELGSSGDPFVSKRRARRHPTQQS
QRPLRRKWREGADELGKWLKDFVKWNPDKFDATGDALAAARWIAHLEYTFLIMVCPDVQRPRCATHVLGGVARWWWESTLSERPAGSPPATWEFFKTEFKAKYISDEAQE
KMVALFQNLQQGSDSIEEYERKFSVYGYFDPQLIATPELKILKFISGLRQGVTQDVRAQAPITYARAVQLATIYNGKYPVERIVAAPAKDCPTVVGREEEGGPRQIPACV
TCGRVHFGACQTGPRRCFRCGKEGHMIRSCDKPAMAIQITQQSSQQAGCSLLNRQRPSVTRCRRWRVRFLFWIIVHLSCLTAVPTHSFVSAFCLVSMLG