; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002428 (gene) of Snake gourd v1 genome

Gene IDTan0002428
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG05:71762358..71765280
RNA-Seq ExpressionTan0002428
SyntenyTan0002428
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3453657.1 reverse transcriptase [Gossypium australe]2.1e-3529.15Show/hide
Query:  MKILYWNVRGMNNLHAFHNLKKVLNLSKPNLVFLAETKLLKFRANNLKTKLGFDNVVCVESEGKSGGLCVLWNNEIHVKLCSFNKFHIDLVIDSDN--PP
        MKIL WNVRG+        LK  L   +P ++FL ETK+   R  +++ K GF N + V + G  GGL + W   +++ L SF+ +HID+ ++ ++    
Subjt:  MKILYWNVRGMNNLHAFHNLKKVLNLSKPNLVFLAETKLLKFRANNLKTKLGFDNVVCVESEGKSGGLCVLWNNEIHVKLCSFNKFHIDLVIDSDN--PP

Query:  WRFTGFYGDLETSKRKESWKLISRLCSLNSIPWLIAGDFNEI----------------------------------------------IDNT--------
        WRFTGFY     ++RKESWKL+  L   N  PWLI GDFNEI                                              +DN         
Subjt:  WRFTGFYGDLETSKRKESWKLISRLCSLNSIPWLIAGDFNEI----------------------------------------------IDNT--------

Query:  ------------------------------------------------------------------KKRVNSITKIKGEDGKWRNTEEGIQLAFSEYFKG
                                                                          +KR N I ++K E G      + I    +EYFK 
Subjt:  ------------------------------------------------------------------KKRVNSITKIKGEDGKWRNTEEGIQLAFSEYFKG

Query:  LFTANAQSE-NAILRNFL-----------------KEVLLALKQTGSSKAPGEDEFTALFYRKYWHIVGEEVTTFCLEVLNRGRSIEDINSTVITLIP
        +F++   S  + ++ +F                  +EV  A+K     KA G+D F ALFY+KYWHIVGEEVT +CL+VLN  R+I++IN T I LIP
Subjt:  LFTANAQSE-NAILRNFL-----------------KEVLLALKQTGSSKAPGEDEFTALFYRKYWHIVGEEVTTFCLEVLNRGRSIEDINSTVITLIP

KAA3462561.1 reverse transcriptase [Gossypium australe]1.4e-3937.72Show/hide
Query:  MKILYWNVRGMNNLHAFHNLKKVLNLSKPNLVFLAETKLLKFRANNLKTKLGFDNVVCVESEGKSGGLCVLWNNEIHVKLCSFNKFHIDLVIDSD-NPPW
        MKIL WNVRG+ N  A   L+  L L  P +VF  ETK+ KF+   ++ + G+ + + VES G  GGL + W  E+++ L SF+  HID++I+ D     
Subjt:  MKILYWNVRGMNNLHAFHNLKKVLNLSKPNLVFLAETKLLKFRANNLKTKLGFDNVVCVESEGKSGGLCVLWNNEIHVKLCSFNKFHIDLVIDSD-NPPW

Query:  RFTGFYGDLETSKRKESWKLISRLCSLNS-------IPWLIAGDFNEII----DNTKKRVNSITKIKGEDGK-WRNTEEGIQLAFSEYFKGLFTANAQ--
        R TGFYG      R+++W L+ R+ +          I WL  GD N          +KR N I K++ EDG+     EE I++A S YF  LF+  +Q  
Subjt:  RFTGFYGDLETSKRKESWKLISRLCSLNS-------IPWLIAGDFNEII----DNTKKRVNSITKIKGEDGK-WRNTEEGIQLAFSEYFKGLFTANAQ--

Query:  --------------SENAILR-NFLK-EVLLALKQTGSSKAPGEDEFTALFYRKYWHIVGEEVTTFCLEVLNRGRSIEDINSTVITLIP
                       +N  L+ +F K E+  AL + G +KAPGED   A+FY+K W I+GEEV+ +CL  LN G  +  IN T I L+P
Subjt:  --------------SENAILR-NFLK-EVLLALKQTGSSKAPGEDEFTALFYRKYWHIVGEEVTTFCLEVLNRGRSIEDINSTVITLIP

KAA3485636.1 reverse transcriptase [Gossypium australe]2.4e-3131.23Show/hide
Query:  ETKLLKFRANNLKTKLGFDNVVCVESEGKSGGLCVLWNNEIHVKLCSFNKFHIDLVI--DSDNPPWRFTGFYGDLETSKRKESWKLISRLCSLNSIPWLI
        ETKL + R +  +   GF + + V++EG  GGLC+ W   I V L SF+K+HID+++  DS    WRFT FYG   + ++ + W+L+  L    + PWL+
Subjt:  ETKLLKFRANNLKTKLGFDNVVCVESEGKSGGLCVLWNNEIHVKLCSFNKFHIDLVI--DSDNPPWRFTGFYGDLETSKRKESWKLISRLCSLNSIPWLI

Query:  AGDFNEII--------------------------------------------------------------------------DNTKKRVNSITKIKGEDG
        AGDFNEI+                                                                            T +R N ITK+  +DG
Subjt:  AGDFNEII--------------------------------------------------------------------------DNTKKRVNSITKIKGEDG

Query:  KWRNTEEGIQLAFSEYFKGLFTANAQSE----------------NAILRN--FLKEVLLALKQTGSSKAPGEDEFTALFYRKYWHIVGEEVTTFCLEVLN
        K    E  +Q A   +F+ LF+++  ++                N +L +     E+L+ALK  G  KAPG D F ALF++KYW IVG++V  FCL VLN
Subjt:  KWRNTEEGIQLAFSEYFKGLFTANAQSE----------------NAILRN--FLKEVLLALKQTGSSKAPGEDEFTALFYRKYWHIVGEEVTTFCLEVLN

Query:  RGRSIEDINSTVITLIP
         G+ +   NST I LIP
Subjt:  RGRSIEDINSTVITLIP

KAF7824053.1 hypothetical protein G2W53_022197 [Senna tora]1.9e-3128.98Show/hide
Query:  MKILYWNVRGMNNLHAFHNLKKVLNLSKPNLVFLAETKLLKFRANNLKTKLGFDNVVCVESEG----KSGGLCVLWNNEIHVKLCSFNKFHIDLVIDSD-
        M  + WN RG+    A  +LK++    +P+L+FL ETK        LK +LGFD V  V+  G    ++GGL + W N + + L SF+  HID+++    
Subjt:  MKILYWNVRGMNNLHAFHNLKKVLNLSKPNLVFLAETKLLKFRANNLKTKLGFDNVVCVESEG----KSGGLCVLWNNEIHVKLCSFNKFHIDLVIDSD-

Query:  -NPPWRFTGFYGDLETSKRKESWKLISRLCSLNSIPWLIAGDFNEIIDNTKKRVNSITK--------------------IKGEDGKWRN---TEEGIQLA
         N  WR TG +G  E   + ++W L+  L S + +PWL  GDFNEI+  ++K+  +                        KG    W N    +  IQ  
Subjt:  -NPPWRFTGFYGDLETSKRKESWKLISRLCSLNSIPWLIAGDFNEIIDNTKKRVNSITK--------------------IKGEDGKWRN---TEEGIQLA

Query:  FSEYF---KGLFTANAQSENAILRNFLKEVLLALKQTGSSKAPGEDEFTALFYRKYWHIVGEEVTTFCLEVLNRGRSIEDINSTVITLIPGRNIANNAIL
            F   + L        N I  +       AL+ +  S  P         +R       +E    C +V+N      D  S  +T +        A L
Subjt:  FSEYF---KGLFTANAQSENAILRNFLKEVLLALKQTGSSKAPGEDEFTALFYRKYWHIVGEEVTTFCLEVLNRGRSIEDINSTVITLIPGRNIANNAIL

Query:  GYEKSLLWE-----RDLLESGMRWRMRDGKQVRIMEDKWLSRPFSLKPLLNPNVNSE-TKVEDLLNGDGR-WNEVLIGNFFGKEDIDQILKLPQISTKGP
            S  W      R +L+ G  WR+ +G Q+ I ED W+S    L+ L    ++S+   V DL+N + R W   LI + F  E    IL LP       
Subjt:  GYEKSLLWE-----RDLLESGMRWRMRDGKQVRIMEDKWLSRPFSLKPLLNPNVNSE-TKVEDLLNGDGR-WNEVLIGNFFGKEDIDQILKLPQISTKGP

Query:  DKLLWHYEKDDFYSVKSGYNL
        DK +W +EK   YSVKS Y++
Subjt:  DKLLWHYEKDDFYSVKSGYNL

XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]1.4e-3129.4Show/hide
Query:  MKILYWNVRGMNNLHAFHNLKKVLNLSKPNLVFLAETKLLKFRANNLKTKLGFDNVVCVESEGKSGGLCVLWNNEIHVKLCSFNKFHIDLVIDSDNPPWR
        MK L WNV G+ N   F  L+ ++  S+P LVFL+ETK         K +L FD  V V S GKSGGL +LWN++ +V++ S +  HID +I      WR
Subjt:  MKILYWNVRGMNNLHAFHNLKKVLNLSKPNLVFLAETKLLKFRANNLKTKLGFDNVVCVESEGKSGGLCVLWNNEIHVKLCSFNKFHIDLVIDSDNPPWR

Query:  FTGFYGDLETSKRKESWKLISRLCSLNSIPWLIAGDFNEIIDNTKK------------------------------------------------------
        FTGFYG+  T KR  SWKL+ RL  +  +PW+I GDFNEI+  T+K                                                      
Subjt:  FTGFYGDLETSKRKESWKLISRLCSLNSIPWLIAGDFNEIIDNTKK------------------------------------------------------

Query:  ---------RVNSITKIKGED-------------GKW-------------------RNTEEGIQLAFSEYFKG-----------LFTANAQSENAIL-RN
                   +   KI+ E+             G W                       E  ++  +   KG           L   +  S+NAIL ++
Subjt:  ---------RVNSITKIKGED-------------GKW-------------------RNTEEGIQLAFSEYFKG-----------LFTANAQSENAIL-RN

Query:  FLK-EVLLALKQTGSSKAPGEDEFTALFYRKYWHIVGEEVTTFCLEVLNRGRSIED--INSTVITLIPGRNIANNAILGYE
        F + E+ +ALK    SK PG D   A+F++K+W ++          + NR +S+ D  I+ T    +PGR+I +NAI+G+E
Subjt:  FLK-EVLLALKQTGSSKAPGEDEFTALFYRKYWHIVGEEVTTFCLEVLNRGRSIED--INSTVITLIPGRNIANNAILGYE

TrEMBL top hitse value%identityAlignment
A0A5B6U9Z8 Reverse transcriptase1.0e-3529.15Show/hide
Query:  MKILYWNVRGMNNLHAFHNLKKVLNLSKPNLVFLAETKLLKFRANNLKTKLGFDNVVCVESEGKSGGLCVLWNNEIHVKLCSFNKFHIDLVIDSDN--PP
        MKIL WNVRG+        LK  L   +P ++FL ETK+   R  +++ K GF N + V + G  GGL + W   +++ L SF+ +HID+ ++ ++    
Subjt:  MKILYWNVRGMNNLHAFHNLKKVLNLSKPNLVFLAETKLLKFRANNLKTKLGFDNVVCVESEGKSGGLCVLWNNEIHVKLCSFNKFHIDLVIDSDN--PP

Query:  WRFTGFYGDLETSKRKESWKLISRLCSLNSIPWLIAGDFNEI----------------------------------------------IDNT--------
        WRFTGFY     ++RKESWKL+  L   N  PWLI GDFNEI                                              +DN         
Subjt:  WRFTGFYGDLETSKRKESWKLISRLCSLNSIPWLIAGDFNEI----------------------------------------------IDNT--------

Query:  ------------------------------------------------------------------KKRVNSITKIKGEDGKWRNTEEGIQLAFSEYFKG
                                                                          +KR N I ++K E G      + I    +EYFK 
Subjt:  ------------------------------------------------------------------KKRVNSITKIKGEDGKWRNTEEGIQLAFSEYFKG

Query:  LFTANAQSE-NAILRNFL-----------------KEVLLALKQTGSSKAPGEDEFTALFYRKYWHIVGEEVTTFCLEVLNRGRSIEDINSTVITLIP
        +F++   S  + ++ +F                  +EV  A+K     KA G+D F ALFY+KYWHIVGEEVT +CL+VLN  R+I++IN T I LIP
Subjt:  LFTANAQSE-NAILRNFL-----------------KEVLLALKQTGSSKAPGEDEFTALFYRKYWHIVGEEVTTFCLEVLNRGRSIEDINSTVITLIP

A0A5B6V0I7 Reverse transcriptase6.9e-4037.72Show/hide
Query:  MKILYWNVRGMNNLHAFHNLKKVLNLSKPNLVFLAETKLLKFRANNLKTKLGFDNVVCVESEGKSGGLCVLWNNEIHVKLCSFNKFHIDLVIDSD-NPPW
        MKIL WNVRG+ N  A   L+  L L  P +VF  ETK+ KF+   ++ + G+ + + VES G  GGL + W  E+++ L SF+  HID++I+ D     
Subjt:  MKILYWNVRGMNNLHAFHNLKKVLNLSKPNLVFLAETKLLKFRANNLKTKLGFDNVVCVESEGKSGGLCVLWNNEIHVKLCSFNKFHIDLVIDSD-NPPW

Query:  RFTGFYGDLETSKRKESWKLISRLCSLNS-------IPWLIAGDFNEII----DNTKKRVNSITKIKGEDGK-WRNTEEGIQLAFSEYFKGLFTANAQ--
        R TGFYG      R+++W L+ R+ +          I WL  GD N          +KR N I K++ EDG+     EE I++A S YF  LF+  +Q  
Subjt:  RFTGFYGDLETSKRKESWKLISRLCSLNS-------IPWLIAGDFNEII----DNTKKRVNSITKIKGEDGK-WRNTEEGIQLAFSEYFKGLFTANAQ--

Query:  --------------SENAILR-NFLK-EVLLALKQTGSSKAPGEDEFTALFYRKYWHIVGEEVTTFCLEVLNRGRSIEDINSTVITLIP
                       +N  L+ +F K E+  AL + G +KAPGED   A+FY+K W I+GEEV+ +CL  LN G  +  IN T I L+P
Subjt:  --------------SENAILR-NFLK-EVLLALKQTGSSKAPGEDEFTALFYRKYWHIVGEEVTTFCLEVLNRGRSIEDINSTVITLIP

A0A803PPS5 Uncharacterized protein2.1e-3629.26Show/hide
Query:  MKILYWNVRGMNNLHAFHNLKKVLNLSKPNLVFLAETKLLKFRANNLKTKLGFDNVVCVESEGKSGGLCVLWNNEIHVKLCSFNKFHIDLVI-DSDNPPW
        MK L WNV+GM N     +LK  +    P+LVF++E++L K +A  L+  LGF     VE+ GKSG L +LW+ E+   + SF+ FHID  I + D+  W
Subjt:  MKILYWNVRGMNNLHAFHNLKKVLNLSKPNLVFLAETKLLKFRANNLKTKLGFDNVVCVESEGKSGGLCVLWNNEIHVKLCSFNKFHIDLVI-DSDNPPW

Query:  RFTGFYGDLETSKRKESWKLISRLCSLNSIPWLIAGDFNEII----------------DNTKK-------------------------------------
        RFTGFYGD + ++R +SW+L++R+  + S PW+I GDFNEI+                +N +K                                     
Subjt:  RFTGFYGDLETSKRKESWKLISRLCSLNSIPWLIAGDFNEII----------------DNTKK-------------------------------------

Query:  -----------------RVNS------ITKIKGEDGK---------------WRNTEEGIQLAFSEYFKGLFTANAQSENAI----------LRNFLKEV
                         R+NS      +  +K   G+               W N E+  ++    +  G  +AN  +   +          + N   ++
Subjt:  -----------------RVNS------ITKIKGEDGK---------------WRNTEEGIQLAFSEYFKGLFTANAQSENAI----------LRNFLKEV

Query:  LL----------ALKQTGSSKAPGEDEFTALFYRKYWHIVGEEVTTFCLEVLNRGRSIEDINSTVITLIPGRNIAN
        LL          A++     KAPG D  + LFYR+YW  +GEEV+  CL +LN G+ ++DI  T+I LIP  + +N
Subjt:  LL----------ALKQTGSSKAPGEDEFTALFYRKYWHIVGEEVTTFCLEVLNRGRSIEDINSTVITLIPGRNIAN

A0A803QLY3 Uncharacterized protein4.3e-4232.48Show/hide
Query:  MKILYWNVRGMNNLHAFHNLKKVLNLSKPNLVFLAETKLLKFRANNLKTKLGFDNVVCVESEGKSGGLCVLWNNEIHVKLCSFNKFHIDLVIDSDNPP-W
        M  L WNV+G+ N     +L  ++    P LVF+ E+KL    A  L  KLGF     VE++GKSG L +LW+ ++   + SF+ FHID  I  +    W
Subjt:  MKILYWNVRGMNNLHAFHNLKKVLNLSKPNLVFLAETKLLKFRANNLKTKLGFDNVVCVESEGKSGGLCVLWNNEIHVKLCSFNKFHIDLVIDSDNPP-W

Query:  RFTGFYGDLETSKRKESWKLISRLCSLNSIPWLIAGDFNEIIDNTKKR---------VNSITK-----------IKGEDGKWRNTEEGIQLAFS------
        RFT FYGD + S+R +SWKL++R+C + S PW + GDFNEI+   +K          + +  K            +G D  W N  +   L F       
Subjt:  RFTGFYGDLETSKRKESWKLISRLCSLNSIPWLIAGDFNEIIDNTKKR---------VNSITK-----------IKGEDGKWRNTEEGIQLAFS------

Query:  ----------EYFKGLFTANA-------------------QSENAILRNF-LKEVLLALKQTGSSKAPGEDEFTALFYRKYWHIVGEEVTTFCLEVLNRG
                  +YF  LFT+N+                    +   +L  F ++E+  A++     KAPG D    LFY+KYW  +G EV+  CL +LN  
Subjt:  ----------EYFKGLFTANA-------------------QSENAILRNF-LKEVLLALKQTGSSKAPGEDEFTALFYRKYWHIVGEEVTTFCLEVLNRG

Query:  RSIEDINSTVITLIP----GRNIANNAILGYEKSLLWERDLLESGMRWRMR
          +E+IN T+I LIP     + I +NAI+G+E     E     +G +  ++
Subjt:  RSIEDINSTVITLIP----GRNIANNAILGYEKSLLWERDLLESGMRWRMR

A0A803QLY3 Uncharacterized protein1.2e-1235.14Show/hide
Query:  KSLLWERDLLESGMRWRMRDGKQVRIMEDKWLSRPFSLKPLLNPNVNSETKVEDLLNGDGRWNEVLIGNFFGKEDIDQILKLPQISTKGPDKLLWHYEKD
        K ++W R +L  G RWR+ +G+ +R+ EDKWL RP        P +   TK++  +  +G W   ++   F +EDI  I  +P I     D L W Y  +
Subjt:  KSLLWERDLLESGMRWRMRDGKQVRIMEDKWLSRPFSLKPLLNPNVNSETKVEDLLNGDGRWNEVLIGNFFGKEDIDQILKLPQISTKGPDKLLWHYEKD

Query:  DFYSVKSGYNL
          Y VKSGY +
Subjt:  DFYSVKSGYNL

A0A803QLY3 Uncharacterized protein1.6e-4122.34Show/hide
Query:  MKILYWNVRGMNNLHAFHNLKKVLNLSKPNLVFLAETKLLKFRANNLKTKLGFDNVVCVESEGKSGGLCVLWNNEIHVKLCSFNKFHIDLVI-DSDNPPW
        M  L WNV+GM N      LK ++    P LVF++E++L K RA NL+  LGF     VE++GKSGGL ++W+++I   + SF+ FHID  I   ++  W
Subjt:  MKILYWNVRGMNNLHAFHNLKKVLNLSKPNLVFLAETKLLKFRANNLKTKLGFDNVVCVESEGKSGGLCVLWNNEIHVKLCSFNKFHIDLVI-DSDNPPW

Query:  RFTGFYGDLETSKRKESWKLISRLCSLNSIPWLIAGDFNEII----------------------------------------------------------
        RFT FYGD + SKR +SW +++R+  + + PWLI GDFNEI+                                                          
Subjt:  RFTGFYGDLETSKRKESWKLISRLCSLNSIPWLIAGDFNEII----------------------------------------------------------

Query:  ------------------------------------------------------------------------DNTKKRVNSITKIKGEDGKWRNTEEGIQ
                                                                                 N++K+ NSIT +  ++ +W      + 
Subjt:  ------------------------------------------------------------------------DNTKKRVNSITKIKGEDGKWRNTEEGIQ

Query:  LAFSEYFKGLFTA-------------------NAQSENAILRNFLK-EVLLALKQTGSSKAPGEDEFTALFYRKYWHIVGEEVTTFCLEVLNRGRSIEDI
             YF+ LFTA                   + ++   ++  F K +V+ A++     KAPG D    LFYRK+W I+GEEVTT CL +LN G+S+E I
Subjt:  LAFSEYFKGLFTA-------------------NAQSENAILRNFLK-EVLLALKQTGSSKAPGEDEFTALFYRKYWHIVGEEVTTFCLEVLNRGRSIEDI

Query:  NSTVITLIP-------------------------------------------------GRNIANNAILGYE-----------------------------
        N T+I LIP                                                 GR I +N I+GYE                             
Subjt:  NSTVITLIP-------------------------------------------------GRNIANNAILGYE-----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------KSLLWERDLLESGMRWRMRDGKQVRIMEDKWLSRP---FSLKPLLNPNVNSETKVEDLLNGDG
                                             K ++W R+++  G RWR+ +G+ +R+ +DKWL RP    + +PL   + N  T V  LLN   
Subjt:  -------------------------------------KSLLWERDLLESGMRWRMRDGKQVRIMEDKWLSRP---FSLKPLLNPNVNSETKVEDLLNGDG

Query:  RWNEVLIGNFFGKEDIDQILKLPQISTKGPDKLLWHYEKDDFYSVKSGYNLA
         WNE ++  +F KED+  IL +P I     D L+W + KD  Y VKSGY +A
Subjt:  RWNEVLIGNFFGKEDIDQILKLPQISTKGPDKLLWHYEKDDFYSVKSGYNLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein7.6e-0731.45Show/hide
Query:  SLLWERDLLESGMRWRMRDGKQVRIMEDKWL-SRPFSLKPLLNPNVNSETKVEDLLNGDGR---WNEVLIGNFFGKEDIDQILKLPQISTKGPDKLLWHY
        SLL    LL+ G R  + DG+ +RI  D  + S P   +PL       E  + +L    G    W++  I  F  + D   I ++    +K PDK++W+Y
Subjt:  SLLWERDLLESGMRWRMRDGKQVRIMEDKWL-SRPFSLKPLLNPNVNSETKVEDLLNGDGR---WNEVLIGNFFGKEDIDQILKLPQISTKGPDKLLWHY

Query:  EKDDFYSVKSGYNLAWSLLNRAST
             Y+V+SGY   W L +  ST
Subjt:  EKDDFYSVKSGYNLAWSLLNRAST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATTCTTTACTGGAATGTCAGAGGAATGAACAATCTTCATGCATTCCACAACTTGAAAAAGGTGTTAAACCTATCTAAACCCAATTTGGTGTTCCTAGCTGAAAC
AAAACTCCTAAAGTTTAGGGCGAACAATTTGAAAACTAAACTTGGCTTTGATAATGTAGTTTGTGTGGAAAGTGAAGGAAAAAGTGGAGGGCTTTGTGTACTATGGAACA
ATGAAATTCATGTAAAGTTATGTTCTTTCAATAAATTTCATATTGATTTGGTGATTGATAGTGACAACCCACCCTGGAGATTTACTGGATTCTATGGAGACCTTGAGACT
TCTAAGAGGAAAGAATCTTGGAAACTTATTTCCAGATTATGCAGCTTGAATTCTATTCCTTGGCTAATTGCAGGTGATTTTAACGAGATAATAGACAATACAAAGAAAAG
AGTAAATTCCATTACAAAAATCAAGGGTGAAGATGGAAAGTGGAGAAACACAGAAGAGGGGATTCAGTTGGCATTTTCAGAGTACTTCAAGGGACTATTTACTGCTAATG
CACAAAGTGAGAATGCCATTCTTCGAAATTTCTTAAAAGAGGTTTTGTTAGCCCTAAAACAAACAGGGTCTTCTAAGGCCCCTGGCGAGGATGAGTTTACTGCTTTATTC
TACAGGAAATATTGGCACATTGTGGGTGAAGAAGTTACAACCTTTTGTCTGGAAGTTCTAAATAGAGGAAGATCCATTGAAGATATTAATTCTACAGTGATAACATTAAT
CCCAGGAAGGAACATTGCAAATAATGCTATCTTGGGATATGAGAAAAGTTTACTCTGGGAAAGAGATTTACTAGAATCTGGAATGAGATGGAGGATGAGAGATGGTAAAC
AAGTAAGGATTATGGAAGATAAATGGTTAAGTCGACCATTCTCTTTGAAACCACTTCTAAATCCAAATGTTAATTCGGAAACCAAAGTTGAAGATTTGCTAAATGGTGAT
GGTCGGTGGAATGAAGTACTCATAGGAAATTTTTTTGGTAAAGAGGATATTGATCAAATTCTGAAGCTCCCTCAGATTTCCACTAAGGGCCCTGATAAGCTATTATGGCA
CTACGAGAAAGATGACTTCTACTCTGTTAAATCAGGTTATAATCTTGCTTGGAGCTTATTAAATAGAGCTTCAACTTTTGATGATGATCAAATGGCAAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAATTCTTTACTGGAATGTCAGAGGAATGAACAATCTTCATGCATTCCACAACTTGAAAAAGGTGTTAAACCTATCTAAACCCAATTTGGTGTTCCTAGCTGAAAC
AAAACTCCTAAAGTTTAGGGCGAACAATTTGAAAACTAAACTTGGCTTTGATAATGTAGTTTGTGTGGAAAGTGAAGGAAAAAGTGGAGGGCTTTGTGTACTATGGAACA
ATGAAATTCATGTAAAGTTATGTTCTTTCAATAAATTTCATATTGATTTGGTGATTGATAGTGACAACCCACCCTGGAGATTTACTGGATTCTATGGAGACCTTGAGACT
TCTAAGAGGAAAGAATCTTGGAAACTTATTTCCAGATTATGCAGCTTGAATTCTATTCCTTGGCTAATTGCAGGTGATTTTAACGAGATAATAGACAATACAAAGAAAAG
AGTAAATTCCATTACAAAAATCAAGGGTGAAGATGGAAAGTGGAGAAACACAGAAGAGGGGATTCAGTTGGCATTTTCAGAGTACTTCAAGGGACTATTTACTGCTAATG
CACAAAGTGAGAATGCCATTCTTCGAAATTTCTTAAAAGAGGTTTTGTTAGCCCTAAAACAAACAGGGTCTTCTAAGGCCCCTGGCGAGGATGAGTTTACTGCTTTATTC
TACAGGAAATATTGGCACATTGTGGGTGAAGAAGTTACAACCTTTTGTCTGGAAGTTCTAAATAGAGGAAGATCCATTGAAGATATTAATTCTACAGTGATAACATTAAT
CCCAGGAAGGAACATTGCAAATAATGCTATCTTGGGATATGAGAAAAGTTTACTCTGGGAAAGAGATTTACTAGAATCTGGAATGAGATGGAGGATGAGAGATGGTAAAC
AAGTAAGGATTATGGAAGATAAATGGTTAAGTCGACCATTCTCTTTGAAACCACTTCTAAATCCAAATGTTAATTCGGAAACCAAAGTTGAAGATTTGCTAAATGGTGAT
GGTCGGTGGAATGAAGTACTCATAGGAAATTTTTTTGGTAAAGAGGATATTGATCAAATTCTGAAGCTCCCTCAGATTTCCACTAAGGGCCCTGATAAGCTATTATGGCA
CTACGAGAAAGATGACTTCTACTCTGTTAAATCAGGTTATAATCTTGCTTGGAGCTTATTAAATAGAGCTTCAACTTTTGATGATGATCAAATGGCAAGATAG
Protein sequenceShow/hide protein sequence
MKILYWNVRGMNNLHAFHNLKKVLNLSKPNLVFLAETKLLKFRANNLKTKLGFDNVVCVESEGKSGGLCVLWNNEIHVKLCSFNKFHIDLVIDSDNPPWRFTGFYGDLET
SKRKESWKLISRLCSLNSIPWLIAGDFNEIIDNTKKRVNSITKIKGEDGKWRNTEEGIQLAFSEYFKGLFTANAQSENAILRNFLKEVLLALKQTGSSKAPGEDEFTALF
YRKYWHIVGEEVTTFCLEVLNRGRSIEDINSTVITLIPGRNIANNAILGYEKSLLWERDLLESGMRWRMRDGKQVRIMEDKWLSRPFSLKPLLNPNVNSETKVEDLLNGD
GRWNEVLIGNFFGKEDIDQILKLPQISTKGPDKLLWHYEKDDFYSVKSGYNLAWSLLNRASTFDDDQMAR