; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh00G000940 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh00G000940
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr00:17568919..17571235
RNA-Seq ExpressionCmoCh00G000940
SyntenyCmoCh00G000940
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:1901800 - positive regulation of proteasomal protein catabolic process (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0000502 - proteasome complex (cellular component)
GO:0016887 - ATPase activity (molecular function)
GO:0036402 - proteasome-activating ATPase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0008194 - UDP-glycosyltransferase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR041588 - Integrase zinc-binding domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR036397 - Ribonuclease H superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR001584 - Integrase, catalytic core
IPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025848.1 pol protein [Cucumis melo var. makuwa]5.4e-27968.83Show/hide
Query:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW
        MSFGLTNAPAVFM++MNRVFKE LD FVIVFIDDILIYS+T+ EHEEHL +VL TLR +KLYAKFSKCEFWLR+V+FLGHVVS +G+SVDP K+EA+T W
Subjt:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW

Query:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYA
         RP+TV+++RSFLGLAGYYRRFV+DF +I+SPLT+LT+KG  F W  ACE SFQ LK++LVT P L VP+ S  +VIYSDAS KGLGCVLMQ GKVVAYA
Subjt:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYA

Query:  SRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYP----------------------------------------VPPWESKLVVDALSRKTVHSSVLITR
        SRQLK +E+NYPTHDLELAAVVFALKIWRHYLY                                           P ++ +V DALSRK  HS+ LIT+
Subjt:  SRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYP----------------------------------------VPPWESKLVVDALSRKTVHSSVLITR

Query:  EVRVQREFERANIAVATKGVIAQLARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMH
        +  + R+FERA IAV+   V AQLA+LTVQPTLRQ+IIA+Q +DP L +    ++    +GFS SSD+GL+++GRLCVP    ++ E+L EAH+SPF MH
Subjt:  EVRVQREFERANIAVATKGVIAQLARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMH

Query:  PGGTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTV
        PG TKMY+DL+  +WW+ MKRDVA FVS+CLVCQQVK+PRQ  AGLLQPLS+P WKWE+  MDFI GLPKT +GYTVIWVVVDRLTKSAHF+PGK TYT 
Subjt:  PGGTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTV

Query:  DNWAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATI
          W QLY+ EIVRLHGV VSI+SDRD RFT  FW+GLQ ALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVL+F  SWDS LHLMEF+YNNS+QATI
Subjt:  DNWAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATI

Query:  GMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKLN
        GMAPFEALYGK CRSP+CW EVGE+ ++GPELV+ TN A+Q IRARM TAQSRQKSYADVRRK LEFEVGD +FLKVAPMKGVLRF  KGKL+
Subjt:  GMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKLN

KAA0031437.1 pol protein [Cucumis melo var. makuwa]5.4e-27968.83Show/hide
Query:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW
        MSFGLTNAPAVFM++MNRVFKE LD FVIVFIDDILIYS+T+ EHEEHL +VL TLR +KLYAKFSKCEFWLR+V+FLGHVVS +G+SVDP K+EA+T W
Subjt:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW

Query:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYA
         RP+TV+++RSFLGLAGYYRRFV+DF +I+SPLT+LT+KG  F W  ACE SFQ LK++LVT P L VP+ S  +VIYSDAS KGLGCVLMQ GKVVAYA
Subjt:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYA

Query:  SRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYP----------------------------------------VPPWESKLVVDALSRKTVHSSVLITR
        SRQLK +E+NYPTHDLELAAVVFALKIWRHYLY                                           P ++ +V DALSRK  HS+ LIT+
Subjt:  SRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYP----------------------------------------VPPWESKLVVDALSRKTVHSSVLITR

Query:  EVRVQREFERANIAVATKGVIAQLARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMH
        +  + R+FERA IAV+   V AQLA+LTVQPTLRQ+IIA+Q +DP L +    ++    +GFS SSD+GL+++GRLCVP    ++ E+L EAH+SPF MH
Subjt:  EVRVQREFERANIAVATKGVIAQLARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMH

Query:  PGGTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTV
        PG TKMY+DL+  +WW+ MKRDVA FVS+CLVCQQVK+PRQ  AGLLQPLS+P WKWE+  MDFI GLPKT +GYTVIWVVVDRLTKSAHF+PGK TYT 
Subjt:  PGGTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTV

Query:  DNWAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATI
          W QLY+ EIVRLHGV VSI+SDRD RFT  FW+GLQ ALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVL+F  SWDS LHLMEF+YNNS+QATI
Subjt:  DNWAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATI

Query:  GMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKLN
        GMAPFEALYGK CRSP+CW EVGE+ ++GPELV+ TN A+Q IRARM TAQSRQKSYADVRRK LEFEVGD +FLKVAPMKGVLRF  KGKL+
Subjt:  GMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKLN

KAA0033825.1 pol protein [Cucumis melo var. makuwa]5.4e-27968.83Show/hide
Query:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW
        MSFGLTNAPAVFM++MNRVFKE LD FVIVFIDDILIYS+T+ EHEEHL +VL TLR +KLYAKFSKCEFWLR+V+FLGHVVS +G+SVDP K+EA+T W
Subjt:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW

Query:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYA
         RP+TV+++RSFLGLAGYYRRFV+DF +I+SPLT+LT+KG  F W  ACE SFQ LK++LVT P L VP+ S  +VIYSDAS KGLGCVLMQ GKVVAYA
Subjt:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYA

Query:  SRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYP----------------------------------------VPPWESKLVVDALSRKTVHSSVLITR
        SRQLK +E+NYPTHDLELAAVVFALKIWRHYLY                                           P ++ +V DALSRK  HS+ LIT+
Subjt:  SRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYP----------------------------------------VPPWESKLVVDALSRKTVHSSVLITR

Query:  EVRVQREFERANIAVATKGVIAQLARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMH
        +  + R+FERA IAV+   V AQLA+LTVQPTLRQ+IIA+Q +DP L +    ++    +GFS SSD+GL+++GRLCVP    ++ E+L EAH+SPF MH
Subjt:  EVRVQREFERANIAVATKGVIAQLARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMH

Query:  PGGTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTV
        PG TKMY+DL+  +WW+ MKRDVA FVS+CLVCQQVK+PRQ  AGLLQPLS+P WKWE+  MDFI GLPKT +GYTVIWVVVDRLTKSAHF+PGK TYT 
Subjt:  PGGTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTV

Query:  DNWAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATI
          W QLY+ EIVRLHGV VSI+SDRD RFT  FW+GLQ ALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVL+F  SWDS LHLMEF+YNNS+QATI
Subjt:  DNWAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATI

Query:  GMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKLN
        GMAPFEALYGK CRSP+CW EVGE+ ++GPELV+ TN A+Q IRARM TAQSRQKSYADVRRK LEFEVGD +FLKVAPMKGVLRF  KGKL+
Subjt:  GMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKLN

TYK20443.1 pol protein [Cucumis melo var. makuwa]5.4e-27968.98Show/hide
Query:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW
        MSFGLTNAPAVFM++MNRVFK+ LD FVIVFIDDILIYS+T+ EHEEHL +VL TLR +KLYAKFSKCEFWLR+V+FLGHVVS +G+SVDP K+EA+T W
Subjt:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW

Query:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYA
         RP+TV+++RSFLGLAGYYRRFV+DF +I+SPLT+LT+KG  F W  ACE SFQ LK++LVT P L VP+ S  +VIYSDAS KGLGCVLMQ GKVVAYA
Subjt:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYA

Query:  SRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYP----------------------------------------VPPWESKLVVDALSRKTVHSSVLITR
        SRQLK +E+NYPTHDLELAAVVFALKIWRHYLY                                           P ++ +V DALSRK  HS+ LIT+
Subjt:  SRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYP----------------------------------------VPPWESKLVVDALSRKTVHSSVLITR

Query:  EVRVQREFERANIAVATKGVIAQLARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMH
        +  + R+FERA IAV+   V AQLA+LTVQPTLRQ+IIA+Q +DP L +    ++    +GFS SSD+GL+++GRLCVP    ++ E+L EAH+SPF MH
Subjt:  EVRVQREFERANIAVATKGVIAQLARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMH

Query:  PGGTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTV
        PG TKMY+DL+  +WW+ MKRDVA FVS+CLVCQQVK+PRQ  AGLLQPLS+P WKWE+  MDFI GLPKT KGYTVIWVVVDRLTKSAHF+PGK TYT 
Subjt:  PGGTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTV

Query:  DNWAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATI
          W QLY+ EIVRLHGV VSIVSDRD RFT  FW+GLQ ALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVL+F  SWDS LHLMEF+YNNS+QATI
Subjt:  DNWAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATI

Query:  GMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKLN
        GMAPFEALYGK CRSP+CW EVGE+ ++GPELV+ TN A+Q IRARM TAQSRQKSYADVRRK LEFEVGD +FLKVAPMKGVLRF  KGKL+
Subjt:  GMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKLN

XP_022933231.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111440131 [Cucurbita moschata]1.6e-30255.82Show/hide
Query:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW
        MSFGLTNAPAVFME+MNRVFKECLD+FVIVFIDDILIYS+TD +H+EHLRK LT LRE+KLYA F+KCEFW+ QVSFLGH+VSKDGI VDP K+EA+TK 
Subjt:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW

Query:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYA
        +RPTTVT++RSFLGL GYYRRFV DF +I++PLT+LTKKGV F WDD CE SFQ LK+RLV+ P L VPESS GY IYSDAS KGLGCVLMQHGKVVAYA
Subjt:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYA

Query:  SRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYP------------------------------------------------------------------
        S QLK+YEKNYPTHDLELAAVVFALKIWRHY Y                                                                   
Subjt:  SRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYP------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------VPP--------------------
                                                                                     VPP                    
Subjt:  -----------------------------------------------------------------------------VPP--------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------WESKL----------------------------------------------------
                                                   W++KL                                                    
Subjt:  -------------------------------------------WESKL----------------------------------------------------

Query:  -------------------------------------------------------VVDALSRKTVHSSVLITREVRVQREFERANIAVATKGVIAQLARL
                                                               VVDALSRKTVHSS LITREVRVQREFERANIAVATKGVIAQLARL
Subjt:  -------------------------------------------------------VVDALSRKTVHSSVLITREVRVQREFERANIAVATKGVIAQLARL

Query:  TVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPGGTKMYEDLKQHFWWKSMKRDVAGFV
        TVQPTLRQRIIASQREDPNLQKVLGQLD+S VDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPGGTKMY+DLKQHFWWKSMKRDVAGFV
Subjt:  TVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPGGTKMYEDLKQHFWWKSMKRDVAGFV

Query:  SKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDNWAQLYVKEIVRLHGVLVSIVSDRDP
        SKCLVCQ+VK+PRQKTAGLLQPLSIPEWKWEN VMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDNWAQLYVKEIVRLHGVLVSIVSDRDP
Subjt:  SKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDNWAQLYVKEIVRLHGVLVSIVSDRDP

Query:  RFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGMAPFEALYGKRCRSPLCWDEVGEKEL
        RFTLAFWRGLQKALGTRLDFSTAFHPQ DGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGMAPFEALYGKRCRSPLCWDEVGEKEL
Subjt:  RFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGMAPFEALYGKRCRSPLCWDEVGEKEL

Query:  VGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKLN
        VGPELVRLTNEAVQIIRARMRT QSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKL+
Subjt:  VGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKLN

TrEMBL top hitse value%identityAlignment
A0A5A7TEQ2 Reverse transcriptase2.6e-27968.83Show/hide
Query:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW
        MSFGLTNAPAVFM++MNRVFKE LD FVIVFIDDILIYS+T+ EHEEHL +VL TLR +KLYAKFSKCEFWLR+V+FLGHVVS +G+SVDP K+EA+T W
Subjt:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW

Query:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYA
         RP+TV+++RSFLGLAGYYRRFV+DF +I+SPLT+LT+KG  F W  ACE SFQ LK++LVT P L VP+ S  +VIYSDAS KGLGCVLMQ GKVVAYA
Subjt:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYA

Query:  SRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYP----------------------------------------VPPWESKLVVDALSRKTVHSSVLITR
        SRQLK +E+NYPTHDLELAAVVFALKIWRHYLY                                           P ++ +V DALSRK  HS+ LIT+
Subjt:  SRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYP----------------------------------------VPPWESKLVVDALSRKTVHSSVLITR

Query:  EVRVQREFERANIAVATKGVIAQLARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMH
        +  + R+FERA IAV+   V AQLA+LTVQPTLRQ+IIA+Q +DP L +    ++    +GFS SSD+GL+++GRLCVP    ++ E+L EAH+SPF MH
Subjt:  EVRVQREFERANIAVATKGVIAQLARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMH

Query:  PGGTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTV
        PG TKMY+DL+  +WW+ MKRDVA FVS+CLVCQQVK+PRQ  AGLLQPLS+P WKWE+  MDFI GLPKT +GYTVIWVVVDRLTKSAHF+PGK TYT 
Subjt:  PGGTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTV

Query:  DNWAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATI
          W QLY+ EIVRLHGV VSI+SDRD RFT  FW+GLQ ALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVL+F  SWDS LHLMEF+YNNS+QATI
Subjt:  DNWAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATI

Query:  GMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKLN
        GMAPFEALYGK CRSP+CW EVGE+ ++GPELV+ TN A+Q IRARM TAQSRQKSYADVRRK LEFEVGD +FLKVAPMKGVLRF  KGKL+
Subjt:  GMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKLN

A0A5A7TSL0 Reverse transcriptase2.6e-27968.83Show/hide
Query:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW
        MSFGLTNAPAVFM++MNRVFKE LD FVIVFIDDILIYS+T+ EHEEHL +VL TLR +KLYAKFSKCEFWLR+V+FLGHVVS +G+SVDP K+EA+T W
Subjt:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW

Query:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYA
         RP+TV+++RSFLGLAGYYRRFV+DF +I+SPLT+LT+KG  F W  ACE SFQ LK++LVT P L VP+ S  +VIYSDAS KGLGCVLMQ GKVVAYA
Subjt:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYA

Query:  SRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYP----------------------------------------VPPWESKLVVDALSRKTVHSSVLITR
        SRQLK +E+NYPTHDLELAAVVFALKIWRHYLY                                           P ++ +V DALSRK  HS+ LIT+
Subjt:  SRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYP----------------------------------------VPPWESKLVVDALSRKTVHSSVLITR

Query:  EVRVQREFERANIAVATKGVIAQLARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMH
        +  + R+FERA IAV+   V AQLA+LTVQPTLRQ+IIA+Q +DP L +    ++    +GFS SSD+GL+++GRLCVP    ++ E+L EAH+SPF MH
Subjt:  EVRVQREFERANIAVATKGVIAQLARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMH

Query:  PGGTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTV
        PG TKMY+DL+  +WW+ MKRDVA FVS+CLVCQQVK+PRQ  AGLLQPLS+P WKWE+  MDFI GLPKT +GYTVIWVVVDRLTKSAHF+PGK TYT 
Subjt:  PGGTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTV

Query:  DNWAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATI
          W QLY+ EIVRLHGV VSI+SDRD RFT  FW+GLQ ALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVL+F  SWDS LHLMEF+YNNS+QATI
Subjt:  DNWAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATI

Query:  GMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKLN
        GMAPFEALYGK CRSP+CW EVGE+ ++GPELV+ TN A+Q IRARM TAQSRQKSYADVRRK LEFEVGD +FLKVAPMKGVLRF  KGKL+
Subjt:  GMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKLN

A0A5A7UBS1 Reverse transcriptase2.6e-27968.83Show/hide
Query:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW
        MSFGLTNAPAVFM++MNRVFKE LD FVIVFIDDILIYS+T+ EHEEHL +VL TLR +KLYAKFSKCEFWLR+V+FLGHVVS +G+SVDP K+EA+T W
Subjt:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW

Query:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYA
         RP+TV+++RSFLGLAGYYRRFV+DF +I+SPLT+LT+KG  F W  ACE SFQ LK++LVT P L VP+ S  +VIYSDAS KGLGCVLMQ GKVVAYA
Subjt:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYA

Query:  SRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYP----------------------------------------VPPWESKLVVDALSRKTVHSSVLITR
        SRQLK +E+NYPTHDLELAAVVFALKIWRHYLY                                           P ++ +V DALSRK  HS+ LIT+
Subjt:  SRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYP----------------------------------------VPPWESKLVVDALSRKTVHSSVLITR

Query:  EVRVQREFERANIAVATKGVIAQLARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMH
        +  + R+FERA IAV+   V AQLA+LTVQPTLRQ+IIA+Q +DP L +    ++    +GFS SSD+GL+++GRLCVP    ++ E+L EAH+SPF MH
Subjt:  EVRVQREFERANIAVATKGVIAQLARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMH

Query:  PGGTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTV
        PG TKMY+DL+  +WW+ MKRDVA FVS+CLVCQQVK+PRQ  AGLLQPLS+P WKWE+  MDFI GLPKT +GYTVIWVVVDRLTKSAHF+PGK TYT 
Subjt:  PGGTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTV

Query:  DNWAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATI
          W QLY+ EIVRLHGV VSI+SDRD RFT  FW+GLQ ALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVL+F  SWDS LHLMEF+YNNS+QATI
Subjt:  DNWAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATI

Query:  GMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKLN
        GMAPFEALYGK CRSP+CW EVGE+ ++GPELV+ TN A+Q IRARM TAQSRQKSYADVRRK LEFEVGD +FLKVAPMKGVLRF  KGKL+
Subjt:  GMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKLN

A0A5D3CQB5 Reverse transcriptase2.6e-27968.83Show/hide
Query:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW
        MSFGLTNAPAVFM++MNRVFKE LD FVIVFIDDILIYS+T+ EHEEHL +VL TLR +KLYAKFSKCEFWLR+V+FLGHVVS +G+SVDP K+EA+T W
Subjt:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW

Query:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYA
         RP+TV+++RSFLGLAGYYRRFV+DF +I+SPLT+LT+KG  F W  ACE SFQ LK++LVT P L VP+ S  +VIYSDAS KGLGCVLMQ GKVVAYA
Subjt:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYA

Query:  SRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYP----------------------------------------VPPWESKLVVDALSRKTVHSSVLITR
        SRQLK +E+NYPTHDLELAAVVFALKIWRHYLY                                           P ++ +V DALSRK  HS+ LIT+
Subjt:  SRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYP----------------------------------------VPPWESKLVVDALSRKTVHSSVLITR

Query:  EVRVQREFERANIAVATKGVIAQLARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMH
        +  + R+FERA IAV+   V AQLA+LTVQPTLRQ+IIA+Q +DP L +    ++    +GFS SSD+GL+++GRLCVP    ++ E+L EAH+SPF MH
Subjt:  EVRVQREFERANIAVATKGVIAQLARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMH

Query:  PGGTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTV
        PG TKMY+DL+  +WW+ MKRDVA FVS+CLVCQQVK+PRQ  AGLLQPLS+P WKWE+  MDFI GLPKT +GYTVIWVVVDRLTKSAHF+PGK TYT 
Subjt:  PGGTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTV

Query:  DNWAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATI
          W QLY+ EIVRLHGV VSI+SDRD RFT  FW+GLQ ALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVL+F  SWDS LHLMEF+YNNS+QATI
Subjt:  DNWAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATI

Query:  GMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKLN
        GMAPFEALYGK CRSP+CW EVGE+ ++GPELV+ TN A+Q IRARM TAQSRQKSYADVRRK LEFEVGD +FLKVAPMKGVLRF  KGKL+
Subjt:  GMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKLN

A0A6J1EYH9 Reverse transcriptase7.6e-30355.82Show/hide
Query:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW
        MSFGLTNAPAVFME+MNRVFKECLD+FVIVFIDDILIYS+TD +H+EHLRK LT LRE+KLYA F+KCEFW+ QVSFLGH+VSKDGI VDP K+EA+TK 
Subjt:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW

Query:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYA
        +RPTTVT++RSFLGL GYYRRFV DF +I++PLT+LTKKGV F WDD CE SFQ LK+RLV+ P L VPESS GY IYSDAS KGLGCVLMQHGKVVAYA
Subjt:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYA

Query:  SRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYP------------------------------------------------------------------
        S QLK+YEKNYPTHDLELAAVVFALKIWRHY Y                                                                   
Subjt:  SRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYP------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------VPP--------------------
                                                                                     VPP                    
Subjt:  -----------------------------------------------------------------------------VPP--------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------WESKL----------------------------------------------------
                                                   W++KL                                                    
Subjt:  -------------------------------------------WESKL----------------------------------------------------

Query:  -------------------------------------------------------VVDALSRKTVHSSVLITREVRVQREFERANIAVATKGVIAQLARL
                                                               VVDALSRKTVHSS LITREVRVQREFERANIAVATKGVIAQLARL
Subjt:  -------------------------------------------------------VVDALSRKTVHSSVLITREVRVQREFERANIAVATKGVIAQLARL

Query:  TVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPGGTKMYEDLKQHFWWKSMKRDVAGFV
        TVQPTLRQRIIASQREDPNLQKVLGQLD+S VDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPGGTKMY+DLKQHFWWKSMKRDVAGFV
Subjt:  TVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPGGTKMYEDLKQHFWWKSMKRDVAGFV

Query:  SKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDNWAQLYVKEIVRLHGVLVSIVSDRDP
        SKCLVCQ+VK+PRQKTAGLLQPLSIPEWKWEN VMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDNWAQLYVKEIVRLHGVLVSIVSDRDP
Subjt:  SKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDNWAQLYVKEIVRLHGVLVSIVSDRDP

Query:  RFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGMAPFEALYGKRCRSPLCWDEVGEKEL
        RFTLAFWRGLQKALGTRLDFSTAFHPQ DGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGMAPFEALYGKRCRSPLCWDEVGEKEL
Subjt:  RFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGMAPFEALYGKRCRSPLCWDEVGEKEL

Query:  VGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKLN
        VGPELVRLTNEAVQIIRARMRT QSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKL+
Subjt:  VGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKLN

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.9e-7827.81Show/hide
Query:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW
        M +G++ APA F   +N +  E  +  V+ ++DDILI+S+++ EH +H++ VL  L+   L    +KCEF   QV F+G+ +S+ G +     ++ + +W
Subjt:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW

Query:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGK-----
        ++P    ++R FLG   Y R+F+    +++ PL  L KK V ++W      + +N+K+ LV+ P L   + S+  ++ +DAS   +G VL Q        
Subjt:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGK-----

Query:  VVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYL-YPVPPWESKLVVD---ALSRKTVHSSVLITREVRVQ------------REFERANIAVATKG
         V Y S ++ + + NY   D E+ A++ +LK WRHYL   + P+  K++ D    + R T  S     R  R Q            R     +IA A   
Subjt:  VVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYL-YPVPPWESKLVVD---ALSRKTVHSSVLITREVRVQ------------REFERANIAVATKG

Query:  VIAQ---------------LARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGR--LCVPAIEDLRKEILMEAHNSPFAMHPG
        ++ +               + ++++    + +++     D  L  +L   D+ + +       +GLL   +  + +P    L + I+ + H     +HPG
Subjt:  VIAQ---------------LARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGR--LCVPAIEDLRKEILMEAHNSPFAMHPG

Query:  GTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDN
           +   + + F WK +++ +  +V  C  CQ  KS   K  G LQP+   E  WE+  MDFI  LP++  GY  ++VVVDR +K A  +P   + T + 
Subjt:  GTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDN

Query:  WAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGM
         A+++ + ++   G    I++D D  FT   W+         + FS  + PQTDGQTER NQ +E +LR        +W   + L++ SYNN+  +   M
Subjt:  WAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGM

Query:  APFEALYG-KRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSL-EFEVGDPIFLK
         PFE ++      SPL      +K     E  + T +  Q ++  + T   + K Y D++ + + EF+ GD + +K
Subjt:  APFEALYG-KRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSL-EFEVGDPIFLK

P0CT35 Transposon Tf2-2 polyprotein3.9e-7827.81Show/hide
Query:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW
        M +G++ APA F   +N +  E  +  V+ ++DDILI+S+++ EH +H++ VL  L+   L    +KCEF   QV F+G+ +S+ G +     ++ + +W
Subjt:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW

Query:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGK-----
        ++P    ++R FLG   Y R+F+    +++ PL  L KK V ++W      + +N+K+ LV+ P L   + S+  ++ +DAS   +G VL Q        
Subjt:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGK-----

Query:  VVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYL-YPVPPWESKLVVD---ALSRKTVHSSVLITREVRVQ------------REFERANIAVATKG
         V Y S ++ + + NY   D E+ A++ +LK WRHYL   + P+  K++ D    + R T  S     R  R Q            R     +IA A   
Subjt:  VVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYL-YPVPPWESKLVVD---ALSRKTVHSSVLITREVRVQ------------REFERANIAVATKG

Query:  VIAQ---------------LARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGR--LCVPAIEDLRKEILMEAHNSPFAMHPG
        ++ +               + ++++    + +++     D  L  +L   D+ + +       +GLL   +  + +P    L + I+ + H     +HPG
Subjt:  VIAQ---------------LARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGR--LCVPAIEDLRKEILMEAHNSPFAMHPG

Query:  GTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDN
           +   + + F WK +++ +  +V  C  CQ  KS   K  G LQP+   E  WE+  MDFI  LP++  GY  ++VVVDR +K A  +P   + T + 
Subjt:  GTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDN

Query:  WAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGM
         A+++ + ++   G    I++D D  FT   W+         + FS  + PQTDGQTER NQ +E +LR        +W   + L++ SYNN+  +   M
Subjt:  WAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGM

Query:  APFEALYG-KRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSL-EFEVGDPIFLK
         PFE ++      SPL      +K     E  + T +  Q ++  + T   + K Y D++ + + EF+ GD + +K
Subjt:  APFEALYG-KRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSL-EFEVGDPIFLK

P0CT36 Transposon Tf2-3 polyprotein3.9e-7827.81Show/hide
Query:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW
        M +G++ APA F   +N +  E  +  V+ ++DDILI+S+++ EH +H++ VL  L+   L    +KCEF   QV F+G+ +S+ G +     ++ + +W
Subjt:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW

Query:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGK-----
        ++P    ++R FLG   Y R+F+    +++ PL  L KK V ++W      + +N+K+ LV+ P L   + S+  ++ +DAS   +G VL Q        
Subjt:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGK-----

Query:  VVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYL-YPVPPWESKLVVD---ALSRKTVHSSVLITREVRVQ------------REFERANIAVATKG
         V Y S ++ + + NY   D E+ A++ +LK WRHYL   + P+  K++ D    + R T  S     R  R Q            R     +IA A   
Subjt:  VVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYL-YPVPPWESKLVVD---ALSRKTVHSSVLITREVRVQ------------REFERANIAVATKG

Query:  VIAQ---------------LARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGR--LCVPAIEDLRKEILMEAHNSPFAMHPG
        ++ +               + ++++    + +++     D  L  +L   D+ + +       +GLL   +  + +P    L + I+ + H     +HPG
Subjt:  VIAQ---------------LARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGR--LCVPAIEDLRKEILMEAHNSPFAMHPG

Query:  GTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDN
           +   + + F WK +++ +  +V  C  CQ  KS   K  G LQP+   E  WE+  MDFI  LP++  GY  ++VVVDR +K A  +P   + T + 
Subjt:  GTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDN

Query:  WAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGM
         A+++ + ++   G    I++D D  FT   W+         + FS  + PQTDGQTER NQ +E +LR        +W   + L++ SYNN+  +   M
Subjt:  WAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGM

Query:  APFEALYG-KRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSL-EFEVGDPIFLK
         PFE ++      SPL      +K     E  + T +  Q ++  + T   + K Y D++ + + EF+ GD + +K
Subjt:  APFEALYG-KRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSL-EFEVGDPIFLK

P0CT37 Transposon Tf2-4 polyprotein3.9e-7827.81Show/hide
Query:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW
        M +G++ APA F   +N +  E  +  V+ ++DDILI+S+++ EH +H++ VL  L+   L    +KCEF   QV F+G+ +S+ G +     ++ + +W
Subjt:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW

Query:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGK-----
        ++P    ++R FLG   Y R+F+    +++ PL  L KK V ++W      + +N+K+ LV+ P L   + S+  ++ +DAS   +G VL Q        
Subjt:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGK-----

Query:  VVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYL-YPVPPWESKLVVD---ALSRKTVHSSVLITREVRVQ------------REFERANIAVATKG
         V Y S ++ + + NY   D E+ A++ +LK WRHYL   + P+  K++ D    + R T  S     R  R Q            R     +IA A   
Subjt:  VVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYL-YPVPPWESKLVVD---ALSRKTVHSSVLITREVRVQ------------REFERANIAVATKG

Query:  VIAQ---------------LARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGR--LCVPAIEDLRKEILMEAHNSPFAMHPG
        ++ +               + ++++    + +++     D  L  +L   D+ + +       +GLL   +  + +P    L + I+ + H     +HPG
Subjt:  VIAQ---------------LARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGR--LCVPAIEDLRKEILMEAHNSPFAMHPG

Query:  GTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDN
           +   + + F WK +++ +  +V  C  CQ  KS   K  G LQP+   E  WE+  MDFI  LP++  GY  ++VVVDR +K A  +P   + T + 
Subjt:  GTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDN

Query:  WAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGM
         A+++ + ++   G    I++D D  FT   W+         + FS  + PQTDGQTER NQ +E +LR        +W   + L++ SYNN+  +   M
Subjt:  WAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGM

Query:  APFEALYG-KRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSL-EFEVGDPIFLK
         PFE ++      SPL      +K     E  + T +  Q ++  + T   + K Y D++ + + EF+ GD + +K
Subjt:  APFEALYG-KRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSL-EFEVGDPIFLK

P0CT41 Transposon Tf2-12 polyprotein3.9e-7827.81Show/hide
Query:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW
        M +G++ APA F   +N +  E  +  V+ ++DDILI+S+++ EH +H++ VL  L+   L    +KCEF   QV F+G+ +S+ G +     ++ + +W
Subjt:  MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKW

Query:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGK-----
        ++P    ++R FLG   Y R+F+    +++ PL  L KK V ++W      + +N+K+ LV+ P L   + S+  ++ +DAS   +G VL Q        
Subjt:  ERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGK-----

Query:  VVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYL-YPVPPWESKLVVD---ALSRKTVHSSVLITREVRVQ------------REFERANIAVATKG
         V Y S ++ + + NY   D E+ A++ +LK WRHYL   + P+  K++ D    + R T  S     R  R Q            R     +IA A   
Subjt:  VVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYL-YPVPPWESKLVVD---ALSRKTVHSSVLITREVRVQ------------REFERANIAVATKG

Query:  VIAQ---------------LARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGR--LCVPAIEDLRKEILMEAHNSPFAMHPG
        ++ +               + ++++    + +++     D  L  +L   D+ + +       +GLL   +  + +P    L + I+ + H     +HPG
Subjt:  VIAQ---------------LARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGLLYQGR--LCVPAIEDLRKEILMEAHNSPFAMHPG

Query:  GTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDN
           +   + + F WK +++ +  +V  C  CQ  KS   K  G LQP+   E  WE+  MDFI  LP++  GY  ++VVVDR +K A  +P   + T + 
Subjt:  GTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWVVVDRLTKSAHFLPGKVTYTVDN

Query:  WAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGM
         A+++ + ++   G    I++D D  FT   W+         + FS  + PQTDGQTER NQ +E +LR        +W   + L++ SYNN+  +   M
Subjt:  WAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGM

Query:  APFEALYG-KRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSL-EFEVGDPIFLK
         PFE ++      SPL      +K     E  + T +  Q ++  + T   + K Y D++ + + EF+ GD + +K
Subjt:  APFEALYG-KRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSL-EFEVGDPIFLK

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein7.8e-2644Show/hide
Query:  HLRKVLTTLREHKLYAKFSKCEFWLRQVSFLG--HVVSKDGISVDPTKVEAITKWERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRW
        HL  VL    +H+ YA   KC F   Q+++LG  H++S +G+S DP K+EA+  W  P   T++R FLGL GYYRRFV+++GKI  PLT+L KK  S +W
Subjt:  HLRKVLTTLREHKLYAKFSKCEFWLRQVSFLG--HVVSKDGISVDPTKVEAITKWERPTTVTKVRSFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRW

Query:  DDACEASFQNLKERLVTTPELIVPE
         +    +F+ LK  + T P L +P+
Subjt:  DDACEASFQNLKERLVTTPELIVPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTTGGCCTCACCAATGCTCCAGCTGTATTTATGGAGGTAATGAACCGGGTATTCAAAGAATGCCTAGACATGTTTGTGATTGTGTTCATTGACGACATCCTCAT
ATACTCGAGAACCGACCAAGAGCACGAGGAACACCTCCGAAAAGTCCTAACCACCCTAAGAGAGCACAAGTTGTACGCCAAGTTCTCCAAATGCGAATTTTGGTTACGAC
AAGTCTCTTTCCTAGGACACGTGGTGTCAAAGGACGGAATATCTGTAGATCCCACCAAGGTCGAAGCGATCACAAAGTGGGAACGCCCAACTACGGTAACGAAAGTAAGG
AGTTTCCTAGGATTGGCGGGATATTATCGAAGGTTCGTACAAGACTTCGGTAAAATATCCTCGCCTTTAACAAAGTTAACAAAAAAAGGGGTGTCATTTAGATGGGATGA
TGCTTGTGAGGCAAGCTTCCAGAACCTAAAAGAGAGATTGGTAACCACCCCGGAACTCATAGTACCCGAGAGCTCAGAAGGATATGTGATCTATAGTGATGCCTCCATGA
AAGGACTGGGATGTGTGTTAATGCAACACGGCAAGGTTGTCGCATACGCATCTCGTCAACTTAAAGAATATGAAAAGAACTACCCTACCCATGACCTAGAGTTGGCCGCT
GTAGTGTTCGCGCTGAAAATCTGGCGACATTACCTATATCCAGTACCACCCTGGGAAAGCAAATTGGTTGTAGATGCCTTGAGTAGGAAGACGGTCCACTCGTCGGTCCT
CATTACGAGGGAAGTAAGGGTACAAAGAGAGTTCGAGCGAGCCAACATAGCTGTAGCGACCAAGGGAGTCATAGCACAGCTGGCTCGACTCACGGTACAACCTACGCTTA
GGCAGAGAATTATTGCCTCCCAACGAGAGGATCCTAACCTACAGAAAGTCCTAGGACAGCTAGACGAAAGTCTAGTAGATGGATTCTCGAAGTCATCAGATGAAGGACTA
TTGTATCAGGGACGCTTATGTGTTCCAGCAATAGAAGATTTAAGGAAGGAAATACTGATGGAAGCTCACAACTCACCATTTGCGATGCATCCGGGAGGTACTAAGATGTA
CGAAGATTTAAAACAACACTTTTGGTGGAAGAGCATGAAGAGGGATGTGGCCGGGTTTGTGAGCAAGTGCTTAGTTTGTCAACAAGTGAAATCTCCAAGACAAAAGACAG
CGGGCTTGTTGCAGCCCCTAAGCATACCGGAGTGGAAGTGGGAAAACACAGTGATGGACTTCATAGTAGGTTTACCCAAAACGCCCAAGGGCTATACAGTGATCTGGGTA
GTTGTCGATAGGTTGACCAAGTCGGCACACTTTCTACCTGGGAAGGTTACATATACAGTTGACAATTGGGCACAACTGTACGTGAAAGAAATAGTAAGACTACATGGAGT
CCTAGTGTCTATAGTGTCGGATCGGGATCCACGCTTTACGTTAGCGTTTTGGCGCGGACTTCAAAAAGCACTAGGTACCCGCCTCGACTTTAGTACCGCCTTTCACCCCC
AAACAGATGGACAAACGGAGCGTTTAAACCAAATTCTAGAGGACATGCTACGCGCTTGCGTACTAGATTTTAAGGAAAGTTGGGATTCCAAACTACACCTAATGGAATTC
TCGTATAACAACAGTTTTCAAGCAACTATTGGAATGGCACCGTTTGAGGCCCTGTACGGGAAACGGTGTAGGTCCCCACTGTGTTGGGACGAGGTAGGAGAGAAAGAATT
AGTAGGACCCGAGTTGGTTCGACTCACCAATGAGGCTGTCCAGATAATTCGAGCGAGGATGCGTACCGCTCAAAGTAGACAGAAAAGCTACGCCGATGTAAGGCGTAAAA
GCCTGGAGTTTGAGGTGGGGGACCCAATATTCCTCAAGGTGGCACCTATGAAAGGTGTGTTAAGATTCGGACATAAGGGCAAGTTAAATGCGGACGCGCACCTTATGATG
AGCGCGAATGCGCACAGATGCGAAAGCACACAGATGAGGGTGTTCATGAGGCGCATGACACTATGGGGTTCCGCTGACCTCCGGACGTCGTTACAGATTAGCTTGACCAG
AGGGTTCAGGGGGTGTGCGAGTGCCCTGGGGACTCACACTCGCACGTGTGAGTCGTGTGTAGGGAAGTACTACACATCCAATTTATCCGAGACTGGAGGCCACCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCATTTGGCCTCACCAATGCTCCAGCTGTATTTATGGAGGTAATGAACCGGGTATTCAAAGAATGCCTAGACATGTTTGTGATTGTGTTCATTGACGACATCCTCAT
ATACTCGAGAACCGACCAAGAGCACGAGGAACACCTCCGAAAAGTCCTAACCACCCTAAGAGAGCACAAGTTGTACGCCAAGTTCTCCAAATGCGAATTTTGGTTACGAC
AAGTCTCTTTCCTAGGACACGTGGTGTCAAAGGACGGAATATCTGTAGATCCCACCAAGGTCGAAGCGATCACAAAGTGGGAACGCCCAACTACGGTAACGAAAGTAAGG
AGTTTCCTAGGATTGGCGGGATATTATCGAAGGTTCGTACAAGACTTCGGTAAAATATCCTCGCCTTTAACAAAGTTAACAAAAAAAGGGGTGTCATTTAGATGGGATGA
TGCTTGTGAGGCAAGCTTCCAGAACCTAAAAGAGAGATTGGTAACCACCCCGGAACTCATAGTACCCGAGAGCTCAGAAGGATATGTGATCTATAGTGATGCCTCCATGA
AAGGACTGGGATGTGTGTTAATGCAACACGGCAAGGTTGTCGCATACGCATCTCGTCAACTTAAAGAATATGAAAAGAACTACCCTACCCATGACCTAGAGTTGGCCGCT
GTAGTGTTCGCGCTGAAAATCTGGCGACATTACCTATATCCAGTACCACCCTGGGAAAGCAAATTGGTTGTAGATGCCTTGAGTAGGAAGACGGTCCACTCGTCGGTCCT
CATTACGAGGGAAGTAAGGGTACAAAGAGAGTTCGAGCGAGCCAACATAGCTGTAGCGACCAAGGGAGTCATAGCACAGCTGGCTCGACTCACGGTACAACCTACGCTTA
GGCAGAGAATTATTGCCTCCCAACGAGAGGATCCTAACCTACAGAAAGTCCTAGGACAGCTAGACGAAAGTCTAGTAGATGGATTCTCGAAGTCATCAGATGAAGGACTA
TTGTATCAGGGACGCTTATGTGTTCCAGCAATAGAAGATTTAAGGAAGGAAATACTGATGGAAGCTCACAACTCACCATTTGCGATGCATCCGGGAGGTACTAAGATGTA
CGAAGATTTAAAACAACACTTTTGGTGGAAGAGCATGAAGAGGGATGTGGCCGGGTTTGTGAGCAAGTGCTTAGTTTGTCAACAAGTGAAATCTCCAAGACAAAAGACAG
CGGGCTTGTTGCAGCCCCTAAGCATACCGGAGTGGAAGTGGGAAAACACAGTGATGGACTTCATAGTAGGTTTACCCAAAACGCCCAAGGGCTATACAGTGATCTGGGTA
GTTGTCGATAGGTTGACCAAGTCGGCACACTTTCTACCTGGGAAGGTTACATATACAGTTGACAATTGGGCACAACTGTACGTGAAAGAAATAGTAAGACTACATGGAGT
CCTAGTGTCTATAGTGTCGGATCGGGATCCACGCTTTACGTTAGCGTTTTGGCGCGGACTTCAAAAAGCACTAGGTACCCGCCTCGACTTTAGTACCGCCTTTCACCCCC
AAACAGATGGACAAACGGAGCGTTTAAACCAAATTCTAGAGGACATGCTACGCGCTTGCGTACTAGATTTTAAGGAAAGTTGGGATTCCAAACTACACCTAATGGAATTC
TCGTATAACAACAGTTTTCAAGCAACTATTGGAATGGCACCGTTTGAGGCCCTGTACGGGAAACGGTGTAGGTCCCCACTGTGTTGGGACGAGGTAGGAGAGAAAGAATT
AGTAGGACCCGAGTTGGTTCGACTCACCAATGAGGCTGTCCAGATAATTCGAGCGAGGATGCGTACCGCTCAAAGTAGACAGAAAAGCTACGCCGATGTAAGGCGTAAAA
GCCTGGAGTTTGAGGTGGGGGACCCAATATTCCTCAAGGTGGCACCTATGAAAGGTGTGTTAAGATTCGGACATAAGGGCAAGTTAAATGCGGACGCGCACCTTATGATG
AGCGCGAATGCGCACAGATGCGAAAGCACACAGATGAGGGTGTTCATGAGGCGCATGACACTATGGGGTTCCGCTGACCTCCGGACGTCGTTACAGATTAGCTTGACCAG
AGGGTTCAGGGGGTGTGCGAGTGCCCTGGGGACTCACACTCGCACGTGTGAGTCGTGTGTAGGGAAGTACTACACATCCAATTTATCCGAGACTGGAGGCCACCCCTAA
Protein sequenceShow/hide protein sequence
MSFGLTNAPAVFMEVMNRVFKECLDMFVIVFIDDILIYSRTDQEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTKVR
SFLGLAGYYRRFVQDFGKISSPLTKLTKKGVSFRWDDACEASFQNLKERLVTTPELIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHDLELAA
VVFALKIWRHYLYPVPPWESKLVVDALSRKTVHSSVLITREVRVQREFERANIAVATKGVIAQLARLTVQPTLRQRIIASQREDPNLQKVLGQLDESLVDGFSKSSDEGL
LYQGRLCVPAIEDLRKEILMEAHNSPFAMHPGGTKMYEDLKQHFWWKSMKRDVAGFVSKCLVCQQVKSPRQKTAGLLQPLSIPEWKWENTVMDFIVGLPKTPKGYTVIWV
VVDRLTKSAHFLPGKVTYTVDNWAQLYVKEIVRLHGVLVSIVSDRDPRFTLAFWRGLQKALGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLDFKESWDSKLHLMEF
SYNNSFQATIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQIIRARMRTAQSRQKSYADVRRKSLEFEVGDPIFLKVAPMKGVLRFGHKGKLNADAHLMM
SANAHRCESTQMRVFMRRMTLWGSADLRTSLQISLTRGFRGCASALGTHTRTCESCVGKYYTSNLSETGGHP