; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0023416 (gene) of Chayote v1 genome

Gene IDSed0023416
OrganismSechium edule (Chayote v1)
DescriptionReverse transcriptase
Genome locationLG01:32532429..32535278
RNA-Seq ExpressionSed0023416
SyntenySed0023416
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001878 - Zinc finger, CCHC-type
IPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025242.1 pol protein [Cucumis melo var. makuwa]1.4e-19845.84Show/hide
Query:  MPVTR-TERGGSRAGRGIRIESDTAGTSSQAETDGIPPVAPPGAPRGGVEATERRETAFGADFADSMRAMMVSLLSDQRSETQALLAAQREESRAQIEQL
        MP  R T RGG R GRG       AG   Q E   + P   P AP      T+    A    + D ++A +   L+ Q+++   +   Q E + AQ +  
Subjt:  MPVTR-TERGGSRAGRGIRIESDTAGTSSQAETDGIPPVAPPGAPRGGVEATERRETAFGADFADSMRAMMVSLLSDQRSETQALLAAQREESRAQIEQL

Query:  TRMFQRGRKALHSTEQVAPEVDMAEGAGELGRWLKDFVKWKPEKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTTRRWWESTLSERP
            Q          Q  P    AE      + L+DF K+ P+ FD + D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER 
Subjt:  TRMFQRGRKALHSTEQVAPEVDMAEGAGELGRWLKDFVKWKPEKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTTRRWWESTLSERP

Query:  AGSPPA--TWEFFKSEFKVKYISDEAQEKML------------------------------IATPELKIRKFISGLRQGVSQDVRAQAPDTYARAVQLAT
         G   +  TWE FK  F  K+ S   +   L                              +     +  KF+ GLR  +   VRA  P T+A A+++A 
Subjt:  AGSPPA--TWEFFKSEFKVKYISDEAQEKML------------------------------IATPELKIRKFISGLRQGVSQDVRAQAPDTYARAVQLAT

Query:  IYNGKYPVERIVATPVQRPVPAAVGQKRKFDFRGG-----GQRRPGQFQNRDQHRAPIRFQPRQQGGPRQVPACASCGKVHTGACQPGPRVCFRCGKEGH
          +   P     +    R   +A+GQKRK + +         R  G FQ   +HR  +    R     R++PAC +CG+VH G C  G  VCFRC + GH
Subjt:  IYNGKYPVERIVATPVQRPVPAAVGQKRKFDFRGG-----GQRRPGQFQNRDQHRAPIRFQPRQQGGPRQVPACASCGKVHTGACQPGPRVCFRCGKEGH

Query:  VVRFCTQPAMAAQPAQ----QQGRVFATTRQEAERTDAAVAG------------FE--------------------------LSVATPSGDDMLARDKIK
            C +      P Q    QQGRVFATTRQEAER    V G            F+                          LSV+TPSG+ +L++++IK
Subjt:  VVRFCTQPAMAAQPAQ----QQGRVFATTRQEAERTDAAVAG------------FE--------------------------LSVATPSGDDMLARDKIK

Query:  NGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVC-PPMVSAVKARRLLAHGAWGYLAKLSAESHPSLS
          R+EI+  +L  TL+VL M  FDVILGM+WL+ANHA IDC+ KEVVF PP   SFKFRG  G VC P ++SA+KA +LL+ G WG LA +     P +S
Subjt:  NGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVC-PPMVSAVKARRLLAHGAWGYLAKLSAESHPSLS

Query:  VDQVPVVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
        +   PVV+E+ DVFP+ELPGLPP REVDF IELEP T PIS+APYRMAPAELKELKVQLQE+LD+GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
Subjt:  VDQVPVVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK

Query:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDR----------------------
        VTVKN+Y LPRIDDLFDQLQGA+VFSKIDLRSGYHQLRIRD DIPK AFRSRYGHYEFVVMSFGLTNAPA FMD+M+R                      
Subjt:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDR----------------------

Query:  ----------------------------------VAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG
                                          V FLGHVVS +G+ VDPAKIEAV++W +P++V+EIRSFLGLAGYYRRFV+DFS IASPLT+LTRKG
Subjt:  ----------------------------------VAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG

Query:  VEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS
          FVW   C+ SF  L   L +  VL    +G+G FV+YSDASK GLGCVLMQ GKV+AYAS
Subjt:  VEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS

KAA0025848.1 pol protein [Cucumis melo var. makuwa]3.1e-19845.84Show/hide
Query:  MPVTR-TERGGSRAGRGIRIESDTAGTSSQAETDGIPPVAPPGAPRGGVEATERRETAFGADFADSMRAMMVSLLSDQRSETQALLAAQREESRAQIEQL
        MP  R T RGG R GRG       AG   Q E   + P   P AP      T+    A    + D ++A +   L+ Q+++   +   Q E + AQ +  
Subjt:  MPVTR-TERGGSRAGRGIRIESDTAGTSSQAETDGIPPVAPPGAPRGGVEATERRETAFGADFADSMRAMMVSLLSDQRSETQALLAAQREESRAQIEQL

Query:  TRMFQRGRKALHSTEQVAPEVDMAEGAGELGRWLKDFVKWKPEKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTTRRWWESTLSERP
            Q          Q  P V ++  A    + L+DF K+ P+ FD + D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER 
Subjt:  TRMFQRGRKALHSTEQVAPEVDMAEGAGELGRWLKDFVKWKPEKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTTRRWWESTLSERP

Query:  AGSPPA--TWEFFKSEFKVKYISDEAQEKML------------------------------IATPELKIRKFISGLRQGVSQDVRAQAPDTYARAVQLAT
         G   +  TWE FK  F  K+ S   +   L                              +     +  KF+ GLR  +   VRA  P T+A A+++A 
Subjt:  AGSPPA--TWEFFKSEFKVKYISDEAQEKML------------------------------IATPELKIRKFISGLRQGVSQDVRAQAPDTYARAVQLAT

Query:  IYNGKYPVERIVATPVQRPVPAAVGQKRKFDFRGG-----GQRRPGQFQNRDQHRAPIRFQPRQQGGPRQVPACASCGKVHTGACQPGPRVCFRCGKEGH
          +   P     +    R   +A+GQKRK + +         R  G FQ   +HR  +    R     R++PAC +CG+VH G C  G  VCFRC + GH
Subjt:  IYNGKYPVERIVATPVQRPVPAAVGQKRKFDFRGG-----GQRRPGQFQNRDQHRAPIRFQPRQQGGPRQVPACASCGKVHTGACQPGPRVCFRCGKEGH

Query:  VVRFCTQPAMAAQPAQ----QQGRVFATTRQEAERTDAAVAG------------FE--------------------------LSVATPSGDDMLARDKIK
            C +      P Q    QQGRVFATTRQEAER    V G            F+                          LSV+TPSG+ +L++++IK
Subjt:  VVRFCTQPAMAAQPAQ----QQGRVFATTRQEAERTDAAVAG------------FE--------------------------LSVATPSGDDMLARDKIK

Query:  NGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVC-PPMVSAVKARRLLAHGAWGYLAKLSAESHPSLS
          R+EI+  +L  TL+VL M  FDVILGM+WL+ANHA IDC+ KEVVF PP   SFKFRG  G VC P ++SA+KA +LL+ G WG LA +     P +S
Subjt:  NGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVC-PPMVSAVKARRLLAHGAWGYLAKLSAESHPSLS

Query:  VDQVPVVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
        +   PVV+E+ DVFP+ELPGLPP REVDF IELEP T PIS+APYRMAPAELKELKVQLQE+LD+GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
Subjt:  VDQVPVVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK

Query:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDR----------------------
        VTVKN+Y LPRIDDLFDQLQGA+VFSKIDLRSGYHQLRIRD DIPK AFRSRYGHYEFVVMSFGLTNAPA FMD+M+R                      
Subjt:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDR----------------------

Query:  ----------------------------------VAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG
                                          V FLGHVVS +G+ VDPAKIEAV++W +P++V+EIRSFLGLAGYYRRFV+DFS IASPLT+LTRKG
Subjt:  ----------------------------------VAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG

Query:  VEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS
          FVW   C+ SF  L   L +  VL    +G+G FV+YSDASK GLGCVLMQ GKV+AYAS
Subjt:  VEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS

KAA0033825.1 pol protein [Cucumis melo var. makuwa]3.1e-19845.84Show/hide
Query:  MPVTR-TERGGSRAGRGIRIESDTAGTSSQAETDGIPPVAPPGAPRGGVEATERRETAFGADFADSMRAMMVSLLSDQRSETQALLAAQREESRAQIEQL
        MP  R T RGG R GRG       AG   Q E   + P   P AP      T+    A    + D ++A +   L+ Q+++   +   Q E + AQ +  
Subjt:  MPVTR-TERGGSRAGRGIRIESDTAGTSSQAETDGIPPVAPPGAPRGGVEATERRETAFGADFADSMRAMMVSLLSDQRSETQALLAAQREESRAQIEQL

Query:  TRMFQRGRKALHSTEQVAPEVDMAEGAGELGRWLKDFVKWKPEKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTTRRWWESTLSERP
            Q          Q  P V ++  A    + L+DF K+ P+ FD + D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER 
Subjt:  TRMFQRGRKALHSTEQVAPEVDMAEGAGELGRWLKDFVKWKPEKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTTRRWWESTLSERP

Query:  AGSPPA--TWEFFKSEFKVKYISDEAQEKML------------------------------IATPELKIRKFISGLRQGVSQDVRAQAPDTYARAVQLAT
         G   +  TWE FK  F  K+ S   +   L                              +     +  KF+ GLR  +   VRA  P T+A A+++A 
Subjt:  AGSPPA--TWEFFKSEFKVKYISDEAQEKML------------------------------IATPELKIRKFISGLRQGVSQDVRAQAPDTYARAVQLAT

Query:  IYNGKYPVERIVATPVQRPVPAAVGQKRKFDFRGG-----GQRRPGQFQNRDQHRAPIRFQPRQQGGPRQVPACASCGKVHTGACQPGPRVCFRCGKEGH
          +   P     +    R   +A+GQKRK + +         R  G FQ   +HR  +    R     R++PAC +CG+VH G C  G  VCFRC + GH
Subjt:  IYNGKYPVERIVATPVQRPVPAAVGQKRKFDFRGG-----GQRRPGQFQNRDQHRAPIRFQPRQQGGPRQVPACASCGKVHTGACQPGPRVCFRCGKEGH

Query:  VVRFCTQPAMAAQPAQ----QQGRVFATTRQEAERTDAAVAG------------FE--------------------------LSVATPSGDDMLARDKIK
            C +      P Q    QQGRVFATTRQEAER    V G            F+                          LSV+TPSG+ +L++++IK
Subjt:  VVRFCTQPAMAAQPAQ----QQGRVFATTRQEAERTDAAVAG------------FE--------------------------LSVATPSGDDMLARDKIK

Query:  NGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVC-PPMVSAVKARRLLAHGAWGYLAKLSAESHPSLS
          R+EI+  +L  TL+VL M  FDVILGM+WL+ANHA IDC+ KEVVF PP   SFKFRG  G VC P ++SA+KA +LL+ G WG LA +     P +S
Subjt:  NGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVC-PPMVSAVKARRLLAHGAWGYLAKLSAESHPSLS

Query:  VDQVPVVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
        +   PVV+E+ DVFP+ELPGLPP REVDF IELEP T PIS+APYRMAPAELKELKVQLQE+LD+GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
Subjt:  VDQVPVVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK

Query:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDR----------------------
        VTVKN+Y LPRIDDLFDQLQGA+VFSKIDLRSGYHQLRIRD DIPK AFRSRYGHYEFVVMSFGLTNAPA FMD+M+R                      
Subjt:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDR----------------------

Query:  ----------------------------------VAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG
                                          V FLGHVVS +G+ VDPAKIEAV++W +P++V+EIRSFLGLAGYYRRFV+DFS IASPLT+LTRKG
Subjt:  ----------------------------------VAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG

Query:  VEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS
          FVW   C+ SF  L   L +  VL    +G+G FV+YSDASK GLGCVLMQ GKV+AYAS
Subjt:  VEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS

TYJ95850.1 pol protein [Cucumis melo var. makuwa]1.4e-19845.84Show/hide
Query:  MPVTR-TERGGSRAGRGIRIESDTAGTSSQAETDGIPPVAPPGAPRGGVEATERRETAFGADFADSMRAMMVSLLSDQRSETQALLAAQREESRAQIEQL
        MP  R T RGG R GRG       AG   Q E   + P   P AP      T+    A    + D ++A +   L+ Q+++   +   Q E + AQ +  
Subjt:  MPVTR-TERGGSRAGRGIRIESDTAGTSSQAETDGIPPVAPPGAPRGGVEATERRETAFGADFADSMRAMMVSLLSDQRSETQALLAAQREESRAQIEQL

Query:  TRMFQRGRKALHSTEQVAPEVDMAEGAGELGRWLKDFVKWKPEKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTTRRWWESTLSERP
            Q          Q  P    AE      + L+DF K+ P+ FD + D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER 
Subjt:  TRMFQRGRKALHSTEQVAPEVDMAEGAGELGRWLKDFVKWKPEKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTTRRWWESTLSERP

Query:  AGSPPA--TWEFFKSEFKVKYISDEAQEKML------------------------------IATPELKIRKFISGLRQGVSQDVRAQAPDTYARAVQLAT
         G   +  TWE FK  F  K+ S   +   L                              +     +  KF+ GLR  +   VRA  P T+A A+++A 
Subjt:  AGSPPA--TWEFFKSEFKVKYISDEAQEKML------------------------------IATPELKIRKFISGLRQGVSQDVRAQAPDTYARAVQLAT

Query:  IYNGKYPVERIVATPVQRPVPAAVGQKRKFDFRGG-----GQRRPGQFQNRDQHRAPIRFQPRQQGGPRQVPACASCGKVHTGACQPGPRVCFRCGKEGH
          +   P     +    R   +A+GQKRK + +         R  G FQ   +HR  +    R     R++PAC +CG+VH G C  G  VCFRC + GH
Subjt:  IYNGKYPVERIVATPVQRPVPAAVGQKRKFDFRGG-----GQRRPGQFQNRDQHRAPIRFQPRQQGGPRQVPACASCGKVHTGACQPGPRVCFRCGKEGH

Query:  VVRFCTQPAMAAQPAQ----QQGRVFATTRQEAERTDAAVAG------------FE--------------------------LSVATPSGDDMLARDKIK
            C +      P Q    QQGRVFATTRQEAER    V G            F+                          LSV+TPSG+ +L++++IK
Subjt:  VVRFCTQPAMAAQPAQ----QQGRVFATTRQEAERTDAAVAG------------FE--------------------------LSVATPSGDDMLARDKIK

Query:  NGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVC-PPMVSAVKARRLLAHGAWGYLAKLSAESHPSLS
          R+EI+  +L  TL+VL M  FDVILGM+WL+ANHA IDC+ KEVVF PP   SFKFRG  G VC P ++SA+KA +LL+ G WG LA +     P +S
Subjt:  NGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVC-PPMVSAVKARRLLAHGAWGYLAKLSAESHPSLS

Query:  VDQVPVVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
        +   PVV+E+ DVFP+ELPGLPP REVDF IELEP T PIS+APYRMAPAELKELKVQLQE+LD+GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
Subjt:  VDQVPVVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK

Query:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDR----------------------
        VTVKN+Y LPRIDDLFDQLQGA+VFSKIDLRSGYHQLRIRD DIPK AFRSRYGHYEFVVMSFGLTNAPA FMD+M+R                      
Subjt:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDR----------------------

Query:  ----------------------------------VAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG
                                          V FLGHVVS +G+ VDPAKIEAV++W +P++V+EIRSFLGLAGYYRRFV+DFS IASPLT+LTRKG
Subjt:  ----------------------------------VAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG

Query:  VEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS
          FVW   C+ SF  L   L +  VL    +G+G FV+YSDASK GLGCVLMQ GKV+AYAS
Subjt:  VEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS

TYK20443.1 pol protein [Cucumis melo var. makuwa]3.1e-19845.84Show/hide
Query:  MPVTR-TERGGSRAGRGIRIESDTAGTSSQAETDGIPPVAPPGAPRGGVEATERRETAFGADFADSMRAMMVSLLSDQRSETQALLAAQREESRAQIEQL
        MP  R T RGG R GRG       AG   Q E   + P   P AP      T+    A    + D ++A +   L+ Q+++   +   Q E + AQ +  
Subjt:  MPVTR-TERGGSRAGRGIRIESDTAGTSSQAETDGIPPVAPPGAPRGGVEATERRETAFGADFADSMRAMMVSLLSDQRSETQALLAAQREESRAQIEQL

Query:  TRMFQRGRKALHSTEQVAPEVDMAEGAGELGRWLKDFVKWKPEKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTTRRWWESTLSERP
            Q          Q  P V ++  A    + L+DF K+ P+ FD + D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER 
Subjt:  TRMFQRGRKALHSTEQVAPEVDMAEGAGELGRWLKDFVKWKPEKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTTRRWWESTLSERP

Query:  AGSPPA--TWEFFKSEFKVKYISDEAQEKML------------------------------IATPELKIRKFISGLRQGVSQDVRAQAPDTYARAVQLAT
         G   +  TWE FK  F  K+ S   +   L                              +     +  KF+ GLR  +   VRA  P T+A A+++A 
Subjt:  AGSPPA--TWEFFKSEFKVKYISDEAQEKML------------------------------IATPELKIRKFISGLRQGVSQDVRAQAPDTYARAVQLAT

Query:  IYNGKYPVERIVATPVQRPVPAAVGQKRKFDFRGG-----GQRRPGQFQNRDQHRAPIRFQPRQQGGPRQVPACASCGKVHTGACQPGPRVCFRCGKEGH
          +   P     +    R   +A+GQKRK + +         R  G FQ   +HR  +    R     R++PAC +CG+VH G C  G  VCFRC + GH
Subjt:  IYNGKYPVERIVATPVQRPVPAAVGQKRKFDFRGG-----GQRRPGQFQNRDQHRAPIRFQPRQQGGPRQVPACASCGKVHTGACQPGPRVCFRCGKEGH

Query:  VVRFCTQPAMAAQPAQ----QQGRVFATTRQEAERTDAAVAG------------FE--------------------------LSVATPSGDDMLARDKIK
            C +      P Q    QQGRVFATTRQEAER    V G            F+                          LSV+TPSG+ +L++++IK
Subjt:  VVRFCTQPAMAAQPAQ----QQGRVFATTRQEAERTDAAVAG------------FE--------------------------LSVATPSGDDMLARDKIK

Query:  NGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVC-PPMVSAVKARRLLAHGAWGYLAKLSAESHPSLS
          R+EI+  +L  TL+VL M  FDVILGM+WL+ANHA IDC+ KEVVF PP   SFKFRG  G VC P ++SA+KA +LL+ G WG LA +     P +S
Subjt:  NGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVC-PPMVSAVKARRLLAHGAWGYLAKLSAESHPSLS

Query:  VDQVPVVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
        +   PVV+E+ DVFP+ELPGLPP REVDF IELEP T PIS+APYRMAPAELKELKVQLQE+LD+GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
Subjt:  VDQVPVVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK

Query:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDR----------------------
        VTVKN+Y LPRIDDLFDQLQGA+VFSKIDLRSGYHQLRIRD DIPK AFRSRYGHYEFVVMSFGLTNAPA FMD+M+R                      
Subjt:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDR----------------------

Query:  ----------------------------------VAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG
                                          V FLGHVVS +G+ VDPAKIEAV++W +P++V+EIRSFLGLAGYYRRFV+DFS IASPLT+LTRKG
Subjt:  ----------------------------------VAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG

Query:  VEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS
          FVW   C+ SF  L   L +  VL    +G+G FV+YSDASK GLGCVLMQ GKV+AYAS
Subjt:  VEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS

TrEMBL top hitse value%identityAlignment
A0A5A7TEQ2 Reverse transcriptase1.5e-19845.84Show/hide
Query:  MPVTR-TERGGSRAGRGIRIESDTAGTSSQAETDGIPPVAPPGAPRGGVEATERRETAFGADFADSMRAMMVSLLSDQRSETQALLAAQREESRAQIEQL
        MP  R T RGG R GRG       AG   Q E   + P   P AP      T+    A    + D ++A +   L+ Q+++   +   Q E + AQ +  
Subjt:  MPVTR-TERGGSRAGRGIRIESDTAGTSSQAETDGIPPVAPPGAPRGGVEATERRETAFGADFADSMRAMMVSLLSDQRSETQALLAAQREESRAQIEQL

Query:  TRMFQRGRKALHSTEQVAPEVDMAEGAGELGRWLKDFVKWKPEKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTTRRWWESTLSERP
            Q          Q  P V ++  A    + L+DF K+ P+ FD + D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER 
Subjt:  TRMFQRGRKALHSTEQVAPEVDMAEGAGELGRWLKDFVKWKPEKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTTRRWWESTLSERP

Query:  AGSPPA--TWEFFKSEFKVKYISDEAQEKML------------------------------IATPELKIRKFISGLRQGVSQDVRAQAPDTYARAVQLAT
         G   +  TWE FK  F  K+ S   +   L                              +     +  KF+ GLR  +   VRA  P T+A A+++A 
Subjt:  AGSPPA--TWEFFKSEFKVKYISDEAQEKML------------------------------IATPELKIRKFISGLRQGVSQDVRAQAPDTYARAVQLAT

Query:  IYNGKYPVERIVATPVQRPVPAAVGQKRKFDFRGG-----GQRRPGQFQNRDQHRAPIRFQPRQQGGPRQVPACASCGKVHTGACQPGPRVCFRCGKEGH
          +   P     +    R   +A+GQKRK + +         R  G FQ   +HR  +    R     R++PAC +CG+VH G C  G  VCFRC + GH
Subjt:  IYNGKYPVERIVATPVQRPVPAAVGQKRKFDFRGG-----GQRRPGQFQNRDQHRAPIRFQPRQQGGPRQVPACASCGKVHTGACQPGPRVCFRCGKEGH

Query:  VVRFCTQPAMAAQPAQ----QQGRVFATTRQEAERTDAAVAG------------FE--------------------------LSVATPSGDDMLARDKIK
            C +      P Q    QQGRVFATTRQEAER    V G            F+                          LSV+TPSG+ +L++++IK
Subjt:  VVRFCTQPAMAAQPAQ----QQGRVFATTRQEAERTDAAVAG------------FE--------------------------LSVATPSGDDMLARDKIK

Query:  NGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVC-PPMVSAVKARRLLAHGAWGYLAKLSAESHPSLS
          R+EI+  +L  TL+VL M  FDVILGM+WL+ANHA IDC+ KEVVF PP   SFKFRG  G VC P ++SA+KA +LL+ G WG LA +     P +S
Subjt:  NGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVC-PPMVSAVKARRLLAHGAWGYLAKLSAESHPSLS

Query:  VDQVPVVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
        +   PVV+E+ DVFP+ELPGLPP REVDF IELEP T PIS+APYRMAPAELKELKVQLQE+LD+GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
Subjt:  VDQVPVVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK

Query:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDR----------------------
        VTVKN+Y LPRIDDLFDQLQGA+VFSKIDLRSGYHQLRIRD DIPK AFRSRYGHYEFVVMSFGLTNAPA FMD+M+R                      
Subjt:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDR----------------------

Query:  ----------------------------------VAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG
                                          V FLGHVVS +G+ VDPAKIEAV++W +P++V+EIRSFLGLAGYYRRFV+DFS IASPLT+LTRKG
Subjt:  ----------------------------------VAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG

Query:  VEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS
          FVW   C+ SF  L   L +  VL    +G+G FV+YSDASK GLGCVLMQ GKV+AYAS
Subjt:  VEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS

A0A5A7TSL0 Reverse transcriptase6.7e-19945.84Show/hide
Query:  MPVTR-TERGGSRAGRGIRIESDTAGTSSQAETDGIPPVAPPGAPRGGVEATERRETAFGADFADSMRAMMVSLLSDQRSETQALLAAQREESRAQIEQL
        MP  R T RGG R GRG       AG   Q E   + P   P AP      T+    A    + D ++A +   L+ Q+++   +   Q E + AQ +  
Subjt:  MPVTR-TERGGSRAGRGIRIESDTAGTSSQAETDGIPPVAPPGAPRGGVEATERRETAFGADFADSMRAMMVSLLSDQRSETQALLAAQREESRAQIEQL

Query:  TRMFQRGRKALHSTEQVAPEVDMAEGAGELGRWLKDFVKWKPEKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTTRRWWESTLSERP
            Q          Q  P    AE      + L+DF K+ P+ FD + D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER 
Subjt:  TRMFQRGRKALHSTEQVAPEVDMAEGAGELGRWLKDFVKWKPEKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTTRRWWESTLSERP

Query:  AGSPPA--TWEFFKSEFKVKYISDEAQEKML------------------------------IATPELKIRKFISGLRQGVSQDVRAQAPDTYARAVQLAT
         G   +  TWE FK  F  K+ S   +   L                              +     +  KF+ GLR  +   VRA  P T+A A+++A 
Subjt:  AGSPPA--TWEFFKSEFKVKYISDEAQEKML------------------------------IATPELKIRKFISGLRQGVSQDVRAQAPDTYARAVQLAT

Query:  IYNGKYPVERIVATPVQRPVPAAVGQKRKFDFRGG-----GQRRPGQFQNRDQHRAPIRFQPRQQGGPRQVPACASCGKVHTGACQPGPRVCFRCGKEGH
          +   P     +    R   +A+GQKRK + +         R  G FQ   +HR  +    R     R++PAC +CG+VH G C  G  VCFRC + GH
Subjt:  IYNGKYPVERIVATPVQRPVPAAVGQKRKFDFRGG-----GQRRPGQFQNRDQHRAPIRFQPRQQGGPRQVPACASCGKVHTGACQPGPRVCFRCGKEGH

Query:  VVRFCTQPAMAAQPAQ----QQGRVFATTRQEAERTDAAVAG------------FE--------------------------LSVATPSGDDMLARDKIK
            C +      P Q    QQGRVFATTRQEAER    V G            F+                          LSV+TPSG+ +L++++IK
Subjt:  VVRFCTQPAMAAQPAQ----QQGRVFATTRQEAERTDAAVAG------------FE--------------------------LSVATPSGDDMLARDKIK

Query:  NGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVC-PPMVSAVKARRLLAHGAWGYLAKLSAESHPSLS
          R+EI+  +L  TL+VL M  FDVILGM+WL+ANHA IDC+ KEVVF PP   SFKFRG  G VC P ++SA+KA +LL+ G WG LA +     P +S
Subjt:  NGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVC-PPMVSAVKARRLLAHGAWGYLAKLSAESHPSLS

Query:  VDQVPVVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
        +   PVV+E+ DVFP+ELPGLPP REVDF IELEP T PIS+APYRMAPAELKELKVQLQE+LD+GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
Subjt:  VDQVPVVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK

Query:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDR----------------------
        VTVKN+Y LPRIDDLFDQLQGA+VFSKIDLRSGYHQLRIRD DIPK AFRSRYGHYEFVVMSFGLTNAPA FMD+M+R                      
Subjt:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDR----------------------

Query:  ----------------------------------VAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG
                                          V FLGHVVS +G+ VDPAKIEAV++W +P++V+EIRSFLGLAGYYRRFV+DFS IASPLT+LTRKG
Subjt:  ----------------------------------VAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG

Query:  VEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS
          FVW   C+ SF  L   L +  VL    +G+G FV+YSDASK GLGCVLMQ GKV+AYAS
Subjt:  VEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS

A0A5A7UBS1 Reverse transcriptase1.5e-19845.84Show/hide
Query:  MPVTR-TERGGSRAGRGIRIESDTAGTSSQAETDGIPPVAPPGAPRGGVEATERRETAFGADFADSMRAMMVSLLSDQRSETQALLAAQREESRAQIEQL
        MP  R T RGG R GRG       AG   Q E   + P   P AP      T+    A    + D ++A +   L+ Q+++   +   Q E + AQ +  
Subjt:  MPVTR-TERGGSRAGRGIRIESDTAGTSSQAETDGIPPVAPPGAPRGGVEATERRETAFGADFADSMRAMMVSLLSDQRSETQALLAAQREESRAQIEQL

Query:  TRMFQRGRKALHSTEQVAPEVDMAEGAGELGRWLKDFVKWKPEKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTTRRWWESTLSERP
            Q          Q  P V ++  A    + L+DF K+ P+ FD + D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER 
Subjt:  TRMFQRGRKALHSTEQVAPEVDMAEGAGELGRWLKDFVKWKPEKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTTRRWWESTLSERP

Query:  AGSPPA--TWEFFKSEFKVKYISDEAQEKML------------------------------IATPELKIRKFISGLRQGVSQDVRAQAPDTYARAVQLAT
         G   +  TWE FK  F  K+ S   +   L                              +     +  KF+ GLR  +   VRA  P T+A A+++A 
Subjt:  AGSPPA--TWEFFKSEFKVKYISDEAQEKML------------------------------IATPELKIRKFISGLRQGVSQDVRAQAPDTYARAVQLAT

Query:  IYNGKYPVERIVATPVQRPVPAAVGQKRKFDFRGG-----GQRRPGQFQNRDQHRAPIRFQPRQQGGPRQVPACASCGKVHTGACQPGPRVCFRCGKEGH
          +   P     +    R   +A+GQKRK + +         R  G FQ   +HR  +    R     R++PAC +CG+VH G C  G  VCFRC + GH
Subjt:  IYNGKYPVERIVATPVQRPVPAAVGQKRKFDFRGG-----GQRRPGQFQNRDQHRAPIRFQPRQQGGPRQVPACASCGKVHTGACQPGPRVCFRCGKEGH

Query:  VVRFCTQPAMAAQPAQ----QQGRVFATTRQEAERTDAAVAG------------FE--------------------------LSVATPSGDDMLARDKIK
            C +      P Q    QQGRVFATTRQEAER    V G            F+                          LSV+TPSG+ +L++++IK
Subjt:  VVRFCTQPAMAAQPAQ----QQGRVFATTRQEAERTDAAVAG------------FE--------------------------LSVATPSGDDMLARDKIK

Query:  NGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVC-PPMVSAVKARRLLAHGAWGYLAKLSAESHPSLS
          R+EI+  +L  TL+VL M  FDVILGM+WL+ANHA IDC+ KEVVF PP   SFKFRG  G VC P ++SA+KA +LL+ G WG LA +     P +S
Subjt:  NGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVC-PPMVSAVKARRLLAHGAWGYLAKLSAESHPSLS

Query:  VDQVPVVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
        +   PVV+E+ DVFP+ELPGLPP REVDF IELEP T PIS+APYRMAPAELKELKVQLQE+LD+GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
Subjt:  VDQVPVVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK

Query:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDR----------------------
        VTVKN+Y LPRIDDLFDQLQGA+VFSKIDLRSGYHQLRIRD DIPK AFRSRYGHYEFVVMSFGLTNAPA FMD+M+R                      
Subjt:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDR----------------------

Query:  ----------------------------------VAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG
                                          V FLGHVVS +G+ VDPAKIEAV++W +P++V+EIRSFLGLAGYYRRFV+DFS IASPLT+LTRKG
Subjt:  ----------------------------------VAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG

Query:  VEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS
          FVW   C+ SF  L   L +  VL    +G+G FV+YSDASK GLGCVLMQ GKV+AYAS
Subjt:  VEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS

A0A5D3CQB5 Reverse transcriptase6.7e-19945.84Show/hide
Query:  MPVTR-TERGGSRAGRGIRIESDTAGTSSQAETDGIPPVAPPGAPRGGVEATERRETAFGADFADSMRAMMVSLLSDQRSETQALLAAQREESRAQIEQL
        MP  R T RGG R GRG       AG   Q E   + P   P AP      T+    A    + D ++A +   L+ Q+++   +   Q E + AQ +  
Subjt:  MPVTR-TERGGSRAGRGIRIESDTAGTSSQAETDGIPPVAPPGAPRGGVEATERRETAFGADFADSMRAMMVSLLSDQRSETQALLAAQREESRAQIEQL

Query:  TRMFQRGRKALHSTEQVAPEVDMAEGAGELGRWLKDFVKWKPEKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTTRRWWESTLSERP
            Q          Q  P    AE      + L+DF K+ P+ FD + D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER 
Subjt:  TRMFQRGRKALHSTEQVAPEVDMAEGAGELGRWLKDFVKWKPEKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTTRRWWESTLSERP

Query:  AGSPPA--TWEFFKSEFKVKYISDEAQEKML------------------------------IATPELKIRKFISGLRQGVSQDVRAQAPDTYARAVQLAT
         G   +  TWE FK  F  K+ S   +   L                              +     +  KF+ GLR  +   VRA  P T+A A+++A 
Subjt:  AGSPPA--TWEFFKSEFKVKYISDEAQEKML------------------------------IATPELKIRKFISGLRQGVSQDVRAQAPDTYARAVQLAT

Query:  IYNGKYPVERIVATPVQRPVPAAVGQKRKFDFRGG-----GQRRPGQFQNRDQHRAPIRFQPRQQGGPRQVPACASCGKVHTGACQPGPRVCFRCGKEGH
          +   P     +    R   +A+GQKRK + +         R  G FQ   +HR  +    R     R++PAC +CG+VH G C  G  VCFRC + GH
Subjt:  IYNGKYPVERIVATPVQRPVPAAVGQKRKFDFRGG-----GQRRPGQFQNRDQHRAPIRFQPRQQGGPRQVPACASCGKVHTGACQPGPRVCFRCGKEGH

Query:  VVRFCTQPAMAAQPAQ----QQGRVFATTRQEAERTDAAVAG------------FE--------------------------LSVATPSGDDMLARDKIK
            C +      P Q    QQGRVFATTRQEAER    V G            F+                          LSV+TPSG+ +L++++IK
Subjt:  VVRFCTQPAMAAQPAQ----QQGRVFATTRQEAERTDAAVAG------------FE--------------------------LSVATPSGDDMLARDKIK

Query:  NGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVC-PPMVSAVKARRLLAHGAWGYLAKLSAESHPSLS
          R+EI+  +L  TL+VL M  FDVILGM+WL+ANHA IDC+ KEVVF PP   SFKFRG  G VC P ++SA+KA +LL+ G WG LA +     P +S
Subjt:  NGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVC-PPMVSAVKARRLLAHGAWGYLAKLSAESHPSLS

Query:  VDQVPVVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
        +   PVV+E+ DVFP+ELPGLPP REVDF IELEP T PIS+APYRMAPAELKELKVQLQE+LD+GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
Subjt:  VDQVPVVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK

Query:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDR----------------------
        VTVKN+Y LPRIDDLFDQLQGA+VFSKIDLRSGYHQLRIRD DIPK AFRSRYGHYEFVVMSFGLTNAPA FMD+M+R                      
Subjt:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDR----------------------

Query:  ----------------------------------VAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG
                                          V FLGHVVS +G+ VDPAKIEAV++W +P++V+EIRSFLGLAGYYRRFV+DFS IASPLT+LTRKG
Subjt:  ----------------------------------VAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG

Query:  VEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS
          FVW   C+ SF  L   L +  VL    +G+G FV+YSDASK GLGCVLMQ GKV+AYAS
Subjt:  VEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS

A0A5D3DY07 Reverse transcriptase1.5e-19845.84Show/hide
Query:  MPVTR-TERGGSRAGRGIRIESDTAGTSSQAETDGIPPVAPPGAPRGGVEATERRETAFGADFADSMRAMMVSLLSDQRSETQALLAAQREESRAQIEQL
        MP  R T RGG R GRG       AG   Q E   + P   P AP      T+    A    + D ++A +   L+ Q+++   +   Q E + AQ +  
Subjt:  MPVTR-TERGGSRAGRGIRIESDTAGTSSQAETDGIPPVAPPGAPRGGVEATERRETAFGADFADSMRAMMVSLLSDQRSETQALLAAQREESRAQIEQL

Query:  TRMFQRGRKALHSTEQVAPEVDMAEGAGELGRWLKDFVKWKPEKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTTRRWWESTLSERP
            Q          Q  P V ++  A    + L+DF K+ P+ FD + D    A  W+  +E  F  M CP+ Q+ +CA   L      WWE+  +ER 
Subjt:  TRMFQRGRKALHSTEQVAPEVDMAEGAGELGRWLKDFVKWKPEKFDATGD-ALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTTRRWWESTLSERP

Query:  AGSPPA--TWEFFKSEFKVKYISDEAQEKML------------------------------IATPELKIRKFISGLRQGVSQDVRAQAPDTYARAVQLAT
         G   +  TWE FK  F  K+ S   +   L                              +     +  KF+ GLR  +   VRA  P T+A A+++A 
Subjt:  AGSPPA--TWEFFKSEFKVKYISDEAQEKML------------------------------IATPELKIRKFISGLRQGVSQDVRAQAPDTYARAVQLAT

Query:  IYNGKYPVERIVATPVQRPVPAAVGQKRKFDFRGG-----GQRRPGQFQNRDQHRAPIRFQPRQQGGPRQVPACASCGKVHTGACQPGPRVCFRCGKEGH
          +   P     +    R   +A+GQKRK + +         R  G FQ   +HR  +    R     R++PAC +CG+VH G C  G  VCFRC + GH
Subjt:  IYNGKYPVERIVATPVQRPVPAAVGQKRKFDFRGG-----GQRRPGQFQNRDQHRAPIRFQPRQQGGPRQVPACASCGKVHTGACQPGPRVCFRCGKEGH

Query:  VVRFCTQPAMAAQPAQ----QQGRVFATTRQEAERTDAAVAG------------FE--------------------------LSVATPSGDDMLARDKIK
            C +      P Q    QQGRVFATTRQEAER    V G            F+                          LSV+TPSG+ +L++++IK
Subjt:  VVRFCTQPAMAAQPAQ----QQGRVFATTRQEAERTDAAVAG------------FE--------------------------LSVATPSGDDMLARDKIK

Query:  NGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVC-PPMVSAVKARRLLAHGAWGYLAKLSAESHPSLS
          R+EI+  +L  TL+VL M  FDVILGM+WL+ANHA IDC+ KEVVF PP   SFKFRG  G VC P ++SA+KA +LL+ G WG LA +     P +S
Subjt:  NGRIEISGELLVATLIVLPMSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVC-PPMVSAVKARRLLAHGAWGYLAKLSAESHPSLS

Query:  VDQVPVVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
        +   PVV+E+ DVFP+ELPGLPP REVDF IELEP T PIS+APYRMAPAELKELKVQLQE+LD+GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
Subjt:  VDQVPVVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK

Query:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDR----------------------
        VTVKN+Y LPRIDDLFDQLQGA+VFSKIDLRSGYHQLRIRD DIPK AFRSRYGHYEFVVMSFGLTNAPA FMD+M+R                      
Subjt:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDR----------------------

Query:  ----------------------------------VAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG
                                          V FLGHVVS +G+ VDPAKIEAV++W +P++V+EIRSFLGLAGYYRRFV+DFS IASPLT+LTRKG
Subjt:  ----------------------------------VAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG

Query:  VEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS
          FVW   C+ SF  L   L +  VL    +G+G FV+YSDASK GLGCVLMQ GKV+AYAS
Subjt:  VEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.5e-4331.4Show/hide
Query:  VVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNK
        ++Q++ D+   E   L    +   TI  +   LP+  + Y    A  +E++ Q+Q+ML++G IR S SP+ +P+  V KK+D S     R+ IDYR+LN+
Subjt:  VVQEFADVFPEELPGLPPKREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNK

Query:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDRV---------------------
        +TV +++ +P +D++  +L   + F+ IDL  G+HQ+ +    + K AF +++GHYE++ M FGL NAPA F   M+ +                     
Subjt:  VTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDRV---------------------

Query:  -----------------------------------AFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG
                                            FLGHV++ DGI  +P KIEA+  +  PT   EI++FLGL GYYR+F+ +F+ IA P+T+  +K 
Subjt:  -----------------------------------AFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKG

Query:  VEF-VWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS
        ++    + + D +F  L   L S+  + K  + T  F + +DAS   LG VL Q G  ++Y S
Subjt:  VEF-VWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYAS

P10394 Retrovirus-related Pol polyprotein from transposon 4123.4e-3828.5Show/hide
Query:  QEFADVFPEELPGLPPKREVDFTIELEPRTL--------------PISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDG------
        + F ++F  +L  +  +    F +E EP T+              P+    YR   ++++E++ Q+Q+++    + PSVS + +P+L V KK        
Subjt:  QEFADVFPEELPGLPPKREVDFTIELEPRTL--------------PISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDG------

Query:  SMRLCIDYRELNKVTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDM------------
          RL IDYR++NK  + +K+ LPRIDD+ DQL  A  FS +DL SG+HQ+ + +      +F +  G Y F  + FGL  AP +F  M            
Subjt:  SMRLCIDYRELNKVTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDM------------

Query:  --------------------------------------------MDRVAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFS
                                                    M  V FLGH  +  GI+ D  K + + ++  P      R F+    YYRRF+K+F+
Subjt:  --------------------------------------------MDRVAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFS

Query:  AIASPLTRLTRKGVEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQ----HGKVIAYAS
          +  +TRL +K V F W  +C ++F  L + L + T+L +  + +  F + +DASK   G VL Q    H   +AYAS
Subjt:  AIASPLTRLTRKGVEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQ----HGKVIAYAS

P20825 Retrovirus-related Pol polyprotein from transposon 2975.0e-4232.12Show/hide
Query:  PISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSG
        PI    Y +A     E++ Q+QEML++G IR S SP+ +P   V KK         R+ IDYR+LN++T+ ++Y +P +D++  +L     F+ IDL  G
Subjt:  PISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSG

Query:  YHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDRVA-----------------------------------------------------
        +HQ+ + +  I K AF ++ GHYE++ M FGL NAPA F   M+ +                                                      
Subjt:  YHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDRVA-----------------------------------------------------

Query:  ---FLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKGVEF-VWDSKCDESFAVLCAALTSKTVLRKSQEG
           FLGH+V+ DGI  +P K++A+ S+  PT   EIR+FLGL GYYR+F+ +++ IA P+T   +K  +      +  E+F  L A +    +L +  + 
Subjt:  ---FLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKGVEF-VWDSKCDESFAVLCAALTSKTVLRKSQEG

Query:  TGGFVVYSDASKFGLGCVLMQHGKVIAYAS
           FV+ +DAS   LG VL Q+G  I++ S
Subjt:  TGGFVVYSDASKFGLGCVLMQHGKVIAYAS

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein4.4e-3830.72Show/hide
Query:  QEFADVFPEELPGLPP---KREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVK
        Q++ ++   +LP  P       V   IE++P        PY +     +E+   +Q++LD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ 
Subjt:  QEFADVFPEELPGLPP---KREVDFTIELEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVK

Query:  NKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMM----------------------------
        + + LPRID+L  ++  A +F+ +DL SGYHQ+ +   D  K AF +  G YE+ VM FGL NAP+ F   M                            
Subjt:  NKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMM----------------------------

Query:  --------------------------DRVAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKGVEFVWD
                                  +   FLG+ +    I     K  A+  +  P +V + + FLG+  YYRRF+ + S IA P+        +  W 
Subjt:  --------------------------DRVAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKGVEFVWD

Query:  SKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVL
         K D++   L AAL +  VL         + + +DASK G+G VL
Subjt:  SKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.0e-4229.81Show/hide
Query:  TLIVLP-MSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVCPPMVSAVKARRLLAHGAWGYLAKLSAESHPSLSVDQV-PVVQEFAD
        T  VLP +  FD I+G + L    A++D     ++  P             G+  P         LLA  +      L+AE HP  + + +  ++ EF  
Subjt:  TLIVLP-MSGFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVCPPMVSAVKARRLLAHGAWGYLAKLSAESHPSLSVDQV-PVVQEFAD

Query:  VFPEELPGLPPKREVDFTIELEPRTL---PISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKK-----DGSMRLCIDYRELNKVTVK
        +F   L G+     V+  ++ E RT    PI    Y        E++ Q+ E+L  G IRPS SP+ +P+  V KK     +   R+ +D++ LN VT+ 
Subjt:  VFPEELPGLPPKREVDFTIELEPRTL---PISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKK-----DGSMRLCIDYRELNKVTVK

Query:  NKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMD---------------------------
        + Y +P I+     L  A  F+ +DL SG+HQ+ ++++DIPK AF +  G YEF+ + FGL NAPA F  M+D                           
Subjt:  NKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDADIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMD---------------------------

Query:  -----------------------------RVAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTR------
                                     +V FLG++V+ DGI  DP K+ A+S    PTSV E++ FLG+  YYR+F++D++ +A PLT LTR      
Subjt:  -----------------------------RVAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTR------

Query:  -----KGVEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQ----HGKVIAYAS
               V    D    +SF  L + L S  +L      T  F + +DAS + +G VL Q      + IAY S
Subjt:  -----KGVEFVWDSKCDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQ----HGKVIAYAS

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.9e-1643.16Show/hide
Query:  RVAFLG--HVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKGVEFVWDSKCDESFAVLCAALTSKTVL
        ++A+LG  H++S +G+  DPAK+EA+  W +P +  E+R FLGL GYYRRFVK++  I  PLT L +K     W      +F  L  A+T+  VL
Subjt:  RVAFLG--HVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKGVEFVWDSKCDESFAVLCAALTSKTVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAGTCACCCGCACCGAAAGAGGAGGTTCGAGAGCAGGTCGAGGTATCAGGATCGAATCTGATACTGCAGGGACATCGTCCCAAGCTGAGACTGATGGCATTCCACC
AGTAGCTCCCCCTGGTGCTCCTAGAGGAGGCGTTGAGGCGACCGAGCGCCGTGAGACTGCTTTTGGAGCAGATTTTGCTGATTCTATGAGGGCTATGATGGTCAGCCTGT
TGAGTGACCAGAGGTCAGAGACCCAGGCCTTGTTGGCAGCGCAGCGGGAAGAGTCCCGAGCTCAAATTGAGCAGTTGACCCGCATGTTTCAAAGAGGGCGCAAAGCACTT
CACTCCACAGAGCAAGTAGCCCCAGAGGTAGATATGGCAGAGGGTGCCGGGGAGTTGGGTAGATGGTTGAAGGACTTCGTGAAGTGGAAGCCTGAGAAGTTTGATGCCAC
AGGAGATGCTTTAGCGGCAGCCAGATGGATTGCCCATTTGGAGTACACCTTCCTGATTATGGTGTGCCCAGATGTTCAGAGGCCGAGGTGTGCGGCTCATGTGCTAGGAG
GCACAACGAGAAGGTGGTGGGAATCCACCTTGAGTGAGAGGCCAGCTGGGTCTCCACCTGCTACTTGGGAGTTCTTCAAGTCGGAGTTCAAGGTCAAGTACATCAGTGAT
GAGGCTCAAGAGAAGATGTTGATAGCCACCCCAGAGTTAAAGATTCGGAAGTTCATCTCTGGTTTGAGGCAGGGGGTTTCCCAGGATGTCAGGGCTCAGGCCCCTGACAC
TTATGCTAGAGCTGTTCAGTTAGCCACCATTTATAATGGGAAGTATCCTGTTGAGCGGATTGTTGCCACTCCGGTTCAGCGTCCTGTCCCAGCTGCAGTGGGGCAGAAGA
GGAAGTTTGATTTCCGAGGAGGCGGCCAGCGCCGTCCAGGTCAGTTTCAGAACAGGGATCAGCATAGAGCCCCGATCCGTTTCCAGCCTCGCCAGCAGGGCGGTCCCAGG
CAAGTTCCCGCTTGTGCTTCTTGTGGGAAAGTCCACACTGGAGCTTGTCAGCCAGGGCCCAGGGTGTGTTTTCGCTGTGGTAAGGAGGGGCATGTGGTTCGTTTTTGTAC
TCAGCCAGCTATGGCAGCTCAGCCAGCGCAGCAACAAGGTCGGGTGTTTGCTACGACCCGTCAGGAGGCCGAGCGTACGGATGCCGCTGTGGCAGGTTTTGAGTTGTCTG
TTGCAACTCCTTCGGGAGATGATATGTTAGCTAGAGATAAGATTAAGAATGGCCGGATCGAGATATCCGGAGAGTTGCTTGTGGCTACGCTCATAGTTCTTCCTATGAGC
GGTTTTGATGTGATTCTTGGTATGGAGTGGTTAGCTGCTAACCACGCGTTGATTGACTGCCACCGGAAAGAAGTAGTGTTCAGGCCGCCAGGTCGGCCCAGCTTTAAGTT
TCGTGGAGTCCAGGGTGGAGTTTGCCCGCCGATGGTGTCAGCCGTGAAAGCTCGTCGTTTGCTAGCTCATGGAGCTTGGGGCTATCTAGCCAAGTTGTCTGCAGAGAGCC
ACCCCTCTCTGTCTGTTGATCAGGTGCCGGTTGTGCAGGAGTTTGCGGATGTGTTTCCTGAGGAGTTACCAGGGTTGCCCCCGAAGAGAGAGGTTGACTTCACAATTGAG
CTCGAGCCAAGGACCCTCCCTATCTCTAAGGCTCCGTATCGTATGGCGCCAGCCGAGTTAAAGGAACTAAAGGTTCAGTTGCAGGAGATGTTAGACCGAGGGTTCATCCG
TCCGAGTGTGTCCCCATGGGGAGCACCAGTTCTATTTGTGAAGAAGAAGGATGGTTCGATGCGCTTATGTATTGATTATCGCGAGCTTAATAAGGTGACTGTGAAGAATA
AGTATCTGTTGCCCCGTATTGATGATTTGTTCGATCAGCTTCAGGGAGCCTCCGTGTTCTCCAAGATAGATCTGAGATCCGGTTATCATCAATTGCGGATCAGGGATGCA
GACATTCCTAAGAAAGCTTTCAGGTCTAGATATGGTCACTACGAGTTTGTGGTCATGTCGTTTGGATTGACTAATGCCCCAGCCGCGTTTATGGACATGATGGATAGGGT
GGCCTTCCTAGGTCACGTAGTTTCGAGAGATGGTATAGTCGTGGACCCCGCAAAGATCGAGGCCGTGAGTTCGTGGGCCAAGCCGACTTCAGTTGCCGAGATACGTAGCT
TTCTGGGATTGGCAGGTTACTACCGTCGGTTCGTTAAGGACTTTTCCGCCATAGCCTCGCCCCTCACTCGTCTGACGAGGAAGGGTGTCGAGTTTGTTTGGGACAGTAAG
TGTGATGAGAGCTTTGCTGTGCTCTGCGCAGCTTTGACTTCAAAGACGGTGTTGCGCAAATCCCAGGAGGGGACAGGTGGTTTCGTGGTCTACAGCGATGCTTCCAAGTT
TGGGTTAGGTTGTGTGTTGATGCAGCACGGGAAGGTCATTGCCTATGCTTCCTGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCAGTCACCCGCACCGAAAGAGGAGGTTCGAGAGCAGGTCGAGGTATCAGGATCGAATCTGATACTGCAGGGACATCGTCCCAAGCTGAGACTGATGGCATTCCACC
AGTAGCTCCCCCTGGTGCTCCTAGAGGAGGCGTTGAGGCGACCGAGCGCCGTGAGACTGCTTTTGGAGCAGATTTTGCTGATTCTATGAGGGCTATGATGGTCAGCCTGT
TGAGTGACCAGAGGTCAGAGACCCAGGCCTTGTTGGCAGCGCAGCGGGAAGAGTCCCGAGCTCAAATTGAGCAGTTGACCCGCATGTTTCAAAGAGGGCGCAAAGCACTT
CACTCCACAGAGCAAGTAGCCCCAGAGGTAGATATGGCAGAGGGTGCCGGGGAGTTGGGTAGATGGTTGAAGGACTTCGTGAAGTGGAAGCCTGAGAAGTTTGATGCCAC
AGGAGATGCTTTAGCGGCAGCCAGATGGATTGCCCATTTGGAGTACACCTTCCTGATTATGGTGTGCCCAGATGTTCAGAGGCCGAGGTGTGCGGCTCATGTGCTAGGAG
GCACAACGAGAAGGTGGTGGGAATCCACCTTGAGTGAGAGGCCAGCTGGGTCTCCACCTGCTACTTGGGAGTTCTTCAAGTCGGAGTTCAAGGTCAAGTACATCAGTGAT
GAGGCTCAAGAGAAGATGTTGATAGCCACCCCAGAGTTAAAGATTCGGAAGTTCATCTCTGGTTTGAGGCAGGGGGTTTCCCAGGATGTCAGGGCTCAGGCCCCTGACAC
TTATGCTAGAGCTGTTCAGTTAGCCACCATTTATAATGGGAAGTATCCTGTTGAGCGGATTGTTGCCACTCCGGTTCAGCGTCCTGTCCCAGCTGCAGTGGGGCAGAAGA
GGAAGTTTGATTTCCGAGGAGGCGGCCAGCGCCGTCCAGGTCAGTTTCAGAACAGGGATCAGCATAGAGCCCCGATCCGTTTCCAGCCTCGCCAGCAGGGCGGTCCCAGG
CAAGTTCCCGCTTGTGCTTCTTGTGGGAAAGTCCACACTGGAGCTTGTCAGCCAGGGCCCAGGGTGTGTTTTCGCTGTGGTAAGGAGGGGCATGTGGTTCGTTTTTGTAC
TCAGCCAGCTATGGCAGCTCAGCCAGCGCAGCAACAAGGTCGGGTGTTTGCTACGACCCGTCAGGAGGCCGAGCGTACGGATGCCGCTGTGGCAGGTTTTGAGTTGTCTG
TTGCAACTCCTTCGGGAGATGATATGTTAGCTAGAGATAAGATTAAGAATGGCCGGATCGAGATATCCGGAGAGTTGCTTGTGGCTACGCTCATAGTTCTTCCTATGAGC
GGTTTTGATGTGATTCTTGGTATGGAGTGGTTAGCTGCTAACCACGCGTTGATTGACTGCCACCGGAAAGAAGTAGTGTTCAGGCCGCCAGGTCGGCCCAGCTTTAAGTT
TCGTGGAGTCCAGGGTGGAGTTTGCCCGCCGATGGTGTCAGCCGTGAAAGCTCGTCGTTTGCTAGCTCATGGAGCTTGGGGCTATCTAGCCAAGTTGTCTGCAGAGAGCC
ACCCCTCTCTGTCTGTTGATCAGGTGCCGGTTGTGCAGGAGTTTGCGGATGTGTTTCCTGAGGAGTTACCAGGGTTGCCCCCGAAGAGAGAGGTTGACTTCACAATTGAG
CTCGAGCCAAGGACCCTCCCTATCTCTAAGGCTCCGTATCGTATGGCGCCAGCCGAGTTAAAGGAACTAAAGGTTCAGTTGCAGGAGATGTTAGACCGAGGGTTCATCCG
TCCGAGTGTGTCCCCATGGGGAGCACCAGTTCTATTTGTGAAGAAGAAGGATGGTTCGATGCGCTTATGTATTGATTATCGCGAGCTTAATAAGGTGACTGTGAAGAATA
AGTATCTGTTGCCCCGTATTGATGATTTGTTCGATCAGCTTCAGGGAGCCTCCGTGTTCTCCAAGATAGATCTGAGATCCGGTTATCATCAATTGCGGATCAGGGATGCA
GACATTCCTAAGAAAGCTTTCAGGTCTAGATATGGTCACTACGAGTTTGTGGTCATGTCGTTTGGATTGACTAATGCCCCAGCCGCGTTTATGGACATGATGGATAGGGT
GGCCTTCCTAGGTCACGTAGTTTCGAGAGATGGTATAGTCGTGGACCCCGCAAAGATCGAGGCCGTGAGTTCGTGGGCCAAGCCGACTTCAGTTGCCGAGATACGTAGCT
TTCTGGGATTGGCAGGTTACTACCGTCGGTTCGTTAAGGACTTTTCCGCCATAGCCTCGCCCCTCACTCGTCTGACGAGGAAGGGTGTCGAGTTTGTTTGGGACAGTAAG
TGTGATGAGAGCTTTGCTGTGCTCTGCGCAGCTTTGACTTCAAAGACGGTGTTGCGCAAATCCCAGGAGGGGACAGGTGGTTTCGTGGTCTACAGCGATGCTTCCAAGTT
TGGGTTAGGTTGTGTGTTGATGCAGCACGGGAAGGTCATTGCCTATGCTTCCTGCTAG
Protein sequenceShow/hide protein sequence
MPVTRTERGGSRAGRGIRIESDTAGTSSQAETDGIPPVAPPGAPRGGVEATERRETAFGADFADSMRAMMVSLLSDQRSETQALLAAQREESRAQIEQLTRMFQRGRKAL
HSTEQVAPEVDMAEGAGELGRWLKDFVKWKPEKFDATGDALAAARWIAHLEYTFLIMVCPDVQRPRCAAHVLGGTTRRWWESTLSERPAGSPPATWEFFKSEFKVKYISD
EAQEKMLIATPELKIRKFISGLRQGVSQDVRAQAPDTYARAVQLATIYNGKYPVERIVATPVQRPVPAAVGQKRKFDFRGGGQRRPGQFQNRDQHRAPIRFQPRQQGGPR
QVPACASCGKVHTGACQPGPRVCFRCGKEGHVVRFCTQPAMAAQPAQQQGRVFATTRQEAERTDAAVAGFELSVATPSGDDMLARDKIKNGRIEISGELLVATLIVLPMS
GFDVILGMEWLAANHALIDCHRKEVVFRPPGRPSFKFRGVQGGVCPPMVSAVKARRLLAHGAWGYLAKLSAESHPSLSVDQVPVVQEFADVFPEELPGLPPKREVDFTIE
LEPRTLPISKAPYRMAPAELKELKVQLQEMLDRGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYLLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIRDA
DIPKKAFRSRYGHYEFVVMSFGLTNAPAAFMDMMDRVAFLGHVVSRDGIVVDPAKIEAVSSWAKPTSVAEIRSFLGLAGYYRRFVKDFSAIASPLTRLTRKGVEFVWDSK
CDESFAVLCAALTSKTVLRKSQEGTGGFVVYSDASKFGLGCVLMQHGKVIAYASC