; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g20230 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g20230
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr7:14575587..14576402
RNA-Seq ExpressionMoc07g20230
SyntenyMoc07g20230
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035531.1 uncharacterized protein E6C27_scaffold285G001970 [Cucumis melo var. makuwa]2.6e-12580.44Show/hide
Query:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEW
        M FGLTNAPATFCTLMNQVFHEYL+KFVVVYLDDIVVYS TMEEH+ HLQ VF+KLK+NQLYVK EKCSFAQERI+FLGHVIECGRIGME+GK+ AI++W
Subjt:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEW

Query:  KIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAYE
         +P S++ELRSFLGLANYYRRFVEGFSK   PLTELLKK+  WNW  ECQ AF+ LK+ +MEG +LGIADVT+PF+VET+ASD+ALGGVLLQ+GH IAYE
Subjt:  KIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAYE

Query:  SRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF
        SRKLN  ERRY  SEKEMLAVVHCLRAWRQYLLG  FVVKTDNS+ CHFF QPKL+SKQARWQE+LAEFDF
Subjt:  SRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF

KAA0037220.1 reverse transcriptase [Cucumis melo var. makuwa]4.4e-12580.44Show/hide
Query:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEW
        M FGLTNAPATFCTLMNQVFHEYL+KFVVVYLDDIVVYS TMEEH+ HLQ VF+KLK+NQLYVK EKCSFAQERI+FLGHVIECGRIGME+GK+ AI++W
Subjt:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEW

Query:  KIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAYE
         +P S++ELRSFLGLANYYRRFVEGFSK   PLTELLKK+  WNW  ECQ AF+ LK+ +MEG +LGIADVT+PF+VET+ASD+ALGGVLLQ+GH IAYE
Subjt:  KIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAYE

Query:  SRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF
        SRKLN  ERRY  SEKEMLAVVHCLRAWRQYLLG  FVVKTDNS+ CHFF QPKL+SKQARWQE+LAEFDF
Subjt:  SRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF

KAA0067557.1 reverse transcriptase [Cucumis melo var. makuwa]4.4e-12580.44Show/hide
Query:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEW
        M FGLTNAPATFCTLMNQVFHEYL+KFVVVYLDDIVVYS TMEEH+ HLQ VF+KLK+NQLYVK EKCSFAQERI+FLGHVIECGRIGME+GK+ AI++W
Subjt:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEW

Query:  KIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAYE
         +P S++ELRSFLGLANYYRRFVEGFSK   PLTELLKK+  WNW  ECQ AF+ LK+ +MEG +LGIADVT+PF+VET+ASD+ALGGVLLQ+GH IAYE
Subjt:  KIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAYE

Query:  SRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF
        SRKLN  ERRY  SEKEMLAVVHCLRAWRQYLLG  FVVKTDNS+ CHFF QPKL+SKQARWQE+LAEFDF
Subjt:  SRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF

TYK07954.1 reverse transcriptase [Cucumis melo var. makuwa]1.5e-12580.81Show/hide
Query:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEW
        M FGLTNAPATFCTLMNQVFHEYL+KFVVVYLDDIVVYS TMEEH+ HLQ VF+KLK+NQLYVK EKCSFAQERI+FLGHVIECGRIGME+GK+ AI++W
Subjt:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEW

Query:  KIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAYE
         +P S++ELRSFLGLANYYRRFVEGFSK   PLTELLKK+  WNW  ECQAAF+ LK+ +MEG +LGIADVT+PF+VET+ASD+ALGGVLLQ+GH IAYE
Subjt:  KIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAYE

Query:  SRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF
        SRKLN  ERRY  SEKEMLAVVHCLRAWRQYLLG  FVVKTDNS+ CHFF QPKL+SKQARWQE+LAEFDF
Subjt:  SRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]6.1e-13594.49Show/hide
Query:  QVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEWKIPTSITELRSFLGLAN
        QVFHEYL+KFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVK EKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEWKIPTSITELRSFLGLAN
Subjt:  QVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEWKIPTSITELRSFLGLAN

Query:  YYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAYESRKLNDTERRYAASEKE
        YYRRFVEGFSK TGPLT+LLKKNQKWNWT EC AAFESLKK MMEG VLGIADVTRPF+VET+ASDFALGGVLLQDGH IAYES+KLND ERRYAASEKE
Subjt:  YYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAYESRKLNDTERRYAASEKE

Query:  MLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF
        MLAVVHCLRAWRQYLLG KFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF
Subjt:  MLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF

TrEMBL top hitse value%identityAlignment
A0A5A7T1S6 Reverse transcriptase domain-containing protein1.2e-12580.44Show/hide
Query:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEW
        M FGLTNAPATFCTLMNQVFHEYL+KFVVVYLDDIVVYS TMEEH+ HLQ VF+KLK+NQLYVK EKCSFAQERI+FLGHVIECGRIGME+GK+ AI++W
Subjt:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEW

Query:  KIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAYE
         +P S++ELRSFLGLANYYRRFVEGFSK   PLTELLKK+  WNW  ECQ AF+ LK+ +MEG +LGIADVT+PF+VET+ASD+ALGGVLLQ+GH IAYE
Subjt:  KIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAYE

Query:  SRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF
        SRKLN  ERRY  SEKEMLAVVHCLRAWRQYLLG  FVVKTDNS+ CHFF QPKL+SKQARWQE+LAEFDF
Subjt:  SRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF

A0A5D3BRZ6 Reverse transcriptase2.1e-12580.44Show/hide
Query:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEW
        M FGLTNAPATFCTLMNQVFHEYL+KFVVVYLDDIVVYS TMEEH+ HLQ VF+KLK+NQLYVK EKCSFAQERI+FLGHVIECGRIGME+GK+ AI++W
Subjt:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEW

Query:  KIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAYE
         +P S++ELRSFLGLANYYRRFVEGFSK   PLTELLKK+  WNW  ECQ AF+ LK+ +MEG +LGIADVT+PF+VET+ASD+ALGGVLLQ+GH IAYE
Subjt:  KIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAYE

Query:  SRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF
        SRKLN  ERRY  SEKEMLAVVHCLRAWRQYLLG  FVVKTDNS+ CHFF QPKL+SKQARWQE+LAEFDF
Subjt:  SRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF

A0A5D3C4R1 Reverse transcriptase2.1e-12580.44Show/hide
Query:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEW
        M FGLTNAPATFCTLMNQVFHEYL+KFVVVYLDDIVVYS TMEEH+ HLQ VF+KLK+NQLYVK EKCSFAQERI+FLGHVIECGRIGME+GK+ AI++W
Subjt:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEW

Query:  KIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAYE
         +P S++ELRSFLGLANYYRRFVEGFSK   PLTELLKK+  WNW  ECQ AF+ LK+ +MEG +LGIADVT+PF+VET+ASD+ALGGVLLQ+GH IAYE
Subjt:  KIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAYE

Query:  SRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF
        SRKLN  ERRY  SEKEMLAVVHCLRAWRQYLLG  FVVKTDNS+ CHFF QPKL+SKQARWQE+LAEFDF
Subjt:  SRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF

A0A5D3C9P8 Reverse transcriptase7.3e-12680.81Show/hide
Query:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEW
        M FGLTNAPATFCTLMNQVFHEYL+KFVVVYLDDIVVYS TMEEH+ HLQ VF+KLK+NQLYVK EKCSFAQERI+FLGHVIECGRIGME+GK+ AI++W
Subjt:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEW

Query:  KIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAYE
         +P S++ELRSFLGLANYYRRFVEGFSK   PLTELLKK+  WNW  ECQAAF+ LK+ +MEG +LGIADVT+PF+VET+ASD+ALGGVLLQ+GH IAYE
Subjt:  KIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAYE

Query:  SRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF
        SRKLN  ERRY  SEKEMLAVVHCLRAWRQYLLG  FVVKTDNS+ CHFF QPKL+SKQARWQE+LAEFDF
Subjt:  SRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF

A0A6J1DLQ6 uncharacterized protein LOC1110223202.9e-13594.49Show/hide
Query:  QVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEWKIPTSITELRSFLGLAN
        QVFHEYL+KFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVK EKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEWKIPTSITELRSFLGLAN
Subjt:  QVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEWKIPTSITELRSFLGLAN

Query:  YYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAYESRKLNDTERRYAASEKE
        YYRRFVEGFSK TGPLT+LLKKNQKWNWT EC AAFESLKK MMEG VLGIADVTRPF+VET+ASDFALGGVLLQDGH IAYES+KLND ERRYAASEKE
Subjt:  YYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAYESRKLNDTERRYAASEKE

Query:  MLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF
        MLAVVHCLRAWRQYLLG KFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF
Subjt:  MLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.9e-5943.75Show/hide
Query:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEW
        M FGL NAPATF   MN +    LNK  +VYLDDI+V+S +++EH   L LVFEKL +  L ++ +KC F ++  +FLGHV+    I     K++AIQ++
Subjt:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEW

Query:  KIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWT-AECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAY
         IPT   E+++FLGL  YYR+F+  F+    P+T+ LKKN K + T  E  +AF+ LK ++ E  +L + D T+ F + T+ASD ALG VL QDGH ++Y
Subjt:  KIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWT-AECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAY

Query:  ESRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF
         SR LN+ E  Y+  EKE+LA+V   + +R YLLG  F + +D+  +   +     +SK  RW+  L+EFDF
Subjt:  ESRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF

P0CT34 Transposon Tf2-1 polyprotein4.3e-4335.94Show/hide
Query:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLG-HVIECGRIGMEDGKVKAIQE
        M +G++ APA F   +N +  E     VV Y+DDI+++S +  EH  H++ V +KLK   L +   KC F Q ++ F+G H+ E G    ++   K +Q 
Subjt:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLG-HVIECGRIGMEDGKVKAIQE

Query:  WKIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDG-----
        WK P +  ELR FLG  NY R+F+   S+ T PL  LLKK+ +W WT     A E++K+ ++   VL   D ++   +ET+ASD A+G VL Q       
Subjt:  WKIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDG-----

Query:  HSIAYESRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGV--KFVVKTDNSS-VCHFFNQPKLSSKQ-ARWQEYLAEFDF
        + + Y S K++  +  Y+ S+KEMLA++  L+ WR YL      F + TD+ + +    N+ +  +K+ ARWQ +L +F+F
Subjt:  HSIAYESRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGV--KFVVKTDNSS-VCHFFNQPKLSSKQ-ARWQEYLAEFDF

P0CT35 Transposon Tf2-2 polyprotein4.3e-4335.94Show/hide
Query:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLG-HVIECGRIGMEDGKVKAIQE
        M +G++ APA F   +N +  E     VV Y+DDI+++S +  EH  H++ V +KLK   L +   KC F Q ++ F+G H+ E G    ++   K +Q 
Subjt:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLG-HVIECGRIGMEDGKVKAIQE

Query:  WKIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDG-----
        WK P +  ELR FLG  NY R+F+   S+ T PL  LLKK+ +W WT     A E++K+ ++   VL   D ++   +ET+ASD A+G VL Q       
Subjt:  WKIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDG-----

Query:  HSIAYESRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGV--KFVVKTDNSS-VCHFFNQPKLSSKQ-ARWQEYLAEFDF
        + + Y S K++  +  Y+ S+KEMLA++  L+ WR YL      F + TD+ + +    N+ +  +K+ ARWQ +L +F+F
Subjt:  HSIAYESRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGV--KFVVKTDNSS-VCHFFNQPKLSSKQ-ARWQEYLAEFDF

P0CT41 Transposon Tf2-12 polyprotein4.3e-4335.94Show/hide
Query:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLG-HVIECGRIGMEDGKVKAIQE
        M +G++ APA F   +N +  E     VV Y+DDI+++S +  EH  H++ V +KLK   L +   KC F Q ++ F+G H+ E G    ++   K +Q 
Subjt:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLG-HVIECGRIGMEDGKVKAIQE

Query:  WKIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDG-----
        WK P +  ELR FLG  NY R+F+   S+ T PL  LLKK+ +W WT     A E++K+ ++   VL   D ++   +ET+ASD A+G VL Q       
Subjt:  WKIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDG-----

Query:  HSIAYESRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGV--KFVVKTDNSS-VCHFFNQPKLSSKQ-ARWQEYLAEFDF
        + + Y S K++  +  Y+ S+KEMLA++  L+ WR YL      F + TD+ + +    N+ +  +K+ ARWQ +L +F+F
Subjt:  HSIAYESRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGV--KFVVKTDNSS-VCHFFNQPKLSSKQ-ARWQEYLAEFDF

P20825 Retrovirus-related Pol polyprotein from transposon 2971.2e-5341.18Show/hide
Query:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEW
        M FGL NAPATF   MN +    LNK  +VYLDDI+++S ++ EH   +QLVF KL    L ++ +KC F ++  +FLGH++    I     KVKAI  +
Subjt:  MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEW

Query:  KIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWN-WTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAY
         IPT   E+R+FLGL  YYR+F+  ++    P+T  LKK  K +    E   AFE LK +++   +L + D  + F + T+AS+ ALG VL Q+GH I++
Subjt:  KIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWN-WTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAY

Query:  ESRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF
         SR LND E  Y+A EKE+LA+V   + +R YLLG +F++ +D+  +    N  +  +K  RW+  L+E+ F
Subjt:  ESRKLNDTERRYAASEKEMLAVVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein5.7e-2241.54Show/hide
Query:  HLQLVFEKLKQNQLYVKWEKCSFAQERISFLG--HVIECGRIGMEDGKVKAIQEWKIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNW
        HL +V +  +Q+Q Y   +KC+F Q +I++LG  H+I    +  +  K++A+  W  P + TELR FLGL  YYRRFV+ + K   PLTELLKKN    W
Subjt:  HLQLVFEKLKQNQLYVKWEKCSFAQERISFLG--HVIECGRIGMEDGKVKAIQEWKIPTSITELRSFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNW

Query:  TAECQAAFESLKKVMMEGLVLGIADVTRPF
        T     AF++LK  +    VL + D+  PF
Subjt:  TAECQAAFESLKKVMMEGLVLGIADVTRPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCTTTGGCCTTACGAATGCTCCTGCCACCTTCTGCACCTTAATGAACCAAGTGTTTCATGAATATCTGAACAAATTTGTGGTGGTCTACCTAGATGATATAGTCGT
CTACAGTCCAACCATGGAAGAACACCAGGTGCATTTACAGCTTGTTTTCGAGAAGCTCAAGCAGAACCAGTTGTATGTTAAATGGGAAAAGTGTTCTTTTGCTCAAGAAC
GCATCAGTTTCTTGGGTCATGTGATAGAATGTGGTCGAATAGGCATGGAAGATGGAAAGGTCAAAGCCATTCAAGAGTGGAAAATTCCGACGTCCATCACAGAACTACGT
TCATTCCTCGGATTAGCCAATTACTATAGGCGATTCGTAGAAGGATTCTCGAAATGGACCGGGCCCTTAACGGAATTACTAAAGAAGAACCAAAAGTGGAACTGGACTGC
AGAATGTCAAGCCGCATTTGAGAGTTTGAAAAAGGTGATGATGGAGGGACTGGTGTTGGGAATTGCAGACGTCACACGACCATTCAAAGTCGAGACTAATGCGTCAGACT
TCGCCCTAGGAGGAGTGCTTCTCCAAGACGGTCATTCCATTGCATACGAGAGTCGGAAGTTGAATGATACTGAAAGGAGGTATGCCGCCTCCGAGAAAGAGATGTTAGCA
GTAGTCCACTGCTTGAGGGCCTGGAGGCAATATCTCCTAGGGGTCAAGTTCGTTGTCAAGACTGACAACAGCTCAGTCTGTCACTTCTTCAACCAACCGAAGTTGTCGTC
CAAGCAAGCTAGGTGGCAAGAATACCTTGCCGAGTTTGATTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCCTTTGGCCTTACGAATGCTCCTGCCACCTTCTGCACCTTAATGAACCAAGTGTTTCATGAATATCTGAACAAATTTGTGGTGGTCTACCTAGATGATATAGTCGT
CTACAGTCCAACCATGGAAGAACACCAGGTGCATTTACAGCTTGTTTTCGAGAAGCTCAAGCAGAACCAGTTGTATGTTAAATGGGAAAAGTGTTCTTTTGCTCAAGAAC
GCATCAGTTTCTTGGGTCATGTGATAGAATGTGGTCGAATAGGCATGGAAGATGGAAAGGTCAAAGCCATTCAAGAGTGGAAAATTCCGACGTCCATCACAGAACTACGT
TCATTCCTCGGATTAGCCAATTACTATAGGCGATTCGTAGAAGGATTCTCGAAATGGACCGGGCCCTTAACGGAATTACTAAAGAAGAACCAAAAGTGGAACTGGACTGC
AGAATGTCAAGCCGCATTTGAGAGTTTGAAAAAGGTGATGATGGAGGGACTGGTGTTGGGAATTGCAGACGTCACACGACCATTCAAAGTCGAGACTAATGCGTCAGACT
TCGCCCTAGGAGGAGTGCTTCTCCAAGACGGTCATTCCATTGCATACGAGAGTCGGAAGTTGAATGATACTGAAAGGAGGTATGCCGCCTCCGAGAAAGAGATGTTAGCA
GTAGTCCACTGCTTGAGGGCCTGGAGGCAATATCTCCTAGGGGTCAAGTTCGTTGTCAAGACTGACAACAGCTCAGTCTGTCACTTCTTCAACCAACCGAAGTTGTCGTC
CAAGCAAGCTAGGTGGCAAGAATACCTTGCCGAGTTTGATTTTTAG
Protein sequenceShow/hide protein sequence
MSFGLTNAPATFCTLMNQVFHEYLNKFVVVYLDDIVVYSPTMEEHQVHLQLVFEKLKQNQLYVKWEKCSFAQERISFLGHVIECGRIGMEDGKVKAIQEWKIPTSITELR
SFLGLANYYRRFVEGFSKWTGPLTELLKKNQKWNWTAECQAAFESLKKVMMEGLVLGIADVTRPFKVETNASDFALGGVLLQDGHSIAYESRKLNDTERRYAASEKEMLA
VVHCLRAWRQYLLGVKFVVKTDNSSVCHFFNQPKLSSKQARWQEYLAEFDF