; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0020161 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0020161
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr01:18261224..18262672
RNA-Seq ExpressionCmc01g0020161
SyntenyCmc01g0020161
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036584.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-248100Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV

Query:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR
        IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR

Query:  IATPLTQLTRKGALFVWSKACCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWMHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELV
        IATPLTQLTRKGALFVWSKACCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWMHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELV
Subjt:  IATPLTQLTRKGALFVWSKACCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWMHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELV

Query:  KDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISS
        KDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISS
Subjt:  KDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISS

Query:  DGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPG
        DGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPG
Subjt:  DGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPG

KAA0040689.1 pol protein [Cucumis melo var. makuwa]3.3e-23989Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRS+YGHYEFIVMSFGLTNAPAVFMDLMNRVFREF+DTFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV

Query:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR
        IVFIDDILIYSKTEAEHEEHLRMVLQT RDNKL+AKFSKCEFWLKQVSFLGHVVSKA VSVDP KIEAVT WTRPSTVSEVRSFLGLAGYYRRF+ENFSR
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR

Query:  IATPLTQLTRKGALFVWSKAC--------------------------------------CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
        IATPLTQLTRKGA FVWSKAC                                      CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
Subjt:  IATPLTQLTRKGALFVWSKAC--------------------------------------CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  MHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLT
         HYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILY+PGKANVVADALSRKVSHSAALITRQAPLHRDLERA+IAVSVG VTMQLAQLT
Subjt:  MHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLT

Query:  VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPGSTKMYQDL
        VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDS IKTELLSEAHSSPFSMHPGSTKMYQDL
Subjt:  VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPGSTKMYQDL

KAA0046185.1 pol protein [Cucumis melo var. makuwa]1.9e-23988.59Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRS+YGHYEFIVMSFGLTNAPAVFMDLMNRVFREF+DTFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV

Query:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR
        IVFIDDILIYSKTEAEHEEHLRMVLQT RDNKL+AKFSKCEFWLKQVSFLGH+VSKA VSVDPAKIEAVT WTRPSTVSEVRSFLGLAGYYRRF+ENFSR
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR

Query:  IATPLTQLTRKGALFVWSKAC--------------------------------------CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
        IATPLTQLTRKGA FVWSKAC                                      CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
Subjt:  IATPLTQLTRKGALFVWSKAC--------------------------------------CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  MHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLT
         HYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILY+PGKANVV DALSRKVSHSAALITRQAPLHRDLERA+IAVSVG VTMQLAQLT
Subjt:  MHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLT

Query:  VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPGSTKMYQDL
        VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFS+SSDGGLLFERRLCVPSDS +KTELLSEAHSSPFSMHPGSTKMYQDL
Subjt:  VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPGSTKMYQDL

KAA0048687.1 pol protein [Cucumis melo var. makuwa]5.7e-23988.59Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRS+YGHYEFIVMSFGLTNAPAVFMDLMNRVFREF+DTFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV

Query:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR
        IVFIDDILIYSKTEAEHEEHLRMVLQT RDNKL+AKFSKCEFWLKQVSFLGHVVSKA VSVDPAKIEAVT WTRPSTVSEVRSFLGLAGYYRRF+ENFSR
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR

Query:  IATPLTQLTRKGALFVWSKAC--------------------------------------CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
        IATPLTQLTRKGA FVWSKAC                                      CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
Subjt:  IATPLTQLTRKGALFVWSKAC--------------------------------------CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  MHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLT
         HYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILY+PGKANVVADALSRKVSHSAALITRQAPLHRDLERA+IAVSVG VTMQLAQLT
Subjt:  MHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLT

Query:  VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPGSTKMYQDL
        VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFS+SSDGGLLFERRLCVPSDSV+KTELLSEAHSSPFSMHPGSTKMY+D+
Subjt:  VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPGSTKMYQDL

TYK01613.1 pol protein [Cucumis melo var. makuwa]2.2e-23888.38Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRS+YGHYEFIVMSFGLTNAPAVFMDLMNRVFREF+DTFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV

Query:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR
        IVFIDDILIYSKTEAEHEEHLRMVLQT RDNKL+AKFSKCEFWLKQVSFLGHVVSKA VSVDPAKIEAVT WTRPSTVSEVRSFLGLAGYYRRF+ENFSR
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR

Query:  IATPLTQLTRKGALFVWSKAC--------------------------------------CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
        IATPLTQLTRKGA FVWSKAC                                      CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
Subjt:  IATPLTQLTRKGALFVWSKAC--------------------------------------CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  MHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLT
         HYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILY+PGKANVVADALSRKVSHSAALITRQAPLHRDLERA+IAVSVG VTMQLAQLT
Subjt:  MHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLT

Query:  VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPGSTKMYQDL
        VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ  EFS+SSDGGLLFERRLCVPSDS +KTELLSEAHSSPFSMHPGSTKMYQDL
Subjt:  VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPGSTKMYQDL

TrEMBL top hitse value%identityAlignment
A0A5A7SZ74 Ty3-gypsy retrotransposon protein6.5e-249100Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV

Query:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR
        IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR

Query:  IATPLTQLTRKGALFVWSKACCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWMHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELV
        IATPLTQLTRKGALFVWSKACCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWMHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELV
Subjt:  IATPLTQLTRKGALFVWSKACCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWMHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELV

Query:  KDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISS
        KDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISS
Subjt:  KDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISS

Query:  DGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPG
        DGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPG
Subjt:  DGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPG

A0A5A7THE6 Reverse transcriptase1.6e-23989Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRS+YGHYEFIVMSFGLTNAPAVFMDLMNRVFREF+DTFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV

Query:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR
        IVFIDDILIYSKTEAEHEEHLRMVLQT RDNKL+AKFSKCEFWLKQVSFLGHVVSKA VSVDP KIEAVT WTRPSTVSEVRSFLGLAGYYRRF+ENFSR
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR

Query:  IATPLTQLTRKGALFVWSKAC--------------------------------------CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
        IATPLTQLTRKGA FVWSKAC                                      CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
Subjt:  IATPLTQLTRKGALFVWSKAC--------------------------------------CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  MHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLT
         HYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILY+PGKANVVADALSRKVSHSAALITRQAPLHRDLERA+IAVSVG VTMQLAQLT
Subjt:  MHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLT

Query:  VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPGSTKMYQDL
        VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDS IKTELLSEAHSSPFSMHPGSTKMYQDL
Subjt:  VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPGSTKMYQDL

A0A5A7TXM6 Reverse transcriptase9.4e-24088.59Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRS+YGHYEFIVMSFGLTNAPAVFMDLMNRVFREF+DTFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV

Query:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR
        IVFIDDILIYSKTEAEHEEHLRMVLQT RDNKL+AKFSKCEFWLKQVSFLGH+VSKA VSVDPAKIEAVT WTRPSTVSEVRSFLGLAGYYRRF+ENFSR
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR

Query:  IATPLTQLTRKGALFVWSKAC--------------------------------------CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
        IATPLTQLTRKGA FVWSKAC                                      CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
Subjt:  IATPLTQLTRKGALFVWSKAC--------------------------------------CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  MHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLT
         HYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILY+PGKANVV DALSRKVSHSAALITRQAPLHRDLERA+IAVSVG VTMQLAQLT
Subjt:  MHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLT

Query:  VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPGSTKMYQDL
        VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFS+SSDGGLLFERRLCVPSDS +KTELLSEAHSSPFSMHPGSTKMYQDL
Subjt:  VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPGSTKMYQDL

A0A5A7U330 Reverse transcriptase2.7e-23988.59Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRS+YGHYEFIVMSFGLTNAPAVFMDLMNRVFREF+DTFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV

Query:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR
        IVFIDDILIYSKTEAEHEEHLRMVLQT RDNKL+AKFSKCEFWLKQVSFLGHVVSKA VSVDPAKIEAVT WTRPSTVSEVRSFLGLAGYYRRF+ENFSR
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR

Query:  IATPLTQLTRKGALFVWSKAC--------------------------------------CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
        IATPLTQLTRKGA FVWSKAC                                      CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
Subjt:  IATPLTQLTRKGALFVWSKAC--------------------------------------CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  MHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLT
         HYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILY+PGKANVVADALSRKVSHSAALITRQAPLHRDLERA+IAVSVG VTMQLAQLT
Subjt:  MHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLT

Query:  VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPGSTKMYQDL
        VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFS+SSDGGLLFERRLCVPSDSV+KTELLSEAHSSPFSMHPGSTKMY+D+
Subjt:  VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPGSTKMYQDL

A0A5D3BPI1 Reverse transcriptase1.0e-23888.38Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRS+YGHYEFIVMSFGLTNAPAVFMDLMNRVFREF+DTFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV

Query:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR
        IVFIDDILIYSKTEAEHEEHLRMVLQT RDNKL+AKFSKCEFWLKQVSFLGHVVSKA VSVDPAKIEAVT WTRPSTVSEVRSFLGLAGYYRRF+ENFSR
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR

Query:  IATPLTQLTRKGALFVWSKAC--------------------------------------CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
        IATPLTQLTRKGA FVWSKAC                                      CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
Subjt:  IATPLTQLTRKGALFVWSKAC--------------------------------------CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  MHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLT
         HYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILY+PGKANVVADALSRKVSHSAALITRQAPLHRDLERA+IAVSVG VTMQLAQLT
Subjt:  MHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGVVTMQLAQLT

Query:  VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPGSTKMYQDL
        VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ  EFS+SSDGGLLFERRLCVPSDS +KTELLSEAHSSPFSMHPGSTKMYQDL
Subjt:  VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPGSTKMYQDL

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.66.6e-6536.29Show/hide
Query:  RLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFVI
        R+ IDYR+LN++TV +R+P+P +D++  +L     F+ IDL  G+HQ+ +    V KTAF +K+GHYE++ M FGL NAPA F   MN + R  ++   +
Subjt:  RLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFVI

Query:  VFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSRI
        V++DDI+++S +  EH + L +V +      L  +  KCEF  ++ +FLGHV++   +  +P KIEA+  +  P+   E+++FLGL GYYR+F+ NF+ I
Subjt:  VFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSRI

Query:  ATPLTQLTRKG-----------ALFVWSKAC----------------------------CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
        A P+T+  +K            + F   K                               VL Q G  ++Y SR L  HE NY T + EL A+V+A K +
Subjt:  ATPLTQLTRKG-----------ALFVWSKAC----------------------------CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  MHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYYPGKANVVADALSR
         HYL G   +I +DH+ L + +  K+ N +  RW   + ++D +I Y  GK N VADALSR
Subjt:  MHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYYPGKANVVADALSR

P0CT34 Transposon Tf2-1 polyprotein9.0e-6230.72Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV
        +R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV

Query:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR
        + ++DDILI+SK+E+EH +H++ VLQ  ++  L    +KCEF   QV F+G+ +S+   +     I+ V  W +P    E+R FLG   Y R+F+   S+
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR

Query:  IATPLTQLTRKGALFVWS----------KAC----------------------------CVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVF
        +  PL  L +K   + W+          K C                             VL Q+        V Y S ++   + NY   D E+ A++ 
Subjt:  IATPLTQLTRKGALFVWS----------KAC----------------------------CVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVF

Query:  ALKIWMHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGV
        +LK W HYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I      
Subjt:  ALKIWMHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGV

Query:  VTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSVIKTELLSEAHSSPFSMHPG
            + Q+++    + +++   +ND  L+    L    + VE +I    GLL   +  + +P+D+ +   ++ + H     +HPG
Subjt:  VTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSVIKTELLSEAHSSPFSMHPG

P0CT35 Transposon Tf2-2 polyprotein9.0e-6230.72Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV
        +R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV

Query:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR
        + ++DDILI+SK+E+EH +H++ VLQ  ++  L    +KCEF   QV F+G+ +S+   +     I+ V  W +P    E+R FLG   Y R+F+   S+
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR

Query:  IATPLTQLTRKGALFVWS----------KAC----------------------------CVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVF
        +  PL  L +K   + W+          K C                             VL Q+        V Y S ++   + NY   D E+ A++ 
Subjt:  IATPLTQLTRKGALFVWS----------KAC----------------------------CVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVF

Query:  ALKIWMHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGV
        +LK W HYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I      
Subjt:  ALKIWMHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGV

Query:  VTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSVIKTELLSEAHSSPFSMHPG
            + Q+++    + +++   +ND  L+    L    + VE +I    GLL   +  + +P+D+ +   ++ + H     +HPG
Subjt:  VTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSVIKTELLSEAHSSPFSMHPG

P0CT41 Transposon Tf2-12 polyprotein9.0e-6230.72Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV
        +R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFV

Query:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR
        + ++DDILI+SK+E+EH +H++ VLQ  ++  L    +KCEF   QV F+G+ +S+   +     I+ V  W +P    E+R FLG   Y R+F+   S+
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSR

Query:  IATPLTQLTRKGALFVWS----------KAC----------------------------CVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVF
        +  PL  L +K   + W+          K C                             VL Q+        V Y S ++   + NY   D E+ A++ 
Subjt:  IATPLTQLTRKGALFVWS----------KAC----------------------------CVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVF

Query:  ALKIWMHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGV
        +LK W HYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I      
Subjt:  ALKIWMHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGV

Query:  VTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSVIKTELLSEAHSSPFSMHPG
            + Q+++    + +++   +ND  L+    L    + VE +I    GLL   +  + +P+D+ +   ++ + H     +HPG
Subjt:  VTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSVIKTELLSEAHSSPFSMHPG

P20825 Retrovirus-related Pol polyprotein from transposon 2971.5e-6436.01Show/hide
Query:  RLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFVI
        R+ IDYR+LN++T+ +RYP+P +D++  +L     F+ IDL  G+HQ+ + +  + KTAF +K GHYE++ M FGL NAPA F   MN + R  ++   +
Subjt:  RLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFVI

Query:  VFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSRI
        V++DDI+I+S +  EH   +++V     D  L  +  KCEF  K+ +FLGH+V+   +  +P K++A+ S+  P+   E+R+FLGL GYYR+F+ N++ I
Subjt:  VFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSRI

Query:  ATPLTQLTRKGA-----------LFVWSKAC----------------------------CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
        A P+T   +K              F   KA                              VL Q G  +++ SR L  HE NY   + EL A+V+A K +
Subjt:  ATPLTQLTRKGA-----------LFVWSKAC----------------------------CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  MHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYYPGKANVVADALSR
         HYL G +  I +DH+ L++    KE   +  RW   + +Y  +I Y  GK N VADALSR
Subjt:  MHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYYPGKANVVADALSR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.0e-2046.39Show/hide
Query:  HLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLG--HVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSRIATPLTQLTRKGAL
        HL MVLQ +  ++ +A   KC F   Q+++LG  H++S   VS DPAK+EA+  W  P   +E+R FLGL GYYRRF++N+ +I  PLT+L +K +L
Subjt:  HLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLG--HVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSRIATPLTQLTRKGAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCCTATGCATTGACTATAGGGAGTTGAACAAGGTAACCGTTAAGAACAGATATCCTTTGCCCAGGATCGACGATCTGTTTGACCAATTACAGGGAGCTACAGTGTT
CTCTAAGATTGATCTTCGGTCGGGATATCATCAGCTGAGGATTAAGGATGGTGATGTACCGAAGACAGCATTTCGTTCCAAATATGGACACTATGAGTTTATTGTGATGT
CTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTTGTAGACACTTTTGTGATCGTGTTTATTGATGATATCTTGATATAT
TCCAAGACGGAGGCCGAGCATGAGGAGCATTTACGTATGGTTTTGCAAACATTTCGGGATAATAAACTGCATGCAAAGTTTTCGAAATGCGAGTTTTGGCTGAAGCAGGT
GTCCTTTCTAGGCCATGTGGTTTCTAAGGCTGAAGTTTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCAGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGTAGCT
TTCTGGGTCTAGCAGGTTATTATCGACGGTTTTTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCTTTTTGTTTGGAGCAAGGCA
TGTTGTGTTTTGATGCAGCAAGGTAAGGTAGTCGCTTATGCTTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATTTAGAGTTGGCAGCAGTGGTTTT
TGCTTTGAAAATATGGATGCATTACTTATATGGTGAAAAGATACAGATCTTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATATGAGACAGC
GAAGATGGCTTGAGTTAGTGAAGGATTACGACTGTGAGATACTGTATTATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGGAAGGTATCACATTCAGCAGCA
CTTATTACCCGACAGGCCCCATTGCATCGAGATCTTGAGCGGGCTAAGATTGCAGTGTCAGTGGGGGTAGTCACTATGCAGTTAGCCCAGTTGACGGTACAGCCGACTTT
GAGGCAAAGGATCATTGATGCTCAGAGTAACGATCCGTATTTGGTTGAGAAGCGTGGCCTAGCAGAGGCAGGGCAAGCGGTTGAGTTCTCCATATCCTCTGATGGTGGAC
TTTTGTTTGAGAGGCGCCTCTGTGTGCCATCAGATAGTGTGATTAAAACAGAATTATTATCTGAGGCTCACAGTTCTCCATTTTCCATGCACCCAGGTAGTACGAAGATG
TATCAGGACCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGCCTATGCATTGACTATAGGGAGTTGAACAAGGTAACCGTTAAGAACAGATATCCTTTGCCCAGGATCGACGATCTGTTTGACCAATTACAGGGAGCTACAGTGTT
CTCTAAGATTGATCTTCGGTCGGGATATCATCAGCTGAGGATTAAGGATGGTGATGTACCGAAGACAGCATTTCGTTCCAAATATGGACACTATGAGTTTATTGTGATGT
CTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTTGTAGACACTTTTGTGATCGTGTTTATTGATGATATCTTGATATAT
TCCAAGACGGAGGCCGAGCATGAGGAGCATTTACGTATGGTTTTGCAAACATTTCGGGATAATAAACTGCATGCAAAGTTTTCGAAATGCGAGTTTTGGCTGAAGCAGGT
GTCCTTTCTAGGCCATGTGGTTTCTAAGGCTGAAGTTTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCAGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGTAGCT
TTCTGGGTCTAGCAGGTTATTATCGACGGTTTTTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCTTTTTGTTTGGAGCAAGGCA
TGTTGTGTTTTGATGCAGCAAGGTAAGGTAGTCGCTTATGCTTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATTTAGAGTTGGCAGCAGTGGTTTT
TGCTTTGAAAATATGGATGCATTACTTATATGGTGAAAAGATACAGATCTTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATATGAGACAGC
GAAGATGGCTTGAGTTAGTGAAGGATTACGACTGTGAGATACTGTATTATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGGAAGGTATCACATTCAGCAGCA
CTTATTACCCGACAGGCCCCATTGCATCGAGATCTTGAGCGGGCTAAGATTGCAGTGTCAGTGGGGGTAGTCACTATGCAGTTAGCCCAGTTGACGGTACAGCCGACTTT
GAGGCAAAGGATCATTGATGCTCAGAGTAACGATCCGTATTTGGTTGAGAAGCGTGGCCTAGCAGAGGCAGGGCAAGCGGTTGAGTTCTCCATATCCTCTGATGGTGGAC
TTTTGTTTGAGAGGCGCCTCTGTGTGCCATCAGATAGTGTGATTAAAACAGAATTATTATCTGAGGCTCACAGTTCTCCATTTTCCATGCACCCAGGTAGTACGAAGATG
TATCAGGACCTGTAG
Protein sequenceShow/hide protein sequence
MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFVIVFIDDILIY
SKTEAEHEEHLRMVLQTFRDNKLHAKFSKCEFWLKQVSFLGHVVSKAEVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFLENFSRIATPLTQLTRKGALFVWSKA
CCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWMHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYYPGKANVVADALSRKVSHSAA
LITRQAPLHRDLERAKIAVSVGVVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSVIKTELLSEAHSSPFSMHPGSTKM
YQDL