; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0223381 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0223381
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr08:12834695..12836731
RNA-Seq ExpressionCmc08g0223381
SyntenyCmc08g0223381
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008194 - UDP-glycosyltransferase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025823.1 pol protein [Cucumis melo var. makuwa]0.0e+0097.57Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNRVFKDFLDTFF
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNRVFKDFLDTFF
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNRVFKDFLDTFF

Query:  IVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAVTSYYRRFVEDFSRIASPLTQLTRKGTPFVWS
        IVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVS               SYYRRFVEDFSRIASPLTQLTRKGTPFVWS
Subjt:  IVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAVTSYYRRFVEDFSRIASPLTQLTRKGTPFVWS

Query:  LACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLELAVMVFALKIWRHYLYGEKIQIFTNHKSL
        LACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLELAVMVFALKIWRHYLYGEKIQIFTNHKSL
Subjt:  LACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLELAVMVFALKIWRHYLYGEKIQIFTNHKSL

Query:  KYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVGEMTSQLAQLTVQSTLRQRIIVAQLNDPY
        KYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVGEMTSQLAQLTVQSTLRQRIIVAQLNDPY
Subjt:  KYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVGEMTSQLAQLTVQSTLRQRIIVAQLNDPY

Query:  LVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRNMKREVADFVSRCLVCQQVKAPRQKPTGL
        LVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRNMKREVADFVSRCLVCQQVKAPRQKPTGL
Subjt:  LVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRNMKREVADFVSRCLVCQQVKAPRQKPTGL

Query:  LQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVPLSIISDRDARFTLKFWKGLQLALGTRLD
        LQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVPLSIISDRDARFTLKFWKGLQLALGTRLD
Subjt:  LQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVPLSIISDRDARFTLKFWKGLQLALGTRLD

Query:  FSTAFHPQTDGQTKRLNQ
        FSTAFHPQTDGQTKRLNQ
Subjt:  FSTAFHPQTDGQTKRLNQ

KAA0025998.1 pol protein [Cucumis melo var. makuwa]0.0e+0086.14Show/hide
Query:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR
        +LFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAT+FSKIDLRSGYHQLRI+D  IP+TAF SRY HYEF+VMSFGLTNAPA++MDLMNR
Subjt:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR

Query:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAVTS------------------Y
        VFKDFLD+F IVFIDDILIYSKT+A+HEEHLHQ+LETLRANKLYAKFSKCEFWL+KVTF  HVVSSE VSVDP KIEAVT+                  Y
Subjt:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAVTS------------------Y

Query:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLEL
        YRRFVEDFSRIASPLTQLTRKGTPFVWS ACE SFQELKQKLV+APVLTV DG G+FVIYSDASKK LGCVLMQQG VVAYASRQLK HEQNYPTHDLEL
Subjt:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLEL

Query:  AVMVFALKIWRHYLYGEKIQIFTNHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVG
        A +VFALKIWRHYLYGEKIQI+T+HKSLKYFFTQKELNMRQRRWLELVKDYDCEILYH GKANVVADALSRKVAHSAALIT Q PLLRDFERAEIAVSVG
Subjt:  AVMVFALKIWRHYLYGEKIQIFTNHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVG

Query:  EMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRN
        E+T+QLAQLTVQ TLRQ+II AQL+DPYL EKR +VE  Q E FSISSDDGL F GRLCV EDSAVKTELLTEAHSSPFTMH GSTKMYQDLR  YWWR 
Subjt:  EMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRN

Query:  MKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVP
        MKR+VADFVSRCLVCQQVKAPRQ P GLLQPL VP WKW SVSMDFITGLP+TLKGY VIWVVVDRLTKSAHFV GK TYT SKWGQLYMTEIVRLHGVP
Subjt:  MKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVP

Query:  LSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNSY
        +SI+SDRDARFT KFWKGLQ+ALGTRLDFSTAFHPQTDGQT+RLNQ+LED+LRACVLEFSGSWDSHLHLMEFAYNNSY
Subjt:  LSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNSY

KAA0035890.1 pol protein [Cucumis melo var. makuwa]0.0e+0086.28Show/hide
Query:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR
        +LFVKKKDGSM LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAT+FSKI+LRSGYHQLRI+D  IP+TAFHSRY HYEF+VMSFGLTNAPA+ MDLMNR
Subjt:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR

Query:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAVTS------------------Y
        VFKDFLD+F IVFIDDILIYSK +A+HEEHLHQ+LETLRANKLYAKFSKCEFWL+KV F  HVVSSE VSVDP KIEAVT+                  Y
Subjt:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAVTS------------------Y

Query:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLEL
        YRRFVEDFSRIASPLTQLTRKGTPFVWS ACESSFQELKQKLV+APVLTV DG G+FVIYSDASKK LGCVLMQQG VVAYASRQLK HEQNYPTHDLEL
Subjt:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLEL

Query:  AVMVFALKIWRHYLYGEKIQIFTNHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVG
        A +VFALKIWRHYLYGEKIQI+T+HKSLKYFFTQKELNMRQRRWLELVKDYDCEILYH GKANVVADALSRKVAHSAALIT Q PLLRDFERAEIAVSVG
Subjt:  AVMVFALKIWRHYLYGEKIQIFTNHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVG

Query:  EMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRN
        E+T+QLAQL+VQ TLRQ+II AQLNDPYL EKR +VE GQ EDFSISSDDGL F GRLCV EDSAVK ELLTEAHSSPFTMH GSTKMYQDLR  YWWR 
Subjt:  EMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRN

Query:  MKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVP
        MKREVADFVSRCLVCQQVKAPRQ P GLLQPL VP WKW SVSMDFITGLP+TLKGY VIWVVVDRLTKSAHFV GK TYT SKWGQLYMTEIVRLHGVP
Subjt:  MKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVP

Query:  LSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNSY
        +SIISDRDARFT KFWKGLQLALGTRLDFST FHPQTDGQT+RLNQ+LED+LRACVLEFSGSWDSHLHLMEFAYNNSY
Subjt:  LSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNSY

KAA0063098.1 pol protein [Cucumis melo var. makuwa]0.0e+0086.28Show/hide
Query:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR
        +LFVKKKDGSMRLCIDYRELNKVT+KNRYPLPRIDDLFDQLQGAT+FSKIDLRSGYHQLRI+D  IP+TAF SRY HYEF+VMSFGLTNAPA++MDLMNR
Subjt:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR

Query:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAVTS------------------Y
        VFKDFLD+F IVFIDDILIYSKT+A+HEEHLHQ+LETLRANKLYAKFSKCEFWL+KVTF  HVVSSE VSVDP KIEAVT+                  Y
Subjt:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAVTS------------------Y

Query:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLEL
        YRRFVEDFSRIASPLTQLTRKGTPFVWS ACE SFQELKQKLV+APVLTV DG G+FVIYSDASKK LGCVLMQQG VVAYASRQLK HEQNYPTHDLEL
Subjt:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLEL

Query:  AVMVFALKIWRHYLYGEKIQIFTNHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVG
        A +VFALKIWRHYLYGEKIQI+T+HKSLKYFFTQKELNMRQRRWLELVKDYDCEILYH GKANVVADALSRKVAHSAALIT Q PLLRDFERAEI VSVG
Subjt:  AVMVFALKIWRHYLYGEKIQIFTNHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVG

Query:  EMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRN
        E+T+QLAQL+VQ TLRQ+II AQLNDPYL EKR +VE GQ EDFSISSDDGL F GRLCV EDSAVKTELLTEAHSSPFTMH GSTKMYQDLR  YWWR 
Subjt:  EMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRN

Query:  MKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVP
        MKREVADFVSRCLVCQQVKAPRQ P GLLQPL VP WKW SVSMDFITGLP+T KGY VIWVVVDRLTKSAHFV GK TYT SKWGQLYMTEIVRLHGVP
Subjt:  MKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVP

Query:  LSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNSY
        +SIISDRDARFT KFWKGLQLALGTRLDFSTAFHPQTDGQT+RLNQ+LED+LRACVLEFS SWDSHLHLMEFAYNNSY
Subjt:  LSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNSY

TYK20443.1 pol protein [Cucumis melo var. makuwa]0.0e+0086.14Show/hide
Query:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR
        +LFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAT+FSKIDLRSGYHQLRI+D  IP+TAF SRY HYEF+VMSFGLTNAPA++MDLMNR
Subjt:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR

Query:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAVTS------------------Y
        VFKDFLD+F IVFIDDILIYSKT+A+HEEHLHQ+LETLRANKLYAKFSKCEFWL+KVTF  HVVSSE VSVDP KIEAVT+                  Y
Subjt:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAVTS------------------Y

Query:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLEL
        YRRFVEDFSRIASPLTQLTRKGTPFVWS ACE SFQELKQKLV+APVLTV DG G+FVIYSDASKK LGCVLMQQG VVAYASRQLK HEQNYPTHDLEL
Subjt:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLEL

Query:  AVMVFALKIWRHYLYGEKIQIFTNHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVG
        A +VFALKIWRHYLYGEKIQI+T+HKSLKYFFTQKELNMRQRRWLELVKDYDCEILYH GKANVVADALSRKVAHSAALIT Q PLLRDFERAEIAVSVG
Subjt:  AVMVFALKIWRHYLYGEKIQIFTNHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVG

Query:  EMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRN
        E+T+QLAQLTVQ TLRQ+II AQL+DPYL EKR +VE  Q E FSISSDDGL F GRLCV EDSAVKTELLTEAHSSPFTMH GSTKMYQDLR  YWWR 
Subjt:  EMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRN

Query:  MKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVP
        MKR+VADFVSRCLVCQQVKAPRQ P GLLQPL VP WKW SVSMDFITGLP+TLKGY VIWVVVDRLTKSAHFV GK TYT SKWGQLYMTEIVRLHGVP
Subjt:  MKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVP

Query:  LSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNSY
        +SI+SDRDARFT KFWKGLQ+ALGTRLDFSTAFHPQTDGQT+RLNQ+LED+LRACVLEFSGSWDSHLHLMEFAYNNSY
Subjt:  LSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNSY

TrEMBL top hitse value%identityAlignment
A0A5A7SIJ5 Reverse transcriptase0.0e+0086.14Show/hide
Query:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR
        +LFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAT+FSKIDLRSGYHQLRI+D  IP+TAF SRY HYEF+VMSFGLTNAPA++MDLMNR
Subjt:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR

Query:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAVTS------------------Y
        VFKDFLD+F IVFIDDILIYSKT+A+HEEHLHQ+LETLRANKLYAKFSKCEFWL+KVTF  HVVSSE VSVDP KIEAVT+                  Y
Subjt:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAVTS------------------Y

Query:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLEL
        YRRFVEDFSRIASPLTQLTRKGTPFVWS ACE SFQELKQKLV+APVLTV DG G+FVIYSDASKK LGCVLMQQG VVAYASRQLK HEQNYPTHDLEL
Subjt:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLEL

Query:  AVMVFALKIWRHYLYGEKIQIFTNHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVG
        A +VFALKIWRHYLYGEKIQI+T+HKSLKYFFTQKELNMRQRRWLELVKDYDCEILYH GKANVVADALSRKVAHSAALIT Q PLLRDFERAEIAVSVG
Subjt:  AVMVFALKIWRHYLYGEKIQIFTNHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVG

Query:  EMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRN
        E+T+QLAQLTVQ TLRQ+II AQL+DPYL EKR +VE  Q E FSISSDDGL F GRLCV EDSAVKTELLTEAHSSPFTMH GSTKMYQDLR  YWWR 
Subjt:  EMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRN

Query:  MKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVP
        MKR+VADFVSRCLVCQQVKAPRQ P GLLQPL VP WKW SVSMDFITGLP+TLKGY VIWVVVDRLTKSAHFV GK TYT SKWGQLYMTEIVRLHGVP
Subjt:  MKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVP

Query:  LSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNSY
        +SI+SDRDARFT KFWKGLQ+ALGTRLDFSTAFHPQTDGQT+RLNQ+LED+LRACVLEFSGSWDSHLHLMEFAYNNSY
Subjt:  LSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNSY

A0A5A7SMC0 Pol protein0.0e+0097.57Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNRVFKDFLDTFF
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNRVFKDFLDTFF
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNRVFKDFLDTFF

Query:  IVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAVTSYYRRFVEDFSRIASPLTQLTRKGTPFVWS
        IVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVS               SYYRRFVEDFSRIASPLTQLTRKGTPFVWS
Subjt:  IVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAVTSYYRRFVEDFSRIASPLTQLTRKGTPFVWS

Query:  LACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLELAVMVFALKIWRHYLYGEKIQIFTNHKSL
        LACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLELAVMVFALKIWRHYLYGEKIQIFTNHKSL
Subjt:  LACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLELAVMVFALKIWRHYLYGEKIQIFTNHKSL

Query:  KYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVGEMTSQLAQLTVQSTLRQRIIVAQLNDPY
        KYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVGEMTSQLAQLTVQSTLRQRIIVAQLNDPY
Subjt:  KYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVGEMTSQLAQLTVQSTLRQRIIVAQLNDPY

Query:  LVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRNMKREVADFVSRCLVCQQVKAPRQKPTGL
        LVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRNMKREVADFVSRCLVCQQVKAPRQKPTGL
Subjt:  LVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRNMKREVADFVSRCLVCQQVKAPRQKPTGL

Query:  LQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVPLSIISDRDARFTLKFWKGLQLALGTRLD
        LQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVPLSIISDRDARFTLKFWKGLQLALGTRLD
Subjt:  LQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVPLSIISDRDARFTLKFWKGLQLALGTRLD

Query:  FSTAFHPQTDGQTKRLNQ
        FSTAFHPQTDGQTKRLNQ
Subjt:  FSTAFHPQTDGQTKRLNQ

A0A5A7SXW6 Reverse transcriptase0.0e+0086.28Show/hide
Query:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR
        +LFVKKKDGSM LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAT+FSKI+LRSGYHQLRI+D  IP+TAFHSRY HYEF+VMSFGLTNAPA+ MDLMNR
Subjt:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR

Query:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAVTS------------------Y
        VFKDFLD+F IVFIDDILIYSK +A+HEEHLHQ+LETLRANKLYAKFSKCEFWL+KV F  HVVSSE VSVDP KIEAVT+                  Y
Subjt:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAVTS------------------Y

Query:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLEL
        YRRFVEDFSRIASPLTQLTRKGTPFVWS ACESSFQELKQKLV+APVLTV DG G+FVIYSDASKK LGCVLMQQG VVAYASRQLK HEQNYPTHDLEL
Subjt:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLEL

Query:  AVMVFALKIWRHYLYGEKIQIFTNHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVG
        A +VFALKIWRHYLYGEKIQI+T+HKSLKYFFTQKELNMRQRRWLELVKDYDCEILYH GKANVVADALSRKVAHSAALIT Q PLLRDFERAEIAVSVG
Subjt:  AVMVFALKIWRHYLYGEKIQIFTNHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVG

Query:  EMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRN
        E+T+QLAQL+VQ TLRQ+II AQLNDPYL EKR +VE GQ EDFSISSDDGL F GRLCV EDSAVK ELLTEAHSSPFTMH GSTKMYQDLR  YWWR 
Subjt:  EMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRN

Query:  MKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVP
        MKREVADFVSRCLVCQQVKAPRQ P GLLQPL VP WKW SVSMDFITGLP+TLKGY VIWVVVDRLTKSAHFV GK TYT SKWGQLYMTEIVRLHGVP
Subjt:  MKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVP

Query:  LSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNSY
        +SIISDRDARFT KFWKGLQLALGTRLDFST FHPQTDGQT+RLNQ+LED+LRACVLEFSGSWDSHLHLMEFAYNNSY
Subjt:  LSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNSY

A0A5A7V646 Reverse transcriptase0.0e+0086.28Show/hide
Query:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR
        +LFVKKKDGSMRLCIDYRELNKVT+KNRYPLPRIDDLFDQLQGAT+FSKIDLRSGYHQLRI+D  IP+TAF SRY HYEF+VMSFGLTNAPA++MDLMNR
Subjt:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR

Query:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAVTS------------------Y
        VFKDFLD+F IVFIDDILIYSKT+A+HEEHLHQ+LETLRANKLYAKFSKCEFWL+KVTF  HVVSSE VSVDP KIEAVT+                  Y
Subjt:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAVTS------------------Y

Query:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLEL
        YRRFVEDFSRIASPLTQLTRKGTPFVWS ACE SFQELKQKLV+APVLTV DG G+FVIYSDASKK LGCVLMQQG VVAYASRQLK HEQNYPTHDLEL
Subjt:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLEL

Query:  AVMVFALKIWRHYLYGEKIQIFTNHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVG
        A +VFALKIWRHYLYGEKIQI+T+HKSLKYFFTQKELNMRQRRWLELVKDYDCEILYH GKANVVADALSRKVAHSAALIT Q PLLRDFERAEI VSVG
Subjt:  AVMVFALKIWRHYLYGEKIQIFTNHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVG

Query:  EMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRN
        E+T+QLAQL+VQ TLRQ+II AQLNDPYL EKR +VE GQ EDFSISSDDGL F GRLCV EDSAVKTELLTEAHSSPFTMH GSTKMYQDLR  YWWR 
Subjt:  EMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRN

Query:  MKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVP
        MKREVADFVSRCLVCQQVKAPRQ P GLLQPL VP WKW SVSMDFITGLP+T KGY VIWVVVDRLTKSAHFV GK TYT SKWGQLYMTEIVRLHGVP
Subjt:  MKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVP

Query:  LSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNSY
        +SIISDRDARFT KFWKGLQLALGTRLDFSTAFHPQTDGQT+RLNQ+LED+LRACVLEFS SWDSHLHLMEFAYNNSY
Subjt:  LSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNSY

A0A5D3BTN0 Reverse transcriptase0.0e+0086.14Show/hide
Query:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR
        +LFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAT+FSKIDLRSGYHQLRI+D  IP+TAF SRY HYEF+VMSFGLTNAPA++MDLMNR
Subjt:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR

Query:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAVTS------------------Y
        VFKDFLD+F IVFIDDILIYSKT+A+HEEHLHQ+LETLRANKLYAKFSKCEFWL+KVTF  HVVSSE VSVDP KIEAVT+                  Y
Subjt:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAVTS------------------Y

Query:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLEL
        YRRFVEDFSRIASPLTQLTRKGTPFVWS ACE SFQELKQKLV+APVLTV DG G+FVIYSDASKK LGCVLMQQG VVAYASRQLK HEQNYPTHDLEL
Subjt:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLEL

Query:  AVMVFALKIWRHYLYGEKIQIFTNHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVG
        A +VFALKIWRHYLYGEKIQI+T+HKSLKYFFTQKELNMRQRRWLELVKDYDCEILYH GKANVVADALSRKVAHSAALIT Q PLLRDFERAEIAVSVG
Subjt:  AVMVFALKIWRHYLYGEKIQIFTNHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVG

Query:  EMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRN
        E+T+QLAQLTVQ TLRQ+II AQL+DPYL EKR +VE  Q E FSISSDDGL F GRLCV EDSAVKTELLTEAHSSPFTMH GSTKMYQDLR  YWWR 
Subjt:  EMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRN

Query:  MKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVP
        MKR+VADFVSRCLVCQQVKAPRQ P GLLQPL VP WKW SVSMDFITGLP+TLKGY VIWVVVDRLTKSAHFV GK TYT SKWGQLYMTEIVRLHGVP
Subjt:  MKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVP

Query:  LSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNSY
        +SI+SDRDARFT KFWKGLQ+ALGTRLDFSTAFHPQTDGQT+RLNQ+LED+LRACVLEFSGSWDSHLHLMEFAYNNSY
Subjt:  LSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNSY

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.1e-9530.38Show/hide
Query:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR
        ++FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+TIF+K+DL+S YH +R++     + AF      +E++VM +G++ APA +   +N 
Subjt:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR

Query:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAV------------------TSY
        +  +  ++  + ++DDILI+SK++++H +H+  +L+ L+   L    +KCEF   +V F  + +S +  +     I+ V                   +Y
Subjt:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAV------------------TSY

Query:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGN-----VVAYASRQLKNHEQNYPT
         R+F+   S++  PL  L +K   + W+     + + +KQ LVS PVL   D     ++ +DAS   +G VL Q+ +      V Y S ++   + NY  
Subjt:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGN-----VVAYASRQLKNHEQNYPT

Query:  HDLELAVMVFALKIWRHYLYG--EKIQIFTNHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFE
         D E+  ++ +LK WRHYL    E  +I T+H++L    T +    N R  RW   ++D++ EI Y  G AN +ADALSR       ++    P+ +D E
Subjt:  HDLELAVMVFALKIWRHYLYG--EKIQIFTNHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFE

Query:  RAEIAVSVGEMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVL--EDSAVKTELLTEAHSSPFTMHSGSTKMY
           I        + + Q+++    + +++    ND  L+   LL    +  + +I   DGL  N +  +L   D+ +   ++ + H     +H G   + 
Subjt:  RAEIAVSVGEMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVL--EDSAVKTELLTEAHSSPFTMHSGSTKMY

Query:  QDLRRAYWWRNMKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLY
          + R + W+ +++++ ++V  C  CQ  K+   KP G LQP+   E  W S+SMDFIT LP +  GY  ++VVVDR +K A  V    + T  +  +++
Subjt:  QDLRRAYWWRNMKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLY

Query:  MTEIVRLHGVPLSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNS
           ++   G P  II+D D  FT + WK         + FS  + PQTDGQT+R NQ +E +LR        +W  H+ L++ +YNN+
Subjt:  MTEIVRLHGVPLSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNS

P0CT35 Transposon Tf2-2 polyprotein3.1e-9530.38Show/hide
Query:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR
        ++FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+TIF+K+DL+S YH +R++     + AF      +E++VM +G++ APA +   +N 
Subjt:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR

Query:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAV------------------TSY
        +  +  ++  + ++DDILI+SK++++H +H+  +L+ L+   L    +KCEF   +V F  + +S +  +     I+ V                   +Y
Subjt:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAV------------------TSY

Query:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGN-----VVAYASRQLKNHEQNYPT
         R+F+   S++  PL  L +K   + W+     + + +KQ LVS PVL   D     ++ +DAS   +G VL Q+ +      V Y S ++   + NY  
Subjt:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGN-----VVAYASRQLKNHEQNYPT

Query:  HDLELAVMVFALKIWRHYLYG--EKIQIFTNHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFE
         D E+  ++ +LK WRHYL    E  +I T+H++L    T +    N R  RW   ++D++ EI Y  G AN +ADALSR       ++    P+ +D E
Subjt:  HDLELAVMVFALKIWRHYLYG--EKIQIFTNHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFE

Query:  RAEIAVSVGEMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVL--EDSAVKTELLTEAHSSPFTMHSGSTKMY
           I        + + Q+++    + +++    ND  L+   LL    +  + +I   DGL  N +  +L   D+ +   ++ + H     +H G   + 
Subjt:  RAEIAVSVGEMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVL--EDSAVKTELLTEAHSSPFTMHSGSTKMY

Query:  QDLRRAYWWRNMKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLY
          + R + W+ +++++ ++V  C  CQ  K+   KP G LQP+   E  W S+SMDFIT LP +  GY  ++VVVDR +K A  V    + T  +  +++
Subjt:  QDLRRAYWWRNMKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLY

Query:  MTEIVRLHGVPLSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNS
           ++   G P  II+D D  FT + WK         + FS  + PQTDGQT+R NQ +E +LR        +W  H+ L++ +YNN+
Subjt:  MTEIVRLHGVPLSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNS

P0CT41 Transposon Tf2-12 polyprotein3.1e-9530.38Show/hide
Query:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR
        ++FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+TIF+K+DL+S YH +R++     + AF      +E++VM +G++ APA +   +N 
Subjt:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR

Query:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAV------------------TSY
        +  +  ++  + ++DDILI+SK++++H +H+  +L+ L+   L    +KCEF   +V F  + +S +  +     I+ V                   +Y
Subjt:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAV------------------TSY

Query:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGN-----VVAYASRQLKNHEQNYPT
         R+F+   S++  PL  L +K   + W+     + + +KQ LVS PVL   D     ++ +DAS   +G VL Q+ +      V Y S ++   + NY  
Subjt:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGN-----VVAYASRQLKNHEQNYPT

Query:  HDLELAVMVFALKIWRHYLYG--EKIQIFTNHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFE
         D E+  ++ +LK WRHYL    E  +I T+H++L    T +    N R  RW   ++D++ EI Y  G AN +ADALSR       ++    P+ +D E
Subjt:  HDLELAVMVFALKIWRHYLYG--EKIQIFTNHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFE

Query:  RAEIAVSVGEMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVL--EDSAVKTELLTEAHSSPFTMHSGSTKMY
           I        + + Q+++    + +++    ND  L+   LL    +  + +I   DGL  N +  +L   D+ +   ++ + H     +H G   + 
Subjt:  RAEIAVSVGEMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVL--EDSAVKTELLTEAHSSPFTMHSGSTKMY

Query:  QDLRRAYWWRNMKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLY
          + R + W+ +++++ ++V  C  CQ  K+   KP G LQP+   E  W S+SMDFIT LP +  GY  ++VVVDR +K A  V    + T  +  +++
Subjt:  QDLRRAYWWRNMKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLY

Query:  MTEIVRLHGVPLSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNS
           ++   G P  II+D D  FT + WK         + FS  + PQTDGQT+R NQ +E +LR        +W  H+ L++ +YNN+
Subjt:  MTEIVRLHGVPLSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNS

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein4.8e-9632.36Show/hide
Query:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR
        ++ V KKDG+ RLC+DYR LNK T+ + +PLPRID+L  ++  A IF+ +DL SGYHQ+ ++     +TAF +    YE+ VM FGL NAP+ +   M  
Subjt:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR

Query:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAV------------------TSY
         F+D    F  V++DDILI+S++  +H +HL  +LE L+   L  K  KC+F  ++  F  + +  ++++    K  A+                   +Y
Subjt:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAV------------------TSY

Query:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGN------VVAYASRQLKNHEQNYP
        YRRF+ + S+IA P+       +   W+   + + ++LK  L ++PVL   +   ++ + +DASK  +G VL +  N      VV Y S+ L++ ++NYP
Subjt:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGN------VVAYASRQLKNHEQNYP

Query:  THDLELAVMVFALKIWRHYLYGEKIQIFTNHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAE
          +LEL  ++ AL  +R+ L+G+   + T+H SL     + E   R +RWL+ +  YD  + Y  G  NVVADA+SR +      IT +     D E  +
Subjt:  THDLELAVMVFALKIWRHYLYGEKIQIFTNHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAE

Query:  IAVSVGEMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRR
               + S +  + ++   +  +    ++     +K+L +     +++S+  D+ + +  RL V          L   H+  F  H G T     +  
Subjt:  IAVSVGEMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRR

Query:  AYWWRNMKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIV
         Y+W  ++  +  ++  C+ CQ +K+ R +  GLLQPL + E +W+ +SMDF+TGLP T     +I VVVDR +K AHF++ + T   ++   L    I 
Subjt:  AYWWRNMKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIV

Query:  RLHGVPLSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNS
          HG P +I SDRD R T   ++ L   LG +   S+A HPQTDGQ++R  Q L  +LRA V     +W  +L  +EF YN++
Subjt:  RLHGVPLSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNS

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.4e-9532.36Show/hide
Query:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR
        ++ V KKDG+ RLC+DYR LNK T+ + +PLPRID+L  ++  A IF+ +DL SGYHQ+ ++     +TAF +    YE+ VM FGL NAP+ +   M  
Subjt:  MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNR

Query:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAV------------------TSY
         F+D    F  V++DDILI+S++  +H +HL  +LE L+   L  K  KC+F  ++  F  + +  ++++    K  A+                   +Y
Subjt:  VFKDFLDTFFIVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAV------------------TSY

Query:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGN------VVAYASRQLKNHEQNYP
        YRRF+ + S+IA P+       +   W+   + +  +LK  L ++PVL   +   ++ + +DASK  +G VL +  N      VV Y S+ L++ ++NYP
Subjt:  YRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQELKQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGN------VVAYASRQLKNHEQNYP

Query:  THDLELAVMVFALKIWRHYLYGEKIQIFTNHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAE
          +LEL  ++ AL  +R+ L+G+   + T+H SL     + E   R +RWL+ +  YD  + Y  G  NVVADA+SR V      IT +     D E  +
Subjt:  THDLELAVMVFALKIWRHYLYGEKIQIFTNHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAE

Query:  IAVSVGEMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRR
               + S +  + ++   +  +    ++     +K+L +     +++S+  D+ + +  RL V          L   H+  F  H G T     +  
Subjt:  IAVSVGEMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRLCVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRR

Query:  AYWWRNMKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIV
         Y+W  ++  +  ++  C+ CQ +K+ R +  GLLQPL + E +W+ +SMDF+TGLP T     +I VVVDR +K AHF++ + T   ++   L    I 
Subjt:  AYWWRNMKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLTKSAHFVSGKFTYTTSKWGQLYMTEIV

Query:  RLHGVPLSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNS
          HG P +I SDRD R T   ++ L   LG +   S+A HPQTDGQ++R  Q L  +LRA       +W  +L  +EF YN++
Subjt:  RLHGVPLSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNS

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein6.4e-1131.2Show/hide
Query:  HLHQILETLRANKLYAKFSKCEFWLKKVTF--SSHVVSSERVSVDPVKIEA------------------VTSYYRRFVEDFSRIASPLTQLTRKGTPFVW
        HL  +L+    ++ YA   KC F   ++ +    H++S E VS DP K+EA                  +T YYRRFV+++ +I  PLT+L +K +   W
Subjt:  HLHQILETLRANKLYAKFSKCEFWLKKVTF--SSHVVSSERVSVDPVKIEA------------------VTSYYRRFVEDFSRIASPLTQLTRKGTPFVW

Query:  SLACESSFQELKQKLVSAPVLTVLD
        +     +F+ LK  + + PVL + D
Subjt:  SLACESSFQELKQKLVSAPVLTVLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGTTTGTGAAGAAGAAGGATGGGTCGATGCGCCTTTGCATTGACTACAGAGAGCTGAACAAGGTGACAGTTAAGAATCGCTATCCCTTGCCCAGGATTGATGATTT
GTTCGATCAGTTGCAAGGAGCCACCATCTTTTCTAAGATCGACCTGCGATCAGGTTACCACCAATTGAGGATCAAGGATAGTGTTATTCCTAGGACCGCTTTCCATTCCA
GATACAGACATTACGAGTTCATTGTGATGTCTTTTGGGTTGACTAATGCTCCGGCGATATACATGGACTTGATGAACAGGGTGTTTAAGGATTTCTTAGACACGTTTTTC
ATAGTTTTCATTGATGACATTTTGATTTACTCCAAGACTAAGGCTAAGCATGAGGAGCATTTACACCAGATTTTGGAGACTCTTCGAGCTAATAAGTTGTACGCCAAGTT
CTCCAAGTGTGAGTTCTGGCTGAAGAAGGTGACTTTTTCTAGCCACGTGGTTTCCAGTGAGAGAGTTTCTGTGGACCCAGTAAAGATCGAAGCAGTTACCAGTTACTACA
GGAGGTTCGTGGAAGACTTCTCTCGTATAGCCAGTCCCTTGACCCAGTTGACCAGGAAGGGGACTCCTTTTGTTTGGAGCCTAGCTTGTGAGAGTAGCTTCCAAGAGCTC
AAGCAGAAGCTTGTGTCTGCACCAGTTTTGACAGTACTAGATGGATATGGAAGTTTCGTGATCTACAGTGATGCCTCAAAGAAAAGACTGGGTTGTGTTCTGATGCAGCA
AGGTAATGTAGTTGCTTATGCCTCTCGTCAGTTGAAGAATCATGAGCAGAACTACCCTACCCATGACCTAGAGTTGGCAGTAATGGTTTTTGCACTGAAGATATGGAGGC
ACTACTTGTACGGTGAGAAGATACAGATTTTCACTAACCATAAGAGCCTGAAGTACTTCTTCACCCAGAAGGAGTTGAACATGAGGCAGAGGAGGTGGCTTGAGTTAGTG
AAAGACTATGACTGCGAGATTTTGTATCACCTAGGTAAGGCAAACGTAGTAGCTGACGCGTTGAGTAGGAAGGTTGCGCATTCAGCAGCGCTTATCACCAACCAAGCTCC
CTTACTAAGAGATTTCGAGAGAGCTGAGATTGCAGTCTCGGTAGGGGAAATGACCTCACAATTGGCTCAGTTGACCGTGCAGTCGACCTTGAGACAGAGGATTATTGTTG
CTCAGCTAAATGATCCTTATTTGGTCGAGAAGCGTCTATTAGTAGAGGCAGGGCAAAGTGAGGATTTCTCCATATCCTCTGATGACGGACTTACTTTCAATGGACGTTTG
TGCGTGCTAGAAGACAGTGCAGTCAAGACAGAGCTTTTAACTGAGGCTCATAGTTCCCCATTTACTATGCATTCCGGAAGTACGAAGATGTACCAAGACTTGAGGCGTGC
TTATTGGTGGAGGAACATGAAGAGAGAAGTGGCAGATTTCGTTAGTAGATGTTTGGTATGCCAGCAGGTGAAGGCACCTAGACAGAAGCCAACAGGGTTGTTGCAACCCT
TGGGTGTACCAGAATGGAAATGGGTGAGTGTATCGATGGACTTCATTACAGGACTGCCTAGGACTCTAAAGGGCTATATAGTGATCTGGGTTGTTGTTGACAGACTCACG
AAGTCAGCCCACTTTGTTTCGGGGAAATTCACTTACACTACCAGTAAGTGGGGACAGTTATATATGACGGAGATAGTGAGACTACATGGAGTACCCCTATCCATCATTTC
AGACAGAGATGCTCGTTTCACATTGAAATTCTGGAAAGGACTTCAGCTAGCCTTGGGCACGAGGTTAGACTTCAGTACAGCTTTTCATCCTCAAACTGACGGTCAAACAA
AGAGGTTGAACCAGGTTCTAGAAGATATACTGCGAGCCTGTGTTCTGGAGTTCTCAGGAAGTTGGGACTCTCACTTGCACTTGATGGAATTCGCCTATAATAACAGCTAC
TAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGTTTGTGAAGAAGAAGGATGGGTCGATGCGCCTTTGCATTGACTACAGAGAGCTGAACAAGGTGACAGTTAAGAATCGCTATCCCTTGCCCAGGATTGATGATTT
GTTCGATCAGTTGCAAGGAGCCACCATCTTTTCTAAGATCGACCTGCGATCAGGTTACCACCAATTGAGGATCAAGGATAGTGTTATTCCTAGGACCGCTTTCCATTCCA
GATACAGACATTACGAGTTCATTGTGATGTCTTTTGGGTTGACTAATGCTCCGGCGATATACATGGACTTGATGAACAGGGTGTTTAAGGATTTCTTAGACACGTTTTTC
ATAGTTTTCATTGATGACATTTTGATTTACTCCAAGACTAAGGCTAAGCATGAGGAGCATTTACACCAGATTTTGGAGACTCTTCGAGCTAATAAGTTGTACGCCAAGTT
CTCCAAGTGTGAGTTCTGGCTGAAGAAGGTGACTTTTTCTAGCCACGTGGTTTCCAGTGAGAGAGTTTCTGTGGACCCAGTAAAGATCGAAGCAGTTACCAGTTACTACA
GGAGGTTCGTGGAAGACTTCTCTCGTATAGCCAGTCCCTTGACCCAGTTGACCAGGAAGGGGACTCCTTTTGTTTGGAGCCTAGCTTGTGAGAGTAGCTTCCAAGAGCTC
AAGCAGAAGCTTGTGTCTGCACCAGTTTTGACAGTACTAGATGGATATGGAAGTTTCGTGATCTACAGTGATGCCTCAAAGAAAAGACTGGGTTGTGTTCTGATGCAGCA
AGGTAATGTAGTTGCTTATGCCTCTCGTCAGTTGAAGAATCATGAGCAGAACTACCCTACCCATGACCTAGAGTTGGCAGTAATGGTTTTTGCACTGAAGATATGGAGGC
ACTACTTGTACGGTGAGAAGATACAGATTTTCACTAACCATAAGAGCCTGAAGTACTTCTTCACCCAGAAGGAGTTGAACATGAGGCAGAGGAGGTGGCTTGAGTTAGTG
AAAGACTATGACTGCGAGATTTTGTATCACCTAGGTAAGGCAAACGTAGTAGCTGACGCGTTGAGTAGGAAGGTTGCGCATTCAGCAGCGCTTATCACCAACCAAGCTCC
CTTACTAAGAGATTTCGAGAGAGCTGAGATTGCAGTCTCGGTAGGGGAAATGACCTCACAATTGGCTCAGTTGACCGTGCAGTCGACCTTGAGACAGAGGATTATTGTTG
CTCAGCTAAATGATCCTTATTTGGTCGAGAAGCGTCTATTAGTAGAGGCAGGGCAAAGTGAGGATTTCTCCATATCCTCTGATGACGGACTTACTTTCAATGGACGTTTG
TGCGTGCTAGAAGACAGTGCAGTCAAGACAGAGCTTTTAACTGAGGCTCATAGTTCCCCATTTACTATGCATTCCGGAAGTACGAAGATGTACCAAGACTTGAGGCGTGC
TTATTGGTGGAGGAACATGAAGAGAGAAGTGGCAGATTTCGTTAGTAGATGTTTGGTATGCCAGCAGGTGAAGGCACCTAGACAGAAGCCAACAGGGTTGTTGCAACCCT
TGGGTGTACCAGAATGGAAATGGGTGAGTGTATCGATGGACTTCATTACAGGACTGCCTAGGACTCTAAAGGGCTATATAGTGATCTGGGTTGTTGTTGACAGACTCACG
AAGTCAGCCCACTTTGTTTCGGGGAAATTCACTTACACTACCAGTAAGTGGGGACAGTTATATATGACGGAGATAGTGAGACTACATGGAGTACCCCTATCCATCATTTC
AGACAGAGATGCTCGTTTCACATTGAAATTCTGGAAAGGACTTCAGCTAGCCTTGGGCACGAGGTTAGACTTCAGTACAGCTTTTCATCCTCAAACTGACGGTCAAACAA
AGAGGTTGAACCAGGTTCTAGAAGATATACTGCGAGCCTGTGTTCTGGAGTTCTCAGGAAGTTGGGACTCTCACTTGCACTTGATGGAATTCGCCTATAATAACAGCTAC
TAG
Protein sequenceShow/hide protein sequence
MLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSGYHQLRIKDSVIPRTAFHSRYRHYEFIVMSFGLTNAPAIYMDLMNRVFKDFLDTFF
IVFIDDILIYSKTKAKHEEHLHQILETLRANKLYAKFSKCEFWLKKVTFSSHVVSSERVSVDPVKIEAVTSYYRRFVEDFSRIASPLTQLTRKGTPFVWSLACESSFQEL
KQKLVSAPVLTVLDGYGSFVIYSDASKKRLGCVLMQQGNVVAYASRQLKNHEQNYPTHDLELAVMVFALKIWRHYLYGEKIQIFTNHKSLKYFFTQKELNMRQRRWLELV
KDYDCEILYHLGKANVVADALSRKVAHSAALITNQAPLLRDFERAEIAVSVGEMTSQLAQLTVQSTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDGLTFNGRL
CVLEDSAVKTELLTEAHSSPFTMHSGSTKMYQDLRRAYWWRNMKREVADFVSRCLVCQQVKAPRQKPTGLLQPLGVPEWKWVSVSMDFITGLPRTLKGYIVIWVVVDRLT
KSAHFVSGKFTYTTSKWGQLYMTEIVRLHGVPLSIISDRDARFTLKFWKGLQLALGTRLDFSTAFHPQTDGQTKRLNQVLEDILRACVLEFSGSWDSHLHLMEFAYNNSY