; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G09030 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G09030
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr4:6817809..6821662
RNA-Seq ExpressionCSPI04G09030
SyntenyCSPI04G09030
Gene Ontology termsNA
InterPro domainsIPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025132.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.7e-11135.67Show/hide
Query:  EVGAVENLNIELSINSVMGLNNPGTMKVKEKVKNEDVI-------------------------------------TAIKGKSVCGKVEV-----------
        E  +VENLNIELSINSV+GL NPGTMKVK ++  E+V+                                     T+IKGK VC KVE+           
Subjt:  EVGAVENLNIELSINSVMGLNNPGTMKVKEKVKNEDVI-------------------------------------TAIKGKSVCGKVEV-----------

Query:  ------------------LIGVTEVDWRKLIMNFQHDGKRITIKGDPSLTKTRVNLKHMMKIWDSEDQSYLIECRALHSSLTVEEMYDEETKPTVKNLLP
                           IG TEVDW+ LI+ F   G ++TIKGDPSLTKT+++LK M+K WD+EDQ YL+ECR L      +   D +   T    + 
Subjt:  ------------------LIGVTEVDWRKLIMNFQHDGKRITIKGDPSLTKTRVNLKHMMKIWDSEDQSYLIECRALHSSLTVEEMYDEETKPTVKNLLP

Query:  RLLTKFDDVFSWPETLPPQRGIEHHIHLKEGTDPTPSLPHLPSYLSRNNSRILYNQNSLLSPFLLFSIYICSLTPLPWSPLVDRASTLHFLVVPTCSWLH
         +L KF DVF  P  LPPQRGIEHHI+LK+G DP                                                                  
Subjt:  RLLTKFDDVFSWPETLPPQRGIEHHIHLKEGTDPTPSLPHLPSYLSRNNSRILYNQNSLLSPFLLFSIYICSLTPLPWSPLVDRASTLHFLVVPTCSWLH

Query:  NQLLVLLLFPVPSPTILVYDGWSNINVWPCRYVDQQKEEMECLVDEMLAEELF------------------GRVPFHIPVPALNNVTIPDKFPIPVIEEL
                                +NV P RY  QQKEEME LVDEML+  +                   G   F +   ALNNVT+PDKFPIPV+EEL
Subjt:  NQLLVLLLFPVPSPTILVYDGWSNINVWPCRYVDQQKEEMECLVDEMLAEELF------------------GRVPFHIPVPALNNVTIPDKFPIPVIEEL

Query:  FDELNGANIY----------------------AIWAHQCPFNFLSI-------------------------------------------NEFHLQAVYE-
        FDELNGA+++                      A   H+  + F+ +                                           +  HL+ V E 
Subjt:  FDELNGANIY----------------------AIWAHQCPFNFLSI-------------------------------------------NEFHLQAVYE-

Query:  ----EFCQGTDRV--------FRAHYF-KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKL
            E     D+         +  HY  K+G EVDPEKIRA++EW  P NVRELRGFLGLTGYYRRFVQNYG+I  PL QL K  AYKW+ E E  FEKL
Subjt:  ----EFCQGTDRV--------FRAHYF-KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKL

Query:  KRAIMTLPTLAMLDFSLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ------------------------------
        K+A+MTLP LAM DF+LPFEIE++ASG+GVG+VL   KKP+AYFS+ +S RDR+RPVYEREL+ VV+AVQ                              
Subjt:  KRAIMTLPTLAMLDFSLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ------------------------------

Query:  --------------------------------RVPPTVHLNHISAPALMNLTIIQEEVDRDPKLKEIKSSVEKHQSDIPNFSVHQ
                                        R+ PT HLN ++APAL+++ +IQ+EV +DP L+EI S +E+   +IP+++ HQ
Subjt:  --------------------------------RVPPTVHLNHISAPALMNLTIIQEEVDRDPKLKEIKSSVEKHQSDIPNFSVHQ

KAA0049776.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.8e-11135.67Show/hide
Query:  EVGAVENLNIELSINSVMGLNNPGTMKVKEKVKNEDVI-------------------------------------TAIKGKSVCGKVEV-----------
        E  +VENLNIELSINSV+GL NPGTMKVK ++  E+V+                                     T+IKGK VC KVE+           
Subjt:  EVGAVENLNIELSINSVMGLNNPGTMKVKEKVKNEDVI-------------------------------------TAIKGKSVCGKVEV-----------

Query:  ------------------LIGVTEVDWRKLIMNFQHDGKRITIKGDPSLTKTRVNLKHMMKIWDSEDQSYLIECRALHSSLTVEEMYDEETKPTVKNLLP
                           IG TEVDW+ LI+ F   G ++TIKGDPSLTKT+++LK M+K WD+EDQ YL+ECR L      +   D +   T    + 
Subjt:  ------------------LIGVTEVDWRKLIMNFQHDGKRITIKGDPSLTKTRVNLKHMMKIWDSEDQSYLIECRALHSSLTVEEMYDEETKPTVKNLLP

Query:  RLLTKFDDVFSWPETLPPQRGIEHHIHLKEGTDPTPSLPHLPSYLSRNNSRILYNQNSLLSPFLLFSIYICSLTPLPWSPLVDRASTLHFLVVPTCSWLH
         +L KF DVF  P  LPPQRGIEHHI+LK+G DP                                                                  
Subjt:  RLLTKFDDVFSWPETLPPQRGIEHHIHLKEGTDPTPSLPHLPSYLSRNNSRILYNQNSLLSPFLLFSIYICSLTPLPWSPLVDRASTLHFLVVPTCSWLH

Query:  NQLLVLLLFPVPSPTILVYDGWSNINVWPCRYVDQQKEEMECLVDEMLAEELF------------------GRVPFHIPVPALNNVTIPDKFPIPVIEEL
                                +NV P RY  QQKEEME LVDEML+  +                   G   F +   ALNNVT+PDKFPIPV+EEL
Subjt:  NQLLVLLLFPVPSPTILVYDGWSNINVWPCRYVDQQKEEMECLVDEMLAEELF------------------GRVPFHIPVPALNNVTIPDKFPIPVIEEL

Query:  FDELNGANIY----------------------AIWAHQCPFNFLSI-------------------------------------------NEFHLQAVYE-
        FDELNGA+++                      A   H+  + F+ +                                           +  HL+ V E 
Subjt:  FDELNGANIY----------------------AIWAHQCPFNFLSI-------------------------------------------NEFHLQAVYE-

Query:  ----EFCQGTDRV--------FRAHYF-KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKL
            E     D+         +  HY  K+G EVDPEKIRA++EW  P NVRELRGFLGLTGYYRRFVQNYG+I  PL QL K  AYKW+ E E  FEKL
Subjt:  ----EFCQGTDRV--------FRAHYF-KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKL

Query:  KRAIMTLPTLAMLDFSLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ------------------------------
        K+A+MTLP LAM DF+LPFEIE++ASG+GVG+VL   KKP+AYFS+ +S RDR+RPVYEREL+ VV+AVQ                              
Subjt:  KRAIMTLPTLAMLDFSLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ------------------------------

Query:  --------------------------------RVPPTVHLNHISAPALMNLTIIQEEVDRDPKLKEIKSSVEKHQSDIPNFSVHQ
                                        R+ PT HLN ++APAL+++ +IQ+EV +DP L+EI S +E+   +IP+++ HQ
Subjt:  --------------------------------RVPPTVHLNHISAPALMNLTIIQEEVDRDPKLKEIKSSVEKHQSDIPNFSVHQ

KAA0061635.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.3e-11235.67Show/hide
Query:  EVGAVENLNIELSINSVMGLNNPGTMKVKEKVKNEDVI-------------------------------------TAIKGKSVCGKVEV-----------
        EV  VENLNIELSINSV+GL NPGTMKVK K+  E+V+                                     T+IKGK VC KVE+           
Subjt:  EVGAVENLNIELSINSVMGLNNPGTMKVKEKVKNEDVI-------------------------------------TAIKGKSVCGKVEV-----------

Query:  ------------------LIGVTEVDWRKLIMNFQHDGKRITIKGDPSLTKTRVNLKHMMKIWDSEDQSYLIECRALHSSLTVEEMYDEETKPTVKNLLP
                           IG TEVDW+ LI+ F   G ++TIKGDPSLTKT+++LK M+K WD++DQ YL+ECRAL      +   D +   T +  + 
Subjt:  ------------------LIGVTEVDWRKLIMNFQHDGKRITIKGDPSLTKTRVNLKHMMKIWDSEDQSYLIECRALHSSLTVEEMYDEETKPTVKNLLP

Query:  RLLTKFDDVFSWPETLPPQRGIEHHIHLKEGTDPTPSLPHLPSYLSRNNSRILYNQNSLLSPFLLFSIYICSLTPLPWSPLVDRASTLHFLVVPTCSWLH
         +++KF DVF  P  LPPQRGIEHHI+LK+G DP                                                                  
Subjt:  RLLTKFDDVFSWPETLPPQRGIEHHIHLKEGTDPTPSLPHLPSYLSRNNSRILYNQNSLLSPFLLFSIYICSLTPLPWSPLVDRASTLHFLVVPTCSWLH

Query:  NQLLVLLLFPVPSPTILVYDGWSNINVWPCRYVDQQKEEMECLVDEMLAEELF------------------GRVPFHIPVPALNNVTIPDKFPIPVIEEL
                                +NV P RY  QQKEEME LVDEML   +                   G   F +   ALNNVT+PDKFPIPV+EEL
Subjt:  NQLLVLLLFPVPSPTILVYDGWSNINVWPCRYVDQQKEEMECLVDEMLAEELF------------------GRVPFHIPVPALNNVTIPDKFPIPVIEEL

Query:  FDELNGANIYA----------------------IWAHQCPFNFLSI-------------------------------------------NEFHLQAVYE-
        FDELNGA++++                         H+  + F+ +                                           +  HL+ V E 
Subjt:  FDELNGANIYA----------------------IWAHQCPFNFLSI-------------------------------------------NEFHLQAVYE-

Query:  ----EFCQGTDRV--------FRAHYF-KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKL
            E     D+         +  HY  K+G EVDPEKIRA++EW  P+NVRELRGFLGLTGYYRRFVQNYG+I  PL QL K   YKW+ E E  FEKL
Subjt:  ----EFCQGTDRV--------FRAHYF-KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKL

Query:  KRAIMTLPTLAMLDFSLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ------------------------------
        K+A+MTLP LAM DF+LPFE E++ASG+GVG+VL  AK+P+AYFS+ +SMRDR+RPVYERELI VV+AVQ                              
Subjt:  KRAIMTLPTLAMLDFSLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ------------------------------

Query:  --------------------------------RVPPTVHLNHISAPALMNLTIIQEEVDRDPKLKEIKSSVEKHQSDIPNFSVHQ
                                        R+ PT HLN ++APAL+++ +IQ+EV +DP L+EI S +E+   +IP+++ HQ
Subjt:  --------------------------------RVPPTVHLNHISAPALMNLTIIQEEVDRDPKLKEIKSSVEKHQSDIPNFSVHQ

TYK15990.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.8e-11135.67Show/hide
Query:  EVGAVENLNIELSINSVMGLNNPGTMKVKEKVKNEDVI-------------------------------------TAIKGKSVCGKVEV-----------
        E  +VENLNIELSINSV+GL NPGTMKVK ++  E+V+                                     T+IKGK VC KVE+           
Subjt:  EVGAVENLNIELSINSVMGLNNPGTMKVKEKVKNEDVI-------------------------------------TAIKGKSVCGKVEV-----------

Query:  ------------------LIGVTEVDWRKLIMNFQHDGKRITIKGDPSLTKTRVNLKHMMKIWDSEDQSYLIECRALHSSLTVEEMYDEETKPTVKNLLP
                           IG TEVDW+ LI+ F   G ++TIKGDPSLTKT+++LK M+K WD+EDQ YL+ECR L      +   D +   T    + 
Subjt:  ------------------LIGVTEVDWRKLIMNFQHDGKRITIKGDPSLTKTRVNLKHMMKIWDSEDQSYLIECRALHSSLTVEEMYDEETKPTVKNLLP

Query:  RLLTKFDDVFSWPETLPPQRGIEHHIHLKEGTDPTPSLPHLPSYLSRNNSRILYNQNSLLSPFLLFSIYICSLTPLPWSPLVDRASTLHFLVVPTCSWLH
         +L KF DVF  P  LPPQRGIEHHI+LK+G DP                                                                  
Subjt:  RLLTKFDDVFSWPETLPPQRGIEHHIHLKEGTDPTPSLPHLPSYLSRNNSRILYNQNSLLSPFLLFSIYICSLTPLPWSPLVDRASTLHFLVVPTCSWLH

Query:  NQLLVLLLFPVPSPTILVYDGWSNINVWPCRYVDQQKEEMECLVDEMLAEELF------------------GRVPFHIPVPALNNVTIPDKFPIPVIEEL
                                +NV P RY  QQKEEME LVDEML+  +                   G   F +   ALNNVT+PDKFPIPV+EEL
Subjt:  NQLLVLLLFPVPSPTILVYDGWSNINVWPCRYVDQQKEEMECLVDEMLAEELF------------------GRVPFHIPVPALNNVTIPDKFPIPVIEEL

Query:  FDELNGANIY----------------------AIWAHQCPFNFLSI-------------------------------------------NEFHLQAVYE-
        FDELNGA+++                      A   H+  + F+ +                                           +  HL+ V E 
Subjt:  FDELNGANIY----------------------AIWAHQCPFNFLSI-------------------------------------------NEFHLQAVYE-

Query:  ----EFCQGTDRV--------FRAHYF-KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKL
            E     D+         +  HY  K+G EVDPEKIRA++EW  P NVRELRGFLGLTGYYRRFVQNYG+I  PL QL K  AYKW+ E E  FEKL
Subjt:  ----EFCQGTDRV--------FRAHYF-KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKL

Query:  KRAIMTLPTLAMLDFSLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ------------------------------
        K+A+MTLP LAM DF+LPFEIE++ASG+GVG+VL   KKP+AYFS+ +S RDR+RPVYEREL+ VV+AVQ                              
Subjt:  KRAIMTLPTLAMLDFSLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ------------------------------

Query:  --------------------------------RVPPTVHLNHISAPALMNLTIIQEEVDRDPKLKEIKSSVEKHQSDIPNFSVHQ
                                        R+ PT HLN ++APAL+++ +IQ+EV +DP L+EI S +E+   +IP+++ HQ
Subjt:  --------------------------------RVPPTVHLNHISAPALMNLTIIQEEVDRDPKLKEIKSSVEKHQSDIPNFSVHQ

TYK23090.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.8e-11135.67Show/hide
Query:  EVGAVENLNIELSINSVMGLNNPGTMKVKEKVKNEDVI-------------------------------------TAIKGKSVCGKVEV-----------
        E  +VENLNIELSINSV+GL NPGTMKVK ++  E+V+                                     T+IKGK VC KVE+           
Subjt:  EVGAVENLNIELSINSVMGLNNPGTMKVKEKVKNEDVI-------------------------------------TAIKGKSVCGKVEV-----------

Query:  ------------------LIGVTEVDWRKLIMNFQHDGKRITIKGDPSLTKTRVNLKHMMKIWDSEDQSYLIECRALHSSLTVEEMYDEETKPTVKNLLP
                           IG TEVDW+ LI+ F   G ++TIKGDPSLTKT+++LK M+K WD+EDQ YL+ECR L      +   D +   T    + 
Subjt:  ------------------LIGVTEVDWRKLIMNFQHDGKRITIKGDPSLTKTRVNLKHMMKIWDSEDQSYLIECRALHSSLTVEEMYDEETKPTVKNLLP

Query:  RLLTKFDDVFSWPETLPPQRGIEHHIHLKEGTDPTPSLPHLPSYLSRNNSRILYNQNSLLSPFLLFSIYICSLTPLPWSPLVDRASTLHFLVVPTCSWLH
         +L KF DVF  P  LPPQRGIEHHI+LK+G DP                                                                  
Subjt:  RLLTKFDDVFSWPETLPPQRGIEHHIHLKEGTDPTPSLPHLPSYLSRNNSRILYNQNSLLSPFLLFSIYICSLTPLPWSPLVDRASTLHFLVVPTCSWLH

Query:  NQLLVLLLFPVPSPTILVYDGWSNINVWPCRYVDQQKEEMECLVDEMLAEELF------------------GRVPFHIPVPALNNVTIPDKFPIPVIEEL
                                +NV P RY  QQKEEME LVDEML+  +                   G   F +   ALNNVT+PDKFPIPV+EEL
Subjt:  NQLLVLLLFPVPSPTILVYDGWSNINVWPCRYVDQQKEEMECLVDEMLAEELF------------------GRVPFHIPVPALNNVTIPDKFPIPVIEEL

Query:  FDELNGANIY----------------------AIWAHQCPFNFLSI-------------------------------------------NEFHLQAVYE-
        FDELNGA+++                      A   H+  + F+ +                                           +  HL+ V E 
Subjt:  FDELNGANIY----------------------AIWAHQCPFNFLSI-------------------------------------------NEFHLQAVYE-

Query:  ----EFCQGTDRV--------FRAHYF-KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKL
            E     D+         +  HY  K+G EVDPEKIRA++EW  P NVRELRGFLGLTGYYRRFVQNYG+I  PL QL K  AYKW+ E E  FEKL
Subjt:  ----EFCQGTDRV--------FRAHYF-KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKL

Query:  KRAIMTLPTLAMLDFSLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ------------------------------
        K+A+MTLP LAM DF+LPFEIE++ASG+GVG+VL   KKP+AYFS+ +S RDR+RPVYEREL+ VV+AVQ                              
Subjt:  KRAIMTLPTLAMLDFSLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ------------------------------

Query:  --------------------------------RVPPTVHLNHISAPALMNLTIIQEEVDRDPKLKEIKSSVEKHQSDIPNFSVHQ
                                        R+ PT HLN ++APAL+++ +IQ+EV +DP L+EI S +E+   +IP+++ HQ
Subjt:  --------------------------------RVPPTVHLNHISAPALMNLTIIQEEVDRDPKLKEIKSSVEKHQSDIPNFSVHQ

TrEMBL top hitse value%identityAlignment
A0A5A7SIV7 Ty3/gypsy retrotransposon protein1.8e-11135.67Show/hide
Query:  EVGAVENLNIELSINSVMGLNNPGTMKVKEKVKNEDVI-------------------------------------TAIKGKSVCGKVEV-----------
        E  +VENLNIELSINSV+GL NPGTMKVK ++  E+V+                                     T+IKGK VC KVE+           
Subjt:  EVGAVENLNIELSINSVMGLNNPGTMKVKEKVKNEDVI-------------------------------------TAIKGKSVCGKVEV-----------

Query:  ------------------LIGVTEVDWRKLIMNFQHDGKRITIKGDPSLTKTRVNLKHMMKIWDSEDQSYLIECRALHSSLTVEEMYDEETKPTVKNLLP
                           IG TEVDW+ LI+ F   G ++TIKGDPSLTKT+++LK M+K WD+EDQ YL+ECR L      +   D +   T    + 
Subjt:  ------------------LIGVTEVDWRKLIMNFQHDGKRITIKGDPSLTKTRVNLKHMMKIWDSEDQSYLIECRALHSSLTVEEMYDEETKPTVKNLLP

Query:  RLLTKFDDVFSWPETLPPQRGIEHHIHLKEGTDPTPSLPHLPSYLSRNNSRILYNQNSLLSPFLLFSIYICSLTPLPWSPLVDRASTLHFLVVPTCSWLH
         +L KF DVF  P  LPPQRGIEHHI+LK+G DP                                                                  
Subjt:  RLLTKFDDVFSWPETLPPQRGIEHHIHLKEGTDPTPSLPHLPSYLSRNNSRILYNQNSLLSPFLLFSIYICSLTPLPWSPLVDRASTLHFLVVPTCSWLH

Query:  NQLLVLLLFPVPSPTILVYDGWSNINVWPCRYVDQQKEEMECLVDEMLAEELF------------------GRVPFHIPVPALNNVTIPDKFPIPVIEEL
                                +NV P RY  QQKEEME LVDEML+  +                   G   F +   ALNNVT+PDKFPIPV+EEL
Subjt:  NQLLVLLLFPVPSPTILVYDGWSNINVWPCRYVDQQKEEMECLVDEMLAEELF------------------GRVPFHIPVPALNNVTIPDKFPIPVIEEL

Query:  FDELNGANIY----------------------AIWAHQCPFNFLSI-------------------------------------------NEFHLQAVYE-
        FDELNGA+++                      A   H+  + F+ +                                           +  HL+ V E 
Subjt:  FDELNGANIY----------------------AIWAHQCPFNFLSI-------------------------------------------NEFHLQAVYE-

Query:  ----EFCQGTDRV--------FRAHYF-KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKL
            E     D+         +  HY  K+G EVDPEKIRA++EW  P NVRELRGFLGLTGYYRRFVQNYG+I  PL QL K  AYKW+ E E  FEKL
Subjt:  ----EFCQGTDRV--------FRAHYF-KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKL

Query:  KRAIMTLPTLAMLDFSLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ------------------------------
        K+A+MTLP LAM DF+LPFEIE++ASG+GVG+VL   KKP+AYFS+ +S RDR+RPVYEREL+ VV+AVQ                              
Subjt:  KRAIMTLPTLAMLDFSLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ------------------------------

Query:  --------------------------------RVPPTVHLNHISAPALMNLTIIQEEVDRDPKLKEIKSSVEKHQSDIPNFSVHQ
                                        R+ PT HLN ++APAL+++ +IQ+EV +DP L+EI S +E+   +IP+++ HQ
Subjt:  --------------------------------RVPPTVHLNHISAPALMNLTIIQEEVDRDPKLKEIKSSVEKHQSDIPNFSVHQ

A0A5A7U6J3 Ty3/gypsy retrotransposon protein2.3e-11135.67Show/hide
Query:  EVGAVENLNIELSINSVMGLNNPGTMKVKEKVKNEDVI-------------------------------------TAIKGKSVCGKVEV-----------
        E  +VENLNIELSINSV+GL NPGTMKVK ++  E+V+                                     T+IKGK VC KVE+           
Subjt:  EVGAVENLNIELSINSVMGLNNPGTMKVKEKVKNEDVI-------------------------------------TAIKGKSVCGKVEV-----------

Query:  ------------------LIGVTEVDWRKLIMNFQHDGKRITIKGDPSLTKTRVNLKHMMKIWDSEDQSYLIECRALHSSLTVEEMYDEETKPTVKNLLP
                           IG TEVDW+ LI+ F   G ++TIKGDPSLTKT+++LK M+K WD+EDQ YL+ECR L      +   D +   T    + 
Subjt:  ------------------LIGVTEVDWRKLIMNFQHDGKRITIKGDPSLTKTRVNLKHMMKIWDSEDQSYLIECRALHSSLTVEEMYDEETKPTVKNLLP

Query:  RLLTKFDDVFSWPETLPPQRGIEHHIHLKEGTDPTPSLPHLPSYLSRNNSRILYNQNSLLSPFLLFSIYICSLTPLPWSPLVDRASTLHFLVVPTCSWLH
         +L KF DVF  P  LPPQRGIEHHI+LK+G DP                                                                  
Subjt:  RLLTKFDDVFSWPETLPPQRGIEHHIHLKEGTDPTPSLPHLPSYLSRNNSRILYNQNSLLSPFLLFSIYICSLTPLPWSPLVDRASTLHFLVVPTCSWLH

Query:  NQLLVLLLFPVPSPTILVYDGWSNINVWPCRYVDQQKEEMECLVDEMLAEELF------------------GRVPFHIPVPALNNVTIPDKFPIPVIEEL
                                +NV P RY  QQKEEME LVDEML+  +                   G   F +   ALNNVT+PDKFPIPV+EEL
Subjt:  NQLLVLLLFPVPSPTILVYDGWSNINVWPCRYVDQQKEEMECLVDEMLAEELF------------------GRVPFHIPVPALNNVTIPDKFPIPVIEEL

Query:  FDELNGANIY----------------------AIWAHQCPFNFLSI-------------------------------------------NEFHLQAVYE-
        FDELNGA+++                      A   H+  + F+ +                                           +  HL+ V E 
Subjt:  FDELNGANIY----------------------AIWAHQCPFNFLSI-------------------------------------------NEFHLQAVYE-

Query:  ----EFCQGTDRV--------FRAHYF-KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKL
            E     D+         +  HY  K+G EVDPEKIRA++EW  P NVRELRGFLGLTGYYRRFVQNYG+I  PL QL K  AYKW+ E E  FEKL
Subjt:  ----EFCQGTDRV--------FRAHYF-KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKL

Query:  KRAIMTLPTLAMLDFSLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ------------------------------
        K+A+MTLP LAM DF+LPFEIE++ASG+GVG+VL   KKP+AYFS+ +S RDR+RPVYEREL+ VV+AVQ                              
Subjt:  KRAIMTLPTLAMLDFSLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ------------------------------

Query:  --------------------------------RVPPTVHLNHISAPALMNLTIIQEEVDRDPKLKEIKSSVEKHQSDIPNFSVHQ
                                        R+ PT HLN ++APAL+++ +IQ+EV +DP L+EI S +E+   +IP+++ HQ
Subjt:  --------------------------------RVPPTVHLNHISAPALMNLTIIQEEVDRDPKLKEIKSSVEKHQSDIPNFSVHQ

A0A5A7V707 Ty3/gypsy retrotransposon protein1.6e-11235.67Show/hide
Query:  EVGAVENLNIELSINSVMGLNNPGTMKVKEKVKNEDVI-------------------------------------TAIKGKSVCGKVEV-----------
        EV  VENLNIELSINSV+GL NPGTMKVK K+  E+V+                                     T+IKGK VC KVE+           
Subjt:  EVGAVENLNIELSINSVMGLNNPGTMKVKEKVKNEDVI-------------------------------------TAIKGKSVCGKVEV-----------

Query:  ------------------LIGVTEVDWRKLIMNFQHDGKRITIKGDPSLTKTRVNLKHMMKIWDSEDQSYLIECRALHSSLTVEEMYDEETKPTVKNLLP
                           IG TEVDW+ LI+ F   G ++TIKGDPSLTKT+++LK M+K WD++DQ YL+ECRAL      +   D +   T +  + 
Subjt:  ------------------LIGVTEVDWRKLIMNFQHDGKRITIKGDPSLTKTRVNLKHMMKIWDSEDQSYLIECRALHSSLTVEEMYDEETKPTVKNLLP

Query:  RLLTKFDDVFSWPETLPPQRGIEHHIHLKEGTDPTPSLPHLPSYLSRNNSRILYNQNSLLSPFLLFSIYICSLTPLPWSPLVDRASTLHFLVVPTCSWLH
         +++KF DVF  P  LPPQRGIEHHI+LK+G DP                                                                  
Subjt:  RLLTKFDDVFSWPETLPPQRGIEHHIHLKEGTDPTPSLPHLPSYLSRNNSRILYNQNSLLSPFLLFSIYICSLTPLPWSPLVDRASTLHFLVVPTCSWLH

Query:  NQLLVLLLFPVPSPTILVYDGWSNINVWPCRYVDQQKEEMECLVDEMLAEELF------------------GRVPFHIPVPALNNVTIPDKFPIPVIEEL
                                +NV P RY  QQKEEME LVDEML   +                   G   F +   ALNNVT+PDKFPIPV+EEL
Subjt:  NQLLVLLLFPVPSPTILVYDGWSNINVWPCRYVDQQKEEMECLVDEMLAEELF------------------GRVPFHIPVPALNNVTIPDKFPIPVIEEL

Query:  FDELNGANIYA----------------------IWAHQCPFNFLSI-------------------------------------------NEFHLQAVYE-
        FDELNGA++++                         H+  + F+ +                                           +  HL+ V E 
Subjt:  FDELNGANIYA----------------------IWAHQCPFNFLSI-------------------------------------------NEFHLQAVYE-

Query:  ----EFCQGTDRV--------FRAHYF-KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKL
            E     D+         +  HY  K+G EVDPEKIRA++EW  P+NVRELRGFLGLTGYYRRFVQNYG+I  PL QL K   YKW+ E E  FEKL
Subjt:  ----EFCQGTDRV--------FRAHYF-KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKL

Query:  KRAIMTLPTLAMLDFSLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ------------------------------
        K+A+MTLP LAM DF+LPFE E++ASG+GVG+VL  AK+P+AYFS+ +SMRDR+RPVYERELI VV+AVQ                              
Subjt:  KRAIMTLPTLAMLDFSLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ------------------------------

Query:  --------------------------------RVPPTVHLNHISAPALMNLTIIQEEVDRDPKLKEIKSSVEKHQSDIPNFSVHQ
                                        R+ PT HLN ++APAL+++ +IQ+EV +DP L+EI S +E+   +IP+++ HQ
Subjt:  --------------------------------RVPPTVHLNHISAPALMNLTIIQEEVDRDPKLKEIKSSVEKHQSDIPNFSVHQ

A0A5D3CXB1 Ty3/gypsy retrotransposon protein2.3e-11135.67Show/hide
Query:  EVGAVENLNIELSINSVMGLNNPGTMKVKEKVKNEDVI-------------------------------------TAIKGKSVCGKVEV-----------
        E  +VENLNIELSINSV+GL NPGTMKVK ++  E+V+                                     T+IKGK VC KVE+           
Subjt:  EVGAVENLNIELSINSVMGLNNPGTMKVKEKVKNEDVI-------------------------------------TAIKGKSVCGKVEV-----------

Query:  ------------------LIGVTEVDWRKLIMNFQHDGKRITIKGDPSLTKTRVNLKHMMKIWDSEDQSYLIECRALHSSLTVEEMYDEETKPTVKNLLP
                           IG TEVDW+ LI+ F   G ++TIKGDPSLTKT+++LK M+K WD+EDQ YL+ECR L      +   D +   T    + 
Subjt:  ------------------LIGVTEVDWRKLIMNFQHDGKRITIKGDPSLTKTRVNLKHMMKIWDSEDQSYLIECRALHSSLTVEEMYDEETKPTVKNLLP

Query:  RLLTKFDDVFSWPETLPPQRGIEHHIHLKEGTDPTPSLPHLPSYLSRNNSRILYNQNSLLSPFLLFSIYICSLTPLPWSPLVDRASTLHFLVVPTCSWLH
         +L KF DVF  P  LPPQRGIEHHI+LK+G DP                                                                  
Subjt:  RLLTKFDDVFSWPETLPPQRGIEHHIHLKEGTDPTPSLPHLPSYLSRNNSRILYNQNSLLSPFLLFSIYICSLTPLPWSPLVDRASTLHFLVVPTCSWLH

Query:  NQLLVLLLFPVPSPTILVYDGWSNINVWPCRYVDQQKEEMECLVDEMLAEELF------------------GRVPFHIPVPALNNVTIPDKFPIPVIEEL
                                +NV P RY  QQKEEME LVDEML+  +                   G   F +   ALNNVT+PDKFPIPV+EEL
Subjt:  NQLLVLLLFPVPSPTILVYDGWSNINVWPCRYVDQQKEEMECLVDEMLAEELF------------------GRVPFHIPVPALNNVTIPDKFPIPVIEEL

Query:  FDELNGANIY----------------------AIWAHQCPFNFLSI-------------------------------------------NEFHLQAVYE-
        FDELNGA+++                      A   H+  + F+ +                                           +  HL+ V E 
Subjt:  FDELNGANIY----------------------AIWAHQCPFNFLSI-------------------------------------------NEFHLQAVYE-

Query:  ----EFCQGTDRV--------FRAHYF-KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKL
            E     D+         +  HY  K+G EVDPEKIRA++EW  P NVRELRGFLGLTGYYRRFVQNYG+I  PL QL K  AYKW+ E E  FEKL
Subjt:  ----EFCQGTDRV--------FRAHYF-KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKL

Query:  KRAIMTLPTLAMLDFSLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ------------------------------
        K+A+MTLP LAM DF+LPFEIE++ASG+GVG+VL   KKP+AYFS+ +S RDR+RPVYEREL+ VV+AVQ                              
Subjt:  KRAIMTLPTLAMLDFSLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ------------------------------

Query:  --------------------------------RVPPTVHLNHISAPALMNLTIIQEEVDRDPKLKEIKSSVEKHQSDIPNFSVHQ
                                        R+ PT HLN ++APAL+++ +IQ+EV +DP L+EI S +E+   +IP+++ HQ
Subjt:  --------------------------------RVPPTVHLNHISAPALMNLTIIQEEVDRDPKLKEIKSSVEKHQSDIPNFSVHQ

A0A5D3DI73 Ty3/gypsy retrotransposon protein2.3e-11135.67Show/hide
Query:  EVGAVENLNIELSINSVMGLNNPGTMKVKEKVKNEDVI-------------------------------------TAIKGKSVCGKVEV-----------
        E  +VENLNIELSINSV+GL NPGTMKVK ++  E+V+                                     T+IKGK VC KVE+           
Subjt:  EVGAVENLNIELSINSVMGLNNPGTMKVKEKVKNEDVI-------------------------------------TAIKGKSVCGKVEV-----------

Query:  ------------------LIGVTEVDWRKLIMNFQHDGKRITIKGDPSLTKTRVNLKHMMKIWDSEDQSYLIECRALHSSLTVEEMYDEETKPTVKNLLP
                           IG TEVDW+ LI+ F   G ++TIKGDPSLTKT+++LK M+K WD+EDQ YL+ECR L      +   D +   T    + 
Subjt:  ------------------LIGVTEVDWRKLIMNFQHDGKRITIKGDPSLTKTRVNLKHMMKIWDSEDQSYLIECRALHSSLTVEEMYDEETKPTVKNLLP

Query:  RLLTKFDDVFSWPETLPPQRGIEHHIHLKEGTDPTPSLPHLPSYLSRNNSRILYNQNSLLSPFLLFSIYICSLTPLPWSPLVDRASTLHFLVVPTCSWLH
         +L KF DVF  P  LPPQRGIEHHI+LK+G DP                                                                  
Subjt:  RLLTKFDDVFSWPETLPPQRGIEHHIHLKEGTDPTPSLPHLPSYLSRNNSRILYNQNSLLSPFLLFSIYICSLTPLPWSPLVDRASTLHFLVVPTCSWLH

Query:  NQLLVLLLFPVPSPTILVYDGWSNINVWPCRYVDQQKEEMECLVDEMLAEELF------------------GRVPFHIPVPALNNVTIPDKFPIPVIEEL
                                +NV P RY  QQKEEME LVDEML+  +                   G   F +   ALNNVT+PDKFPIPV+EEL
Subjt:  NQLLVLLLFPVPSPTILVYDGWSNINVWPCRYVDQQKEEMECLVDEMLAEELF------------------GRVPFHIPVPALNNVTIPDKFPIPVIEEL

Query:  FDELNGANIY----------------------AIWAHQCPFNFLSI-------------------------------------------NEFHLQAVYE-
        FDELNGA+++                      A   H+  + F+ +                                           +  HL+ V E 
Subjt:  FDELNGANIY----------------------AIWAHQCPFNFLSI-------------------------------------------NEFHLQAVYE-

Query:  ----EFCQGTDRV--------FRAHYF-KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKL
            E     D+         +  HY  K+G EVDPEKIRA++EW  P NVRELRGFLGLTGYYRRFVQNYG+I  PL QL K  AYKW+ E E  FEKL
Subjt:  ----EFCQGTDRV--------FRAHYF-KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKL

Query:  KRAIMTLPTLAMLDFSLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ------------------------------
        K+A+MTLP LAM DF+LPFEIE++ASG+GVG+VL   KKP+AYFS+ +S RDR+RPVYEREL+ VV+AVQ                              
Subjt:  KRAIMTLPTLAMLDFSLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ------------------------------

Query:  --------------------------------RVPPTVHLNHISAPALMNLTIIQEEVDRDPKLKEIKSSVEKHQSDIPNFSVHQ
                                        R+ PT HLN ++APAL+++ +IQ+EV +DP L+EI S +E+   +IP+++ HQ
Subjt:  --------------------------------RVPPTVHLNHISAPALMNLTIIQEEVDRDPKLKEIKSSVEKHQSDIPNFSVHQ

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.0e-1935.66Show/hide
Query:  EGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPK--VSAYKWSAEAEIFFEKLKRAIMTLPTLAMLDFSLPFEIETNASG
        +G + +PEKI AI+++ IPT  +E++ FLGLTGYYR+F+ N+ +I  P+ +  K  +     + E +  F+KLK  I   P L + DF+  F + T+AS 
Subjt:  EGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPK--VSAYKWSAEAEIFFEKLKRAIMTLPTLAMLDFSLPFEIETNASG

Query:  YGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ
          +G+VL+    P++Y S+T++  + +    E+EL+ +V+A +
Subjt:  YGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ

P10401 Retrovirus-related Pol polyprotein from transposon gypsy6.1e-1628.33Show/hide
Query:  KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPK-----VSAYKWSAEAEIFFEKLKRAIMTLPT--------LAMLDF
        K+G + DPEK++AI+E+  P  V ++R FLGL  YYR F++++  I  P+  + K     VS +        F E  + A   L          L   DF
Subjt:  KEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPK-----VSAYKWSAEAEIFFEKLKRAIMTLPT--------LAMLDF

Query:  SLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQRVPPTVHLNHISAPALMN-------LTIIQEEVDRDPKLKEIKSS
          PF++ T+AS  G+G+VL+   +PI   S+T+   +++    EREL+ +V+A+ ++      N +     +N       LT    + + + K+K  KS 
Subjt:  SLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQRVPPTVHLNHISAPALMN-------LTIIQEEVDRDPKLKEIKSS

Query:  VEKHQSDIPNFSVHQEHGVDETLEGQTL--AENRTEETAA
        +++H + +  +   +E+ V + L  Q L   +N  +  AA
Subjt:  VEKHQSDIPNFSVHQEHGVDETLEGQTL--AENRTEETAA

P20825 Retrovirus-related Pol polyprotein from transposon 2977.6e-1936.36Show/hide
Query:  EGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIF--FEKLKRAIMTLPTLAMLDFSLPFEIETNASG
        +G + +P K++AI  + IPT  +E+R FLGLTGYYR+F+ NY +I  P+    K      + + E    FEKLK  I+  P L + DF   F + T+AS 
Subjt:  EGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIF--FEKLKRAIMTLPTLAMLDFSLPFEIETNASG

Query:  YGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ
          +G+VL+    PI++ S+T++  + +    E+EL+ +V+A +
Subjt:  YGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQ

P92523 Uncharacterized mitochondrial protein AtMg008606.3e-2155.56Show/hide
Query:  EGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKLKRAIMTLPTLAMLDFSLPF
        EG   DP K+ A+  W  P N  ELRGFLGLTGYYRRFV+NYG IV PL +L K ++ KW+  A + F+ LK A+ TLP LA+ D  LPF
Subjt:  EGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKLKRAIMTLPTLAMLDFSLPF

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus9.3e-1727.49Show/hide
Query:  EGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPK--VSAYKWSAEAEI----------FFEKLKRAIMTLPTLAMLDFSL
        +G + DP+K+RAI E   PT+V+EL+ FLG+T YYR+F+Q+Y  +  PL  L +   +  K S  +++           F  LK  + +   LA   F+ 
Subjt:  EGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPK--VSAYKWSAEAEI----------FFEKLKRAIMTLPTLAMLDFSL

Query:  PFEIETNASGYGVGSVLTP----AKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQRVPP--------TVHLNHISAPALMNLTIIQEEVDRDPKLKEI
        PF + T+AS + +G+VL+       +PIAY S++++  + +    E+E++ +++++  +           V+ +H        LT      + + KLK  
Subjt:  PFEIETNASGYGVGSVLTP----AKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQRVPP--------TVHLNHISAPALMNLTIIQEEVDRDPKLKEI

Query:  KSSVEKHQSDI
        K+ +E++  ++
Subjt:  KSSVEKHQSDI

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein4.4e-2255.56Show/hide
Query:  EGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKLKRAIMTLPTLAMLDFSLPF
        EG   DP K+ A+  W  P N  ELRGFLGLTGYYRRFV+NYG IV PL +L K ++ KW+  A + F+ LK A+ TLP LA+ D  LPF
Subjt:  EGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKLKRAIMTLPTLAMLDFSLPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGTGGGAGCTGTTGAAAATCTCAACATTGAGTTATCTATCAACTCAGTGATGGGATTAAACAACCCTGGAACCATGAAAGTGAAGGAGAAAGTGAAGAATGAGGA
TGTCATAACGGCTATTAAAGGAAAAAGCGTGTGCGGTAAAGTGGAAGTTTTGATTGGTGTTACCGAAGTGGATTGGAGGAAGTTAATTATGAATTTTCAGCATGATGGGA
AGAGGATTACAATCAAAGGTGATCCTAGCTTAACGAAGACAAGGGTTAATCTGAAGCATATGATGAAGATTTGGGATAGTGAAGATCAGAGTTATTTAATAGAGTGTCGA
GCCTTACACAGTAGTTTAACTGTAGAAGAGATGTATGATGAGGAGACGAAACCCACAGTCAAAAACTTACTGCCCCGTCTGTTAACTAAGTTCGATGATGTGTTCAGTTG
GCCCGAGACACTTCCACCACAACGTGGGATTGAACATCATATCCATCTCAAAGAAGGGACGGATCCAACTCCCTCTCTCCCTCACCTGCCTTCCTATCTCTCAAGGAATA
ACAGTAGGATTCTTTACAACCAAAATTCTCTGCTTTCTCCCTTCCTTCTCTTCTCTATTTATATTTGCTCCCTCACTCCCCTACCGTGGTCCCCCTTAGTGGATCGTGCG
TCGACGCTCCATTTTCTTGTGGTCCCCACCTGCAGTTGGTTACATAACCAACTCCTCGTCCTTCTTCTCTTTCCTGTTCCTTCTCCTACTATACTGGTGTATGATGGGTG
GTCTAACATAAACGTGTGGCCATGTAGATATGTAGACCAACAAAAAGAGGAAATGGAATGTTTGGTGGATGAGATGTTGGCTGAGGAATTATTTGGACGAGTACCATTTC
ATATTCCAGTCCCAGCGCTAAACAATGTAACAATCCCCGATAAATTTCCTATTCCAGTTATTGAAGAGCTCTTTGATGAACTTAATGGAGCGAATATTTATGCCATTTGG
GCTCACCAATGCCCCTTCAACTTTCTAAGCATTAATGAATTCCATCTTCAGGCCGTATATGAGGAGTTTTGCCAAGGCACGGATAGGGTATTTAGGGCACATTATTTCAA
AGAAGGGCGTGAAGTTGATCCTGAGAAAATACGAGCTATTAGGGAATGGTCAATTCCAACTAATGTTCGAGAGTTACGGGGATTTCTAGGATTGACGGGTTATTATAGAA
GATTTGTCCAAAATTATGGTAATATTGTCGGACCATTAATTCAATTACCAAAAGTGAGTGCCTATAAATGGAGTGCAGAAGCTGAAATCTTCTTTGAAAAATTGAAGAGG
GCAATAATGACTTTGCCAACGTTGGCTATGCTTGATTTTAGTTTGCCATTCGAAATCGAAACGAATGCTTCGGGGTATGGAGTTGGGTCAGTCTTGACACCAGCTAAGAA
ACCTATTGCTTACTTCAGTCAAACTGTGAGTATGAGAGATAGATCGCGGCCAGTTTATGAAAGAGAGTTGATTGATGTGGTGTTTGCAGTTCAAAGAGTACCTCCAACAG
TTCATCTTAATCATATATCAGCACCAGCATTGATGAATTTAACAATTATTCAAGAAGAGGTTGACCGAGATCCGAAGTTGAAGGAGATCAAAAGCAGTGTGGAGAAGCAC
CAATCTGATATTCCAAATTTTTCGGTACATCAAGAGCATGGTGTCGATGAAACCCTAGAAGGTCAAACACTTGCTGAAAACCGAACAGAGGAGACAGCCGCCATCAGCGC
CGTCGTTGCCACTGTTGTCAATGCTCGTATAGAGACCATCGAACCTACCCATCTCATTCAGAAAGGGCCGACCAACCCATCCCTACCACCCTTGCCTTCATTCGCAGCCG
TTAATGCCTCCCATGCGCCGCTTCATGTTCTGCCATCTACCTCCGTCGAGCCACCGTTGCTTCTGCTGACCAACTTGTATACCCTGCCATCCACTGCCTCTATTCCAAAC
CCTAGCTATCGTCCCGAGGTTGGGCATCCCCAAATCTACTCAACATTTGAAGTTGGAGAGTCTTCGGCACATTCGAACCCTAACGTGCCAATCTCTTCTTCTTTGAATAT
AGCCCACCAGCAACTAGAGGAGCTTAAACAGCAGATTGCGGCACTTGAGGCTACCTTAGGGGCAACATCCAATACCCCTAGTCATGATCTCCATACGTCTTCACCCGTGT
ATTCTGAGAATCCGTTAGTGAACATGGTTCTTGAAGGACGACATAAATTCGATTTTCTGACGAGGGAAATACCTCGTCCTTCGCCTGGGGACCCTCAGGAACGGTACTGG
AAGGGGGAGGATTCTTTTCTTCGATCCACATTAATCAATAGTATGGAGCCACAGATTGGCAAACCTTTACTCCACACTGCAACTGCTAAGGATATTTGGGACACAACCTA
G
mRNA sequenceShow/hide mRNA sequence
ATGGAGGTGGGAGCTGTTGAAAATCTCAACATTGAGTTATCTATCAACTCAGTGATGGGATTAAACAACCCTGGAACCATGAAAGTGAAGGAGAAAGTGAAGAATGAGGA
TGTCATAACGGCTATTAAAGGAAAAAGCGTGTGCGGTAAAGTGGAAGTTTTGATTGGTGTTACCGAAGTGGATTGGAGGAAGTTAATTATGAATTTTCAGCATGATGGGA
AGAGGATTACAATCAAAGGTGATCCTAGCTTAACGAAGACAAGGGTTAATCTGAAGCATATGATGAAGATTTGGGATAGTGAAGATCAGAGTTATTTAATAGAGTGTCGA
GCCTTACACAGTAGTTTAACTGTAGAAGAGATGTATGATGAGGAGACGAAACCCACAGTCAAAAACTTACTGCCCCGTCTGTTAACTAAGTTCGATGATGTGTTCAGTTG
GCCCGAGACACTTCCACCACAACGTGGGATTGAACATCATATCCATCTCAAAGAAGGGACGGATCCAACTCCCTCTCTCCCTCACCTGCCTTCCTATCTCTCAAGGAATA
ACAGTAGGATTCTTTACAACCAAAATTCTCTGCTTTCTCCCTTCCTTCTCTTCTCTATTTATATTTGCTCCCTCACTCCCCTACCGTGGTCCCCCTTAGTGGATCGTGCG
TCGACGCTCCATTTTCTTGTGGTCCCCACCTGCAGTTGGTTACATAACCAACTCCTCGTCCTTCTTCTCTTTCCTGTTCCTTCTCCTACTATACTGGTGTATGATGGGTG
GTCTAACATAAACGTGTGGCCATGTAGATATGTAGACCAACAAAAAGAGGAAATGGAATGTTTGGTGGATGAGATGTTGGCTGAGGAATTATTTGGACGAGTACCATTTC
ATATTCCAGTCCCAGCGCTAAACAATGTAACAATCCCCGATAAATTTCCTATTCCAGTTATTGAAGAGCTCTTTGATGAACTTAATGGAGCGAATATTTATGCCATTTGG
GCTCACCAATGCCCCTTCAACTTTCTAAGCATTAATGAATTCCATCTTCAGGCCGTATATGAGGAGTTTTGCCAAGGCACGGATAGGGTATTTAGGGCACATTATTTCAA
AGAAGGGCGTGAAGTTGATCCTGAGAAAATACGAGCTATTAGGGAATGGTCAATTCCAACTAATGTTCGAGAGTTACGGGGATTTCTAGGATTGACGGGTTATTATAGAA
GATTTGTCCAAAATTATGGTAATATTGTCGGACCATTAATTCAATTACCAAAAGTGAGTGCCTATAAATGGAGTGCAGAAGCTGAAATCTTCTTTGAAAAATTGAAGAGG
GCAATAATGACTTTGCCAACGTTGGCTATGCTTGATTTTAGTTTGCCATTCGAAATCGAAACGAATGCTTCGGGGTATGGAGTTGGGTCAGTCTTGACACCAGCTAAGAA
ACCTATTGCTTACTTCAGTCAAACTGTGAGTATGAGAGATAGATCGCGGCCAGTTTATGAAAGAGAGTTGATTGATGTGGTGTTTGCAGTTCAAAGAGTACCTCCAACAG
TTCATCTTAATCATATATCAGCACCAGCATTGATGAATTTAACAATTATTCAAGAAGAGGTTGACCGAGATCCGAAGTTGAAGGAGATCAAAAGCAGTGTGGAGAAGCAC
CAATCTGATATTCCAAATTTTTCGGTACATCAAGAGCATGGTGTCGATGAAACCCTAGAAGGTCAAACACTTGCTGAAAACCGAACAGAGGAGACAGCCGCCATCAGCGC
CGTCGTTGCCACTGTTGTCAATGCTCGTATAGAGACCATCGAACCTACCCATCTCATTCAGAAAGGGCCGACCAACCCATCCCTACCACCCTTGCCTTCATTCGCAGCCG
TTAATGCCTCCCATGCGCCGCTTCATGTTCTGCCATCTACCTCCGTCGAGCCACCGTTGCTTCTGCTGACCAACTTGTATACCCTGCCATCCACTGCCTCTATTCCAAAC
CCTAGCTATCGTCCCGAGGTTGGGCATCCCCAAATCTACTCAACATTTGAAGTTGGAGAGTCTTCGGCACATTCGAACCCTAACGTGCCAATCTCTTCTTCTTTGAATAT
AGCCCACCAGCAACTAGAGGAGCTTAAACAGCAGATTGCGGCACTTGAGGCTACCTTAGGGGCAACATCCAATACCCCTAGTCATGATCTCCATACGTCTTCACCCGTGT
ATTCTGAGAATCCGTTAGTGAACATGGTTCTTGAAGGACGACATAAATTCGATTTTCTGACGAGGGAAATACCTCGTCCTTCGCCTGGGGACCCTCAGGAACGGTACTGG
AAGGGGGAGGATTCTTTTCTTCGATCCACATTAATCAATAGTATGGAGCCACAGATTGGCAAACCTTTACTCCACACTGCAACTGCTAAGGATATTTGGGACACAACCTA
G
Protein sequenceShow/hide protein sequence
MEVGAVENLNIELSINSVMGLNNPGTMKVKEKVKNEDVITAIKGKSVCGKVEVLIGVTEVDWRKLIMNFQHDGKRITIKGDPSLTKTRVNLKHMMKIWDSEDQSYLIECR
ALHSSLTVEEMYDEETKPTVKNLLPRLLTKFDDVFSWPETLPPQRGIEHHIHLKEGTDPTPSLPHLPSYLSRNNSRILYNQNSLLSPFLLFSIYICSLTPLPWSPLVDRA
STLHFLVVPTCSWLHNQLLVLLLFPVPSPTILVYDGWSNINVWPCRYVDQQKEEMECLVDEMLAEELFGRVPFHIPVPALNNVTIPDKFPIPVIEELFDELNGANIYAIW
AHQCPFNFLSINEFHLQAVYEEFCQGTDRVFRAHYFKEGREVDPEKIRAIREWSIPTNVRELRGFLGLTGYYRRFVQNYGNIVGPLIQLPKVSAYKWSAEAEIFFEKLKR
AIMTLPTLAMLDFSLPFEIETNASGYGVGSVLTPAKKPIAYFSQTVSMRDRSRPVYERELIDVVFAVQRVPPTVHLNHISAPALMNLTIIQEEVDRDPKLKEIKSSVEKH
QSDIPNFSVHQEHGVDETLEGQTLAENRTEETAAISAVVATVVNARIETIEPTHLIQKGPTNPSLPPLPSFAAVNASHAPLHVLPSTSVEPPLLLLTNLYTLPSTASIPN
PSYRPEVGHPQIYSTFEVGESSAHSNPNVPISSSLNIAHQQLEELKQQIAALEATLGATSNTPSHDLHTSSPVYSENPLVNMVLEGRHKFDFLTREIPRPSPGDPQERYW
KGEDSFLRSTLINSMEPQIGKPLLHTATAKDIWDTT