; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0000602 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0000602
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase
Genome locationchr08:14873051..14885921
RNA-Seq ExpressionPay0000602
SyntenyPay0000602
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042023.1 pol protein [Cucumis melo var. makuwa]3.3e-29462.59Show/hide
Query:  PVVKEFLEVFPDDLSGLPPDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQLNKVTIR
        PVV+++ +VFP++L GLPP REIEF IEL PGT PIS+APY MAP+ELKELK+QLQEL+DKG+IRPSVSPWGAP+LFVKKKDG++RL IDY++LNKVT++
Subjt:  PVVKEFLEVFPDDLSGLPPDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQLNKVTIR

Query:  NKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRT-----------------------------------------------------
        N+YPLP IDDLFDQL+GA +FSKIDLRSGYHQL++++ D+ KTAFR+                                                     
Subjt:  NKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRT-----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------RQRIIDAQSNDPYLVEK
                                                                                           RQ+IIDAQS+DPYL EK
Subjt:  -----------------------------------------------------------------------------------RQRIIDAQSNDPYLVEK

Query:  RGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPL
        RGLAEAGQAVEFS+SSDGGLLFERRLCVP DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMK+EVAEFVS+CL CQQVKAPRQKPAGLLQPL
Subjt:  RGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPL

Query:  SIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
        S+P+WKWENVSMDFITGLPRTLRGFTV WVVVDRLTKSAHF+P KSTYT SKWAQLY+SEIVRLHGVPVSI+SDR ARFTSKFWKGLQ AMGTRLDFST 
Subjt:  SIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA

Query:  FHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRS----------RLMGPELVQSTNEAIQKIRSRMHTA
        FHPQTDGQTERLNQVLEDMLRACALEFPGSWDSH HLMEFAYNNS+QATIG+APFEALYGKCCRS          RLMGPELVQSTN+AIQKIRSRMHTA
Subjt:  FHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRS----------RLMGPELVQSTNEAIQKIRSRMHTA

Query:  QSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLKID
        QSRQKSYADVRRKDLEF+VGDKVFLKVAPM+GVLRFERRGKLSPRFVGPFE LERIGPVAYRLALPPSLS VHDVFHVSMLR YVPDPSHVVDYEPL+ID
Subjt:  QSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLKID

Query:  ENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE
        ENLSY EQPVEVLAREVK LRN+EIPLVKVLW+NHRVEEATWEREDDMR+RYP+ FEE
Subjt:  ENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE

KAA0045277.1 pol protein [Cucumis melo var. makuwa]3.7e-29864.62Show/hide
Query:  PVVKEFLEVFPDDLSGLPPDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQLNKVTIR
        PVV+++ +VFP++  GLPP RE+EF IEL PGT PIS+APY+MAP+ELKELK+QLQEL+DKG+IRPSVSPWGAP+LFVKKKDG++RLCIDYR+LNKVT++
Subjt:  PVVKEFLEVFPDDLSGLPPDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQLNKVTIR

Query:  NKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRT-----------------------------------------------------
        N+YPLP IDDLFDQL+GA +FSKIDLRSGYHQL++++ D+ KTAF +                                                     
Subjt:  NKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRT-----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------RQRIIDAQSNDPYLVEKRGLAEAGQAVEFS
                                                                              RQRIIDAQSNDPYLVEKRGLAEAGQAVEFS
Subjt:  ----------------------------------------------------------------------RQRIIDAQSNDPYLVEKRGLAEAGQAVEFS

Query:  LSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPKWKWENVSMD
        +SSDGGLLFERRLCVPS+SA+KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRE AEFVS+CLVCQQVKAPRQKPAGLLQPL+IP+WKWENVSMD
Subjt:  LSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPKWKWENVSMD

Query:  FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLN
        FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYT SKWAQLYMSEIVRLHGVPVSIVSDRDARFT KF KGLQTAMGTRLDFSTAFHPQTDGQTERLN
Subjt:  FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLN

Query:  QVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRS----------RLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRK
        QVLEDMLRACALEFPGSWDSHLHLMEFAYNNS+QATIGMAPFEALYGKCCRS          RLMGPELVQSTNEAIQKIRSRM TAQSRQKSYADVR+K
Subjt:  QVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRS----------RLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRK

Query:  DLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLKIDENLSYAEQPVEVL
        DLEF+VGDKVFLKVAPM+GVLRFERRGKLSPRFVGPFEILERI PVAYRLALPPSLS VHDVFHVSMLRKY+ DPSHVVDYEPL+IDENLSY +QPVEVL
Subjt:  DLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLKIDENLSYAEQPVEVL

Query:  AREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE
        AREVK LRN+EIPLVKVLW NHRVEEATWEREDDMRSRYP+LFEE
Subjt:  AREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE

KAA0050673.1 pol protein [Cucumis melo var. makuwa]1.9e-29461.74Show/hide
Query:  PVVKEFLEVFPDDLSGLPPDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQLNKVTIR
        P+V+ + +VFP++L  LPP RE+EF IEL PGT PIS+APYRMAP+ELKELK+QLQ+L+DKG+IRPSVSPWGA +LFVKKKDG++RLCIDYR+LNKVT++
Subjt:  PVVKEFLEVFPDDLSGLPPDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQLNKVTIR

Query:  NKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRT-----------------------------------------------------
        N YPLP IDDLFDQL+GA +FSKIDLRSGYHQL++++ D+ KTAFR+                                                     
Subjt:  NKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRT-----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVY
                           RQRIIDAQSNDPYLVEKRGLAEAGQAV FS+SSDGGL+FERRLCVPSDSAVKTELLSEAHS PFSMHPGSTKMYQDLKRVY
Subjt:  -------------------RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVY

Query:  WWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRL
        WWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLS+P+WKWENVSMDFITGLPRTLRGFTVIWVVVDR TKSAHFVPGKSTYT SKWAQLYMSEIVRL
Subjt:  WWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRL

Query:  HGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCR
        HGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD QTE LNQVLEDMLRACALEFPGSWDSHLHLMEFAY NSYQATI MAPFEALYGKCCR
Subjt:  HGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCR

Query:  S----------RLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLA
        S          RLMGPEL QSTNEAIQKIRSRMHTAQSRQK YADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLA
Subjt:  S----------RLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLA

Query:  LPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLKIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE
        LPPSLSTVHDVFHVSMLRK+VPDPSH+VDYEPL+IDENLSY EQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYP+LFEE
Subjt:  LPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLKIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE

KAA0051522.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.5e-29963.56Show/hide
Query:  PVVKEFLEVFPDDLSGLPPDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQLNKVTIR
        PVV+++ +VFP++LSGLPP RE+EF IEL PGT PIS+APYRMAP+ELKELK+QLQEL+DK +I+PSVSPWGAP+LFVKKKDG++RLCIDYR+LNKVT++
Subjt:  PVVKEFLEVFPDDLSGLPPDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQLNKVTIR

Query:  NKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRT-----------------------------------------------------
        N+YPLP IDDLFD+L+GA +FSKIDLRSGYHQL+++  D+ KTAFR+                                                     
Subjt:  NKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRT-----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------------------RQRII
                                                                                                       RQRII
Subjt:  -----------------------------------------------------------------------------------------------RQRII

Query:  DAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKA
        DAQSNDPYLVEKRG AEA QAVEFS+SSDGGLLF RRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKA
Subjt:  DAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKA

Query:  PRQKPAGLLQPLSIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQ
        PRQKPAGLLQPLSIP+WKWENVSMDFITGLPRTLRGFTVIWVVV+RLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFW+GLQ
Subjt:  PRQKPAGLLQPLSIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQ

Query:  TAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRS----------RLMGPELVQSTNE
        TAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGM PFEALYGKCCRS          RLMGPELVQSTNE
Subjt:  TAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRS----------RLMGPELVQSTNE

Query:  AIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDP
        AIQKIRS MHTAQ+RQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSP FVGPFE+LERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDP
Subjt:  AIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDP

Query:  SHVVDYEPLKIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE
        SHVVDYEPL+IDENLSY EQPV VLAREVK LRN+EIP VK+LWRNHRVEEATWEREDDMRSRYP+LFEE
Subjt:  SHVVDYEPLKIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE

KAA0062245.1 pol protein [Cucumis melo var. makuwa]1.5e-29960.21Show/hide
Query:  VITVQREKLKPEDVPVVKEFLEVFPDDLSGLPPDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLR
        V+  +   +     PVV+++ +VFP++L GLP  RE+EF IEL PGT PIS+APYRMAP+ELKELK+QLQEL+DKG+IRPSVSPWGAP+LFVKKKDG++R
Subjt:  VITVQREKLKPEDVPVVKEFLEVFPDDLSGLPPDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLR

Query:  LCIDYRQLNKVTIRNKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRT---------------------------------------
        LCIDYR+LNKVT++N+YPLP IDDLFDQL+GA +FSKIDLRSGYHQL++++ D+ KTAFR+                                       
Subjt:  LCIDYRQLNKVTIRNKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRT---------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDG
                                                                         RQRIIDAQSNDPYLVEKRGLAEAGQA EFSLSSDG
Subjt:  -----------------------------------------------------------------RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDG

Query:  GLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPKWKWENVSMDFITGL
        GLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAP QKPAGLLQPLSIP+WKWENVSMDFITGL
Subjt:  GLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPKWKWENVSMDFITGL

Query:  PRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLED
        PRTLRGF+VIWVVVDRLTKSAHFV GKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLED
Subjt:  PRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLED

Query:  MLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRS----------RLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFE
        MLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCC+S          RLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFE
Subjt:  MLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRS----------RLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFE

Query:  VGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLKIDENLSYAEQPVEVLAREVK
        VGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGP+AYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPL+IDENLSYAEQPVEVLAREVK
Subjt:  VGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLKIDENLSYAEQPVEVLAREVK

Query:  TLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE
        TLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE
Subjt:  TLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE

TrEMBL top hitse value%identityAlignment
A0A5A7TFN7 Pol protein1.6e-29462.59Show/hide
Query:  PVVKEFLEVFPDDLSGLPPDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQLNKVTIR
        PVV+++ +VFP++L GLPP REIEF IEL PGT PIS+APY MAP+ELKELK+QLQEL+DKG+IRPSVSPWGAP+LFVKKKDG++RL IDY++LNKVT++
Subjt:  PVVKEFLEVFPDDLSGLPPDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQLNKVTIR

Query:  NKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRT-----------------------------------------------------
        N+YPLP IDDLFDQL+GA +FSKIDLRSGYHQL++++ D+ KTAFR+                                                     
Subjt:  NKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRT-----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------RQRIIDAQSNDPYLVEK
                                                                                           RQ+IIDAQS+DPYL EK
Subjt:  -----------------------------------------------------------------------------------RQRIIDAQSNDPYLVEK

Query:  RGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPL
        RGLAEAGQAVEFS+SSDGGLLFERRLCVP DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMK+EVAEFVS+CL CQQVKAPRQKPAGLLQPL
Subjt:  RGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPL

Query:  SIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
        S+P+WKWENVSMDFITGLPRTLRGFTV WVVVDRLTKSAHF+P KSTYT SKWAQLY+SEIVRLHGVPVSI+SDR ARFTSKFWKGLQ AMGTRLDFST 
Subjt:  SIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA

Query:  FHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRS----------RLMGPELVQSTNEAIQKIRSRMHTA
        FHPQTDGQTERLNQVLEDMLRACALEFPGSWDSH HLMEFAYNNS+QATIG+APFEALYGKCCRS          RLMGPELVQSTN+AIQKIRSRMHTA
Subjt:  FHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRS----------RLMGPELVQSTNEAIQKIRSRMHTA

Query:  QSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLKID
        QSRQKSYADVRRKDLEF+VGDKVFLKVAPM+GVLRFERRGKLSPRFVGPFE LERIGPVAYRLALPPSLS VHDVFHVSMLR YVPDPSHVVDYEPL+ID
Subjt:  QSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLKID

Query:  ENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE
        ENLSY EQPVEVLAREVK LRN+EIPLVKVLW+NHRVEEATWEREDDMR+RYP+ FEE
Subjt:  ENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE

A0A5A7TTW5 Pol protein1.8e-29864.62Show/hide
Query:  PVVKEFLEVFPDDLSGLPPDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQLNKVTIR
        PVV+++ +VFP++  GLPP RE+EF IEL PGT PIS+APY+MAP+ELKELK+QLQEL+DKG+IRPSVSPWGAP+LFVKKKDG++RLCIDYR+LNKVT++
Subjt:  PVVKEFLEVFPDDLSGLPPDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQLNKVTIR

Query:  NKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRT-----------------------------------------------------
        N+YPLP IDDLFDQL+GA +FSKIDLRSGYHQL++++ D+ KTAF +                                                     
Subjt:  NKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRT-----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------RQRIIDAQSNDPYLVEKRGLAEAGQAVEFS
                                                                              RQRIIDAQSNDPYLVEKRGLAEAGQAVEFS
Subjt:  ----------------------------------------------------------------------RQRIIDAQSNDPYLVEKRGLAEAGQAVEFS

Query:  LSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPKWKWENVSMD
        +SSDGGLLFERRLCVPS+SA+KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRE AEFVS+CLVCQQVKAPRQKPAGLLQPL+IP+WKWENVSMD
Subjt:  LSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPKWKWENVSMD

Query:  FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLN
        FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYT SKWAQLYMSEIVRLHGVPVSIVSDRDARFT KF KGLQTAMGTRLDFSTAFHPQTDGQTERLN
Subjt:  FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLN

Query:  QVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRS----------RLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRK
        QVLEDMLRACALEFPGSWDSHLHLMEFAYNNS+QATIGMAPFEALYGKCCRS          RLMGPELVQSTNEAIQKIRSRM TAQSRQKSYADVR+K
Subjt:  QVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRS----------RLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRK

Query:  DLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLKIDENLSYAEQPVEVL
        DLEF+VGDKVFLKVAPM+GVLRFERRGKLSPRFVGPFEILERI PVAYRLALPPSLS VHDVFHVSMLRKY+ DPSHVVDYEPL+IDENLSY +QPVEVL
Subjt:  DLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLKIDENLSYAEQPVEVL

Query:  AREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE
        AREVK LRN+EIPLVKVLW NHRVEEATWEREDDMRSRYP+LFEE
Subjt:  AREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE

A0A5A7U470 Pol protein9.3e-29561.74Show/hide
Query:  PVVKEFLEVFPDDLSGLPPDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQLNKVTIR
        P+V+ + +VFP++L  LPP RE+EF IEL PGT PIS+APYRMAP+ELKELK+QLQ+L+DKG+IRPSVSPWGA +LFVKKKDG++RLCIDYR+LNKVT++
Subjt:  PVVKEFLEVFPDDLSGLPPDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQLNKVTIR

Query:  NKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRT-----------------------------------------------------
        N YPLP IDDLFDQL+GA +FSKIDLRSGYHQL++++ D+ KTAFR+                                                     
Subjt:  NKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRT-----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVY
                           RQRIIDAQSNDPYLVEKRGLAEAGQAV FS+SSDGGL+FERRLCVPSDSAVKTELLSEAHS PFSMHPGSTKMYQDLKRVY
Subjt:  -------------------RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVY

Query:  WWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRL
        WWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLS+P+WKWENVSMDFITGLPRTLRGFTVIWVVVDR TKSAHFVPGKSTYT SKWAQLYMSEIVRL
Subjt:  WWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRL

Query:  HGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCR
        HGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD QTE LNQVLEDMLRACALEFPGSWDSHLHLMEFAY NSYQATI MAPFEALYGKCCR
Subjt:  HGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCR

Query:  S----------RLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLA
        S          RLMGPEL QSTNEAIQKIRSRMHTAQSRQK YADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLA
Subjt:  S----------RLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLA

Query:  LPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLKIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE
        LPPSLSTVHDVFHVSMLRK+VPDPSH+VDYEPL+IDENLSY EQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYP+LFEE
Subjt:  LPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLKIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE

A0A5A7U6Z4 Ty3-gypsy retrotransposon protein7.3e-30063.56Show/hide
Query:  PVVKEFLEVFPDDLSGLPPDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQLNKVTIR
        PVV+++ +VFP++LSGLPP RE+EF IEL PGT PIS+APYRMAP+ELKELK+QLQEL+DK +I+PSVSPWGAP+LFVKKKDG++RLCIDYR+LNKVT++
Subjt:  PVVKEFLEVFPDDLSGLPPDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQLNKVTIR

Query:  NKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRT-----------------------------------------------------
        N+YPLP IDDLFD+L+GA +FSKIDLRSGYHQL+++  D+ KTAFR+                                                     
Subjt:  NKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRT-----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------------------RQRII
                                                                                                       RQRII
Subjt:  -----------------------------------------------------------------------------------------------RQRII

Query:  DAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKA
        DAQSNDPYLVEKRG AEA QAVEFS+SSDGGLLF RRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKA
Subjt:  DAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKA

Query:  PRQKPAGLLQPLSIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQ
        PRQKPAGLLQPLSIP+WKWENVSMDFITGLPRTLRGFTVIWVVV+RLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFW+GLQ
Subjt:  PRQKPAGLLQPLSIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQ

Query:  TAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRS----------RLMGPELVQSTNE
        TAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGM PFEALYGKCCRS          RLMGPELVQSTNE
Subjt:  TAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRS----------RLMGPELVQSTNE

Query:  AIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDP
        AIQKIRS MHTAQ+RQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSP FVGPFE+LERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDP
Subjt:  AIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDP

Query:  SHVVDYEPLKIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE
        SHVVDYEPL+IDENLSY EQPV VLAREVK LRN+EIP VK+LWRNHRVEEATWEREDDMRSRYP+LFEE
Subjt:  SHVVDYEPLKIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE

A0A5A7V8L8 Pol protein7.3e-30060.21Show/hide
Query:  VITVQREKLKPEDVPVVKEFLEVFPDDLSGLPPDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLR
        V+  +   +     PVV+++ +VFP++L GLP  RE+EF IEL PGT PIS+APYRMAP+ELKELK+QLQEL+DKG+IRPSVSPWGAP+LFVKKKDG++R
Subjt:  VITVQREKLKPEDVPVVKEFLEVFPDDLSGLPPDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLR

Query:  LCIDYRQLNKVTIRNKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRT---------------------------------------
        LCIDYR+LNKVT++N+YPLP IDDLFDQL+GA +FSKIDLRSGYHQL++++ D+ KTAFR+                                       
Subjt:  LCIDYRQLNKVTIRNKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRT---------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDG
                                                                         RQRIIDAQSNDPYLVEKRGLAEAGQA EFSLSSDG
Subjt:  -----------------------------------------------------------------RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDG

Query:  GLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPKWKWENVSMDFITGL
        GLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAP QKPAGLLQPLSIP+WKWENVSMDFITGL
Subjt:  GLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPKWKWENVSMDFITGL

Query:  PRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLED
        PRTLRGF+VIWVVVDRLTKSAHFV GKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLED
Subjt:  PRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLED

Query:  MLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRS----------RLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFE
        MLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCC+S          RLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFE
Subjt:  MLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRS----------RLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFE

Query:  VGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLKIDENLSYAEQPVEVLAREVK
        VGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGP+AYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPL+IDENLSYAEQPVEVLAREVK
Subjt:  VGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLKIDENLSYAEQPVEVLAREVK

Query:  TLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE
        TLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE
Subjt:  TLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPDLFEE

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein5.1e-5632.26Show/hide
Query:  RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL
        + +++   +ND  L+    L    + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C 
Subjt:  RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL

Query:  VCQQVKAPRQKPAGLLQPLSIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTS
         CQ  K+   KP G LQP+   +  WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS
Subjt:  VCQQVKAPRQKPAGLLQPLSIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTS

Query:  KFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSRLMGP-------ELV
        + WK         + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      S L  P       E  
Subjt:  KFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSRLMGP-------ELV

Query:  QSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSML
        Q T +  Q ++  ++T   + K Y D++ +++ EF+ GD V +K     G L   +  KL+P F GPF +L++ GP  Y L LP S+  +    FHVS L
Subjt:  QSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSML

Query:  RKY
         KY
Subjt:  RKY

P0CT34 Transposon Tf2-1 polyprotein1.3e-2230.52Show/hide
Query:  KPEDVPVVKEFLEVFPD-DLSGLP-PDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQ
        +PE   + KEF ++  + +   LP P + +EF +EL      +    Y + P +++ +  ++ + +  G IR S +    P++FV KK+GTLR+ +DY+ 
Subjt:  KPEDVPVVKEFLEVFPD-DLSGLP-PDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQ

Query:  LNKVTIRNKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRTRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR-LCV
        LNK    N YPLP I+ L  +++G+ +F+K+DL+S YH ++VR+ D  K AFR  + + +      YLV   G++ A    ++ +++  G   E   +C 
Subjt:  LNKVTIRNKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRTRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR-LCV

Query:  PSDSAVKTELLSE
          D  + ++  SE
Subjt:  PSDSAVKTELLSE

P0CT35 Transposon Tf2-2 polyprotein5.1e-5632.26Show/hide
Query:  RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL
        + +++   +ND  L+    L    + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C 
Subjt:  RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL

Query:  VCQQVKAPRQKPAGLLQPLSIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTS
         CQ  K+   KP G LQP+   +  WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS
Subjt:  VCQQVKAPRQKPAGLLQPLSIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTS

Query:  KFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSRLMGP-------ELV
        + WK         + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      S L  P       E  
Subjt:  KFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSRLMGP-------ELV

Query:  QSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSML
        Q T +  Q ++  ++T   + K Y D++ +++ EF+ GD V +K     G L   +  KL+P F GPF +L++ GP  Y L LP S+  +    FHVS L
Subjt:  QSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSML

Query:  RKY
         KY
Subjt:  RKY

P0CT35 Transposon Tf2-2 polyprotein1.3e-2230.52Show/hide
Query:  KPEDVPVVKEFLEVFPD-DLSGLP-PDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQ
        +PE   + KEF ++  + +   LP P + +EF +EL      +    Y + P +++ +  ++ + +  G IR S +    P++FV KK+GTLR+ +DY+ 
Subjt:  KPEDVPVVKEFLEVFPD-DLSGLP-PDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQ

Query:  LNKVTIRNKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRTRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR-LCV
        LNK    N YPLP I+ L  +++G+ +F+K+DL+S YH ++VR+ D  K AFR  + + +      YLV   G++ A    ++ +++  G   E   +C 
Subjt:  LNKVTIRNKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRTRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR-LCV

Query:  PSDSAVKTELLSE
          D  + ++  SE
Subjt:  PSDSAVKTELLSE

P0CT36 Transposon Tf2-3 polyprotein5.1e-5632.26Show/hide
Query:  RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL
        + +++   +ND  L+    L    + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C 
Subjt:  RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL

Query:  VCQQVKAPRQKPAGLLQPLSIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTS
         CQ  K+   KP G LQP+   +  WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS
Subjt:  VCQQVKAPRQKPAGLLQPLSIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTS

Query:  KFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSRLMGP-------ELV
        + WK         + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      S L  P       E  
Subjt:  KFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSRLMGP-------ELV

Query:  QSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSML
        Q T +  Q ++  ++T   + K Y D++ +++ EF+ GD V +K     G L   +  KL+P F GPF +L++ GP  Y L LP S+  +    FHVS L
Subjt:  QSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSML

Query:  RKY
         KY
Subjt:  RKY

P0CT36 Transposon Tf2-3 polyprotein1.3e-2230.52Show/hide
Query:  KPEDVPVVKEFLEVFPD-DLSGLP-PDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQ
        +PE   + KEF ++  + +   LP P + +EF +EL      +    Y + P +++ +  ++ + +  G IR S +    P++FV KK+GTLR+ +DY+ 
Subjt:  KPEDVPVVKEFLEVFPD-DLSGLP-PDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQ

Query:  LNKVTIRNKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRTRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR-LCV
        LNK    N YPLP I+ L  +++G+ +F+K+DL+S YH ++VR+ D  K AFR  + + +      YLV   G++ A    ++ +++  G   E   +C 
Subjt:  LNKVTIRNKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRTRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR-LCV

Query:  PSDSAVKTELLSE
          D  + ++  SE
Subjt:  PSDSAVKTELLSE

P0CT41 Transposon Tf2-12 polyprotein5.1e-5632.26Show/hide
Query:  RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL
        + +++   +ND  L+    L    + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C 
Subjt:  RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL

Query:  VCQQVKAPRQKPAGLLQPLSIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTS
         CQ  K+   KP G LQP+   +  WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS
Subjt:  VCQQVKAPRQKPAGLLQPLSIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTS

Query:  KFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSRLMGP-------ELV
        + WK         + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      S L  P       E  
Subjt:  KFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSRLMGP-------ELV

Query:  QSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSML
        Q T +  Q ++  ++T   + K Y D++ +++ EF+ GD V +K     G L   +  KL+P F GPF +L++ GP  Y L LP S+  +    FHVS L
Subjt:  QSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSML

Query:  RKY
         KY
Subjt:  RKY

P0CT41 Transposon Tf2-12 polyprotein1.3e-2230.52Show/hide
Query:  KPEDVPVVKEFLEVFPD-DLSGLP-PDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQ
        +PE   + KEF ++  + +   LP P + +EF +EL      +    Y + P +++ +  ++ + +  G IR S +    P++FV KK+GTLR+ +DY+ 
Subjt:  KPEDVPVVKEFLEVFPD-DLSGLP-PDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQ

Query:  LNKVTIRNKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRTRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR-LCV
        LNK    N YPLP I+ L  +++G+ +F+K+DL+S YH ++VR+ D  K AFR  + + +      YLV   G++ A    ++ +++  G   E   +C 
Subjt:  LNKVTIRNKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRTRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR-LCV

Query:  PSDSAVKTELLSE
          D  + ++  SE
Subjt:  PSDSAVKTELLSE

Q9UR07 Transposon Tf2-11 polyprotein5.1e-5632.26Show/hide
Query:  RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL
        + +++   +ND  L+    L    + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C 
Subjt:  RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL

Query:  VCQQVKAPRQKPAGLLQPLSIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTS
         CQ  K+   KP G LQP+   +  WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS
Subjt:  VCQQVKAPRQKPAGLLQPLSIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTS

Query:  KFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSRLMGP-------ELV
        + WK         + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      S L  P       E  
Subjt:  KFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSRLMGP-------ELV

Query:  QSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSML
        Q T +  Q ++  ++T   + K Y D++ +++ EF+ GD V +K     G L   +  KL+P F GPF +L++ GP  Y L LP S+  +    FHVS L
Subjt:  QSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSML

Query:  RKY
         KY
Subjt:  RKY

Q9UR07 Transposon Tf2-11 polyprotein4.8e-2231.55Show/hide
Query:  KPEDVPVVKEFLEVFPD-DLSGLP-PDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQ
        +PE   + KEF ++  + +   LP P + +EF +EL      +    Y + P +++ +  ++ + +  G IR S +    P++FV KK+GTLR+ +DY+ 
Subjt:  KPEDVPVVKEFLEVFPD-DLSGLP-PDREIEFTIELLPGTTPISQAPYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQ

Query:  LNKVTIRNKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRTRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSS
        LNK    N YPLP I+ L  +++G+ +F+K+DL+S YH ++VR+ D  K AFR  + + +      YLV   G++ A    ++ +++
Subjt:  LNKVTIRNKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRTRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGTTCAGAGAGATGAGGGAGTTGGGTCCCAAACAGTTAAGCAATCGAGAGTTTCAGTGGTTCCAACAGAGGGCACCAATGGTGCAAGGCAAAAGGGAGTTGTTGG
AAGACCGAGGCAACAGGGAAAAGTCTATGCTATGACTCAACAAGAAGCCGAGGACGCACCAGGCGTTATTACTGTGCAGAGAGAAAAGCTAAAGCCAGAAGATGTTCCTG
TGGTGAAAGAGTTTCTTGAAGTATTTCCAGATGATCTATCAGGTTTGCCACCTGATAGAGAGATTGAGTTCACCATTGAATTATTACCAGGAACAACACCTATTTCACAG
GCACCATATAGAATGGCTCCAAGCGAGCTAAAAGAATTGAAGATGCAGTTACAAGAACTAGTTGACAAGGGTTACATCAGGCCTAGTGTTTCGCCGTGGGGAGCACCAAT
GCTATTTGTGAAAAAGAAAGATGGTACCCTCCGATTATGTATTGACTATAGACAGTTAAACAAGGTTACAATACGTAACAAGTATCCTTTACCATGCATCGATGACTTAT
TTGATCAACTAAGGGGAGCAGCGTTGTTCTCTAAGATTGACTTAAGGTCAGGATACCACCAGTTGAAGGTTAGAGAATCAGATATTGCTAAGACAGCATTTAGAACGAGG
CAAAGGATCATTGATGCTCAGAGTAACGATCCTTATTTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAGCGGTTGAGTTCTCATTATCCTCTGATGGTGGACTTTT
GTTTGAGAGACGCCTCTGTGTGCCGTCAGATAGTGCGGTTAAGACAGAATTACTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGAAGATGTATC
AGGACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTGGCAGAATTTGTTAGTAGATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGCAGAAACCGGCG
GGTTTATTACAACCCTTGAGCATACCGAAATGGAAGTGGGAAAACGTGTCCATGGATTTCATTACAGGACTGCCGAGAACTCTGAGGGGTTTTACAGTGATTTGGGTTGT
GGTGGACAGACTTACTAAATCAGCGCACTTCGTTCCGGGTAAATCCACCTATACTGCTAGTAAGTGGGCACAGCTGTACATGTCGGAGATAGTGAGATTACATGGAGTGC
CAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAATTCTGGAAGGGTTTGCAGACTGCTATGGGCACGAGGTTAGACTTTAGTACAGCTTTCCATCCACAG
ACTGACGGTCAGACTGAGCGTCTGAACCAAGTTTTAGAGGATATGTTGCGAGCGTGTGCATTGGAATTTCCAGGTAGCTGGGACTCCCACTTACATTTGATGGAATTTGC
TTATAATAACAGTTATCAGGCTACTATTGGCATGGCACCATTTGAGGCCTTGTACGGCAAATGTTGTAGATCCCGATTGATGGGTCCTGAGTTAGTTCAGTCTACTAACG
AAGCGATACAGAAGATTAGATCACGCATGCATACCGCTCAGAGTAGGCAGAAGAGTTATGCAGATGTGAGGCGGAAGGATCTCGAATTTGAGGTAGGGGACAAGGTGTTC
TTAAAGGTAGCACCTATGAGAGGTGTCTTACGATTTGAAAGGAGGGGAAAGCTGAGTCCCCGTTTTGTTGGGCCATTTGAGATTCTGGAGCGGATTGGCCCTGTAGCTTA
TCGCTTGGCGTTGCCACCATCACTCTCGACAGTTCATGATGTGTTTCATGTTTCTATGTTGAGGAAGTACGTGCCAGATCCATCCCATGTAGTGGATTACGAGCCACTGA
AGATTGATGAAAACTTGAGCTATGCTGAACAACCTGTTGAGGTGCTTGCTAGAGAGGTGAAAACGTTGAGGAATAAAGAAATCCCTCTGGTTAAAGTCTTATGGCGGAAT
CACCGGGTAGAAGAGGCTACATGGGAGCGTGAAGATGACATGAGGTCCCGTTATCCCGATCTGTTCGAGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGACAGTTCAGAGAGATGAGGGAGTTGGGTCCCAAACAGTTAAGCAATCGAGAGTTTCAGTGGTTCCAACAGAGGGCACCAATGGTGCAAGGCAAAAGGGAGTTGTTGG
AAGACCGAGGCAACAGGGAAAAGTCTATGCTATGACTCAACAAGAAGCCGAGGACGCACCAGGCGTTATTACTGTGCAGAGAGAAAAGCTAAAGCCAGAAGATGTTCCTG
TGGTGAAAGAGTTTCTTGAAGTATTTCCAGATGATCTATCAGGTTTGCCACCTGATAGAGAGATTGAGTTCACCATTGAATTATTACCAGGAACAACACCTATTTCACAG
GCACCATATAGAATGGCTCCAAGCGAGCTAAAAGAATTGAAGATGCAGTTACAAGAACTAGTTGACAAGGGTTACATCAGGCCTAGTGTTTCGCCGTGGGGAGCACCAAT
GCTATTTGTGAAAAAGAAAGATGGTACCCTCCGATTATGTATTGACTATAGACAGTTAAACAAGGTTACAATACGTAACAAGTATCCTTTACCATGCATCGATGACTTAT
TTGATCAACTAAGGGGAGCAGCGTTGTTCTCTAAGATTGACTTAAGGTCAGGATACCACCAGTTGAAGGTTAGAGAATCAGATATTGCTAAGACAGCATTTAGAACGAGG
CAAAGGATCATTGATGCTCAGAGTAACGATCCTTATTTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAGCGGTTGAGTTCTCATTATCCTCTGATGGTGGACTTTT
GTTTGAGAGACGCCTCTGTGTGCCGTCAGATAGTGCGGTTAAGACAGAATTACTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGAAGATGTATC
AGGACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTGGCAGAATTTGTTAGTAGATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGCAGAAACCGGCG
GGTTTATTACAACCCTTGAGCATACCGAAATGGAAGTGGGAAAACGTGTCCATGGATTTCATTACAGGACTGCCGAGAACTCTGAGGGGTTTTACAGTGATTTGGGTTGT
GGTGGACAGACTTACTAAATCAGCGCACTTCGTTCCGGGTAAATCCACCTATACTGCTAGTAAGTGGGCACAGCTGTACATGTCGGAGATAGTGAGATTACATGGAGTGC
CAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAATTCTGGAAGGGTTTGCAGACTGCTATGGGCACGAGGTTAGACTTTAGTACAGCTTTCCATCCACAG
ACTGACGGTCAGACTGAGCGTCTGAACCAAGTTTTAGAGGATATGTTGCGAGCGTGTGCATTGGAATTTCCAGGTAGCTGGGACTCCCACTTACATTTGATGGAATTTGC
TTATAATAACAGTTATCAGGCTACTATTGGCATGGCACCATTTGAGGCCTTGTACGGCAAATGTTGTAGATCCCGATTGATGGGTCCTGAGTTAGTTCAGTCTACTAACG
AAGCGATACAGAAGATTAGATCACGCATGCATACCGCTCAGAGTAGGCAGAAGAGTTATGCAGATGTGAGGCGGAAGGATCTCGAATTTGAGGTAGGGGACAAGGTGTTC
TTAAAGGTAGCACCTATGAGAGGTGTCTTACGATTTGAAAGGAGGGGAAAGCTGAGTCCCCGTTTTGTTGGGCCATTTGAGATTCTGGAGCGGATTGGCCCTGTAGCTTA
TCGCTTGGCGTTGCCACCATCACTCTCGACAGTTCATGATGTGTTTCATGTTTCTATGTTGAGGAAGTACGTGCCAGATCCATCCCATGTAGTGGATTACGAGCCACTGA
AGATTGATGAAAACTTGAGCTATGCTGAACAACCTGTTGAGGTGCTTGCTAGAGAGGTGAAAACGTTGAGGAATAAAGAAATCCCTCTGGTTAAAGTCTTATGGCGGAAT
CACCGGGTAGAAGAGGCTACATGGGAGCGTGAAGATGACATGAGGTCCCGTTATCCCGATCTGTTCGAGGAATAA
Protein sequenceShow/hide protein sequence
MTVQRDEGVGSQTVKQSRVSVVPTEGTNGARQKGVVGRPRQQGKVYAMTQQEAEDAPGVITVQREKLKPEDVPVVKEFLEVFPDDLSGLPPDREIEFTIELLPGTTPISQ
APYRMAPSELKELKMQLQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPCIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRTR
QRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPA
GLLQPLSIPKWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQ
TDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVF
LKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLKIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRN
HRVEEATWEREDDMRSRYPDLFEE