; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001270 (gene) of Snake gourd v1 genome

Gene IDTan0001270
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG01:64657737..64659853
RNA-Seq ExpressionTan0001270
SyntenyTan0001270
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]6.2e-15747.51Show/hide
Query:  MKEGTSAREHVLDMMTYFNLAEMNGASIDESSQVNFILETLPKSFLQFRSNVVMNKISYSLTTLLNELQNFQSLMMIRTPEAEANVA--YRSYHRGSTSE
        M EG S REHVL+MM +FN+AEMNGA IDE+SQV+FILE+LP+SFLQFRSN VMNKI+Y+LTTLLNELQ F+SLM I+  + EANVA   R +HRGSTS 
Subjt:  MKEGTSAREHVLDMMTYFNLAEMNGASIDESSQVNFILETLPKSFLQFRSNVVMNKISYSLTTLLNELQNFQSLMMIRTPEAEANVA--YRSYHRGSTSE

Query:  TKLIAPSHPKGKKKKMKKG----KVDRAAAQKGIKVKEVAENGKSFHFNGDGHWKRNCLKFLADRKNT--GKCDLLVTETCLVESNDSAWILDSGATNHV
        TK + PS    KK K KKG    K + AAA+   K K  A  G  FH N +GHWKRNC K+LA++K    GK DLLV ETCLVE++DSAWI+DSGATNHV
Subjt:  TKLIAPSHPKGKKKKMKKG----KVDRAAAQKGIKVKEVAENGKSFHFNGDGHWKRNCLKFLADRKNT--GKCDLLVTETCLVESNDSAWILDSGATNHV

Query:  CSSFQGISSWQQLREE------------------------------------------------------------------------------------
        CSSFQGISSW+QL                                                                                       
Subjt:  CSSFQGISSWQQLREE------------------------------------------------------------------------------------

Query:  -------------------TRTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGKL----FSGKGYRAKDLLELIHSDLC
                           T+ KR+ +SPKEN HLWHLR GHINLNRIERLVK+GLLSELEEN LPVCESCL+GK+    F+GKG+RAK+ LEL+HSDLC
Subjt:  -------------------TRTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGKL----FSGKGYRAKDLLELIHSDLC

Query:  GPMSLKARGGYEYFVSFINDYSRYGYIYLMHRKFETLEKFKEYKIEVENLL-------------------------------------------------
        GPM++KARGG+EYF++F +DYSRYGY+YLM  K E LEKFKEYK EVEN L                                                 
Subjt:  GPMSLKARGGYEYFVSFINDYSRYGYIYLMHRKFETLEKFKEYKIEVENLL-------------------------------------------------

Query:  ------------------------VETAVYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHL-------------------------GGLFYDPK
                                V+TAVY+LN VPSKSV ETP K+WNGRKGSL HF IWGCP H+                         GG FYDPK
Subjt:  ------------------------VETAVYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHL-------------------------GGLFYDPK

Query:  ENRVLVSTDAIFLEEDHVKDHLPRSKIVLNEMD----SISARVADGASTSTSVVDPSTSSQV-RSQEFGMPRRSGRVVRQPERYMGLAETQVVTPDDDCE
        +N+V VST+A FLEEDH+++H PRSKIVLNE+       S RV +  S  T VV   +S++  + Q    PRRSGRV   P RYM L ET  V  D D E
Subjt:  ENRVLVSTDAIFLEEDHVKDHLPRSKIVLNEMD----SISARVADGASTSTSVVDPSTSSQV-RSQEFGMPRRSGRVVRQPERYMGLAETQVVTPDDDCE

Query:  DPLTYDQAMTDVDKDEWIKVMD
        DPLT+ +AM DVDKDEWIK M+
Subjt:  DPLTYDQAMTDVDKDEWIKVMD

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]6.2e-15747.51Show/hide
Query:  MKEGTSAREHVLDMMTYFNLAEMNGASIDESSQVNFILETLPKSFLQFRSNVVMNKISYSLTTLLNELQNFQSLMMIRTPEAEANVA--YRSYHRGSTSE
        M EG S REHVL+MM +FN+AEMNGA IDE+SQV+FILE+LP+SFLQFRSN VMNKI+Y+LTTLLNELQ F+SLM I+  + EANVA   R +HRGSTS 
Subjt:  MKEGTSAREHVLDMMTYFNLAEMNGASIDESSQVNFILETLPKSFLQFRSNVVMNKISYSLTTLLNELQNFQSLMMIRTPEAEANVA--YRSYHRGSTSE

Query:  TKLIAPSHPKGKKKKMKKG----KVDRAAAQKGIKVKEVAENGKSFHFNGDGHWKRNCLKFLADRKNT--GKCDLLVTETCLVESNDSAWILDSGATNHV
        TK + PS    KK K KKG    K + AAA+   K K  A  G  FH N +GHWKRNC K+LA++K    GK DLLV ETCLVE++DSAWI+DSGATNHV
Subjt:  TKLIAPSHPKGKKKKMKKG----KVDRAAAQKGIKVKEVAENGKSFHFNGDGHWKRNCLKFLADRKNT--GKCDLLVTETCLVESNDSAWILDSGATNHV

Query:  CSSFQGISSWQQLREE------------------------------------------------------------------------------------
        CSSFQGISSW+QL                                                                                       
Subjt:  CSSFQGISSWQQLREE------------------------------------------------------------------------------------

Query:  -------------------TRTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGKL----FSGKGYRAKDLLELIHSDLC
                           T+ KR+ +SPKEN HLWHLR GHINLNRIERLVK+GLLSELEEN LPVCESCL+GK+    F+GKG+RAK+ LEL+HSDLC
Subjt:  -------------------TRTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGKL----FSGKGYRAKDLLELIHSDLC

Query:  GPMSLKARGGYEYFVSFINDYSRYGYIYLMHRKFETLEKFKEYKIEVENLL-------------------------------------------------
        GPM++KARGG+EYF++F +DYSRYGY+YLM  K E LEKFKEYK EVEN L                                                 
Subjt:  GPMSLKARGGYEYFVSFINDYSRYGYIYLMHRKFETLEKFKEYKIEVENLL-------------------------------------------------

Query:  ------------------------VETAVYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHL-------------------------GGLFYDPK
                                V+TAVY+LN VPSKSV ETP K+WNGRKGSL HF IWGCP H+                         GG FYDPK
Subjt:  ------------------------VETAVYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHL-------------------------GGLFYDPK

Query:  ENRVLVSTDAIFLEEDHVKDHLPRSKIVLNEMD----SISARVADGASTSTSVVDPSTSSQV-RSQEFGMPRRSGRVVRQPERYMGLAETQVVTPDDDCE
        +N+V VST+A FLEEDH+++H PRSKIVLNE+       S RV +  S  T VV   +S++  + Q    PRRSGRV   P RYM L ET  V  D D E
Subjt:  ENRVLVSTDAIFLEEDHVKDHLPRSKIVLNEMD----SISARVADGASTSTSVVDPSTSSQV-RSQEFGMPRRSGRVVRQPERYMGLAETQVVTPDDDCE

Query:  DPLTYDQAMTDVDKDEWIKVMD
        DPLT+ +AM DVDKDEWIK M+
Subjt:  DPLTYDQAMTDVDKDEWIKVMD

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]6.2e-15747.51Show/hide
Query:  MKEGTSAREHVLDMMTYFNLAEMNGASIDESSQVNFILETLPKSFLQFRSNVVMNKISYSLTTLLNELQNFQSLMMIRTPEAEANVA--YRSYHRGSTSE
        M EG S REHVL+MM +FN+AEMNGA IDE+SQV+FILE+LP+SFLQFRSN VMNKI+Y+LTTLLNELQ F+SLM I+  + EANVA   R +HRGSTS 
Subjt:  MKEGTSAREHVLDMMTYFNLAEMNGASIDESSQVNFILETLPKSFLQFRSNVVMNKISYSLTTLLNELQNFQSLMMIRTPEAEANVA--YRSYHRGSTSE

Query:  TKLIAPSHPKGKKKKMKKG----KVDRAAAQKGIKVKEVAENGKSFHFNGDGHWKRNCLKFLADRKNT--GKCDLLVTETCLVESNDSAWILDSGATNHV
        TK + PS    KK K KKG    K + AAA+   K K  A  G  FH N +GHWKRNC K+LA++K    GK DLLV ETCLVE++DSAWI+DSGATNHV
Subjt:  TKLIAPSHPKGKKKKMKKG----KVDRAAAQKGIKVKEVAENGKSFHFNGDGHWKRNCLKFLADRKNT--GKCDLLVTETCLVESNDSAWILDSGATNHV

Query:  CSSFQGISSWQQLREE------------------------------------------------------------------------------------
        CSSFQGISSW+QL                                                                                       
Subjt:  CSSFQGISSWQQLREE------------------------------------------------------------------------------------

Query:  -------------------TRTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGKL----FSGKGYRAKDLLELIHSDLC
                           T+ KR+ +SPKEN HLWHLR GHINLNRIERLVK+GLLSELEEN LPVCESCL+GK+    F+GKG+RAK+ LEL+HSDLC
Subjt:  -------------------TRTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGKL----FSGKGYRAKDLLELIHSDLC

Query:  GPMSLKARGGYEYFVSFINDYSRYGYIYLMHRKFETLEKFKEYKIEVENLL-------------------------------------------------
        GPM++KARGG+EYF++F +DYSRYGY+YLM  K E LEKFKEYK EVEN L                                                 
Subjt:  GPMSLKARGGYEYFVSFINDYSRYGYIYLMHRKFETLEKFKEYKIEVENLL-------------------------------------------------

Query:  ------------------------VETAVYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHL-------------------------GGLFYDPK
                                V+TAVY+LN VPSKSV ETP K+WNGRKGSL HF IWGCP H+                         GG FYDPK
Subjt:  ------------------------VETAVYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHL-------------------------GGLFYDPK

Query:  ENRVLVSTDAIFLEEDHVKDHLPRSKIVLNEMD----SISARVADGASTSTSVVDPSTSSQV-RSQEFGMPRRSGRVVRQPERYMGLAETQVVTPDDDCE
        +N+V VST+A FLEEDH+++H PRSKIVLNE+       S RV +  S  T VV   +S++  + Q    PRRSGRV   P RYM L ET  V  D D E
Subjt:  ENRVLVSTDAIFLEEDHVKDHLPRSKIVLNEMD----SISARVADGASTSTSVVDPSTSSQV-RSQEFGMPRRSGRVVRQPERYMGLAETQVVTPDDDCE

Query:  DPLTYDQAMTDVDKDEWIKVMD
        DPLT+ +AM DVDKDEWIK M+
Subjt:  DPLTYDQAMTDVDKDEWIKVMD

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]6.2e-15747.51Show/hide
Query:  MKEGTSAREHVLDMMTYFNLAEMNGASIDESSQVNFILETLPKSFLQFRSNVVMNKISYSLTTLLNELQNFQSLMMIRTPEAEANVA--YRSYHRGSTSE
        M EG S REHVL+MM +FN+AEMNGA IDE+SQV+FILE+LP+SFLQFRSN VMNKI+Y+LTTLLNELQ F+SLM I+  + EANVA   R +HRGSTS 
Subjt:  MKEGTSAREHVLDMMTYFNLAEMNGASIDESSQVNFILETLPKSFLQFRSNVVMNKISYSLTTLLNELQNFQSLMMIRTPEAEANVA--YRSYHRGSTSE

Query:  TKLIAPSHPKGKKKKMKKG----KVDRAAAQKGIKVKEVAENGKSFHFNGDGHWKRNCLKFLADRKNT--GKCDLLVTETCLVESNDSAWILDSGATNHV
        TK + PS    KK K KKG    K + AAA+   K K  A  G  FH N +GHWKRNC K+LA++K    GK DLLV ETCLVE++DSAWI+DSGATNHV
Subjt:  TKLIAPSHPKGKKKKMKKG----KVDRAAAQKGIKVKEVAENGKSFHFNGDGHWKRNCLKFLADRKNT--GKCDLLVTETCLVESNDSAWILDSGATNHV

Query:  CSSFQGISSWQQLREE------------------------------------------------------------------------------------
        CSSFQGISSW+QL                                                                                       
Subjt:  CSSFQGISSWQQLREE------------------------------------------------------------------------------------

Query:  -------------------TRTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGKL----FSGKGYRAKDLLELIHSDLC
                           T+ KR+ +SPKEN HLWHLR GHINLNRIERLVK+GLLSELEEN LPVCESCL+GK+    F+GKG+RAK+ LEL+HSDLC
Subjt:  -------------------TRTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGKL----FSGKGYRAKDLLELIHSDLC

Query:  GPMSLKARGGYEYFVSFINDYSRYGYIYLMHRKFETLEKFKEYKIEVENLL-------------------------------------------------
        GPM++KARGG+EYF++F +DYSRYGY+YLM  K E LEKFKEYK EVEN L                                                 
Subjt:  GPMSLKARGGYEYFVSFINDYSRYGYIYLMHRKFETLEKFKEYKIEVENLL-------------------------------------------------

Query:  ------------------------VETAVYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHL-------------------------GGLFYDPK
                                V+TAVY+LN VPSKSV ETP K+WNGRKGSL HF IWGCP H+                         GG FYDPK
Subjt:  ------------------------VETAVYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHL-------------------------GGLFYDPK

Query:  ENRVLVSTDAIFLEEDHVKDHLPRSKIVLNEMD----SISARVADGASTSTSVVDPSTSSQV-RSQEFGMPRRSGRVVRQPERYMGLAETQVVTPDDDCE
        +N+V VST+A FLEEDH+++H PRSKIVLNE+       S RV +  S  T VV   +S++  + Q    PRRSGRV   P RYM L ET  V  D D E
Subjt:  ENRVLVSTDAIFLEEDHVKDHLPRSKIVLNEMD----SISARVADGASTSTSVVDPSTSSQV-RSQEFGMPRRSGRVVRQPERYMGLAETQVVTPDDDCE

Query:  DPLTYDQAMTDVDKDEWIKVMD
        DPLT+ +AM DVDKDEWIK M+
Subjt:  DPLTYDQAMTDVDKDEWIKVMD

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]6.2e-15747.51Show/hide
Query:  MKEGTSAREHVLDMMTYFNLAEMNGASIDESSQVNFILETLPKSFLQFRSNVVMNKISYSLTTLLNELQNFQSLMMIRTPEAEANVA--YRSYHRGSTSE
        M EG S REHVL+MM +FN+AEMNGA IDE+SQV+FILE+LP+SFLQFRSN VMNKI+Y+LTTLLNELQ F+SLM I+  + EANVA   R +HRGSTS 
Subjt:  MKEGTSAREHVLDMMTYFNLAEMNGASIDESSQVNFILETLPKSFLQFRSNVVMNKISYSLTTLLNELQNFQSLMMIRTPEAEANVA--YRSYHRGSTSE

Query:  TKLIAPSHPKGKKKKMKKG----KVDRAAAQKGIKVKEVAENGKSFHFNGDGHWKRNCLKFLADRKNT--GKCDLLVTETCLVESNDSAWILDSGATNHV
        TK + PS    KK K KKG    K + AAA+   K K  A  G  FH N +GHWKRNC K+LA++K    GK DLLV ETCLVE++DSAWI+DSGATNHV
Subjt:  TKLIAPSHPKGKKKKMKKG----KVDRAAAQKGIKVKEVAENGKSFHFNGDGHWKRNCLKFLADRKNT--GKCDLLVTETCLVESNDSAWILDSGATNHV

Query:  CSSFQGISSWQQLREE------------------------------------------------------------------------------------
        CSSFQGISSW+QL                                                                                       
Subjt:  CSSFQGISSWQQLREE------------------------------------------------------------------------------------

Query:  -------------------TRTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGKL----FSGKGYRAKDLLELIHSDLC
                           T+ KR+ +SPKEN HLWHLR GHINLNRIERLVK+GLLSELEEN LPVCESCL+GK+    F+GKG+RAK+ LEL+HSDLC
Subjt:  -------------------TRTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGKL----FSGKGYRAKDLLELIHSDLC

Query:  GPMSLKARGGYEYFVSFINDYSRYGYIYLMHRKFETLEKFKEYKIEVENLL-------------------------------------------------
        GPM++KARGG+EYF++F +DYSRYGY+YLM  K E LEKFKEYK EVEN L                                                 
Subjt:  GPMSLKARGGYEYFVSFINDYSRYGYIYLMHRKFETLEKFKEYKIEVENLL-------------------------------------------------

Query:  ------------------------VETAVYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHL-------------------------GGLFYDPK
                                V+TAVY+LN VPSKSV ETP K+WNGRKGSL HF IWGCP H+                         GG FYDPK
Subjt:  ------------------------VETAVYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHL-------------------------GGLFYDPK

Query:  ENRVLVSTDAIFLEEDHVKDHLPRSKIVLNEMD----SISARVADGASTSTSVVDPSTSSQV-RSQEFGMPRRSGRVVRQPERYMGLAETQVVTPDDDCE
        +N+V VST+A FLEEDH+++H PRSKIVLNE+       S RV +  S  T VV   +S++  + Q    PRRSGRV   P RYM L ET  V  D D E
Subjt:  ENRVLVSTDAIFLEEDHVKDHLPRSKIVLNEMD----SISARVADGASTSTSVVDPSTSSQV-RSQEFGMPRRSGRVVRQPERYMGLAETQVVTPDDDCE

Query:  DPLTYDQAMTDVDKDEWIKVMD
        DPLT+ +AM DVDKDEWIK M+
Subjt:  DPLTYDQAMTDVDKDEWIKVMD

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein3.0e-15747.51Show/hide
Query:  MKEGTSAREHVLDMMTYFNLAEMNGASIDESSQVNFILETLPKSFLQFRSNVVMNKISYSLTTLLNELQNFQSLMMIRTPEAEANVA--YRSYHRGSTSE
        M EG S REHVL+MM +FN+AEMNGA IDE+SQV+FILE+LP+SFLQFRSN VMNKI+Y+LTTLLNELQ F+SLM I+  + EANVA   R +HRGSTS 
Subjt:  MKEGTSAREHVLDMMTYFNLAEMNGASIDESSQVNFILETLPKSFLQFRSNVVMNKISYSLTTLLNELQNFQSLMMIRTPEAEANVA--YRSYHRGSTSE

Query:  TKLIAPSHPKGKKKKMKKG----KVDRAAAQKGIKVKEVAENGKSFHFNGDGHWKRNCLKFLADRKNT--GKCDLLVTETCLVESNDSAWILDSGATNHV
        TK + PS    KK K KKG    K + AAA+   K K  A  G  FH N +GHWKRNC K+LA++K    GK DLLV ETCLVE++DSAWI+DSGATNHV
Subjt:  TKLIAPSHPKGKKKKMKKG----KVDRAAAQKGIKVKEVAENGKSFHFNGDGHWKRNCLKFLADRKNT--GKCDLLVTETCLVESNDSAWILDSGATNHV

Query:  CSSFQGISSWQQLREE------------------------------------------------------------------------------------
        CSSFQGISSW+QL                                                                                       
Subjt:  CSSFQGISSWQQLREE------------------------------------------------------------------------------------

Query:  -------------------TRTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGKL----FSGKGYRAKDLLELIHSDLC
                           T+ KR+ +SPKEN HLWHLR GHINLNRIERLVK+GLLSELEEN LPVCESCL+GK+    F+GKG+RAK+ LEL+HSDLC
Subjt:  -------------------TRTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGKL----FSGKGYRAKDLLELIHSDLC

Query:  GPMSLKARGGYEYFVSFINDYSRYGYIYLMHRKFETLEKFKEYKIEVENLL-------------------------------------------------
        GPM++KARGG+EYF++F +DYSRYGY+YLM  K E LEKFKEYK EVEN L                                                 
Subjt:  GPMSLKARGGYEYFVSFINDYSRYGYIYLMHRKFETLEKFKEYKIEVENLL-------------------------------------------------

Query:  ------------------------VETAVYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHL-------------------------GGLFYDPK
                                V+TAVY+LN VPSKSV ETP K+WNGRKGSL HF IWGCP H+                         GG FYDPK
Subjt:  ------------------------VETAVYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHL-------------------------GGLFYDPK

Query:  ENRVLVSTDAIFLEEDHVKDHLPRSKIVLNEMD----SISARVADGASTSTSVVDPSTSSQV-RSQEFGMPRRSGRVVRQPERYMGLAETQVVTPDDDCE
        +N+V VST+A FLEEDH+++H PRSKIVLNE+       S RV +  S  T VV   +S++  + Q    PRRSGRV   P RYM L ET  V  D D E
Subjt:  ENRVLVSTDAIFLEEDHVKDHLPRSKIVLNEMD----SISARVADGASTSTSVVDPSTSSQV-RSQEFGMPRRSGRVVRQPERYMGLAETQVVTPDDDCE

Query:  DPLTYDQAMTDVDKDEWIKVMD
        DPLT+ +AM DVDKDEWIK M+
Subjt:  DPLTYDQAMTDVDKDEWIKVMD

A0A5A7TWB9 Gag/pol protein3.0e-15747.51Show/hide
Query:  MKEGTSAREHVLDMMTYFNLAEMNGASIDESSQVNFILETLPKSFLQFRSNVVMNKISYSLTTLLNELQNFQSLMMIRTPEAEANVA--YRSYHRGSTSE
        M EG S REHVL+MM +FN+AEMNGA IDE+SQV+FILE+LP+SFLQFRSN VMNKI+Y+LTTLLNELQ F+SLM I+  + EANVA   R +HRGSTS 
Subjt:  MKEGTSAREHVLDMMTYFNLAEMNGASIDESSQVNFILETLPKSFLQFRSNVVMNKISYSLTTLLNELQNFQSLMMIRTPEAEANVA--YRSYHRGSTSE

Query:  TKLIAPSHPKGKKKKMKKG----KVDRAAAQKGIKVKEVAENGKSFHFNGDGHWKRNCLKFLADRKNT--GKCDLLVTETCLVESNDSAWILDSGATNHV
        TK + PS    KK K KKG    K + AAA+   K K  A  G  FH N +GHWKRNC K+LA++K    GK DLLV ETCLVE++DSAWI+DSGATNHV
Subjt:  TKLIAPSHPKGKKKKMKKG----KVDRAAAQKGIKVKEVAENGKSFHFNGDGHWKRNCLKFLADRKNT--GKCDLLVTETCLVESNDSAWILDSGATNHV

Query:  CSSFQGISSWQQLREE------------------------------------------------------------------------------------
        CSSFQGISSW+QL                                                                                       
Subjt:  CSSFQGISSWQQLREE------------------------------------------------------------------------------------

Query:  -------------------TRTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGKL----FSGKGYRAKDLLELIHSDLC
                           T+ KR+ +SPKEN HLWHLR GHINLNRIERLVK+GLLSELEEN LPVCESCL+GK+    F+GKG+RAK+ LEL+HSDLC
Subjt:  -------------------TRTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGKL----FSGKGYRAKDLLELIHSDLC

Query:  GPMSLKARGGYEYFVSFINDYSRYGYIYLMHRKFETLEKFKEYKIEVENLL-------------------------------------------------
        GPM++KARGG+EYF++F +DYSRYGY+YLM  K E LEKFKEYK EVEN L                                                 
Subjt:  GPMSLKARGGYEYFVSFINDYSRYGYIYLMHRKFETLEKFKEYKIEVENLL-------------------------------------------------

Query:  ------------------------VETAVYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHL-------------------------GGLFYDPK
                                V+TAVY+LN VPSKSV ETP K+WNGRKGSL HF IWGCP H+                         GG FYDPK
Subjt:  ------------------------VETAVYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHL-------------------------GGLFYDPK

Query:  ENRVLVSTDAIFLEEDHVKDHLPRSKIVLNEMD----SISARVADGASTSTSVVDPSTSSQV-RSQEFGMPRRSGRVVRQPERYMGLAETQVVTPDDDCE
        +N+V VST+A FLEEDH+++H PRSKIVLNE+       S RV +  S  T VV   +S++  + Q    PRRSGRV   P RYM L ET  V  D D E
Subjt:  ENRVLVSTDAIFLEEDHVKDHLPRSKIVLNEMD----SISARVADGASTSTSVVDPSTSSQV-RSQEFGMPRRSGRVVRQPERYMGLAETQVVTPDDDCE

Query:  DPLTYDQAMTDVDKDEWIKVMD
        DPLT+ +AM DVDKDEWIK M+
Subjt:  DPLTYDQAMTDVDKDEWIKVMD

A0A5A7V4M1 Gag/pol protein3.0e-15747.51Show/hide
Query:  MKEGTSAREHVLDMMTYFNLAEMNGASIDESSQVNFILETLPKSFLQFRSNVVMNKISYSLTTLLNELQNFQSLMMIRTPEAEANVA--YRSYHRGSTSE
        M EG S REHVL+MM +FN+AEMNGA IDE+SQV+FILE+LP+SFLQFRSN VMNKI+Y+LTTLLNELQ F+SLM I+  + EANVA   R +HRGSTS 
Subjt:  MKEGTSAREHVLDMMTYFNLAEMNGASIDESSQVNFILETLPKSFLQFRSNVVMNKISYSLTTLLNELQNFQSLMMIRTPEAEANVA--YRSYHRGSTSE

Query:  TKLIAPSHPKGKKKKMKKG----KVDRAAAQKGIKVKEVAENGKSFHFNGDGHWKRNCLKFLADRKNT--GKCDLLVTETCLVESNDSAWILDSGATNHV
        TK + PS    KK K KKG    K + AAA+   K K  A  G  FH N +GHWKRNC K+LA++K    GK DLLV ETCLVE++DSAWI+DSGATNHV
Subjt:  TKLIAPSHPKGKKKKMKKG----KVDRAAAQKGIKVKEVAENGKSFHFNGDGHWKRNCLKFLADRKNT--GKCDLLVTETCLVESNDSAWILDSGATNHV

Query:  CSSFQGISSWQQLREE------------------------------------------------------------------------------------
        CSSFQGISSW+QL                                                                                       
Subjt:  CSSFQGISSWQQLREE------------------------------------------------------------------------------------

Query:  -------------------TRTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGKL----FSGKGYRAKDLLELIHSDLC
                           T+ KR+ +SPKEN HLWHLR GHINLNRIERLVK+GLLSELEEN LPVCESCL+GK+    F+GKG+RAK+ LEL+HSDLC
Subjt:  -------------------TRTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGKL----FSGKGYRAKDLLELIHSDLC

Query:  GPMSLKARGGYEYFVSFINDYSRYGYIYLMHRKFETLEKFKEYKIEVENLL-------------------------------------------------
        GPM++KARGG+EYF++F +DYSRYGY+YLM  K E LEKFKEYK EVEN L                                                 
Subjt:  GPMSLKARGGYEYFVSFINDYSRYGYIYLMHRKFETLEKFKEYKIEVENLL-------------------------------------------------

Query:  ------------------------VETAVYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHL-------------------------GGLFYDPK
                                V+TAVY+LN VPSKSV ETP K+WNGRKGSL HF IWGCP H+                         GG FYDPK
Subjt:  ------------------------VETAVYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHL-------------------------GGLFYDPK

Query:  ENRVLVSTDAIFLEEDHVKDHLPRSKIVLNEMD----SISARVADGASTSTSVVDPSTSSQV-RSQEFGMPRRSGRVVRQPERYMGLAETQVVTPDDDCE
        +N+V VST+A FLEEDH+++H PRSKIVLNE+       S RV +  S  T VV   +S++  + Q    PRRSGRV   P RYM L ET  V  D D E
Subjt:  ENRVLVSTDAIFLEEDHVKDHLPRSKIVLNEMD----SISARVADGASTSTSVVDPSTSSQV-RSQEFGMPRRSGRVVRQPERYMGLAETQVVTPDDDCE

Query:  DPLTYDQAMTDVDKDEWIKVMD
        DPLT+ +AM DVDKDEWIK M+
Subjt:  DPLTYDQAMTDVDKDEWIKVMD

A0A5D3CPJ6 Gag/pol protein3.0e-15747.51Show/hide
Query:  MKEGTSAREHVLDMMTYFNLAEMNGASIDESSQVNFILETLPKSFLQFRSNVVMNKISYSLTTLLNELQNFQSLMMIRTPEAEANVA--YRSYHRGSTSE
        M EG S REHVL+MM +FN+AEMNGA IDE+SQV+FILE+LP+SFLQFRSN VMNKI+Y+LTTLLNELQ F+SLM I+  + EANVA   R +HRGSTS 
Subjt:  MKEGTSAREHVLDMMTYFNLAEMNGASIDESSQVNFILETLPKSFLQFRSNVVMNKISYSLTTLLNELQNFQSLMMIRTPEAEANVA--YRSYHRGSTSE

Query:  TKLIAPSHPKGKKKKMKKG----KVDRAAAQKGIKVKEVAENGKSFHFNGDGHWKRNCLKFLADRKNT--GKCDLLVTETCLVESNDSAWILDSGATNHV
        TK + PS    KK K KKG    K + AAA+   K K  A  G  FH N +GHWKRNC K+LA++K    GK DLLV ETCLVE++DSAWI+DSGATNHV
Subjt:  TKLIAPSHPKGKKKKMKKG----KVDRAAAQKGIKVKEVAENGKSFHFNGDGHWKRNCLKFLADRKNT--GKCDLLVTETCLVESNDSAWILDSGATNHV

Query:  CSSFQGISSWQQLREE------------------------------------------------------------------------------------
        CSSFQGISSW+QL                                                                                       
Subjt:  CSSFQGISSWQQLREE------------------------------------------------------------------------------------

Query:  -------------------TRTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGKL----FSGKGYRAKDLLELIHSDLC
                           T+ KR+ +SPKEN HLWHLR GHINLNRIERLVK+GLLSELEEN LPVCESCL+GK+    F+GKG+RAK+ LEL+HSDLC
Subjt:  -------------------TRTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGKL----FSGKGYRAKDLLELIHSDLC

Query:  GPMSLKARGGYEYFVSFINDYSRYGYIYLMHRKFETLEKFKEYKIEVENLL-------------------------------------------------
        GPM++KARGG+EYF++F +DYSRYGY+YLM  K E LEKFKEYK EVEN L                                                 
Subjt:  GPMSLKARGGYEYFVSFINDYSRYGYIYLMHRKFETLEKFKEYKIEVENLL-------------------------------------------------

Query:  ------------------------VETAVYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHL-------------------------GGLFYDPK
                                V+TAVY+LN VPSKSV ETP K+WNGRKGSL HF IWGCP H+                         GG FYDPK
Subjt:  ------------------------VETAVYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHL-------------------------GGLFYDPK

Query:  ENRVLVSTDAIFLEEDHVKDHLPRSKIVLNEMD----SISARVADGASTSTSVVDPSTSSQV-RSQEFGMPRRSGRVVRQPERYMGLAETQVVTPDDDCE
        +N+V VST+A FLEEDH+++H PRSKIVLNE+       S RV +  S  T VV   +S++  + Q    PRRSGRV   P RYM L ET  V  D D E
Subjt:  ENRVLVSTDAIFLEEDHVKDHLPRSKIVLNEMD----SISARVADGASTSTSVVDPSTSSQV-RSQEFGMPRRSGRVVRQPERYMGLAETQVVTPDDDCE

Query:  DPLTYDQAMTDVDKDEWIKVMD
        DPLT+ +AM DVDKDEWIK M+
Subjt:  DPLTYDQAMTDVDKDEWIKVMD

A0A5D3CSZ6 Gag/pol protein3.0e-15747.51Show/hide
Query:  MKEGTSAREHVLDMMTYFNLAEMNGASIDESSQVNFILETLPKSFLQFRSNVVMNKISYSLTTLLNELQNFQSLMMIRTPEAEANVA--YRSYHRGSTSE
        M EG S REHVL+MM +FN+AEMNGA IDE+SQV+FILE+LP+SFLQFRSN VMNKI+Y+LTTLLNELQ F+SLM I+  + EANVA   R +HRGSTS 
Subjt:  MKEGTSAREHVLDMMTYFNLAEMNGASIDESSQVNFILETLPKSFLQFRSNVVMNKISYSLTTLLNELQNFQSLMMIRTPEAEANVA--YRSYHRGSTSE

Query:  TKLIAPSHPKGKKKKMKKG----KVDRAAAQKGIKVKEVAENGKSFHFNGDGHWKRNCLKFLADRKNT--GKCDLLVTETCLVESNDSAWILDSGATNHV
        TK + PS    KK K KKG    K + AAA+   K K  A  G  FH N +GHWKRNC K+LA++K    GK DLLV ETCLVE++DSAWI+DSGATNHV
Subjt:  TKLIAPSHPKGKKKKMKKG----KVDRAAAQKGIKVKEVAENGKSFHFNGDGHWKRNCLKFLADRKNT--GKCDLLVTETCLVESNDSAWILDSGATNHV

Query:  CSSFQGISSWQQLREE------------------------------------------------------------------------------------
        CSSFQGISSW+QL                                                                                       
Subjt:  CSSFQGISSWQQLREE------------------------------------------------------------------------------------

Query:  -------------------TRTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGKL----FSGKGYRAKDLLELIHSDLC
                           T+ KR+ +SPKEN HLWHLR GHINLNRIERLVK+GLLSELEEN LPVCESCL+GK+    F+GKG+RAK+ LEL+HSDLC
Subjt:  -------------------TRTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGKL----FSGKGYRAKDLLELIHSDLC

Query:  GPMSLKARGGYEYFVSFINDYSRYGYIYLMHRKFETLEKFKEYKIEVENLL-------------------------------------------------
        GPM++KARGG+EYF++F +DYSRYGY+YLM  K E LEKFKEYK EVEN L                                                 
Subjt:  GPMSLKARGGYEYFVSFINDYSRYGYIYLMHRKFETLEKFKEYKIEVENLL-------------------------------------------------

Query:  ------------------------VETAVYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHL-------------------------GGLFYDPK
                                V+TAVY+LN VPSKSV ETP K+WNGRKGSL HF IWGCP H+                         GG FYDPK
Subjt:  ------------------------VETAVYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHL-------------------------GGLFYDPK

Query:  ENRVLVSTDAIFLEEDHVKDHLPRSKIVLNEMD----SISARVADGASTSTSVVDPSTSSQV-RSQEFGMPRRSGRVVRQPERYMGLAETQVVTPDDDCE
        +N+V VST+A FLEEDH+++H PRSKIVLNE+       S RV +  S  T VV   +S++  + Q    PRRSGRV   P RYM L ET  V  D D E
Subjt:  ENRVLVSTDAIFLEEDHVKDHLPRSKIVLNEMD----SISARVADGASTSTSVVDPSTSSQV-RSQEFGMPRRSGRVVRQPERYMGLAETQVVTPDDDCE

Query:  DPLTYDQAMTDVDKDEWIKVMD
        DPLT+ +AM DVDKDEWIK M+
Subjt:  DPLTYDQAMTDVDKDEWIKVMD

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.8e-1029.01Show/hide
Query:  VNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELE-----ENYLPVCESCLDGKLFSGKGYRAKDL------LELIHSDLCGPMSLKARGGYEYFVSF
        +N   K N  LWH RFGHI+  ++  + +  + S+       E    +CE CL+GK       + KD       L ++HSD+CGP++        YFV F
Subjt:  VNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELE-----ENYLPVCESCLDGKLFSGKGYRAKDL------LELIHSDLCGPMSLKARGGYEYFVSF

Query:  INDYSRYGYIYLMHRKFETLEKFKEYKIEVE
        ++ ++ Y   YL+  K +    F+++  + E
Subjt:  INDYSRYGYIYLMHRKFETLEKFKEYKIEVE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-1630.72Show/hide
Query:  NVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGK----LFSGKGYRAKDLLELIHSDLCGPMSLKARGGYEYFVSFINDYSRYGYIYLMH
        +V LWH R GH++   ++ L K  L+S  +   +  C+ CL GK     F     R  ++L+L++SD+CGPM +++ GG +YFV+FI+D SR  ++Y++ 
Subjt:  NVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGK----LFSGKGYRAKDLLELIHSDLCGPMSLKARGGYEYFVSFINDYSRYGYIYLMH

Query:  RKFETLEKFKEYKIEVENLLVETA--VYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHLG
         K +  + F+++   VE    ET   +  L +          F+ +    G  H   + G P H G
Subjt:  RKFETLEKFKEYKIEVENLLVETA--VYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHLG

P25384 Transposon Ty2-C Gag-Pol polyprotein6.7e-0526.98Show/hide
Query:  RTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLP-------VCESCLDGKLFSG---KGYRAK-----DLLELIHSDLCGPMSLKARG
        ++K VN  P     L H   GH N   I++ +K   ++ L+E+ +         C  CL GK       KG R K     +  + +H+D+ GP+    + 
Subjt:  RTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLP-------VCESCLDGKLFSG---KGYRAK-----DLLELIHSDLCGPMSLKARG

Query:  GYEYFVSFINDYSRYGYIYLMHRKFE
           YF+SF ++ +R+ ++Y +H + E
Subjt:  GYEYFVSFINDYSRYGYIYLMHRKFE

Q12472 Transposon Ty2-DR1 Gag-Pol polyprotein6.7e-0526.98Show/hide
Query:  RTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLP-------VCESCLDGKLFSG---KGYRAK-----DLLELIHSDLCGPMSLKARG
        ++K VN  P     L H   GH N   I++ +K   ++ L+E+ +         C  CL GK       KG R K     +  + +H+D+ GP+    + 
Subjt:  RTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLP-------VCESCLDGKLFSG---KGYRAK-----DLLELIHSDLCGPMSLKARG

Query:  GYEYFVSFINDYSRYGYIYLMHRKFE
           YF+SF ++ +R+ ++Y +H + E
Subjt:  GYEYFVSFINDYSRYGYIYLMHRKFE

Q12491 Transposon Ty2-B Gag-Pol polyprotein6.7e-0526.98Show/hide
Query:  RTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLP-------VCESCLDGKLFSG---KGYRAK-----DLLELIHSDLCGPMSLKARG
        ++K VN  P     L H   GH N   I++ +K   ++ L+E+ +         C  CL GK       KG R K     +  + +H+D+ GP+    + 
Subjt:  RTKRVNVSPKENVHLWHLRFGHINLNRIERLVKSGLLSELEENYLP-------VCESCLDGKLFSG---KGYRAK-----DLLELIHSDLCGPMSLKARG

Query:  GYEYFVSFINDYSRYGYIYLMHRKFE
           YF+SF ++ +R+ ++Y +H + E
Subjt:  GYEYFVSFINDYSRYGYIYLMHRKFE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGAAGGGACGTCTGCTCGAGAACACGTTCTAGACATGATGACATACTTTAATCTAGCGGAGATGAACGGTGCTTCGATCGATGAGTCGAGCCAGGTCAACTTCAT
CTTGGAGACTCTTCCGAAGAGTTTCCTTCAGTTTCGTAGCAATGTTGTTATGAACAAAATTAGCTACTCACTGACCACCCTTCTCAATGAGCTACAGAACTTCCAATCCT
TGATGATGATCAGGACACCGGAAGCTGAGGCAAATGTTGCCTACAGGTCCTATCACAGGGGTTCGACCTCTGAGACAAAACTTATAGCTCCATCACATCCGAAAGGGAAG
AAGAAGAAGATGAAGAAGGGTAAAGTTGATCGTGCTGCCGCTCAAAAGGGAATAAAGGTCAAGGAAGTTGCAGAAAATGGAAAGAGTTTCCACTTCAATGGGGACGGGCA
CTGGAAGCGAAACTGTCTCAAGTTTCTTGCCGACAGGAAGAATACAGGTAAATGTGATTTACTAGTAACTGAAACCTGTTTAGTGGAGAGTAATGACTCTGCCTGGATAT
TAGATTCGGGCGCCACTAACCACGTTTGTTCTTCTTTTCAAGGAATTAGTTCCTGGCAACAGCTGAGAGAGGAAACACGAACTAAAAGAGTAAACGTTTCCCCAAAAGAA
AATGTCCATCTTTGGCATCTACGGTTTGGCCACATTAATCTCAATAGGATTGAGAGACTAGTGAAGAGTGGACTTCTAAGCGAGTTGGAAGAAAATTATTTACCGGTGTG
TGAGTCATGCCTCGATGGCAAATTATTTAGTGGAAAAGGATATAGAGCTAAAGATCTCCTTGAGCTTATACATTCTGACCTCTGTGGTCCGATGAGTTTAAAAGCACGAG
GTGGTTACGAATACTTTGTATCTTTTATAAATGACTATTCAAGGTATGGGTATATTTACCTAATGCATAGGAAGTTTGAAACTCTTGAAAAGTTCAAGGAGTACAAGATT
GAGGTTGAGAACCTGTTAGTGGAGACTGCGGTCTACGTTTTGAACAATGTTCCGTCGAAGAGTGTTTGTGAAACACCTTTCAAAGTCTGGAATGGACGTAAAGGCAGTTT
ACACCATTTCATAATTTGGGGATGCCCGACCCACCTGGGTGGTCTATTTTACGATCCTAAGGAAAATAGGGTGCTTGTGTCGACAGATGCCATTTTCCTTGAGGAAGACC
ACGTCAAGGATCATTTGCCTAGAAGTAAAATCGTGTTGAACGAAATGGACAGTATATCAGCAAGAGTTGCTGATGGGGCTAGTACATCAACAAGTGTTGTTGATCCTAGC
ACGTCTAGTCAAGTTCGCTCTCAAGAGTTTGGAATGCCTCGACGTAGTGGAAGGGTTGTGAGACAACCTGAACGTTACATGGGTTTAGCTGAAACCCAAGTCGTCACTCC
TGATGATGACTGCGAGGATCCATTGACCTATGATCAGGCAATGACAGACGTTGACAAGGACGAATGGATTAAAGTTATGGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGAAGGGACGTCTGCTCGAGAACACGTTCTAGACATGATGACATACTTTAATCTAGCGGAGATGAACGGTGCTTCGATCGATGAGTCGAGCCAGGTCAACTTCAT
CTTGGAGACTCTTCCGAAGAGTTTCCTTCAGTTTCGTAGCAATGTTGTTATGAACAAAATTAGCTACTCACTGACCACCCTTCTCAATGAGCTACAGAACTTCCAATCCT
TGATGATGATCAGGACACCGGAAGCTGAGGCAAATGTTGCCTACAGGTCCTATCACAGGGGTTCGACCTCTGAGACAAAACTTATAGCTCCATCACATCCGAAAGGGAAG
AAGAAGAAGATGAAGAAGGGTAAAGTTGATCGTGCTGCCGCTCAAAAGGGAATAAAGGTCAAGGAAGTTGCAGAAAATGGAAAGAGTTTCCACTTCAATGGGGACGGGCA
CTGGAAGCGAAACTGTCTCAAGTTTCTTGCCGACAGGAAGAATACAGGTAAATGTGATTTACTAGTAACTGAAACCTGTTTAGTGGAGAGTAATGACTCTGCCTGGATAT
TAGATTCGGGCGCCACTAACCACGTTTGTTCTTCTTTTCAAGGAATTAGTTCCTGGCAACAGCTGAGAGAGGAAACACGAACTAAAAGAGTAAACGTTTCCCCAAAAGAA
AATGTCCATCTTTGGCATCTACGGTTTGGCCACATTAATCTCAATAGGATTGAGAGACTAGTGAAGAGTGGACTTCTAAGCGAGTTGGAAGAAAATTATTTACCGGTGTG
TGAGTCATGCCTCGATGGCAAATTATTTAGTGGAAAAGGATATAGAGCTAAAGATCTCCTTGAGCTTATACATTCTGACCTCTGTGGTCCGATGAGTTTAAAAGCACGAG
GTGGTTACGAATACTTTGTATCTTTTATAAATGACTATTCAAGGTATGGGTATATTTACCTAATGCATAGGAAGTTTGAAACTCTTGAAAAGTTCAAGGAGTACAAGATT
GAGGTTGAGAACCTGTTAGTGGAGACTGCGGTCTACGTTTTGAACAATGTTCCGTCGAAGAGTGTTTGTGAAACACCTTTCAAAGTCTGGAATGGACGTAAAGGCAGTTT
ACACCATTTCATAATTTGGGGATGCCCGACCCACCTGGGTGGTCTATTTTACGATCCTAAGGAAAATAGGGTGCTTGTGTCGACAGATGCCATTTTCCTTGAGGAAGACC
ACGTCAAGGATCATTTGCCTAGAAGTAAAATCGTGTTGAACGAAATGGACAGTATATCAGCAAGAGTTGCTGATGGGGCTAGTACATCAACAAGTGTTGTTGATCCTAGC
ACGTCTAGTCAAGTTCGCTCTCAAGAGTTTGGAATGCCTCGACGTAGTGGAAGGGTTGTGAGACAACCTGAACGTTACATGGGTTTAGCTGAAACCCAAGTCGTCACTCC
TGATGATGACTGCGAGGATCCATTGACCTATGATCAGGCAATGACAGACGTTGACAAGGACGAATGGATTAAAGTTATGGACTAG
Protein sequenceShow/hide protein sequence
MKEGTSAREHVLDMMTYFNLAEMNGASIDESSQVNFILETLPKSFLQFRSNVVMNKISYSLTTLLNELQNFQSLMMIRTPEAEANVAYRSYHRGSTSETKLIAPSHPKGK
KKKMKKGKVDRAAAQKGIKVKEVAENGKSFHFNGDGHWKRNCLKFLADRKNTGKCDLLVTETCLVESNDSAWILDSGATNHVCSSFQGISSWQQLREETRTKRVNVSPKE
NVHLWHLRFGHINLNRIERLVKSGLLSELEENYLPVCESCLDGKLFSGKGYRAKDLLELIHSDLCGPMSLKARGGYEYFVSFINDYSRYGYIYLMHRKFETLEKFKEYKI
EVENLLVETAVYVLNNVPSKSVCETPFKVWNGRKGSLHHFIIWGCPTHLGGLFYDPKENRVLVSTDAIFLEEDHVKDHLPRSKIVLNEMDSISARVADGASTSTSVVDPS
TSSQVRSQEFGMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYDQAMTDVDKDEWIKVMD