; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G15220 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G15220
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr6:13395424..13398822
RNA-Seq ExpressionCSPI06G15220
SyntenyCSPI06G15220
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041601.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.2e-21966.03Show/hide
Query:  DKEEDNKFIERLEDWDSKNHQIIIWLGNTSIPAIHAQFDAFDD------------------------------------SVNEYLAVLQPIWTQLDQASI
        DKE+D+KFIERLE+WDSKNHQII WL NTSIPAIH QFDAF++                                    SVNEYLAVLQPIWTQLDQA+I
Subjt:  DKEEDNKFIERLEDWDSKNHQIIIWLGNTSIPAIHAQFDAFDD------------------------------------SVNEYLAVLQPIWTQLDQASI

Query:  NKDHLRLIKVLMRLRPEYESVRAALLHRSPLPSLDAAIQEILFEEKHLGINSTKQPDVGLGSTYT----SNMFCKNCKLFGHKFINGPKIECRYCHKHGH
        +KDHLRLIKVLM LRPEYESVRAALLHRSPLPSLDAAIQEILFEE+ LGIN +K  D  L STY+    S+ FCKNCKL GHKFIN PKIECRYCHK GH
Subjt:  NKDHLRLIKVLMRLRPEYESVRAALLHRSPLPSLDAAIQEILFEEKHLGINSTKQPDVGLGSTYT----SNMFCKNCKLFGHKFINGPKIECRYCHKHGH

Query:  ILDNCPTRPPQPLGTSTKEKNFTKPGPSSVVAATSDDSSL--IEISDLQILLNQIISS-SSALVVSSGNRWLLDSTCCNHMTSDFSLMSTSSPTKSLPPI
        ILDNCP +PP+P   ST+ KNFTKP  SS      D S++   +ISDLQ LLNQ+ISS SSAL VS GNRWLLDS CCNHMTSD+SLM+T SPTKSLPPI
Subjt:  ILDNCPTRPPQPLGTSTKEKNFTKPGPSSVVAATSDDSSL--IEISDLQILLNQIISS-SSALVVSSGNRWLLDSTCCNHMTSDFSLMSTSSPTKSLPPI

Query:  YAADG----------------------------------------------SGSADETDD-------WNGSQSGKI------------------------
        YAADG                                              S +  +  D         G + G++                        
Subjt:  YAADG----------------------------------------------SGSADETDD-------WNGSQSGKI------------------------

Query:  WHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSE
        WHLRLGHAS EKLRHLIS+NNLN++TKFVPFNCLNCKLAKQ ALSFS S S CDKPFDL+HSDIWGPAP +TVHGYRYYVLFIDD+SRFTWIYFLKHRSE
Subjt:  WHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSE

Query:  LSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINRL
        LSRTYIEFANMIRTQFS PIK LRTDNALEYKDS LLSFLSQQGT+VQRSCPHTSQQNGRAERKHRHILDS+ ALLLSASCP+KFWGEAALTSVYTINRL
Subjt:  LSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINRL

Query:  PSSVLQNTSPFERLYGISPDYSNLKDF
        PSSVLQN SPFERLYG  P+YSNLK F
Subjt:  PSSVLQNTSPFERLYGISPDYSNLKDF

TYK15021.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.7e-22066.35Show/hide
Query:  DKEEDNKFIERLEDWDSKNHQIIIWLGNTSIPAIHAQFDAFDD------------------------------------SVNEYLAVLQPIWTQLDQASI
        DKE+D+KFIERLE+WDSKNHQII WL NTSIPAIH QFDAFD+                                    SVNEYLAVLQPIWTQLDQA+I
Subjt:  DKEEDNKFIERLEDWDSKNHQIIIWLGNTSIPAIHAQFDAFDD------------------------------------SVNEYLAVLQPIWTQLDQASI

Query:  NKDHLRLIKVLMRLRPEYESVRAALLHRSPLPSLDAAIQEILFEEKHLGINSTKQPDVGLGSTYT----SNMFCKNCKLFGHKFINGPKIECRYCHKHGH
        +KDHLRLIKVLM LRPEYESVRAALLHRSPLPSLDAAIQEILFEE+ LGIN +K  DV L STY+    S+ FCKNCKL GHKFIN PKIECRYCHK GH
Subjt:  NKDHLRLIKVLMRLRPEYESVRAALLHRSPLPSLDAAIQEILFEEKHLGINSTKQPDVGLGSTYT----SNMFCKNCKLFGHKFINGPKIECRYCHKHGH

Query:  ILDNCPTRPPQPLGTSTKEKNFTKPGPSSVVAATSDDSSL--IEISDLQILLNQIISS-SSALVVSSGNRWLLDSTCCNHMTSDFSLMSTSSPTKSLPPI
        ILDNCP +PP+P   ST+ KNFTKP  SS      D+S+    +ISDLQ LLNQ+ISS SSAL VS GNRWLLDS CCNHMTSD+SLM+T SPTKSLPPI
Subjt:  ILDNCPTRPPQPLGTSTKEKNFTKPGPSSVVAATSDDSSL--IEISDLQILLNQIISS-SSALVVSSGNRWLLDSTCCNHMTSDFSLMSTSSPTKSLPPI

Query:  YAADG----------------------------------------------SGSADETDD-------WNGSQSGKI------------------------
        YAADG                                              S +  +  D         G + G++                        
Subjt:  YAADG----------------------------------------------SGSADETDD-------WNGSQSGKI------------------------

Query:  WHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSE
        WHLRLGHAS EKLRHLIS+NNLN++TKFVPFNCLNCKLAKQ ALSFS+S S CDKPFDL+HSDIWGPAP TTVHGYRYYVLFIDD+SRFTWIYFLKHRSE
Subjt:  WHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSE

Query:  LSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINRL
        LSRTYIEFANMIRTQFS PIK LRTDNALEYKDS LLSFLSQQGT+VQRSCPHTSQQNGRAERKHRHILDS+ ALLLSASCP+KFWGEAALTSVYTINRL
Subjt:  LSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINRL

Query:  PSSVLQNTSPFERLYGISPDYSNLKDF
        PSSVLQN SPFE+LYG  P+YSNLK F
Subjt:  PSSVLQNTSPFERLYGISPDYSNLKDF

TYK19656.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.2e-21966.03Show/hide
Query:  DKEEDNKFIERLEDWDSKNHQIIIWLGNTSIPAIHAQFDAFDD------------------------------------SVNEYLAVLQPIWTQLDQASI
        DKE+D+KFIERLE+WDSKNHQII WL NTSIPAIH QFDAF++                                    SVNEYLAVLQPIWTQLDQA+I
Subjt:  DKEEDNKFIERLEDWDSKNHQIIIWLGNTSIPAIHAQFDAFDD------------------------------------SVNEYLAVLQPIWTQLDQASI

Query:  NKDHLRLIKVLMRLRPEYESVRAALLHRSPLPSLDAAIQEILFEEKHLGINSTKQPDVGLGSTYT----SNMFCKNCKLFGHKFINGPKIECRYCHKHGH
        +KDHLRLIKVLM LRPEYESVRAALLHRSPLPSLDAAIQEILFEE+ LGIN +K  D  L STY+    S+ FCKNCKL GHKFIN PKIECRYCHK GH
Subjt:  NKDHLRLIKVLMRLRPEYESVRAALLHRSPLPSLDAAIQEILFEEKHLGINSTKQPDVGLGSTYT----SNMFCKNCKLFGHKFINGPKIECRYCHKHGH

Query:  ILDNCPTRPPQPLGTSTKEKNFTKPGPSSVVAATSDDSSL--IEISDLQILLNQIISS-SSALVVSSGNRWLLDSTCCNHMTSDFSLMSTSSPTKSLPPI
        ILDNCP +PP+P   ST+ KNFTKP  SS      D S++   +ISDLQ LLNQ+ISS SSAL VS GNRWLLDS CCNHMTSD+SLM+T SPTKSLPPI
Subjt:  ILDNCPTRPPQPLGTSTKEKNFTKPGPSSVVAATSDDSSL--IEISDLQILLNQIISS-SSALVVSSGNRWLLDSTCCNHMTSDFSLMSTSSPTKSLPPI

Query:  YAADG----------------------------------------------SGSADETDD-------WNGSQSGKI------------------------
        YAADG                                              S +  +  D         G + G++                        
Subjt:  YAADG----------------------------------------------SGSADETDD-------WNGSQSGKI------------------------

Query:  WHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSE
        WHLRLGHAS EKLRHLIS+NNLN++TKFVPFNCLNCKLAKQ ALSFS S S CDKPFDL+HSDIWGPAP +TVHGYRYYVLFIDD+SRFTWIYFLKHRSE
Subjt:  WHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSE

Query:  LSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINRL
        LSRTYIEFANMIRTQFS PIK LRTDNALEYKDS LLSFLSQQGT+VQRSCPHTSQQNGRAERKHRHILDS+ ALLLSASCP+KFWGEAALTSVYTINRL
Subjt:  LSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINRL

Query:  PSSVLQNTSPFERLYGISPDYSNLKDF
        PSSVLQN SPFERLYG  P+YSNLK F
Subjt:  PSSVLQNTSPFERLYGISPDYSNLKDF

TYK26360.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.2e-21966.03Show/hide
Query:  DKEEDNKFIERLEDWDSKNHQIIIWLGNTSIPAIHAQFDAFDD------------------------------------SVNEYLAVLQPIWTQLDQASI
        DKE+D+KFIERLE+WDSKNHQII WL NTSIPAIH QFDAF++                                    SVNEYLAVLQPIWTQLDQA+I
Subjt:  DKEEDNKFIERLEDWDSKNHQIIIWLGNTSIPAIHAQFDAFDD------------------------------------SVNEYLAVLQPIWTQLDQASI

Query:  NKDHLRLIKVLMRLRPEYESVRAALLHRSPLPSLDAAIQEILFEEKHLGINSTKQPDVGLGSTYT----SNMFCKNCKLFGHKFINGPKIECRYCHKHGH
        +KDHLRLIKVLM LRPEYESVRAALLHRSPLPSLDAAIQEILFEE+ LGIN +K  D  L STY+    S+ FCKNCKL GHKFIN PKIECRYCHK GH
Subjt:  NKDHLRLIKVLMRLRPEYESVRAALLHRSPLPSLDAAIQEILFEEKHLGINSTKQPDVGLGSTYT----SNMFCKNCKLFGHKFINGPKIECRYCHKHGH

Query:  ILDNCPTRPPQPLGTSTKEKNFTKPGPSSVVAATSDDSSL--IEISDLQILLNQIISS-SSALVVSSGNRWLLDSTCCNHMTSDFSLMSTSSPTKSLPPI
        ILDNCP +PP+P   ST+ KNFTKP  SS      D S++   +ISDLQ LLNQ+ISS SSAL VS GNRWLLDS CCNHMTSD+SLM+T SPTKSLPPI
Subjt:  ILDNCPTRPPQPLGTSTKEKNFTKPGPSSVVAATSDDSSL--IEISDLQILLNQIISS-SSALVVSSGNRWLLDSTCCNHMTSDFSLMSTSSPTKSLPPI

Query:  YAADG----------------------------------------------SGSADETDD-------WNGSQSGKI------------------------
        YAADG                                              S +  +  D         G + G++                        
Subjt:  YAADG----------------------------------------------SGSADETDD-------WNGSQSGKI------------------------

Query:  WHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSE
        WHLRLGHAS EKLRHLIS+NNLN++TKFVPFNCLNCKLAKQ ALSFS S S CDKPFDL+HSDIWGPAP +TVHGYRYYVLFIDD+SRFTWIYFLKHRSE
Subjt:  WHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSE

Query:  LSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINRL
        LSRTYIEFANMIRTQFS PIK LRTDNALEYKDS LLSFLSQQGT+VQRSCPHTSQQNGRAERKHRHILDS+ ALLLSASCP+KFWGEAALTSVYTINRL
Subjt:  LSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINRL

Query:  PSSVLQNTSPFERLYGISPDYSNLKDF
        PSSVLQN SPFERLYG  P+YSNLK F
Subjt:  PSSVLQNTSPFERLYGISPDYSNLKDF

XP_031745067.1 uncharacterized protein LOC116405245 [Cucumis sativus]1.3e-23368.32Show/hide
Query:  PLDKEEDNKFIERLEDWDSKNHQIIIWLGNTSIPAIHAQFDAFDD------------------------------------SVNEYLAVLQPIWTQLDQA
        P  KE DN FIERLEDWDSKNHQII WLGNTSIPAIHAQFDAFDD                                    SVNEYLAVLQPIWTQLDQA
Subjt:  PLDKEEDNKFIERLEDWDSKNHQIIIWLGNTSIPAIHAQFDAFDD------------------------------------SVNEYLAVLQPIWTQLDQA

Query:  SINKDHLRLIKVLMRLRPEYESVRAALLHRSPLPSLDAAIQEILFEEKHLGINSTKQPDVGLGSTYT----SNMFCKNCKLFGHKFINGPKIECRYCHKH
        +I+KDHLRLIKVLM LRPEYESVRAALLHR+PLPSLDAAIQEIL                    TYT    +NMFCKNCKL GHKF N PKIECRYCHKH
Subjt:  SINKDHLRLIKVLMRLRPEYESVRAALLHRSPLPSLDAAIQEILFEEKHLGINSTKQPDVGLGSTYT----SNMFCKNCKLFGHKFINGPKIECRYCHKH

Query:  GHILDNCPTRPPQPLGTSTKEKNFTKPGPSSVVAATSDDSSLIEISDLQILLNQIISSSSALVVSSGNRWLLDSTCCNHMTSDFSLMSTSSPTKSLPPIY
        GHILDNCPTRPP+P GTSTKEK FTK G SSVVAATSDDSSLI+ISDLQ LLNQ+ISSSSAL+               H   D     T    + +  ++
Subjt:  GHILDNCPTRPPQPLGTSTKEKNFTKPGPSSVVAATSDDSSLIEISDLQILLNQIISSSSALVVSSGNRWLLDSTCCNHMTSDFSLMSTSSPTKSLPPIY

Query:  -------AADGSGSADETDDWNGSQSGKIWHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPIT
               ++  S SA  TD          WHLRLGHASSEKLRHLISVNNL NLTKFVPFNCLNCKLAKQ ALSFSQSISNCDKPFDLVHSDIWGPAPIT
Subjt:  -------AADGSGSADETDDWNGSQSGKIWHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPIT

Query:  TVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDS
        TVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDS
Subjt:  TVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDS

Query:  ICALLLSASCPKKFWGEAALTSVYTINRLPSSVLQNTSPFERLYGISPDYSNLKDF---VVGTLFPTDSG----------YLG-----------------
        + ALLLSASCP+KFWGEAALTSVYTINRLPSSVLQNTSPFE+LYGISPDYS LK F       L P +            +LG                 
Subjt:  ICALLLSASCPKKFWGEAALTSVYTINRLPSSVLQNTSPFERLYGISPDYSNLKDF---VVGTLFPTDSG----------YLG-----------------

Query:  ------MSLFGNTLCSLVCPPSTPLSLVLILSLPIHLLTFFLSLNPPWILSLHNLHLLLRIRIHHLSSMMFLNLHLLLLFVALPG
              ++ + +T+ S +    T  SLVL LSL IHLLTFFLSLNPPWILSLHNLHLLL+I IH LS MMFLN HLLLLFVALPG
Subjt:  ------MSLFGNTLCSLVCPPSTPLSLVLILSLPIHLLTFFLSLNPPWILSLHNLHLLLRIRIHHLSSMMFLNLHLLLLFVALPG

TrEMBL top hitse value%identityAlignment
A0A5A7VIT8 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-21966.03Show/hide
Query:  DKEEDNKFIERLEDWDSKNHQIIIWLGNTSIPAIHAQFDAFDD------------------------------------SVNEYLAVLQPIWTQLDQASI
        DKE+D+KFIERLE+WDSKNHQII WL NTSIPAIH QFDAF++                                    SVNEYLAVLQPIWTQLDQA+I
Subjt:  DKEEDNKFIERLEDWDSKNHQIIIWLGNTSIPAIHAQFDAFDD------------------------------------SVNEYLAVLQPIWTQLDQASI

Query:  NKDHLRLIKVLMRLRPEYESVRAALLHRSPLPSLDAAIQEILFEEKHLGINSTKQPDVGLGSTYT----SNMFCKNCKLFGHKFINGPKIECRYCHKHGH
        +KDHLRLIKVLM LRPEYESVRAALLHRSPLPSLDAAIQEILFEE+ LGIN +K  D  L STY+    S+ FCKNCKL GHKFIN PKIECRYCHK GH
Subjt:  NKDHLRLIKVLMRLRPEYESVRAALLHRSPLPSLDAAIQEILFEEKHLGINSTKQPDVGLGSTYT----SNMFCKNCKLFGHKFINGPKIECRYCHKHGH

Query:  ILDNCPTRPPQPLGTSTKEKNFTKPGPSSVVAATSDDSSL--IEISDLQILLNQIISS-SSALVVSSGNRWLLDSTCCNHMTSDFSLMSTSSPTKSLPPI
        ILDNCP +PP+P   ST+ KNFTKP  SS      D S++   +ISDLQ LLNQ+ISS SSAL VS GNRWLLDS CCNHMTSD+SLM+T SPTKSLPPI
Subjt:  ILDNCPTRPPQPLGTSTKEKNFTKPGPSSVVAATSDDSSL--IEISDLQILLNQIISS-SSALVVSSGNRWLLDSTCCNHMTSDFSLMSTSSPTKSLPPI

Query:  YAADG----------------------------------------------SGSADETDD-------WNGSQSGKI------------------------
        YAADG                                              S +  +  D         G + G++                        
Subjt:  YAADG----------------------------------------------SGSADETDD-------WNGSQSGKI------------------------

Query:  WHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSE
        WHLRLGHAS EKLRHLIS+NNLN++TKFVPFNCLNCKLAKQ ALSFS S S CDKPFDL+HSDIWGPAP +TVHGYRYYVLFIDD+SRFTWIYFLKHRSE
Subjt:  WHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSE

Query:  LSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINRL
        LSRTYIEFANMIRTQFS PIK LRTDNALEYKDS LLSFLSQQGT+VQRSCPHTSQQNGRAERKHRHILDS+ ALLLSASCP+KFWGEAALTSVYTINRL
Subjt:  LSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINRL

Query:  PSSVLQNTSPFERLYGISPDYSNLKDF
        PSSVLQN SPFERLYG  P+YSNLK F
Subjt:  PSSVLQNTSPFERLYGISPDYSNLKDF

A0A5D3C0D7 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-21966.03Show/hide
Query:  DKEEDNKFIERLEDWDSKNHQIIIWLGNTSIPAIHAQFDAFDD------------------------------------SVNEYLAVLQPIWTQLDQASI
        DKE+D+KFIERLE+WDSKNHQII WL NTSIPAIH QFDAF++                                    SVNEYLAVLQPIWTQLDQA+I
Subjt:  DKEEDNKFIERLEDWDSKNHQIIIWLGNTSIPAIHAQFDAFDD------------------------------------SVNEYLAVLQPIWTQLDQASI

Query:  NKDHLRLIKVLMRLRPEYESVRAALLHRSPLPSLDAAIQEILFEEKHLGINSTKQPDVGLGSTYT----SNMFCKNCKLFGHKFINGPKIECRYCHKHGH
        +KDHLRLIKVLM LRPEYESVRAALLHRSPLPSLDAAIQEILFEE+ LGIN +K  D  L STY+    S+ FCKNCKL GHKFIN PKIECRYCHK GH
Subjt:  NKDHLRLIKVLMRLRPEYESVRAALLHRSPLPSLDAAIQEILFEEKHLGINSTKQPDVGLGSTYT----SNMFCKNCKLFGHKFINGPKIECRYCHKHGH

Query:  ILDNCPTRPPQPLGTSTKEKNFTKPGPSSVVAATSDDSSL--IEISDLQILLNQIISS-SSALVVSSGNRWLLDSTCCNHMTSDFSLMSTSSPTKSLPPI
        ILDNCP +PP+P   ST+ KNFTKP  SS      D S++   +ISDLQ LLNQ+ISS SSAL VS GNRWLLDS CCNHMTSD+SLM+T SPTKSLPPI
Subjt:  ILDNCPTRPPQPLGTSTKEKNFTKPGPSSVVAATSDDSSL--IEISDLQILLNQIISS-SSALVVSSGNRWLLDSTCCNHMTSDFSLMSTSSPTKSLPPI

Query:  YAADG----------------------------------------------SGSADETDD-------WNGSQSGKI------------------------
        YAADG                                              S +  +  D         G + G++                        
Subjt:  YAADG----------------------------------------------SGSADETDD-------WNGSQSGKI------------------------

Query:  WHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSE
        WHLRLGHAS EKLRHLIS+NNLN++TKFVPFNCLNCKLAKQ ALSFS S S CDKPFDL+HSDIWGPAP +TVHGYRYYVLFIDD+SRFTWIYFLKHRSE
Subjt:  WHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSE

Query:  LSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINRL
        LSRTYIEFANMIRTQFS PIK LRTDNALEYKDS LLSFLSQQGT+VQRSCPHTSQQNGRAERKHRHILDS+ ALLLSASCP+KFWGEAALTSVYTINRL
Subjt:  LSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINRL

Query:  PSSVLQNTSPFERLYGISPDYSNLKDF
        PSSVLQN SPFERLYG  P+YSNLK F
Subjt:  PSSVLQNTSPFERLYGISPDYSNLKDF

A0A5D3CT53 Retrovirus-related Pol polyprotein from transposon TNT 1-948.1e-22166.35Show/hide
Query:  DKEEDNKFIERLEDWDSKNHQIIIWLGNTSIPAIHAQFDAFDD------------------------------------SVNEYLAVLQPIWTQLDQASI
        DKE+D+KFIERLE+WDSKNHQII WL NTSIPAIH QFDAFD+                                    SVNEYLAVLQPIWTQLDQA+I
Subjt:  DKEEDNKFIERLEDWDSKNHQIIIWLGNTSIPAIHAQFDAFDD------------------------------------SVNEYLAVLQPIWTQLDQASI

Query:  NKDHLRLIKVLMRLRPEYESVRAALLHRSPLPSLDAAIQEILFEEKHLGINSTKQPDVGLGSTYT----SNMFCKNCKLFGHKFINGPKIECRYCHKHGH
        +KDHLRLIKVLM LRPEYESVRAALLHRSPLPSLDAAIQEILFEE+ LGIN +K  DV L STY+    S+ FCKNCKL GHKFIN PKIECRYCHK GH
Subjt:  NKDHLRLIKVLMRLRPEYESVRAALLHRSPLPSLDAAIQEILFEEKHLGINSTKQPDVGLGSTYT----SNMFCKNCKLFGHKFINGPKIECRYCHKHGH

Query:  ILDNCPTRPPQPLGTSTKEKNFTKPGPSSVVAATSDDSSL--IEISDLQILLNQIISS-SSALVVSSGNRWLLDSTCCNHMTSDFSLMSTSSPTKSLPPI
        ILDNCP +PP+P   ST+ KNFTKP  SS      D+S+    +ISDLQ LLNQ+ISS SSAL VS GNRWLLDS CCNHMTSD+SLM+T SPTKSLPPI
Subjt:  ILDNCPTRPPQPLGTSTKEKNFTKPGPSSVVAATSDDSSL--IEISDLQILLNQIISS-SSALVVSSGNRWLLDSTCCNHMTSDFSLMSTSSPTKSLPPI

Query:  YAADG----------------------------------------------SGSADETDD-------WNGSQSGKI------------------------
        YAADG                                              S +  +  D         G + G++                        
Subjt:  YAADG----------------------------------------------SGSADETDD-------WNGSQSGKI------------------------

Query:  WHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSE
        WHLRLGHAS EKLRHLIS+NNLN++TKFVPFNCLNCKLAKQ ALSFS+S S CDKPFDL+HSDIWGPAP TTVHGYRYYVLFIDD+SRFTWIYFLKHRSE
Subjt:  WHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSE

Query:  LSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINRL
        LSRTYIEFANMIRTQFS PIK LRTDNALEYKDS LLSFLSQQGT+VQRSCPHTSQQNGRAERKHRHILDS+ ALLLSASCP+KFWGEAALTSVYTINRL
Subjt:  LSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINRL

Query:  PSSVLQNTSPFERLYGISPDYSNLKDF
        PSSVLQN SPFE+LYG  P+YSNLK F
Subjt:  PSSVLQNTSPFERLYGISPDYSNLKDF

A0A5D3D7V8 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-21966.03Show/hide
Query:  DKEEDNKFIERLEDWDSKNHQIIIWLGNTSIPAIHAQFDAFDD------------------------------------SVNEYLAVLQPIWTQLDQASI
        DKE+D+KFIERLE+WDSKNHQII WL NTSIPAIH QFDAF++                                    SVNEYLAVLQPIWTQLDQA+I
Subjt:  DKEEDNKFIERLEDWDSKNHQIIIWLGNTSIPAIHAQFDAFDD------------------------------------SVNEYLAVLQPIWTQLDQASI

Query:  NKDHLRLIKVLMRLRPEYESVRAALLHRSPLPSLDAAIQEILFEEKHLGINSTKQPDVGLGSTYT----SNMFCKNCKLFGHKFINGPKIECRYCHKHGH
        +KDHLRLIKVLM LRPEYESVRAALLHRSPLPSLDAAIQEILFEE+ LGIN +K  D  L STY+    S+ FCKNCKL GHKFIN PKIECRYCHK GH
Subjt:  NKDHLRLIKVLMRLRPEYESVRAALLHRSPLPSLDAAIQEILFEEKHLGINSTKQPDVGLGSTYT----SNMFCKNCKLFGHKFINGPKIECRYCHKHGH

Query:  ILDNCPTRPPQPLGTSTKEKNFTKPGPSSVVAATSDDSSL--IEISDLQILLNQIISS-SSALVVSSGNRWLLDSTCCNHMTSDFSLMSTSSPTKSLPPI
        ILDNCP +PP+P   ST+ KNFTKP  SS      D S++   +ISDLQ LLNQ+ISS SSAL VS GNRWLLDS CCNHMTSD+SLM+T SPTKSLPPI
Subjt:  ILDNCPTRPPQPLGTSTKEKNFTKPGPSSVVAATSDDSSL--IEISDLQILLNQIISS-SSALVVSSGNRWLLDSTCCNHMTSDFSLMSTSSPTKSLPPI

Query:  YAADG----------------------------------------------SGSADETDD-------WNGSQSGKI------------------------
        YAADG                                              S +  +  D         G + G++                        
Subjt:  YAADG----------------------------------------------SGSADETDD-------WNGSQSGKI------------------------

Query:  WHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSE
        WHLRLGHAS EKLRHLIS+NNLN++TKFVPFNCLNCKLAKQ ALSFS S S CDKPFDL+HSDIWGPAP +TVHGYRYYVLFIDD+SRFTWIYFLKHRSE
Subjt:  WHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSE

Query:  LSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINRL
        LSRTYIEFANMIRTQFS PIK LRTDNALEYKDS LLSFLSQQGT+VQRSCPHTSQQNGRAERKHRHILDS+ ALLLSASCP+KFWGEAALTSVYTINRL
Subjt:  LSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINRL

Query:  PSSVLQNTSPFERLYGISPDYSNLKDF
        PSSVLQN SPFERLYG  P+YSNLK F
Subjt:  PSSVLQNTSPFERLYGISPDYSNLKDF

A0A5D3DT06 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-21966.03Show/hide
Query:  DKEEDNKFIERLEDWDSKNHQIIIWLGNTSIPAIHAQFDAFDD------------------------------------SVNEYLAVLQPIWTQLDQASI
        DKE+D+KFIERLE+WDSKNHQII WL NTSIPAIH QFDAF++                                    SVNEYLAVLQPIWTQLDQA+I
Subjt:  DKEEDNKFIERLEDWDSKNHQIIIWLGNTSIPAIHAQFDAFDD------------------------------------SVNEYLAVLQPIWTQLDQASI

Query:  NKDHLRLIKVLMRLRPEYESVRAALLHRSPLPSLDAAIQEILFEEKHLGINSTKQPDVGLGSTYT----SNMFCKNCKLFGHKFINGPKIECRYCHKHGH
        +KDHLRLIKVLM LRPEYESVRAALLHRSPLPSLDAAIQEILFEE+ LGIN +K  D  L STY+    S+ FCKNCKL GHKFIN PKIECRYCHK GH
Subjt:  NKDHLRLIKVLMRLRPEYESVRAALLHRSPLPSLDAAIQEILFEEKHLGINSTKQPDVGLGSTYT----SNMFCKNCKLFGHKFINGPKIECRYCHKHGH

Query:  ILDNCPTRPPQPLGTSTKEKNFTKPGPSSVVAATSDDSSL--IEISDLQILLNQIISS-SSALVVSSGNRWLLDSTCCNHMTSDFSLMSTSSPTKSLPPI
        ILDNCP +PP+P   ST+ KNFTKP  SS      D S++   +ISDLQ LLNQ+ISS SSAL VS GNRWLLDS CCNHMTSD+SLM+T SPTKSLPPI
Subjt:  ILDNCPTRPPQPLGTSTKEKNFTKPGPSSVVAATSDDSSL--IEISDLQILLNQIISS-SSALVVSSGNRWLLDSTCCNHMTSDFSLMSTSSPTKSLPPI

Query:  YAADG----------------------------------------------SGSADETDD-------WNGSQSGKI------------------------
        YAADG                                              S +  +  D         G + G++                        
Subjt:  YAADG----------------------------------------------SGSADETDD-------WNGSQSGKI------------------------

Query:  WHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSE
        WHLRLGHAS EKLRHLIS+NNLN++TKFVPFNCLNCKLAKQ ALSFS S S CDKPFDL+HSDIWGPAP +TVHGYRYYVLFIDD+SRFTWIYFLKHRSE
Subjt:  WHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSE

Query:  LSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINRL
        LSRTYIEFANMIRTQFS PIK LRTDNALEYKDS LLSFLSQQGT+VQRSCPHTSQQNGRAERKHRHILDS+ ALLLSASCP+KFWGEAALTSVYTINRL
Subjt:  LSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINRL

Query:  PSSVLQNTSPFERLYGISPDYSNLKDF
        PSSVLQN SPFERLYG  P+YSNLK F
Subjt:  PSSVLQNTSPFERLYGISPDYSNLKDF

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-2731.09Show/hide
Query:  KIWHLRLGHASSEKL-----RHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQ--SISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTW
        ++WH R GH S  KL     +++ S  +L N  +     C  C   KQ  L F Q    ++  +P  +VHSD+ GP    T+    Y+V+F+D ++ +  
Subjt:  KIWHLRLGHASSEKL-----RHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQ--SISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTW

Query:  IYFLKHRSELSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAAL
         Y +K++S++   + +F       F+  +  L  DN  EY  + +  F  ++G     + PHT Q NG +ER  R I +    ++  A   K FWGEA L
Subjt:  IYFLKHRSELSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAAL

Query:  TSVYTINRLPSSVLQNTS--PFERLYGISPDYSNLKDF
        T+ Y INR+PS  L ++S  P+E  +   P   +L+ F
Subjt:  TSVYTINRLPSSVLQNTS--PFERLYGISPDYSNLKDF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.2e-3134.65Show/hide
Query:  IWHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS
        +WH R+GH S + L+ L   + ++         C  C   KQ  +SF  S        DLV+SD+ GP  I ++ G +Y+V FIDD SR  W+Y LK + 
Subjt:  IWHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS

Query:  ELSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINR
        ++ + + +F  ++  +    +K LR+DN  EY       + S  G   +++ P T Q NG AER +R I++ + ++L  A  PK FWGEA  T+ Y INR
Subjt:  ELSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINR

Query:  LPSSVLQNTSPFERLYGISPDYSNLKDF
         PS  L    P          YS+LK F
Subjt:  LPSSVLQNTSPFERLYGISPDYSNLKDF

Q12490 Transposon Ty1-BL Gag-Pol polyprotein3.7e-1325.23Show/hide
Query:  HLRLGHASSEKLRHLISVNNLN-------NLTKFVPFNCLNC---KLAKQLALSFSQ-SISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFT
        H  L HA+++ +R+ +  N +        + +  + + C +C   K  K   +  S+    N  +PF  +H+DI+GP          Y++ F D+ ++F 
Subjt:  HLRLGHASSEKLRHLISVNNLN-------NLTKFVPFNCLNC---KLAKQLALSFSQ-SISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFT

Query:  WIYFLKHRSE--LSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGE
        W+Y L  R E  +   +      I+ QF + + +++ D   EY +  L  FL + G     +    S+ +G AER +R +LD     L  +  P   W  
Subjt:  WIYFLKHRSE--LSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGE

Query:  AALTSVYTINRLPS
        A   S    N L S
Subjt:  AALTSVYTINRLPS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.1e-4440.79Show/hide
Query:  WHLRLGHASSEKLRHLISVNNLNNLTKFVPF-NCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS
        WH RLGH +   L  +IS  +L+ L     F +C +C + K   + FSQS  N  +P + ++SD+W  +PI +   YRYYV+F+D ++R+TW+Y LK +S
Subjt:  WHLRLGHASSEKLRHLISVNNLNNLTKFVPF-NCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS

Query:  ELSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINR
        ++  T+I F N++  +F + I    +DN  E+    L  + SQ G     S PHT + NG +ERKHRHI+++   LL  AS PK +W  A   +VY INR
Subjt:  ELSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINR

Query:  LPSSVLQNTSPFERLYGISPDYSNLKDF
        LP+ +LQ  SPF++L+G SP+Y  L+ F
Subjt:  LPSSVLQNTSPFERLYGISPDYSNLKDF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.7e-4340.79Show/hide
Query:  WHLRLGHASSEKLRHLISVNNLNNLT-KFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS
        WH RLGH S   L  +IS ++L  L       +C +C + K   + FS S     KP + ++SD+W  +PI ++  YRYYV+F+D ++R+TW+Y LK +S
Subjt:  WHLRLGHASSEKLRHLISVNNLNNLT-KFVPFNCLNCKLAKQLALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS

Query:  ELSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINR
        ++  T+I F +++  +F + I  L +DN  E+   +L  +LSQ G     S PHT + NG +ERKHRHI++    LL  AS PK +W  A   +VY INR
Subjt:  ELSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRAERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINR

Query:  LPSSVLQNTSPFERLYGISPDYSNLKDF
        LP+ +LQ  SPF++L+G  P+Y  LK F
Subjt:  LPSSVLQNTSPFERLYGISPDYSNLKDF

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGTCCAATAAATTGAATCTTTCAAGAAATAGTACTCCCCCAGGTGCTTTCAGTTCTCTCAGCTATTCAAGCCAGAAACCCTATATCTACTTTAATCACCACCGTCG
CCGCCAACCGTCTGTGAACCGTCGCGTCTCCGTTAACTCCAAGAATTGCTGCGTCCCGAGCCGTCTCCGTCACAGCCTCGGCGTCGCGTCGTTCACATCGTCGCGTCGTC
GCGTCCGTTCCTGCTCCGTTCTGTCTCCATCCGCGAAGCGCCAGTCGTCGCCACTGAGTTTTGCGCCTATATGTCCTTCATCAAACGACCTCCTCTGTTGTTCCAAGCTG
CATGCGACTCCTATTCCAGCCGTTTGTCTTCATTTTTTTGTCTCCGTTCACAGCCGTGAACGTCTGTCCGACCCAGCCATTAAGACAAATCCTGGCTGCGAACCCCCGAA
TCCAGTCGTGTCCTCGCCGTCGCAACTCAGCCGCGACCTAGACGTGAAGACGCTCACTTGTCACGAATCCCAGCCTTGCGAATCCCAGTCACAAAATACAGTAAAACCTA
GATCTGACGAATCCAGTCACGAATCATTCGAAACTCAGCCGAAAATCCTCATTGTTAATCTGAACTGCGAATCCAAATGGAGAGAGACCATATCTTTAGACCCATTGGAC
AAGGAAGAAGATAACAAATTCATTGAACGCCTCGAAGATTGGGACAGTAAAAATCATCAAATTATCATCTGGCTTGGTAACACTTCTATTCCTGCTATACATGCACAGTT
TGATGCTTTTGATGATTCTGTTAATGAATATTTGGCAGTTCTTCAACCCATTTGGACTCAGCTTGACCAAGCGAGCATCAACAAAGATCATCTTCGCCTTATTAAAGTCC
TTATGAGATTACGCCCAGAATATGAATCTGTCAGAGCTGCTTTACTACACCGGAGTCCCTTACCCTCATTGGATGCAGCTATTCAAGAAATTTTGTTTGAAGAAAAGCAC
CTTGGCATAAATTCTACTAAACAACCTGATGTTGGCCTTGGCAGCACATACACCTCAAATATGTTTTGTAAGAATTGTAAGCTCTTTGGTCACAAATTTATTAACGGTCC
TAAAATAGAGTGCAGGTACTGCCATAAGCATGGTCACATTCTGGATAACTGCCCTACCAGACCACCCCAACCTCTTGGTACTTCCACAAAAGAGAAAAACTTTACTAAAC
CTGGTCCATCATCTGTTGTTGCTGCAACCTCGGATGATTCATCCCTTATTGAGATAAGTGATCTTCAAATCCTATTGAATCAAATAATTTCATCATCCTCTGCTCTTGTT
GTCTCGTCAGGTAATCGATGGCTTCTTGATTCTACCTGTTGTAATCATATGACCTCTGACTTTTCTCTTATGTCTACTTCTAGCCCTACAAAATCTTTACCTCCTATTTA
TGCTGCTGATGGTTCAGGATCCGCAGACGAGACAGACGATTGGAACGGGTCGCAAAGTGGGAAGATTTGGCATCTTCGTCTTGGTCATGCTTCTTCTGAAAAACTTCGTC
ATTTAATTTCTGTTAACAATTTGAATAATCTTACTAAGTTTGTTCCTTTTAATTGTTTGAATTGCAAACTTGCTAAACAACTTGCCTTGTCTTTTTCTCAATCCATCTCT
AATTGTGATAAACCTTTTGATTTAGTGCATTCTGATATTTGGGGTCCTGCCCCAATTACTACTGTTCATGGTTATCGCTACTATGTTTTATTCATTGATGACTACTCTCG
TTTTACATGGATTTACTTTCTAAAACATCGTTCTGAATTATCTCGCACATATATTGAGTTTGCTAACATGATTCGCACTCAATTTTCCTCTCCCATCAAAATTCTTCGCA
CTGATAATGCTTTGGAATATAAAGATTCCATCCTTCTTTCTTTTCTTTCCCAACAGGGCACTATTGTTCAGCGCTCTTGCCCTCATACCTCTCAACAAAATGGACGTGCT
GAGCGCAAACATCGTCACATTCTTGACTCAATATGTGCCCTCCTTCTTTCTGCCTCTTGTCCTAAAAAATTCTGGGGTGAAGCTGCCCTTACATCAGTATATACCATCAA
TCGTCTCCCTTCTTCTGTTCTTCAAAACACATCTCCATTCGAAAGACTATATGGTATTTCTCCCGACTACTCTAACCTCAAAGATTTTGTTGTTGGGACCCTCTTTCCAA
CCGACTCCGGATATCTCGGCATGTCACTTTTTGGGAACACACTATGTTCTCTCGTTTGTCCTCCTTCCACACCTCTTTCTCTAGTCCTCATTCTTTCTTTACCAATACAT
CTGTTGACCTTTTTCCTCTCTCTGAACCCACCATGGATACTGAGCTTGCATAATCTTCACCTGCTACTGCGAATCCGGATCCACCATCTGTCTTCGATGATGTTCCTGAA
TCTCCACCTGCTACTCCTCTTCGTCGCTCTACCTGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGACGTCCAATAAATTGAATCTTTCAAGAAATAGTACTCCCCCAGGTGCTTTCAGTTCTCTCAGCTATTCAAGCCAGAAACCCTATATCTACTTTAATCACCACCGTCG
CCGCCAACCGTCTGTGAACCGTCGCGTCTCCGTTAACTCCAAGAATTGCTGCGTCCCGAGCCGTCTCCGTCACAGCCTCGGCGTCGCGTCGTTCACATCGTCGCGTCGTC
GCGTCCGTTCCTGCTCCGTTCTGTCTCCATCCGCGAAGCGCCAGTCGTCGCCACTGAGTTTTGCGCCTATATGTCCTTCATCAAACGACCTCCTCTGTTGTTCCAAGCTG
CATGCGACTCCTATTCCAGCCGTTTGTCTTCATTTTTTTGTCTCCGTTCACAGCCGTGAACGTCTGTCCGACCCAGCCATTAAGACAAATCCTGGCTGCGAACCCCCGAA
TCCAGTCGTGTCCTCGCCGTCGCAACTCAGCCGCGACCTAGACGTGAAGACGCTCACTTGTCACGAATCCCAGCCTTGCGAATCCCAGTCACAAAATACAGTAAAACCTA
GATCTGACGAATCCAGTCACGAATCATTCGAAACTCAGCCGAAAATCCTCATTGTTAATCTGAACTGCGAATCCAAATGGAGAGAGACCATATCTTTAGACCCATTGGAC
AAGGAAGAAGATAACAAATTCATTGAACGCCTCGAAGATTGGGACAGTAAAAATCATCAAATTATCATCTGGCTTGGTAACACTTCTATTCCTGCTATACATGCACAGTT
TGATGCTTTTGATGATTCTGTTAATGAATATTTGGCAGTTCTTCAACCCATTTGGACTCAGCTTGACCAAGCGAGCATCAACAAAGATCATCTTCGCCTTATTAAAGTCC
TTATGAGATTACGCCCAGAATATGAATCTGTCAGAGCTGCTTTACTACACCGGAGTCCCTTACCCTCATTGGATGCAGCTATTCAAGAAATTTTGTTTGAAGAAAAGCAC
CTTGGCATAAATTCTACTAAACAACCTGATGTTGGCCTTGGCAGCACATACACCTCAAATATGTTTTGTAAGAATTGTAAGCTCTTTGGTCACAAATTTATTAACGGTCC
TAAAATAGAGTGCAGGTACTGCCATAAGCATGGTCACATTCTGGATAACTGCCCTACCAGACCACCCCAACCTCTTGGTACTTCCACAAAAGAGAAAAACTTTACTAAAC
CTGGTCCATCATCTGTTGTTGCTGCAACCTCGGATGATTCATCCCTTATTGAGATAAGTGATCTTCAAATCCTATTGAATCAAATAATTTCATCATCCTCTGCTCTTGTT
GTCTCGTCAGGTAATCGATGGCTTCTTGATTCTACCTGTTGTAATCATATGACCTCTGACTTTTCTCTTATGTCTACTTCTAGCCCTACAAAATCTTTACCTCCTATTTA
TGCTGCTGATGGTTCAGGATCCGCAGACGAGACAGACGATTGGAACGGGTCGCAAAGTGGGAAGATTTGGCATCTTCGTCTTGGTCATGCTTCTTCTGAAAAACTTCGTC
ATTTAATTTCTGTTAACAATTTGAATAATCTTACTAAGTTTGTTCCTTTTAATTGTTTGAATTGCAAACTTGCTAAACAACTTGCCTTGTCTTTTTCTCAATCCATCTCT
AATTGTGATAAACCTTTTGATTTAGTGCATTCTGATATTTGGGGTCCTGCCCCAATTACTACTGTTCATGGTTATCGCTACTATGTTTTATTCATTGATGACTACTCTCG
TTTTACATGGATTTACTTTCTAAAACATCGTTCTGAATTATCTCGCACATATATTGAGTTTGCTAACATGATTCGCACTCAATTTTCCTCTCCCATCAAAATTCTTCGCA
CTGATAATGCTTTGGAATATAAAGATTCCATCCTTCTTTCTTTTCTTTCCCAACAGGGCACTATTGTTCAGCGCTCTTGCCCTCATACCTCTCAACAAAATGGACGTGCT
GAGCGCAAACATCGTCACATTCTTGACTCAATATGTGCCCTCCTTCTTTCTGCCTCTTGTCCTAAAAAATTCTGGGGTGAAGCTGCCCTTACATCAGTATATACCATCAA
TCGTCTCCCTTCTTCTGTTCTTCAAAACACATCTCCATTCGAAAGACTATATGGTATTTCTCCCGACTACTCTAACCTCAAAGATTTTGTTGTTGGGACCCTCTTTCCAA
CCGACTCCGGATATCTCGGCATGTCACTTTTTGGGAACACACTATGTTCTCTCGTTTGTCCTCCTTCCACACCTCTTTCTCTAGTCCTCATTCTTTCTTTACCAATACAT
CTGTTGACCTTTTTCCTCTCTCTGAACCCACCATGGATACTGAGCTTGCATAATCTTCACCTGCTACTGCGAATCCGGATCCACCATCTGTCTTCGATGATGTTCCTGAA
TCTCCACCTGCTACTCCTCTTCGTCGCTCTACCTGGATAA
Protein sequenceShow/hide protein sequence
MTSNKLNLSRNSTPPGAFSSLSYSSQKPYIYFNHHRRRQPSVNRRVSVNSKNCCVPSRLRHSLGVASFTSSRRRVRSCSVLSPSAKRQSSPLSFAPICPSSNDLLCCSKL
HATPIPAVCLHFFVSVHSRERLSDPAIKTNPGCEPPNPVVSSPSQLSRDLDVKTLTCHESQPCESQSQNTVKPRSDESSHESFETQPKILIVNLNCESKWRETISLDPLD
KEEDNKFIERLEDWDSKNHQIIIWLGNTSIPAIHAQFDAFDDSVNEYLAVLQPIWTQLDQASINKDHLRLIKVLMRLRPEYESVRAALLHRSPLPSLDAAIQEILFEEKH
LGINSTKQPDVGLGSTYTSNMFCKNCKLFGHKFINGPKIECRYCHKHGHILDNCPTRPPQPLGTSTKEKNFTKPGPSSVVAATSDDSSLIEISDLQILLNQIISSSSALV
VSSGNRWLLDSTCCNHMTSDFSLMSTSSPTKSLPPIYAADGSGSADETDDWNGSQSGKIWHLRLGHASSEKLRHLISVNNLNNLTKFVPFNCLNCKLAKQLALSFSQSIS
NCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFANMIRTQFSSPIKILRTDNALEYKDSILLSFLSQQGTIVQRSCPHTSQQNGRA
ERKHRHILDSICALLLSASCPKKFWGEAALTSVYTINRLPSSVLQNTSPFERLYGISPDYSNLKDFVVGTLFPTDSGYLGMSLFGNTLCSLVCPPSTPLSLVLILSLPIH
LLTFFLSLNPPWILSLHNLHLLLRIRIHHLSSMMFLNLHLLLLFVALPG