; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G25080 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G25080
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag-pol polyprotein
Genome locationChr3:22290261..22291553
RNA-Seq ExpressionCSPI03G25080
SyntenyCSPI03G25080
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EEC84282.1 hypothetical protein OsI_30754 [Oryza sativa Indica Group]3.1e-13360Show/hide
Query:  MTENRNKVQMIEDIMKGSDRSG-KLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVIT
        +TE  ++V M ED+++  D+S  +L+M V++T NRLY+I LK    VCLLT +++P WLWH RLGHVNF  +KL+ +K +  G+P +T PN+LC+AC++ 
Subjt:  MTENRNKVQMIEDIMKGSDRSG-KLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVIT

Query:  KQARLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQ
        KQ R PFP  + +RAE+PLELLH D+CGPI+P T+AGN+YF+LIVDD +RWMW+++++ K    EAF KFK L EN    +I+TLR DRGGEFLS EF Q
Subjt:  KQARLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQ

Query:  FCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP
         C++ GI+R LTAPYSPQQNG++ERRNR+VMA  RSL+K M VP +FWGEA+RH VYLLN LPTKA+G+RTPFEAW GRKP L HLRVFGC+A+ K TTP
Subjt:  FCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP

Query:  HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQV
        + KKLDDRS+P VY GVEEG KAHRL+DP  G++ +SRDV+F+EN+ W W+ VV+  +  TEF V
Subjt:  HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQV

KAB8107251.1 hypothetical protein EE612_041900 [Oryza sativa]3.1e-13360Show/hide
Query:  MTENRNKVQMIEDIMKGSDRSG-KLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVIT
        +TE  ++V M ED+++  D+S  +L+M V++T NRLY+I LK    VCLLT +++P WLWH RLGHVNF  +KL+ +K +  G+P +T PN+LC+AC++ 
Subjt:  MTENRNKVQMIEDIMKGSDRSG-KLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVIT

Query:  KQARLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQ
        KQ R PFP  + +RAE+PLELLH D+CGPI+P T+AGN+YF+LIVDD +RWMW+++++ K    EAF KFK L EN    +I+TLR DRGGEFLS EF Q
Subjt:  KQARLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQ

Query:  FCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP
         C++ GI+R LTAPYSPQQNG++ERRNR+VMA  RSL+K M VP +FWGEA+RH VYLLN LPTKA+G+RTPFEAW GRKP L HLRVFGC+A+ K TTP
Subjt:  FCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP

Query:  HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQV
        + KKLDDRS+P VY GVEEG KAHRL+DP  G++ +SRDV+F+EN+ W W+ VV+  +  TEF V
Subjt:  HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQV

XP_042396841.1 uncharacterized protein LOC121986987 [Zingiber officinale]1.1e-13358.41Show/hide
Query:  ENRNKVQMIEDIMKGSDRSGKLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVITKQA
        E  N+V M +DIMK  DRSGKLLM VK+TQNRLY                                   KL+GEKKL V VP + QPNKLCE CV+ K A
Subjt:  ENRNKVQMIEDIMKGSDRSGKLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVITKQA

Query:  RLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQFCK
        R PFP Q+ +RA KPLELLHADI GPISP TLA                      AK D F+AF KFK + EN TEYKI+TLR+DRGGEFLS EFT+FC+
Subjt:  RLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQFCK

Query:  KEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTPHFK
         EGIER LTAPY+PQQNG++ER NRTVMA  RSLLK  H+PA+FWGEA+RHTVYLLN LPTK LG RTPFEAWMGRKPHLAHLRVFGCVAYVKN TPH K
Subjt:  KEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTPHFK

Query:  KLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQVMDQFYSDEFENLEDAETWVENAFPHATEIPAIGETSS-
        KLDDRSSPMVY GVEEGCKAHRL+DP  GKLQ+SRDV+FQEN EW W    +  + + EF V D   +DE   + D E   E+  P  T +  +   SS 
Subjt:  KLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQVMDQFYSDEFENLEDAETWVENAFPHATEIPAIGETSS-

Query:  -----------SPPSTNTPVRLRSLSDIYANTEEVVGGDE
                   SP S   PVR RS++DIYANTEEVVG DE
Subjt:  -----------SPPSTNTPVRLRSLSDIYANTEEVVGGDE

XP_042404661.1 uncharacterized protein LOC121994812 [Zingiber officinale]6.9e-13357.95Show/hide
Query:  ENRNKVQMIEDIMKGSDRSGKLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVITKQA
        E  N+V M +DIMK  DRSGKLLM VK+TQNRLY                                   KL+GEKKL V VP + QPNKLCE CV+ K A
Subjt:  ENRNKVQMIEDIMKGSDRSGKLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVITKQA

Query:  RLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQFCK
        R PFP Q+ +RA KPLELLHADICGPISP TLA                      AK D F+AF KFK + EN TEYKI+TLR DRGGEFLS EFT+FC+
Subjt:  RLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQFCK

Query:  KEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTPHFK
         E IER  TAPY+PQQNG++ERRNR VMA  RSLLK  H+PA+FWGEA+RH VYLLN LPTKALG+RTPFEAWMGRKPHLAHLRVFGCVAYVKNTTPH K
Subjt:  KEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTPHFK

Query:  KLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQVMDQFYSDEFENLEDAETWVENAFPHATEIPAIGETSS-
        KLDDRSSPMVY GVEEGCKAHRL+DP   KLQ+SRDV+FQEN EW W    +  + + EF V D   +DE   + D E   E+  P AT + ++   SS 
Subjt:  KLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQVMDQFYSDEFENLEDAETWVENAFPHATEIPAIGETSS-

Query:  -----------SPPSTNTPVRLRSLSDIYANTEEVVGGDE
                   SP S   PV  RS++DIYANTE+VVG DE
Subjt:  -----------SPPSTNTPVRLRSLSDIYANTEEVVGGDE

XP_042446579.1 uncharacterized protein LOC122031543 [Zingiber officinale]1.0e-14761.82Show/hide
Query:  ENRNKVQMIEDIMKGSDRSGKLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVITKQA
        E  N+V M +DIMK  DR GKLLM VK+TQNRLY                                   KL+GEKKLVV VP +  PNKLCE CV+ K A
Subjt:  ENRNKVQMIEDIMKGSDRSGKLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVITKQA

Query:  RLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQFCK
        R PF  Q+ +RA K LELLHADICGPISP TLAGNKYF LIVDD TRWMW+++L AK D F+ F KFK + EN TEYKI+TLR DRGGEFLS EFT+FC+
Subjt:  RLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQFCK

Query:  KEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTPHFK
         EGIER LTAPY+PQQNG++ERRNRTVMA  RSLLK  H+PA+FWGEA+RH VYLLN LPTKALG+RTPFEAWMGRKPHLAHLRVFGCVAYVKNTTPH K
Subjt:  KEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTPHFK

Query:  KLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQVMDQFYSDEFENLEDAETWVENAFPHATEIPAIGETSS-
        KLDDRSSPMVY GVEEGCKAHRL+DP   KLQ+SRDV+FQEN EW W    +  + + EF V D   +DE   + D E   E+  P AT +  +   SS 
Subjt:  KLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQVMDQFYSDEFENLEDAETWVENAFPHATEIPAIGETSS-

Query:  -----------SPPSTNTPVRLRSLSDIYANTEEVVGGDE
                   SP S   PVR RS++DIYANTEEVVG DE
Subjt:  -----------SPPSTNTPVRLRSLSDIYANTEEVVGGDE

TrEMBL top hitse value%identityAlignment
A0A0P0XB91 Os08g0125300 protein1.5e-13360Show/hide
Query:  MTENRNKVQMIEDIMKGSDRSG-KLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVIT
        +TE  ++V M ED+++  D+S  +L+M V++T NRLY+I LK    VCLLT +++P WLWH RLGHVNF  +KL+ +K +  G+P +T PN+LC+AC++ 
Subjt:  MTENRNKVQMIEDIMKGSDRSG-KLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVIT

Query:  KQARLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQ
        KQ R PFP  + +RAE+PLELLH D+CGPI+P T+AGN+YF+LIVDD +RWMW+++++ K    EAF KFK L EN    +I+TLR DRGGEFLS EF Q
Subjt:  KQARLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQ

Query:  FCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP
         C++ GI+R LTAPYSPQQNG++ERRNR+VMA  RSL+K M VP +FWGEA+RH VYLLN LPTKA+G+RTPFEAW GRKP L HLRVFGC+A+ K TTP
Subjt:  FCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP

Query:  HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQV
        + KKLDDRS+P VY GVEEG KAHRL+DP  G++ +SRDV+F+EN+ W W+ VV+  +  TEF V
Subjt:  HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQV

B8BDZ6 Uncharacterized protein1.5e-13360Show/hide
Query:  MTENRNKVQMIEDIMKGSDRSG-KLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVIT
        +TE  ++V M ED+++  D+S  +L+M V++T NRLY+I LK    VCLLT +++P WLWH RLGHVNF  +KL+ +K +  G+P +T PN+LC+AC++ 
Subjt:  MTENRNKVQMIEDIMKGSDRSG-KLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVIT

Query:  KQARLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQ
        KQ R PFP  + +RAE+PLELLH D+CGPI+P T+AGN+YF+LIVDD +RWMW+++++ K    EAF KFK L EN    +I+TLR DRGGEFLS EF Q
Subjt:  KQARLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQ

Query:  FCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP
         C++ GI+R LTAPYSPQQNG++ERRNR+VMA  RSL+K M VP +FWGEA+RH VYLLN LPTKA+G+RTPFEAW GRKP L HLRVFGC+A+ K TTP
Subjt:  FCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP

Query:  HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQV
        + KKLDDRS+P VY GVEEG KAHRL+DP  G++ +SRDV+F+EN+ W W+ VV+  +  TEF V
Subjt:  HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQV

Q0J8A6 Os08g0125300 protein1.5e-13360Show/hide
Query:  MTENRNKVQMIEDIMKGSDRSG-KLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVIT
        +TE  ++V M ED+++  D+S  +L+M V++T NRLY+I LK    VCLLT +++P WLWH RLGHVNF  +KL+ +K +  G+P +T PN+LC+AC++ 
Subjt:  MTENRNKVQMIEDIMKGSDRSG-KLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVIT

Query:  KQARLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQ
        KQ R PFP  + +RAE+PLELLH D+CGPI+P T+AGN+YF+LIVDD +RWMW+++++ K    EAF KFK L EN    +I+TLR DRGGEFLS EF Q
Subjt:  KQARLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQ

Query:  FCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP
         C++ GI+R LTAPYSPQQNG++ERRNR+VMA  RSL+K M VP +FWGEA+RH VYLLN LPTKA+G+RTPFEAW GRKP L HLRVFGC+A+ K TTP
Subjt:  FCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP

Query:  HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQV
        + KKLDDRS+P VY GVEEG KAHRL+DP  G++ +SRDV+F+EN+ W W+ VV+  +  TEF V
Subjt:  HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQV

Q10F84 Gag-pol polyprotein1.6e-12754.26Show/hide
Query:  MTENRNKVQMIEDIMKGSDRS-GKLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVIT
        +TE  ++V M  D ++  D++  +L+M V+++ NRLY+I L+    VCLL SL+DP WLWH RLGHVNFH LKL+ +K++  GVP V  PN+LC+AC++ 
Subjt:  MTENRNKVQMIEDIMKGSDRS-GKLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVIT

Query:  KQARLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQ
        KQ R PFP  + Y AE PLELLH D+CGPI+P T  GN+YF+LIVDD + WMW++++++K     AF KFK L EN     I+TLR DRGGEFLS EF +
Subjt:  KQARLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQ

Query:  FCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP
         C   GIER LT PYSPQQNG++ERRNRTVMA  RSLLK M VP + WGEA+RH V+LLN LPTKA+G RTPFEAW G+KPHL HLRVFGC A+ K T P
Subjt:  FCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP

Query:  HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQVMDQFYSDEFENLEDAET--W--VENAFPHATEIPA
        H KKLDDRS+P+VY GVEEG KAHRL+DP R ++ +SRDV+F EN  W W+    +    TEF+V +   +++    E A +  W     A   A + P 
Subjt:  HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQVMDQFYSDEFENLEDAET--W--VENAFPHATEIPA

Query:  IGETSSSPPST
        + E   +PP++
Subjt:  IGETSSSPPST

Q338J6 Retrotransposon protein, putative, unclassified9.5e-12853.36Show/hide
Query:  MTENRNKVQMIEDIMKGSDRS-GKLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVIT
        +TE  ++V M  D +K  D++  +L+M V++T NRLY+I L+   QVCLL SL++P WLWH R+GHVNFH LKL+ +K++  GVP V  PN+LC+AC++ 
Subjt:  MTENRNKVQMIEDIMKGSDRS-GKLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVIT

Query:  KQARLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQ
        KQ R PFP  + YRAE PLELLH D+CGPI+P T AGN+YF+LIVDD + WMW+++++ K      F KFK L +N     I+TLR DRGGEFLS +F +
Subjt:  KQARLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQ

Query:  FCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP
         C    IER LTAPYSPQQN ++ERRNRTVMA  RSLLK M VP + WGEA+RH ++LLN LPTKA+G RTPFEAW G+KPHL HLRVFGC A+ K T P
Subjt:  FCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTP

Query:  HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQVMDQFYSDEFENLEDAETWVENAFPHATEIPAIGET
        H KKLDDRS+P VY GVEEG KAHRL+DP R ++ +SRDV+F EN  W W+       E+T         S EFE  E          P   E PA+ E 
Subjt:  HFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQVMDQFYSDEFENLEDAETWVENAFPHATEIPAIGET

Query:  S--SSPPSTNT---PVRLRSLSDIYANTEEV
        +  +SP +  +   PVR RSL++I      V
Subjt:  S--SSPPSTNT---PVRLRSLSDIYANTEEV

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.2e-5234.2Show/hide
Query:  LWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPN---KLCEACVITKQARLPFPR-QSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWL
        LWH R GH++   L  +  K +     L+       ++CE C+  KQARLPF + +     ++PL ++H+D+CGPI+P TL    YF++ VD  T +   
Subjt:  LWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPN---KLCEACVITKQARLPFPR-QSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWL

Query:  YMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRH
        Y+++ KSD F  F  F    E     K+  L +D G E+LS E  QFC K+GI   LT P++PQ NG+ ER  RT+  + R+++    +   FWGEA+  
Subjt:  YMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRH

Query:  TVYLLNCLPTKAL--GERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTPHFK----KLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEW
          YL+N +P++AL    +TP+E W  +KP+L HLRVFG   YV     H K    K DD+S   ++ G E      +L+D    K  ++RDV+  E    
Subjt:  TVYLLNCLPTKAL--GERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTPHFK----KLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEW

Query:  AWNEVVSDGKEITEFQVMDQFYSDEFENLEDAETWVENAFPHATE
          N V S   +     + D   S+      D+   ++  FP+ ++
Subjt:  AWNEVVSDGKEITEFQVMDQFYSDEFENLEDAETWVENAFPHATE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-6035.29Show/hide
Query:  GKLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVITKQARLPFPRQSTYRAEKPLELL
        G L+++    +  LY+   +  +        E    LWH R+GH++   L+++ +K L+      T   K C+ C+  KQ R+ F + S+ R    L+L+
Subjt:  GKLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVITKQARLPFPRQSTYRAEKPLELL

Query:  HADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGI
        ++D+CGP+   ++ GNKYF+  +DD++R +W+Y+L+ K   F+ F KF  L+E +T  K++ LR D GGE+ S EF ++C   GI    T P +PQ NG+
Subjt:  HADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGI

Query:  IERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTPHFKKLDDRSSPMVYFGVEEGCK
         ER NRT++ + RS+L+   +P  FWGEA++   YL+N  P+  L    P   W  ++   +HL+VFGC A+         KLDD+S P ++ G  +   
Subjt:  IERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTPHFKKLDDRSSPMVYFGVEEGCK

Query:  AHRLYDPGRGKLQISRDVLFQEN
         +RL+DP + K+  SRDV+F+E+
Subjt:  AHRLYDPGRGKLQISRDVLFQEN

Q12491 Transposon Ty2-B Gag-Pol polyprotein4.1e-1921.83Show/hide
Query:  PTWLWHVRLGHVNFHDLKLMGEKKLVV-----GVPLVTQPNKLCEACVI---TKQARLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDD
        P  L H  LGH NF  ++   +K  V       +         C  C+I   TK   +   R     + +P + LH DI GP+     +   YF+   D+
Subjt:  PTWLWHVRLGHVNFHDLKLMGEKKLVV-----GVPLVTQPNKLCEACVI---TKQARLPFPRQSTYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDD

Query:  STRWMWLYMLEAKSDG--FEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPA
         TR+ W+Y L  + +      F      ++N+   ++  ++MDRG E+ +    +F    GI    T     + +G+ ER NRT++   R+LL    +P 
Subjt:  STRWMWLYMLEAKSDG--FEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPA

Query:  KFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTPHFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGK-LQISRDVLFQE
          W  A+  +  + N L +    +     A +     +  +  FG    V N  P   K+  R  P            + +Y P   K +  +  V+ Q+
Subjt:  KFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTPHFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGK-LQISRDVLFQE

Query:  N---LEWAWNEVVSDGKEITEFQVMDQFYSDEFENLEDAETWVENAFPHATEI-----PAIGETSSSPPS----TNTPV-RLRSLSDIYANTEE
        N   L+    + ++   ++      +Q + ++ E  +  +   E+   + +EI     P + + SS   +     + PV ++R+L ++ A+  E
Subjt:  N---LEWAWNEVVSDGKEITEFQVMDQFYSDEFENLEDAETWVENAFPHATEI-----PAIGETSSSPPS----TNTPV-RLRSLSDIYANTEE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.2e-4033.23Show/hide
Query:  QTQNRLYKITLKTLKQVCLLTSLEDPTW--LWHVRLGHVNFHDL-KLMGEKKLVVGVPLVTQPNKLCEACVITKQARLPFPRQSTYRAEKPLELLHADIC
        +T++ LY+  + + + V L  S         WH RLGH     L  ++    L V  P  +     C  C+I K  ++PF  QST  + +PLE +++D+ 
Subjt:  QTQNRLYKITLKTLKQVCLLTSLEDPTW--LWHVRLGHVNFHDL-KLMGEKKLVVGVPLVTQPNKLCEACVITKQARLPFPRQSTYRAEKPLELLHADIC

Query:  GPISPRTLAGN-KYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGIIERR
           SP     N +Y+++ VD  TR+ WLY L+ KS   E F  FK L+EN+ + +I T   D GGEF++    ++  + GI    + P++P+ NG+ ER+
Subjt:  GPISPRTLAGN-KYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNGIIERR

Query:  NRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTPHFKKLDDRSSPMVYFGVEEGCKAHRL
        +R ++    +LL    +P  +W  A    VYL+N LPT  L   +PF+   G  P+   LRVFGC  Y      +  KLDD+S   V+ G      A+  
Subjt:  NRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTPHFKKLDDRSSPMVYFGVEEGCKAHRL

Query:  YDPGRGKLQISRDVLFQEN
              +L ISR V F EN
Subjt:  YDPGRGKLQISRDVLFQEN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.8e-3828.33Show/hide
Query:  NRNKVQMIEDIMKGSDRSGKLLMSVKQTQNRLYKITLKTLKQVCLLTS--LEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKL--CEACVIT
        NR  V+      +  D +  + +   +T++ LY+  + + + V +  S   +     WH RLGH +   L ++        +P++   +KL  C  C I 
Subjt:  NRNKVQMIEDIMKGSDRSGKLLMSVKQTQNRLYKITLKTLKQVCLLTS--LEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKL--CEACVIT

Query:  KQARLPFPRQSTYRAEKPLELLHADI-CGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFT
        K  ++PF   ST  + KPLE +++D+   PI   ++   +Y+++ VD  TR+ WLY L+ KS   + F  FK L+EN+ + +I TL  D GGEF+     
Subjt:  KQARLPFPRQSTYRAEKPLELLHADI-CGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFT

Query:  QFCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTT
         +  + GI    + P++P+ NG+ ER++R ++    +LL    VP  +W  A    VYL+N LPT  L  ++PF+   G+ P+   L+VFGC  Y     
Subjt:  QFCKKEGIERPLTAPYSPQQNGIIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTT

Query:  PHFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQVMDQFYSDEFENLEDAETWVENAFPHATEI-----
         +  KL+D+S    + G      A+       G+L  SR V F E                T F V     + + +  + A  W  +     T +     
Subjt:  PHFKKLDDRSSPMVYFGVEEGCKAHRLYDPGRGKLQISRDVLFQENLEWAWNEVVSDGKEITEFQVMDQFYSDEFENLEDAETWVENAFPHATEI-----

Query:  PAIG---ETSSSPPSTNTPV
        P +G   +TS  PPS+ +P+
Subjt:  PAIG---ETSSSPPSTNTPV

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.3e-0931.87Show/hide
Query:  NRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVK----NTTPHFKKLDDRSSPMV
        NRT++ + RS+L    +P  F  +A    V+++N  P+ A+    P E W    P  ++LR FGCVAY+        P  KK +++ S ++
Subjt:  NRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVK----NTTPHFKKLDDRSSPMV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGAAAACAGAAACAAGGTGCAAATGATAGAAGATATCATGAAAGGGTCTGATAGGAGTGGGAAGCTTTTGATGTCGGTGAAGCAAACTCAAAATCGTTTGTACAA
GATAACTTTGAAGACACTCAAGCAAGTCTGCCTTCTGACAAGCCTAGAAGATCCAACATGGTTATGGCACGTGAGACTTGGCCATGTAAATTTTCATGACTTGAAGCTCA
TGGGGGAGAAGAAATTGGTAGTTGGAGTACCACTAGTGACTCAACCGAACAAGTTATGTGAAGCGTGCGTGATTACCAAACAAGCCAGATTGCCCTTCCCCCGTCAATCA
ACATATAGAGCAGAGAAGCCATTAGAACTCCTCCATGCTGATATATGCGGACCGATTTCACCACGTACTCTTGCTGGAAACAAGTATTTTCTGTTGATCGTTGACGATTC
CACGAGATGGATGTGGTTGTATATGTTGGAGGCAAAAAGTGATGGATTTGAAGCATTCAATAAATTCAAACTCTTAATGGAGAACAAAACGGAGTACAAGATCAGAACGC
TCCGGATGGATCGAGGTGGTGAGTTCTTATCTGCAGAGTTCACTCAATTTTGCAAAAAAGAAGGAATCGAACGACCCCTCACCGCTCCATATTCACCACAACAAAATGGC
ATTATAGAGCGTCGTAACCGCACCGTAATGGCGAGGACGAGATCACTCCTCAAAAGCATGCATGTGCCTGCAAAATTTTGGGGAGAGGCATTGAGACACACGGTTTATTT
GTTAAATTGTCTTCCAACGAAGGCCCTTGGAGAACGCACACCATTTGAAGCTTGGATGGGGAGAAAGCCACATCTTGCACACTTGAGAGTCTTTGGTTGTGTGGCATATG
TAAAGAACACAACCCCTCACTTCAAGAAACTCGATGATCGAAGCTCACCAATGGTATATTTTGGTGTCGAAGAAGGATGCAAAGCCCATCGCTTATATGACCCAGGCCGT
GGAAAACTACAAATCAGTAGAGATGTTCTTTTTCAAGAGAATCTTGAATGGGCTTGGAATGAAGTTGTCAGTGACGGTAAGGAGATTACAGAGTTTCAGGTGATGGACCA
ATTTTATTCTGACGAGTTCGAAAACTTGGAGGATGCAGAAACTTGGGTTGAAAATGCCTTCCCACATGCAACTGAGATACCTGCGATTGGAGAGACCAGTTCATCTCCTC
CATCGACGAACACACCGGTTCGTCTAAGATCTCTCAGTGACATCTACGCCAACACAGAGGAAGTTGTAGGTGGTGATGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGACAGAAAACAGAAACAAGGTGCAAATGATAGAAGATATCATGAAAGGGTCTGATAGGAGTGGGAAGCTTTTGATGTCGGTGAAGCAAACTCAAAATCGTTTGTACAA
GATAACTTTGAAGACACTCAAGCAAGTCTGCCTTCTGACAAGCCTAGAAGATCCAACATGGTTATGGCACGTGAGACTTGGCCATGTAAATTTTCATGACTTGAAGCTCA
TGGGGGAGAAGAAATTGGTAGTTGGAGTACCACTAGTGACTCAACCGAACAAGTTATGTGAAGCGTGCGTGATTACCAAACAAGCCAGATTGCCCTTCCCCCGTCAATCA
ACATATAGAGCAGAGAAGCCATTAGAACTCCTCCATGCTGATATATGCGGACCGATTTCACCACGTACTCTTGCTGGAAACAAGTATTTTCTGTTGATCGTTGACGATTC
CACGAGATGGATGTGGTTGTATATGTTGGAGGCAAAAAGTGATGGATTTGAAGCATTCAATAAATTCAAACTCTTAATGGAGAACAAAACGGAGTACAAGATCAGAACGC
TCCGGATGGATCGAGGTGGTGAGTTCTTATCTGCAGAGTTCACTCAATTTTGCAAAAAAGAAGGAATCGAACGACCCCTCACCGCTCCATATTCACCACAACAAAATGGC
ATTATAGAGCGTCGTAACCGCACCGTAATGGCGAGGACGAGATCACTCCTCAAAAGCATGCATGTGCCTGCAAAATTTTGGGGAGAGGCATTGAGACACACGGTTTATTT
GTTAAATTGTCTTCCAACGAAGGCCCTTGGAGAACGCACACCATTTGAAGCTTGGATGGGGAGAAAGCCACATCTTGCACACTTGAGAGTCTTTGGTTGTGTGGCATATG
TAAAGAACACAACCCCTCACTTCAAGAAACTCGATGATCGAAGCTCACCAATGGTATATTTTGGTGTCGAAGAAGGATGCAAAGCCCATCGCTTATATGACCCAGGCCGT
GGAAAACTACAAATCAGTAGAGATGTTCTTTTTCAAGAGAATCTTGAATGGGCTTGGAATGAAGTTGTCAGTGACGGTAAGGAGATTACAGAGTTTCAGGTGATGGACCA
ATTTTATTCTGACGAGTTCGAAAACTTGGAGGATGCAGAAACTTGGGTTGAAAATGCCTTCCCACATGCAACTGAGATACCTGCGATTGGAGAGACCAGTTCATCTCCTC
CATCGACGAACACACCGGTTCGTCTAAGATCTCTCAGTGACATCTACGCCAACACAGAGGAAGTTGTAGGTGGTGATGAATAA
Protein sequenceShow/hide protein sequence
MTENRNKVQMIEDIMKGSDRSGKLLMSVKQTQNRLYKITLKTLKQVCLLTSLEDPTWLWHVRLGHVNFHDLKLMGEKKLVVGVPLVTQPNKLCEACVITKQARLPFPRQS
TYRAEKPLELLHADICGPISPRTLAGNKYFLLIVDDSTRWMWLYMLEAKSDGFEAFNKFKLLMENKTEYKIRTLRMDRGGEFLSAEFTQFCKKEGIERPLTAPYSPQQNG
IIERRNRTVMARTRSLLKSMHVPAKFWGEALRHTVYLLNCLPTKALGERTPFEAWMGRKPHLAHLRVFGCVAYVKNTTPHFKKLDDRSSPMVYFGVEEGCKAHRLYDPGR
GKLQISRDVLFQENLEWAWNEVVSDGKEITEFQVMDQFYSDEFENLEDAETWVENAFPHATEIPAIGETSSSPPSTNTPVRLRSLSDIYANTEEVVGGDE