; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G05940 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G05940
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr3:5024321..5028767
RNA-Seq ExpressionCSPI03G05940
SyntenyCSPI03G05940
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GEV40152.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Tanacetum cinerariifolium]1.5e-3736.05Show/hide
Query:  STPSLSGSRKSGITRHKALTYTSQQNGVAERLNRTIMETTLNRSPHTSLGLLTPEEKWSKHPPSLNDLTVFGCVGTEGKSDDTESYTTQIEVENTRNNVQ
        +TP  +  +K GI RH  + +T QQNGVAER+N+T+M     RSP T++GL TP+E W   P + +DL +FGC      +D   +           N   
Subjt:  STPSLSGSRKSGITRHKALTYTSQQNGVAERLNRTIMETTLNRSPHTSLGLLTPEEKWSKHPPSLNDLTVFGCVGTEGKSDDTESYTTQIEVENTRNNVQ

Query:  LTEEPTVTEQEQVQNLSEEHAKIIEQQPDLSQYSLARDKQRRVIVPPARYKARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNLELDQLDVK
        +  +          ++  +  K  ++    S+ ++ + K+R   V PAR+KARLVA  F+Q+EGIDY    SPVVK  +I++L ++V   NLEL+QLDVK
Subjt:  LTEEPTVTEQEQVQNLSEEHAKIIEQQPDLSQYSLARDKQRRVIVPPARYKARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNLELDQLDVK

Query:  TAFLHG---------------------------YLEETIYMVQPKGFESSIGKLMYLMISTRPDLSYSASLVNRYMANSGRRHWEATKWIIRYL
        TAFLH                             L++ +  ++   + S +G LMY M+ TRPDL+++ S+V+ +MAN G+ HW+  KWI+RYL
Subjt:  TAFLHG---------------------------YLEETIYMVQPKGFESSIGKLMYLMISTRPDLSYSASLVNRYMANSGRRHWEATKWIIRYL

KAG8472304.1 hypothetical protein CXB51_034358 [Gossypium anomalum]3.4e-3729.85Show/hide
Query:  RKSGITRHKALTYTSQQNGVAERLNRTIMETT------------------------LNRSPHTSLGLLTPEEKWSKHPPSLNDLTVFGCVG----TEGKS
        +  GI RH  + +T QQNGVAER+NRTIME                          +NRSP  ++   TP+E WS +P + +DL +FGC        GK 
Subjt:  RKSGITRHKALTYTSQQNGVAERLNRTIMETT------------------------LNRSPHTSLGLLTPEEKWSKHPPSLNDLTVFGCVG----TEGKS

Query:  DDTE------SYTTQIE------VENTRNNVQLTEEPTVTEQEQVQNLS------EEHAKIIEQQ------PDLS-----------QYSLARDKQRRVIV
        +          Y   ++       EN +  V ++ +    E   + NLS      +E+ K +E Q      P +S           QYS+A+++ +R I 
Subjt:  DDTE------SYTTQIE------VENTRNNVQLTEEPTVTEQEQVQNLS------EEHAKIIEQQ------PDLS-----------QYSLARDKQRRVIV

Query:  PPARY------------------------------------------------------------KARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLS
        PP +Y                                                            KARLVA  ++Q  G+D++  FSPVVK +SIR LL 
Subjt:  PPARY------------------------------------------------------------KARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLS

Query:  LVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKGF-----------------------------------------------------------------
        +VA  +LEL+QLDVKTAFLHG LEE IYM QP+GF                                                                 
Subjt:  LVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKGF-----------------------------------------------------------------

Query:  ---ESSIGKLMYLMISTRPDLSYSASLVNRYMANSGRRHWEATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSDIS
            S++G LMY M+ +RPDLSY+ S V+RYMAN G+ HW+A +WI+RYL    +  L + R   TE  +IGYVD+D +
Subjt:  ---ESSIGKLMYLMISTRPDLSYSASLVNRYMANSGRRHWEATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSDIS

KAG8483260.1 hypothetical protein CXB51_022251 [Gossypium anomalum]5.3e-3828.98Show/hide
Query:  RKSGITRHKALTYTSQQNGVAERLNRTIMET--------TLNRSPHTSLGLLTPEEKWSKHPPSLNDLTVFGCVGTEGKSDDTESYTTQIEVENTRNNVQ
        +  GI RH  + +T QQNGVAER+NRTIME          +NRSP  ++   TP+E WS +P + +DL +FGC      +D     T  +   + +++  
Subjt:  RKSGITRHKALTYTSQQNGVAERLNRTIMET--------TLNRSPHTSLGLLTPEEKWSKHPPSLNDLTVFGCVGTEGKSDDTESYTTQIEVENTRNNVQ

Query:  LTEEPTVTEQEQVQNLSEEHAKIIEQQPDLSQYSLARDKQRRVIVPP------------------------------ARYKARLVANSFTQREGIDYSGN
           +  V  Q   ++  +   KI  +     QYS+A+++ +R I PP                               +YKARLVA  ++Q  G+D++  
Subjt:  LTEEPTVTEQEQVQNLSEEHAKIIEQQPDLSQYSLARDKQRRVIVPP------------------------------ARYKARLVANSFTQREGIDYSGN

Query:  FSPVVKQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKGF--------------------------------------------------
        FSPVVK +SIR LL +VA ++LEL+QLDVKTAFLHG LEE IYM QP+GF                                                  
Subjt:  FSPVVKQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKGF--------------------------------------------------

Query:  -----------------------------------------------------------------------------------ESSIGKLMYLMISTRPD
                                                                                            S++G LMY M+ +RPD
Subjt:  -----------------------------------------------------------------------------------ESSIGKLMYLMISTRPD

Query:  LSYSASLVNRYMANSGRRHWEATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSDIS
        LSY+ S V+RYMAN G+ HW+A +WI+RYL    +  L + R   TE  +IGYVD+D +
Subjt:  LSYSASLVNRYMANSGRRHWEATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSDIS

KAG8490786.1 hypothetical protein CXB51_014006 [Gossypium anomalum]4.5e-3730.28Show/hide
Query:  RKSGITRHKALTYTSQQNGVAERLNRTIMETT------------------------LNRSPHTSLGLLTPEEKWSKHPPSLNDLTVFGCVG----TEGKS
        +  GI RH  + +T QQNGVAER+NRTIME                          +NRSP  ++   TP+E WS +P + +DL +FGC        GK 
Subjt:  RKSGITRHKALTYTSQQNGVAERLNRTIMETT------------------------LNRSPHTSLGLLTPEEKWSKHPPSLNDLTVFGCVG----TEGKS

Query:  DDTE------SYTTQIE------VENTRNNVQLTEEPTVTEQEQVQNLS------EEHAKIIEQQ------PDLS-----------QYSLARDKQRRVI-
        +          Y   ++       EN +  V ++ +    E   + NLS      +E+ K +E Q      P +S           QYS+A++K +  + 
Subjt:  DDTE------SYTTQIE------VENTRNNVQLTEEPTVTEQEQVQNLS------EEHAKIIEQQ------PDLS-----------QYSLARDKQRRVI-

Query:  ---------------------------------------------------------------------VPPARYKARLVANSFTQREGIDYSGNFSPVV
                                                                             V   +YKARLVA  ++Q  G+D++  FSPVV
Subjt:  ---------------------------------------------------------------------VPPARYKARLVANSFTQREGIDYSGNFSPVV

Query:  KQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPK--------------------------------------GFESSIGKLMYLMISTRPD
        K +SIR LL +VA ++LEL+QLDVKTAFLHG LEE IYM QP+                                       + S++G LMY M+ +RPD
Subjt:  KQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPK--------------------------------------GFESSIGKLMYLMISTRPD

Query:  LSYSASLVNRYMANSGRRHWEATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSDIS
        LSY+ S V+RYMAN G+ HW+A +WI+RYL    +  L + R   TE  +IGYVD+D +
Subjt:  LSYSASLVNRYMANSGRRHWEATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSDIS

KAG8501848.1 hypothetical protein CXB51_004653 [Gossypium anomalum]1.8e-3830.47Show/hide
Query:  RKSGITRHKALTYTSQQNGVAERLNRTIMETT------------------------LNRSPHTSLGLLTPEEKWSKHPPSLNDLTVFGCVG----TEGKS
        +  GI RH  + +T QQNGVAER+NRTIME                          +NRSP  ++   TP+E WS +P + +DL +FGC      + GK 
Subjt:  RKSGITRHKALTYTSQQNGVAERLNRTIMETT------------------------LNRSPHTSLGLLTPEEKWSKHPPSLNDLTVFGCVG----TEGKS

Query:  DDTE------SYTTQIE------VENTRNNVQLTEEPTVTEQEQVQNLS------EEHAKIIEQQ------PDLS-----------QYSLARDKQRRVIV
        +          Y   ++       EN +  V ++ +    E   + NLS      +E+ K +E Q      P +S           QYS+A+++ +R I 
Subjt:  DDTE------SYTTQIE------VENTRNNVQLTEEPTVTEQEQVQNLS------EEHAKIIEQQ------PDLS-----------QYSLARDKQRRVIV

Query:  PP--------------------------------------------------------------ARYKARLVANSFTQREGIDYSGNFSPVVKQTSIRLL
        PP                                                               +YKARLVA  ++Q  G+D++  FSPVVK +SIR L
Subjt:  PP--------------------------------------------------------------ARYKARLVANSFTQREGIDYSGNFSPVVKQTSIRLL

Query:  LSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKGF-----------------------------------------------------ESSIGKLMYL
        L +VA ++LEL+QLDVKTAFLHG LEE IYM QP+GF                                                      S++G LMY 
Subjt:  LSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKGF-----------------------------------------------------ESSIGKLMYL

Query:  MISTRPDLSYSASLVNRYMANSGRRHWEATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSDIS
        M+ +RPDLSY+ S V+RYMAN  + HW+A +WI+RYL    +  L + R   TE  +IGYVD+D +
Subjt:  MISTRPDLSYSASLVNRYMANSGRRHWEATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSDIS

TrEMBL top hitse value%identityAlignment
A0A2N9FXF2 Integrase catalytic domain-containing protein8.0e-3231.25Show/hide
Query:  KSGITRHKALTYTSQQNGVAERLNRTIMETT------------------------LNRSPHTSLGLLTPEEKWSKHPPSLNDLTVFGC------------
        K GI     +  T QQNGV ER NRT+M+                          LNR P  ++   TP E W+K  PSL  L V+GC            
Subjt:  KSGITRHKALTYTSQQNGVAERLNRTIMETT------------------------LNRSPHTSLGLLTPEEKWSKHPPSLNDLTVFGC------------

Query:  -----------VGTEGKSDDTESYTTQ-----IEVENTR--------NNVQLTEEPTVTEQEQVQNLSEEHAKIIEQQP--DLSQ-------------YS
                   +G   KS     Y        +E +N R        + +  ++     +  +  N  ++  K ++Q    DL +             Y 
Subjt:  -----------VGTEGKSDDTESYTTQ-----IEVENTR--------NNVQLTEEPTVTEQEQVQNLSEEHAKIIEQQP--DLSQ-------------YS

Query:  LARDKQRRVIVPPARYKARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKG---------------
           D +  +     R+KARLVA  FTQ+ GIDY   FSPV K+ S+R++++LVA  +LEL Q+DVKTAFL+G LEE +YM Q +G               
Subjt:  LARDKQRRVIVPPARYKARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKG---------------

Query:  -------FESSIGKLMYLMISTRPDLSYSASLVNRYMANSGRRHWEATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSDIS
               + S++G LMY    TRPD+S++ S++ RY  N G  HW+A K ++RYL   K+  L ++R+    LE+IGY D D +
Subjt:  -------FESSIGKLMYLMISTRPDLSYSASLVNRYMANSGRRHWEATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSDIS

A0A2N9HJ44 Integrase catalytic domain-containing protein6.3e-3733.42Show/hide
Query:  GITRHKALTYTSQQNGVAERLNRTIMETT------------------------LNRSPHTSLGLLTPEEKWSKHPPSLNDLTVFGC------VGTEGKSD
        GI RH  +  T QQNGVAERLNRTI ET                         +NRSP  +L     EE W+      + + +FGC       G +    
Subjt:  GITRHKALTYTSQQNGVAERLNRTIMETT------------------------LNRSPHTSLGLLTPEEKWSKHPPSLNDLTVFGC------VGTEGKSD

Query:  DTES-------------------------YTTQIEVENTRNNVQLTEEPTVTEQEQVQNLSE---------------EHAKIIEQQPDLSQYSLARDKQR
        D +S                           ++  VE      Q  EEP   +QEQ    S+               E  K +E   + ++ SL+++K  
Subjt:  DTES-------------------------YTTQIEVENTRNNVQLTEEPTVTEQEQVQNLSE---------------EHAKIIEQQPDLSQYSLARDKQR

Query:  RVIVPP-----------------------ARYKARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPK
         +   P                        R+K RLVA  + QR  IDY   FSPVV+ TSIR +L+LV   +LEL+QLDVKTAFLHG LEE I+MVQP+
Subjt:  RVIVPP-----------------------ARYKARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPK

Query:  GFE-------------------------SSIGKLMYLMISTRPDLSYSASLVNRYMANSGRRHWEATKWIIRYLNWLKNAKLNYQRATETELELIGYVDS
        GF+                         S++G LMY M+ TRPDL+++ S VNRYMAN GR HW A KWI RYL       + + R   T   ++GY+D+
Subjt:  GFE-------------------------SSIGKLMYLMISTRPDLSYSASLVNRYMANSGRRHWEATKWIIRYLNWLKNAKLNYQRATETELELIGYVDS

Query:  D
        D
Subjt:  D

A0A438GEX0 Retrovirus-related Pol polyprotein from transposon TNT 1-946.6e-3452.7Show/hide
Query:  RYKARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKGFESSIGKLMYLMISTRPDLSYSASLVNRY
        RYKARLVA  FTQ+EGIDY+  FSPV K+ S+R++L+LVA  +LEL Q DVKTAFL+G LEE +YM QP+GF SS G+    ++ TRPD++++  ++ RY
Subjt:  RYKARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKGFESSIGKLMYLMISTRPDLSYSASLVNRY

Query:  MANSGRRHWEATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSDIS
         +N G  HW+A K ++RYL   K+ KL Y+R   + LE++GY DSD +
Subjt:  MANSGRRHWEATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSDIS

A0A5A7UJ23 Integrase catalytic domain-containing protein3.7e-3730.54Show/hide
Query:  GPTSTPSLSGSRKSGITRHKALTYTSQQNGVAERLNRTIMET------------------------TLNRSPHTSLGLLTPEEKWSKHPPSLNDLTVFGC
        GP  T S  GS      RH+ + YT QQNGVAER+NRT+ME                         T+ RS   S+ + TPEE+W+   P L++L  FGC
Subjt:  GPTSTPSLSGSRKSGITRHKALTYTSQQNGVAERLNRTIMET------------------------TLNRSPHTSLGLLTPEEKWSKHPPSLNDLTVFGC

Query:  V----------------------GTEGKSDDTESYTTQI--EVENTRNNVQLTEEPTVTEQEQVQNLSEEHAKIIEQQPDLSQYSLARDKQRRVIVPPAR
                                TE   D+ +SY   +  +  N        E  ++ + +  + + + H K I     + +  +  D   +V     +
Subjt:  V----------------------GTEGKSDDTESYTTQI--EVENTRNNVQLTEEPTVTEQEQVQNLSEEHAKIIEQQPDLSQYSLARDKQRRVIVPPAR

Query:  YKARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKGFE----------------------------
         KARLVA  F Q+EG+DY+  FSPVVK T IR+LL++VA  NL+L+QLDV TAFLHG LEE I+M QPKGFE                            
Subjt:  YKARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKGFE----------------------------

Query:  -------------------------------------------------------------------SSIGKLMYLMISTRPDLSYSASLVNRYMANSGR
                                                                            + G LMYLM+ TR DL++S+SLV+RYM N  +
Subjt:  -------------------------------------------------------------------SSIGKLMYLMISTRPDLSYSASLVNRYMANSGR

Query:  RHWEATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSDIS
         HWEATKW+ RYL   +N  L Y+   +++L + G+VD+D +
Subjt:  RHWEATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSDIS

A0A6A3AHM7 Uncharacterized protein7.3e-3333.23Show/hide
Query:  RKSGITRHKALTYTSQQNGVAERLNRTIMETTLNRSPHTSLGLLTPEEKWSKHPPSLNDLTVFGCVGTEGKSDDTESYTTQIE------VENTRNNVQLT
        +  GI RH  +  T QQNGVAER+NRT++E    R   ++ GL   E+ W++   ++N     G +    K      Y   ++      +        ++
Subjt:  RKSGITRHKALTYTSQQNGVAERLNRTIMETTLNRSPHTSLGLLTPEEKWSKHPPSLNDLTVFGCVGTEGKSDDTESYTTQIE------VENTRNNVQLT

Query:  EEPTVTEQEQVQNLSEEHAKIIEQQPDLSQYSLARDKQRRVI----------VPPARYKARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNL
         +    E   +++ +       E    +  + + +  + RV           V   R+KA LV   F+Q+EGIDY+  FSPVVK +SI +LL++VA+ +L
Subjt:  EEPTVTEQEQVQNLSEEHAKIIEQQPDLSQYSLARDKQRRVI----------VPPARYKARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNL

Query:  ELDQLDVKTAFLHGYLEETIYMVQPKGF-----------------------------------ESSIGKLMYLMISTRPDLSYSASLVNRYMANSGRRHW
        EL+QLDVKTAFLHG LEETIYM QP+GF                                   + ++G +MY M+ TRP++S++ S+VNRYM   G+ HW
Subjt:  ELDQLDVKTAFLHGYLEETIYMVQPKGF-----------------------------------ESSIGKLMYLMISTRPDLSYSASLVNRYMANSGRRHW

Query:  EATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSD
        +A KWI+RYL    +  L Y + +     + GYVDSD
Subjt:  EATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSD

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.3e-1436.09Show/hide
Query:  EEKWSKHPPSLNDLTVFGCVGTEGKS----DDTESYTTQIEVENTRNNVQLTEEPTVTEQEQVQNLSEEHAKIIEQQPDLSQYSLARDKQRRVIVPPARY
        EE  S +   LN  T+F  V          DD  S+   I  E   + +  T   T+T++ + +N+ +       +  +L                P RY
Subjt:  EEKWSKHPPSLNDLTVFGCVGTEGKS----DDTESYTTQIEVENTRNNVQLTEEPTVTEQEQVQNLSEEHAKIIEQQPDLSQYSLARDKQRRVIVPPARY

Query:  KARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKG
        KARLVA  FTQ+  IDY   F+PV + +S R +LSLV Q NL++ Q+DVKTAFL+G L+E IYM  P+G
Subjt:  KARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKG

P04146 Copia protein2.0e-0839.73Show/hide
Query:  SSIGKLMYLMISTRPDLSYSASLVNRYMANSGRRHWEATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSD
        S IG LMY+M+ TRPDL+ + ++++RY + +    W+  K ++RYL    + KL +++    E ++IGYVDSD
Subjt:  SSIGKLMYLMISTRPDLSYSASLVNRYMANSGRRHWEATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSD

P0CV72 Secreted RxLR effector protein 1616.6e-0735.62Show/hide
Query:  SSIGKLMYLMISTRPDLSYSASLVNRYMANSGRRHWEATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSD
        S++G +MYLM+ TRPDL+ +  +++++ ++    HW+A K ++RYL   +   L + RA     +L+GY D+D
Subjt:  SSIGKLMYLMISTRPDLSYSASLVNRYMANSGRRHWEATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSD

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.8e-2068.49Show/hide
Query:  RYKARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKGFE
        RYKARLV   F Q++GID+   FSPVVK TSIR +LSL A  +LE++QLDVKTAFLHG LEE IYM QP+GFE
Subjt:  RYKARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKGFE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-1033.33Show/hide
Query:  VVKQTSIRLLLSLVAQNNLELDQLDVKTAF-----LHGYLEETIYM----VQPKG------FESSIGKLMYLMISTRPDLSYSASLVNRYMANSGRRHWE
        V ++TS +L LS        L++ ++K A      L G+L+ +  M    V+ KG      + S++G LMY M+ TRPD++++  +V+R++ N G+ HWE
Subjt:  VVKQTSIRLLLSLVAQNNLELDQLDVKTAF-----LHGYLEETIYM----VQPKG------FESSIGKLMYLMISTRPDLSYSASLVNRYMANSGRRHWE

Query:  ATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSDIS
        A KWI+RYL       L +  +      L GY D+D++
Subjt:  ATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSDIS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.5e-1452.78Show/hide
Query:  RYKARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKGF
        RYKARLVA  + QR G+DY+  FSPV+K TSIR++L +    +  + QLDV  AFL G L + +YM QP GF
Subjt:  RYKARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKGF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-1452.78Show/hide
Query:  RYKARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKGF
        RYKARLVA  + QR G+DY+  FSPV+K TSIR++L +    +  + QLDV  AFL G L + +YM QP GF
Subjt:  RYKARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKGF

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.0e-1653.25Show/hide
Query:  RYKARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKGFESSIG
        RYKARLVA  +TQ+EGID+   FSPV K TS++L+L++ A  N  L QLD+  AFL+G L+E IYM  P G+ +  G
Subjt:  RYKARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYMVQPKGFESSIG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCGGGGTCCAACATCCACTCCAAGCCTAAGTGGCTCAAGAAAAAGTGGAATTACAAGACACAAGGCTCTAACCTACACATCTCAACAAAATGGAGTTGCAGAAAG
ACTAAACAGAACAATAATGGAAACGACACTGAACAGAAGCCCTCATACCTCCTTAGGACTACTAACTCCTGAGGAGAAATGGTCCAAACATCCACCAAGTCTAAATGACC
TTACGGTGTTTGGATGTGTAGGCACTGAAGGGAAATCAGATGATACAGAATCCTATACTACTCAAATTGAGGTGGAGAATACAAGGAACAATGTCCAACTTACTGAGGAA
CCTACAGTGACTGAACAAGAACAAGTGCAGAACCTAAGTGAAGAACATGCTAAAATAATTGAACAACAACCTGACTTGAGCCAATATTCCCTAGCAAGAGATAAACAAAG
AAGGGTAATTGTTCCTCCAGCTAGATACAAGGCAAGACTGGTAGCAAATAGTTTCACTCAAAGAGAGGGTATTGACTATTCTGGAAATTTTTCCCCTGTTGTTAAACAAA
CTTCCATTAGACTTCTCCTATCCCTAGTTGCTCAAAACAATCTAGAATTAGATCAACTTGATGTAAAAACTGCTTTTCTCCATGGCTATCTAGAAGAAACAATTTATATG
GTTCAACCTAAGGGTTTTGAGTCAAGCATTGGGAAGTTGATGTACCTAATGATCTCAACCAGACCTGATTTATCCTATTCAGCTAGCCTAGTCAACAGGTATATGGCTAA
TTCTGGAAGAAGACACTGGGAAGCTACTAAGTGGATAATCAGATACCTAAATTGGTTAAAGAATGCTAAATTGAACTACCAAAGGGCCACCGAGACAGAGCTAGAACTAA
TAGGTTATGTGGATTCAGATATAAGCATAATACATAGCATTATCAAAAGCTGTAAAATAGGGACTATGGCTCAAAGGACTGATGAAGGACTTTGGAATCAAACAGTTGAT
TTTACAAATCTTCTGGACAAATTCTTACCTCCATACAAGTGTAAGGAATTTCTGGTCGAGTTTGATTTTGAGTTCGAACATAATAAAAGAGCGAGCAACCGAGCAGCTGA
TGTCTTGTGGCTTCTGGAAAATGTCAAATGGGGACAAAGGGAGGGGGAGAGGGGGAGAACTATTTACCTTCTTGGTGACAAGTCTGAATATATCCTCAAGCTACCATCCT
TAGAGACGAATAGACCGAGTGATTCAACTGAAATGGAAGCAGACAACAAACATCGTTCGGACCTTTCTAGAGAAGCCTCGAATCGGATGAAGAAAGTGGCGTTGCCCACA
TGGATGAAAATTTATCTAGTAATTCATGTGAGTAACTTGAAGCTCTGCCATCAAGATCTTGATGACATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCCGGGGTCCAACATCCACTCCAAGCCTAAGTGGCTCAAGAAAAAGTGGAATTACAAGACACAAGGCTCTAACCTACACATCTCAACAAAATGGAGTTGCAGAAAG
ACTAAACAGAACAATAATGGAAACGACACTGAACAGAAGCCCTCATACCTCCTTAGGACTACTAACTCCTGAGGAGAAATGGTCCAAACATCCACCAAGTCTAAATGACC
TTACGGTGTTTGGATGTGTAGGCACTGAAGGGAAATCAGATGATACAGAATCCTATACTACTCAAATTGAGGTGGAGAATACAAGGAACAATGTCCAACTTACTGAGGAA
CCTACAGTGACTGAACAAGAACAAGTGCAGAACCTAAGTGAAGAACATGCTAAAATAATTGAACAACAACCTGACTTGAGCCAATATTCCCTAGCAAGAGATAAACAAAG
AAGGGTAATTGTTCCTCCAGCTAGATACAAGGCAAGACTGGTAGCAAATAGTTTCACTCAAAGAGAGGGTATTGACTATTCTGGAAATTTTTCCCCTGTTGTTAAACAAA
CTTCCATTAGACTTCTCCTATCCCTAGTTGCTCAAAACAATCTAGAATTAGATCAACTTGATGTAAAAACTGCTTTTCTCCATGGCTATCTAGAAGAAACAATTTATATG
GTTCAACCTAAGGGTTTTGAGTCAAGCATTGGGAAGTTGATGTACCTAATGATCTCAACCAGACCTGATTTATCCTATTCAGCTAGCCTAGTCAACAGGTATATGGCTAA
TTCTGGAAGAAGACACTGGGAAGCTACTAAGTGGATAATCAGATACCTAAATTGGTTAAAGAATGCTAAATTGAACTACCAAAGGGCCACCGAGACAGAGCTAGAACTAA
TAGGTTATGTGGATTCAGATATAAGCATAATACATAGCATTATCAAAAGCTGTAAAATAGGGACTATGGCTCAAAGGACTGATGAAGGACTTTGGAATCAAACAGTTGAT
TTTACAAATCTTCTGGACAAATTCTTACCTCCATACAAGTGTAAGGAATTTCTGGTCGAGTTTGATTTTGAGTTCGAACATAATAAAAGAGCGAGCAACCGAGCAGCTGA
TGTCTTGTGGCTTCTGGAAAATGTCAAATGGGGACAAAGGGAGGGGGAGAGGGGGAGAACTATTTACCTTCTTGGTGACAAGTCTGAATATATCCTCAAGCTACCATCCT
TAGAGACGAATAGACCGAGTGATTCAACTGAAATGGAAGCAGACAACAAACATCGTTCGGACCTTTCTAGAGAAGCCTCGAATCGGATGAAGAAAGTGGCGTTGCCCACA
TGGATGAAAATTTATCTAGTAATTCATGTGAGTAACTTGAAGCTCTGCCATCAAGATCTTGATGACATGTAG
Protein sequenceShow/hide protein sequence
MGRGPTSTPSLSGSRKSGITRHKALTYTSQQNGVAERLNRTIMETTLNRSPHTSLGLLTPEEKWSKHPPSLNDLTVFGCVGTEGKSDDTESYTTQIEVENTRNNVQLTEE
PTVTEQEQVQNLSEEHAKIIEQQPDLSQYSLARDKQRRVIVPPARYKARLVANSFTQREGIDYSGNFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTAFLHGYLEETIYM
VQPKGFESSIGKLMYLMISTRPDLSYSASLVNRYMANSGRRHWEATKWIIRYLNWLKNAKLNYQRATETELELIGYVDSDISIIHSIIKSCKIGTMAQRTDEGLWNQTVD
FTNLLDKFLPPYKCKEFLVEFDFEFEHNKRASNRAADVLWLLENVKWGQREGERGRTIYLLGDKSEYILKLPSLETNRPSDSTEMEADNKHRSDLSREASNRMKKVALPT
WMKIYLVIHVSNLKLCHQDLDDM