; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G16070 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G16070
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase Ty1/copia-type domain-containing protein
Genome locationChr6:14500694..14501563
RNA-Seq ExpressionCSPI06G16070
SyntenyCSPI06G16070
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN71189.1 hypothetical protein VITISV_005044 [Vitis vinifera]4.0e-11673.05Show/hide
Query:  MMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKE
        MMGEL+ FLGLQIKQLK+G FI+Q KY +DLLK+F + E KV KTPMS++ KLD DEKGK +D   YRGMIGSLLYLTASRPDIM+SVCLCARFQSCPKE
Subjt:  MMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKE

Query:  SHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLCDF
        SH   VKRIL+YL GT+++GLWYP+   F L+G+SDADFAG  ++RKSTSGTC FLG SLVSW SKKQNSVALST EAEYIA   CCAQILWMKQTL DF
Subjt:  SHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLCDF

Query:  GLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGII
         L F++VPI CDNTSAIN++KNP+ HSRTKHI+IRHHF+R+H Q G ITLEFVS+ +QLADIFTKPLSEE F   R +LG+I
Subjt:  GLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGII

KAF7119794.1 hypothetical protein RHSIM_Rhsim13G0158900 [Rhododendron simsii]3.8e-11975.62Show/hide
Query:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCP
        MSMMGEL++FLGLQIKQ  DGIFI+Q KY +DLLK+F + + K   TPMST+TKLDKDE GK  D K YRGMIGSLLYLTASRPDIMFSVCLCARFQS P
Subjt:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCP

Query:  KESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLC
        KESH  AVKRI KYL+GT ++GLWYPR    +L+GYSDADFAG  +DRKSTSGTCQFLG SLVSWFSKKQ+SVALST EAEY+AV SCCAQILWMKQ L 
Subjt:  KESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLC

Query:  DFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGI
        DFGL FD++PI CDNTSAINLTKNPI HSRTKHIDIRHHFIR+HVQ G + ++FV ++NQLADIFTKPLSE+ FC  R E+G+
Subjt:  DFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGI

KAG8633904.1 hypothetical protein MANES_18G147001v8 [Manihot esculenta]3.5e-12075Show/hide
Query:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCP
        MSMMGEL FFLGLQIKQ KDGIFI+Q KYT++L+K+F +   K ++TPMST TKLDKDEKGK +D K YRGMIGSLLYLTASRPDIMFSVCLCARFQSCP
Subjt:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCP

Query:  KESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLC
        KESH HAVKRIL+YL GT+ +GLWYPR+  F+L  YSDADFAGS+LDRKSTSGTCQ LG SLVSW SKKQNSVALST EAEY+A   CC+QILW+KQ L 
Subjt:  KESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLC

Query:  DFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGIIRWDA
        DF +  D++PI CDNTSAINLTKNPI HSRTKHIDIRHHFIR+HV NG + LEFV +NNQLADIFTKPL+EE F   + ELG++  DA
Subjt:  DFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGIIRWDA

RVW71911.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.2e-11873.59Show/hide
Query:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCP
        MSMMGEL++FLGLQIKQLK+G FI+Q KY +DLLK+F + E KV KTPMS++ KLD DEKGK +D   YRGMIGSLLYLTASRPDIM+SVCLCARFQSCP
Subjt:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCP

Query:  KESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLC
        KESH  AVKRIL+YL GT+++GLWYP+   F L+G+SDADFAG  ++RKSTSGTC FLG SLVSW SKKQNSVALST EAEYIA   CCAQILWMKQTL 
Subjt:  KESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLC

Query:  DFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGII
        DF L F++VPI CDNTSAIN++KNP+ HSRTKHI+IRHHF+R+H Q G ITLEFVS+ +QLADIFTKPLSEE F   R +LG+I
Subjt:  DFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGII

XP_020113283.1 uncharacterized protein LOC109727552, partial [Ananas comosus]6.1e-11773.94Show/hide
Query:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCP
        MSMMGELSFFLGLQ+KQ+KDGIFI+Q KY +DL+K+F L   K   TPMS +TKLDKDEKGK VDIK YRGMIGSLLYLTASRPDIMFSVCLCA+FQ+ P
Subjt:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCP

Query:  KESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLC
        KESH  AVKRIL+Y+ GT+++GLWYP    FNL+G+SDADFAG  +DRKSTSGTCQFLG SLVSW SKKQNSVALST EAEYIA   CCAQ+LWMKQTL 
Subjt:  KESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLC

Query:  DFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGII
        DFG+    VPI CDNTSAIN++KNPI HSRTKHI+IRHHFIR+H+Q   +T+EFV + NQLADIFTKPLSE+ F + R ELG+I
Subjt:  DFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGII

TrEMBL top hitse value%identityAlignment
A0A151SMP1 Copia protein4.3e-11672.54Show/hide
Query:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCP
        MSMMGEL+FFLGLQI+Q K+GIFI+Q KY ++LLK+F +   K   TPMSTT  LDKDE GK +D+K YRGMIGSLLYL+ASRPDIMFSVCLCAR+QS P
Subjt:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCP

Query:  KESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLC
        KESH  AVKRI++YLLGT ++GLWYP+N+ FNLVGYSD+DFAG   DRKSTSGTC F+GS+LVSW SKKQNSVALST EAEYIA  SCCAQILWMKQ L 
Subjt:  KESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLC

Query:  DFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGII
        D+GL  D++PI CDNTSAINL+KNP+ HSRTKHI+IRHHF+R+HVQ G   LEFV + NQLADIFTKPL +E+F   R ELGI+
Subjt:  DFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGII

A0A438GI90 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-11873.59Show/hide
Query:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCP
        MSMMGEL++FLGLQIKQLK+G FI+Q KY +DLLK+F + E KV KTPMS++ KLD DEKGK +D   YRGMIGSLLYLTASRPDIM+SVCLCARFQSCP
Subjt:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCP

Query:  KESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLC
        KESH  AVKRIL+YL GT+++GLWYP+   F L+G+SDADFAG  ++RKSTSGTC FLG SLVSW SKKQNSVALST EAEYIA   CCAQILWMKQTL 
Subjt:  KESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLC

Query:  DFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGII
        DF L F++VPI CDNTSAIN++KNP+ HSRTKHI+IRHHF+R+H Q G ITLEFVS+ +QLADIFTKPLSEE F   R +LG+I
Subjt:  DFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGII

A0A6P5HB69 uncharacterized protein LOC1097275523.0e-11773.94Show/hide
Query:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCP
        MSMMGELSFFLGLQ+KQ+KDGIFI+Q KY +DL+K+F L   K   TPMS +TKLDKDEKGK VDIK YRGMIGSLLYLTASRPDIMFSVCLCA+FQ+ P
Subjt:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCP

Query:  KESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLC
        KESH  AVKRIL+Y+ GT+++GLWYP    FNL+G+SDADFAG  +DRKSTSGTCQFLG SLVSW SKKQNSVALST EAEYIA   CCAQ+LWMKQTL 
Subjt:  KESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLC

Query:  DFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGII
        DFG+    VPI CDNTSAIN++KNPI HSRTKHI+IRHHFIR+H+Q   +T+EFV + NQLADIFTKPLSE+ F + R ELG+I
Subjt:  DFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGII

A5AEG4 Reverse transcriptase Ty1/copia-type domain-containing protein1.9e-11673.05Show/hide
Query:  MMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKE
        MMGEL+ FLGLQIKQLK+G FI+Q KY +DLLK+F + E KV KTPMS++ KLD DEKGK +D   YRGMIGSLLYLTASRPDIM+SVCLCARFQSCPKE
Subjt:  MMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKE

Query:  SHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLCDF
        SH   VKRIL+YL GT+++GLWYP+   F L+G+SDADFAG  ++RKSTSGTC FLG SLVSW SKKQNSVALST EAEYIA   CCAQILWMKQTL DF
Subjt:  SHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLCDF

Query:  GLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGII
         L F++VPI CDNTSAIN++KNP+ HSRTKHI+IRHHF+R+H Q G ITLEFVS+ +QLADIFTKPLSEE F   R +LG+I
Subjt:  GLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGII

A5BLV7 Reverse transcriptase Ty1/copia-type domain-containing protein3.3e-11673.24Show/hide
Query:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCP
        MSMMGEL+FFLGLQIKQLK+G FI+Q KY RDLLK+F + E K  KTPMS++ KLD DEK K V+   YRGMIGSLLYLT SRPDIM+SVCLCARFQSCP
Subjt:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCP

Query:  KESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLC
        K+SH  AVKRIL+YL GT+D+GLWYP+   F L+GYSDADF G  ++RKSTS TC FLG SLVSW+SKKQNSVALST EAEYIAV  CCAQILWMKQTL 
Subjt:  KESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLC

Query:  DFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGII
        DF L F++VPI CDNTSAIN++KNP+ HSRTKHI+IRHHF+R+H Q G ITLEFVS+ +QLADIFTKPLSEE F   R +LG+I
Subjt:  DFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGII

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.5e-5338.98Show/hide
Query:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKT-YRGMIGSLLY-LTASRPDIMFSVCLCARFQS
        M+ + E+  F+G++I+  +D I++SQ  Y + +L KF +       TP+   +K++ +      D  T  R +IG L+Y +  +RPD+  +V + +R+ S
Subjt:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKT-YRGMIGSLLY-LTASRPDIMFSVCLCARFQS

Query:  CPKESHFHAVKRILKYLLGTIDVGLWYPRNVEF--NLVGYSDADFAGSLLDRKSTSG-TCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWM
              +  +KR+L+YL GTID+ L + +N+ F   ++GY D+D+AGS +DRKST+G   +    +L+ W +K+QNSVA S+TEAEY+A+     + LW+
Subjt:  CPKESHFHAVKRILKYLLGTIDVGLWYPRNVEF--NLVGYSDADFAGSLLDRKSTSG-TCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWM

Query:  KQTLCDFGLKFDN-VPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGIIRWDAS
        K  L    +K +N + I+ DN   I++  NP  H R KHIDI++HF RE VQN  I LE++ + NQLADIFTKPL    F + R +LG+++ D S
Subjt:  KQTLCDFGLKFDN-VPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGIIRWDAS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-4437.85Show/hide
Query:  MSMMGELSFFLGLQI--KQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDK-------DEKGKCVDIKTYRGMIGSLLY-LTASRPDIMFSV
        M  +G     LG++I  ++    +++SQEKY   +L++F +   K   TP++   KL K       +EKG    +  Y   +GSL+Y +  +RPDI  +V
Subjt:  MSMMGELSFFLGLQI--KQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDK-------DEKGKCVDIKTYRGMIGSLLY-LTASRPDIMFSV

Query:  CLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCA
         + +RF   P + H+ AVK IL+YL GT    L +  +    L GY+DAD AG + +RKS++G         +SW SK Q  VALSTTEAEYIA      
Subjt:  CLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCA

Query:  QILWMKQTLCDFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTK--PLSEESFCK
        +++W+K+ L + GL      ++CD+ SAI+L+KN ++H+RTKHID+R+H+IRE V +  + +  +S+N   AD+ TK  P ++   CK
Subjt:  QILWMKQTLCDFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTK--PLSEESFCK

P92519 Uncharacterized mitochondrial protein AtMg008104.1e-3137.56Show/hide
Query:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEK---GKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQ
        M  +G + +FLG+QIK    G+F+SQ KY   +L     N G +   PMST   L  +      K  D   +R ++G+L YLT +RPDI ++V +  +  
Subjt:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEK---GKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQ

Query:  SCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILW
          P  + F  +KR+L+Y+ GTI  GL+  +N + N+  + D+D+AG    R+ST+G C FLG +++SW +K+Q +V+ S+TE EY A+A   A++ W
Subjt:  SCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.3e-5641.64Show/hide
Query:  ELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHF
        EL +FLG++ K++  G+ +SQ +Y  DLL +  +   K   TPM+ + KL      K  D   YRG++GSL YL  +RPDI ++V   ++F   P E H 
Subjt:  ELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHF

Query:  HAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLCDFGLK
         A+KRIL+YL GT + G++  +    +L  YSDAD+AG   D  ST+G   +LG   +SW SKKQ  V  S+TEAEY +VA+  +++ W+   L + G++
Subjt:  HAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLCDFGLK

Query:  FDNVP-IFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGIIR
            P I+CDN  A  L  NP+ HSR KHI I +HFIR  VQ+G + +  VS+++QLAD  TKPLS  +F     ++G+ R
Subjt:  FDNVP-IFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGIIR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.7e-5740.57Show/hide
Query:  ELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHF
        +L +FLG++ K++  G+ +SQ +YT DLL +  +   K   TPM+T+ KL      K  D   YRG++GSL YL  +RPD+ ++V   +++   P + H+
Subjt:  ELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHF

Query:  HAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLCDFGLK
        +A+KR+L+YL GT D G++  +    +L  YSDAD+AG   D  ST+G   +LG   +SW SKKQ  V  S+TEAEY +VA+  +++ W+   L + G++
Subjt:  HAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLCDFGLK

Query:  FDNVP-IFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGIIR
          + P I+CDN  A  L  NP+ HSR KHI + +HFIR  VQ+G + +  VS+++QLAD  TKPLS  +F     ++G+I+
Subjt:  FDNVP-IFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGIIR

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.1e-4539.41Show/hide
Query:  MGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKES
        +G L +FLGL+I +   GI I Q KY  DLL +  L   K +  PM  +        G  VD K YR +IG L+YL  +R DI F+V   ++F   P+ +
Subjt:  MGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKES

Query:  HFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLCDFG
        H  AV +IL Y+ GT+  GL+Y    E  L  +SDA F      R+ST+G C FLG+SL+SW SKKQ  V+ S+ EAEY A++    +++W+ Q   +  
Subjt:  HFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLCDFG

Query:  LKFDN-VPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREH-VQNGHITLEFVSSNNQLADIFTKPLS
        L       +FCDNT+AI++  N + H RTKHI+   H +RE  V    ++  F + + Q  D FT+ LS
Subjt:  LKFDN-VPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREH-VQNGHITLEFVSSNNQLADIFTKPLS

ATMG00810.1 DNA/RNA polymerases superfamily protein2.9e-3237.56Show/hide
Query:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEK---GKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQ
        M  +G + +FLG+QIK    G+F+SQ KY   +L     N G +   PMST   L  +      K  D   +R ++G+L YLT +RPDI ++V +  +  
Subjt:  MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEK---GKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQ

Query:  SCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILW
          P  + F  +KR+L+Y+ GTI  GL+  +N + N+  + D+D+AG    R+ST+G C FLG +++SW +K+Q +V+ S+TE EY A+A   A++ W
Subjt:  SCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGTCTTCAAATCAAACAACTCAAGGATGGCATCTTCATAAGTCAAGAAAAATACACAAGGGATTTGCTCAAGAAATT
CAAATTAAATGAAGGTAAAGTTGCAAAAACTCCTATGAGCACTACCACTAAGCTTGACAAAGATGAAAAAGGTAAGTGTGTGGATATAAAGACTTATCGAGGTATGATCG
GATCTTTACTTTATTTGACCGCTAGTAGACCCGATATCATGTTTAGTGTATGTCTTTGTGCTAGATTTCAATCTTGTCCTAAAGAATCACATTTCCATGCCGTTAAAAGG
ATACTTAAATATTTGCTTGGAACTATTGATGTTGGATTATGGTATCCTAGAAATGTTGAGTTTAATTTGGTAGGATATTCCGATGCGGACTTTGCCGGTAGTTTACTTGA
CCGTAAAAGTACTAGTGGGACTTGTCAATTTCTTGGTAGTTCCTTAGTATCTTGGTTTAGTAAAAAGCAAAATTCGGTTGCCTTATCCACTACCGAAGCGGAATATATTG
CGGTTGCTAGTTGTTGTGCACAAATTCTTTGGATGAAACAAACTCTTTGTGATTTTGGATTAAAATTTGATAATGTGCCTATATTTTGTGATAATACTAGTGCCATAAAT
TTGACTAAGAATCCTATTCATCATTCTAGAACTAAGCATATAGATATTAGGCATCACTTTATTAGAGAGCATGTACAAAATGGTCATATTACTCTTGAGTTTGTAAGCTC
CAATAATCAATTAGCGGATATATTTACCAAGCCTTTGAGTGAAGAAAGCTTTTGTAAAAATAGGCTTGAGCTTGGAATTATTCGTTGGGATGCATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGTCTTCAAATCAAACAACTCAAGGATGGCATCTTCATAAGTCAAGAAAAATACACAAGGGATTTGCTCAAGAAATT
CAAATTAAATGAAGGTAAAGTTGCAAAAACTCCTATGAGCACTACCACTAAGCTTGACAAAGATGAAAAAGGTAAGTGTGTGGATATAAAGACTTATCGAGGTATGATCG
GATCTTTACTTTATTTGACCGCTAGTAGACCCGATATCATGTTTAGTGTATGTCTTTGTGCTAGATTTCAATCTTGTCCTAAAGAATCACATTTCCATGCCGTTAAAAGG
ATACTTAAATATTTGCTTGGAACTATTGATGTTGGATTATGGTATCCTAGAAATGTTGAGTTTAATTTGGTAGGATATTCCGATGCGGACTTTGCCGGTAGTTTACTTGA
CCGTAAAAGTACTAGTGGGACTTGTCAATTTCTTGGTAGTTCCTTAGTATCTTGGTTTAGTAAAAAGCAAAATTCGGTTGCCTTATCCACTACCGAAGCGGAATATATTG
CGGTTGCTAGTTGTTGTGCACAAATTCTTTGGATGAAACAAACTCTTTGTGATTTTGGATTAAAATTTGATAATGTGCCTATATTTTGTGATAATACTAGTGCCATAAAT
TTGACTAAGAATCCTATTCATCATTCTAGAACTAAGCATATAGATATTAGGCATCACTTTATTAGAGAGCATGTACAAAATGGTCATATTACTCTTGAGTTTGTAAGCTC
CAATAATCAATTAGCGGATATATTTACCAAGCCTTTGAGTGAAGAAAGCTTTTGTAAAAATAGGCTTGAGCTTGGAATTATTCGTTGGGATGCATCTTGA
Protein sequenceShow/hide protein sequence
MSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKR
ILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLCDFGLKFDNVPIFCDNTSAIN
LTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGIIRWDAS