; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018078 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018078
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE2
Genome locationtig00153092:1154301..1155281
RNA-Seq ExpressionSgr018078
SyntenySgr018078
Gene Ontology termsGO:0009987 - cellular process (biological process)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8696580.1 putative disease resistance protein [Hibiscus syriacus]1.7e-8358.18Show/hide
Query:  NTLEASHCPV-VKNSHPMTTRAKSGIYKPKVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVAR
        +T +++H PV V+N HPM TRAK GI KP+V   E L+ EP S++EA+K PHW KA QEE+DALI N T SLVS  +D+  + CKW+FK+KRN+DG+V+R
Subjt:  NTLEASHCPV-VKNSHPMTTRAKSGIYKPKVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVAR

Query:  YKARLVAKGFLQEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAGSPSLVCRLKKALYGLKQAPRAWFDR
        YK+RLVAKGFLQ+  +DY E FSPVVK  T+RV+L+LAL   W LRQVDINN FL+G L EEV+M+QPPG + AG  SLVC+L KA+YGLKQAPRAWF+R
Subjt:  YKARLVAKGFLQEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAGSPSLVCRLKKALYGLKQAPRAWFDR

Query:  LSTFLNSLSFVNSPPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEV
        L  +L S  F  S  N+SL VR  +    Y+LVYVD I++ GS  S +  ++  L+ +F+LKDLGSLN+F G+EV
Subjt:  LSTFLNSLSFVNSPPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEV

KAG8480384.1 hypothetical protein CXB51_024547 [Gossypium anomalum]5.5e-8256.88Show/hide
Query:  VKNSHPMTTRAKSGIYKPKVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFL
        V N+HPM TR+KSGI+KPK+FT+   + EP S+ EAL+ P W  A Q EY AL+ N TW LV     +K +GCKW+F++KRN+DG+VARYK RLV KG+L
Subjt:  VKNSHPMTTRAKSGIYKPKVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFL

Query:  QEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAG--SPSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLS
        QE  +D+ +TFSPVVK  TIRV+L+LA+ F WSLRQVDINN FL+G L EE++M QPPG    G     LVCRL+KALYGLKQAPRAWF +L  FL +  
Subjt:  QEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAG--SPSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLS

Query:  FVNSPPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEVSVTPAGGLFL
        FV S  + SL V   D    Y+L+YVDDI+I G++  AI+  V  L   FSLKDLG L+YF G+EV  T   GLFL
Subjt:  FVNSPPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEVSVTPAGGLFL

KAG8491997.1 hypothetical protein CXB51_015327 [Gossypium anomalum]8.5e-8357.25Show/hide
Query:  VKNSHPMTTRAKSGIYKPKVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFL
        V N+HPM TR+KSGI+KPK+FT+   + EP S+ EAL+ P W  A Q EY AL+ N TW LV     +K +GCKW+F++KRN+DG+VARYK RLV KG+L
Subjt:  VKNSHPMTTRAKSGIYKPKVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFL

Query:  QEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAG--SPSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLS
        QE  +D+ +TFSPVVK  TIRV+L+LA+ F WSLRQVDINN FL+G L EE++M QPPG    G     LVCRL+KALYGLKQAPRAWF +L  FL +  
Subjt:  QEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAG--SPSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLS

Query:  FVNSPPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEVSVTPAGGLFL
        FV S  + SL V   D    Y+L+YVDDI+I G++  AID  V  L   FSLKDLG L+YF G+EV  T   GLFL
Subjt:  FVNSPPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEVSVTPAGGLFL

PNX93770.1 histone deacetylase [Trifolium pratense]6.5e-8356.99Show/hide
Query:  NSHPMTTRAKSGIYKPKVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFLQE
        N+HPM TR K+G  KPK FT     +EP SVK AL  P W+KAMQ EY AL+ N+TWSLVS    KK IGCKW+F++K N DGT+ +YKARLVAKGFLQ 
Subjt:  NSHPMTTRAKSGIYKPKVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFLQE

Query:  DDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAGSPSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLSFVNS
           D+ ETFSPV+K  TIRV+L+LA+ + WS++Q+D+NN FL+G L EEV+MSQPPG   A   SLVC+L K+LYGLKQAPRAW++RL+  L  + FV S
Subjt:  DDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAGSPSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLSFVNS

Query:  PPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEVSVTPAGGLFL
          + SLLV H+ G   Y+L+YVDDI+I GS+   I  L+  L+ +FSLK LG ++YF G+EV   P+GGL L
Subjt:  PPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEVSVTPAGGLFL

XP_012441791.1 PREDICTED: uncharacterized protein LOC105766761 [Gossypium raimondii]9.4e-8258.27Show/hide
Query:  NSHPMTTRAKSGIYKPKVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFLQE
        N HPMTTR+K+GI+KPKV   E    EP +++EA   P W  A Q EYDALI+N TW LVS    +KVIGCKW+FK+K+N DGTV R KARLVAKG  Q 
Subjt:  NSHPMTTRAKSGIYKPKVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFLQE

Query:  DDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMI---VAGSPSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLSF
           D+ ETFSPVVK  TIRV+LS+A+   WSLRQVD+NN FL+G ++ EVFM QP G +   V G P LVCRLKKALYGL QAPRAWF++L TFL ++ F
Subjt:  DDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMI---VAGSPSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLSF

Query:  VNSPPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEVSVTPAGGLFLVWK
        V+S  +ASL VR       Y+LVYVDDI+I GS  S+ID  V +L+D+FSLKD+GS++YF G+EV+ +  G L L  K
Subjt:  VNSPPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEVSVTPAGGLFLVWK

TrEMBL top hitse value%identityAlignment
A0A2K3MSH2 Histone deacetylase3.1e-8356.99Show/hide
Query:  NSHPMTTRAKSGIYKPKVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFLQE
        N+HPM TR K+G  KPK FT     +EP SVK AL  P W+KAMQ EY AL+ N+TWSLVS    KK IGCKW+F++K N DGT+ +YKARLVAKGFLQ 
Subjt:  NSHPMTTRAKSGIYKPKVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFLQE

Query:  DDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAGSPSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLSFVNS
           D+ ETFSPV+K  TIRV+L+LA+ + WS++Q+D+NN FL+G L EEV+MSQPPG   A   SLVC+L K+LYGLKQAPRAW++RL+  L  + FV S
Subjt:  DDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAGSPSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLSFVNS

Query:  PPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEVSVTPAGGLFL
          + SLLV H+ G   Y+L+YVDDI+I GS+   I  L+  L+ +FSLK LG ++YF G+EV   P+GGL L
Subjt:  PPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEVSVTPAGGLFL

A0A2N9FGX9 Reverse transcriptase Ty1/copia-type domain-containing protein7.8e-8252.68Show/hide
Query:  PDASTNQADVIDILPNTLEASHCPVVKNSHPMTTRAKSGIYKPKVFTT---EYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIG
        P A T+   V + +PN L +       N+HPM TR KSGI K K F T   +YL+ EPPS   A     WV AM++E+ AL +  TW+LV P   + V+G
Subjt:  PDASTNQADVIDILPNTLEASHCPVVKNSHPMTTRAKSGIYKPKVFTT---EYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIG

Query:  CKWVFKIKRNSDGTVARYKARLVAKGFLQEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAGSPSLVCRL
        CKWV+KIKRNSDGTV+RYKARLVAKGF Q+  +DY ETFSPVVK  T+R++L+LA  F W LRQ+DI+N FLHGFL E+V M+QP G +    P  VC+L
Subjt:  CKWVFKIKRNSDGTVARYKARLVAKGFLQEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAGSPSLVCRL

Query:  KKALYGLKQAPRAWFDRLSTFLNSLSFVNSPPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEVSVTPAG
         K+LYGLKQAPRAWFDR +T L SL F  S  + SL V H  G   Y+L+YVDDI++ G+S + I  L+S L   F LKDLG L+YF G+++    +G
Subjt:  KKALYGLKQAPRAWFDRLSTFLNSLSFVNSPPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEVSVTPAG

A0A6A2ZXB1 Putative disease resistance protein8.3e-8458.18Show/hide
Query:  NTLEASHCPV-VKNSHPMTTRAKSGIYKPKVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVAR
        +T +++H PV V+N HPM TRAK GI KP+V   E L+ EP S++EA+K PHW KA QEE+DALI N T SLVS  +D+  + CKW+FK+KRN+DG+V+R
Subjt:  NTLEASHCPV-VKNSHPMTTRAKSGIYKPKVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVAR

Query:  YKARLVAKGFLQEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAGSPSLVCRLKKALYGLKQAPRAWFDR
        YK+RLVAKGFLQ+  +DY E FSPVVK  T+RV+L+LAL   W LRQVDINN FL+G L EEV+M+QPPG + AG  SLVC+L KA+YGLKQAPRAWF+R
Subjt:  YKARLVAKGFLQEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAGSPSLVCRLKKALYGLKQAPRAWFDR

Query:  LSTFLNSLSFVNSPPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEV
        L  +L S  F  S  N+SL VR  +    Y+LVYVD I++ GS  S +  ++  L+ +F+LKDLGSLN+F G+EV
Subjt:  LSTFLNSLSFVNSPPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEV

A0A6A3D3Y0 Integrase catalytic domain-containing protein5.9e-8255.85Show/hide
Query:  VKNSHPMTTRAKSGIYKPKVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFL
        + N HPM TR+K GI+KPKV+     + EP ++ EAL  PHW +A Q EYDAL++N TW+LV   RD+K + CKW+F++KRN+DG+++RYKARLVAKGFL
Subjt:  VKNSHPMTTRAKSGIYKPKVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFL

Query:  QEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAGSPSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLSFV
        Q+  +D+ E FSPVVK  TIRV++SLAL   W LRQVD+NN FL+G L EEV+MSQPPG    GS  LVC+L KA+YGLKQAPRAWF+RL  +L S  F 
Subjt:  QEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAGSPSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLSFV

Query:  NSPPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEVS
         +  +ASL VR       Y+LVYVDDI++ G  +  +  ++  L+ +FSLKDLG L++F GVEVS
Subjt:  NSPPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEVS

A0A6A3D5Y8 Reverse transcriptase Ty1/copia-type domain-containing protein5.9e-8255.85Show/hide
Query:  VKNSHPMTTRAKSGIYKPKVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFL
        + N HPM TR+K GI+KPKV+     + EP ++ EAL  PHW +A Q EYDAL++N TW+LV   RD+K + CKW+F++KRN+DG+++RYKARLVAKGFL
Subjt:  VKNSHPMTTRAKSGIYKPKVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFL

Query:  QEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAGSPSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLSFV
        Q+  +D+ E FSPVVK  TIRV++SLAL   W LRQVD+NN FL+G L EEV+MSQPPG    GS  LVC+L KA+YGLKQAPRAWF+RL  +L S  F 
Subjt:  QEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAGSPSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLSFV

Query:  NSPPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEVS
         +  +ASL VR       Y+LVYVDDI++ G  +  +  ++  L+ +FSLKDLG L++F GVEVS
Subjt:  NSPPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEVS

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.2e-4236.12Show/hide
Query:  WVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFLQEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINN
        W +A+  E +A   N+TW++     +K ++  +WVF +K N  G   RYKARLVA+GF Q+  +DY ETF+PV ++ + R +LSL + ++  + Q+D+  
Subjt:  WVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFLQEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINN

Query:  TFLHGFLNEEVFMSQPPGMIVAGSPSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLSFVNSPPNASLLVRHKD--GHRCYILVYVDDIVIIGSSQSAIDH
         FL+G L EE++M  P G  ++ +   VC+L KA+YGLKQA R WF+     L    FVNS  +  + +  K       Y+L+YVDD+VI     + +++
Subjt:  TFLHGFLNEEVFMSQPPGMIVAGSPSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLSFVNSPPNASLLVRHKD--GHRCYILVYVDDIVIIGSSQSAIDH

Query:  LVSMLHDKFSLKDLGSLNYFFGVEVSV
            L +KF + DL  + +F G+ + +
Subjt:  LVSMLHDKFSLKDLGSLNYFFGVEVSV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.3e-5340.28Show/hide
Query:  PNTLEASHCPVVKNSHPMTTRAKSGIYKPKVFTTEYLEI----EPPSVKEALKCP---HWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRN
        P   E  H P+ ++  P   R +S  Y     +TEY+ I    EP S+KE L  P     +KAMQEE ++L KN T+ LV   + K+ + CKWVFK+K++
Subjt:  PNTLEASHCPVVKNSHPMTTRAKSGIYKPKVFTTEYLEI----EPPSVKEALKCP---HWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRN

Query:  SDGTVARYKARLVAKGFLQEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAGSPSLVCRLKKALYGLKQA
         D  + RYKARLV KGF Q+  +D+ E FSPVVK+ +IR +LSLA      + Q+D+   FLHG L EE++M QP G  VAG   +VC+L K+LYGLKQA
Subjt:  SDGTVARYKARLVAKGFLQEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAGSPSLVCRLKKALYGLKQA

Query:  PRAWFDRLSTFLNSLSFVNSPPNASL-LVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEV
        PR W+ +  +F+ S +++ +  +  +   R  + +   +L+YVDD++I+G  +  I  L   L   F +KDLG      G+++
Subjt:  PRAWFDRLSTFLNSLSFVNSPPNASL-LVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEV

P92520 Uncharacterized mitochondrial protein AtMg008202.8e-2854.4Show/hide
Query:  MTTRAKSGIYK--PK--VFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFLQE
        M TR+K+GI K  PK  +  T  ++ EP SV  ALK P W +AMQEE DAL +N TW LV P  ++ ++GCKWVFK K +SDGT+ R KARLVAKGF QE
Subjt:  MTTRAKSGIYK--PK--VFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFLQE

Query:  DDLDYTETFSPVVKLRTIRVLLSLA
        + + + ET+SPVV+  TIR +L++A
Subjt:  DDLDYTETFSPVVKLRTIRVLLSLA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.4e-6642.77Show/hide
Query:  STSAPSSLPDASTNQADVIDILPNTLEASHCPVVK----------NSHPMTTRAKSGIYKP----KVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDAL
        S+S+PS  P  S + +      P+ L     P+ +          N+H M TRAK+GI KP     +  +   E EP +  +ALK   W  AM  E +A 
Subjt:  STSAPSSLPDASTNQADVIDILPNTLEASHCPVVK----------NSHPMTTRAKSGIYKP----KVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDAL

Query:  IKNDTWSLVSPSRDK-KVIGCKWVFKIKRNSDGTVARYKARLVAKGFLQEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEV
        I N TW LV P      ++GC+W+F  K NSDG++ RYKARLVAKG+ Q   LDY ETFSPV+K  +IR++L +A+  SW +RQ+D+NN FL G L ++V
Subjt:  IKNDTWSLVSPSRDK-KVIGCKWVFKIKRNSDGTVARYKARLVAKGFLQEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEV

Query:  FMSQPPGMIVAGSPSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLSFVNSPPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKD
        +MSQPPG I    P+ VC+L+KALYGLKQAPRAW+  L  +L ++ FVNS  + SL V  +     Y+LVYVDDI+I G+  + + + +  L  +FS+KD
Subjt:  FMSQPPGMIVAGSPSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLSFVNSPPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKD

Query:  LGSLNYFFGVEVSVTPAGGLFLVWKSLPLSSLLYLLFLV
           L+YF G+E    P G        L LS   Y+L L+
Subjt:  LGSLNYFFGVEVSVTPAGGLFLVWKSLPLSSLLYLLFLV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.1e-6844.66Show/hide
Query:  STSAPSSLPDASTNQADVIDILP-NTLEASHCPVVKNSHPMTTRAKSGIYKP----KVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLV
        S S P+S   +ST+   +  +LP   +   +     N+H M TRAK GI KP       T+     EP +  +A+K   W +AM  E +A I N TW LV
Subjt:  STSAPSSLPDASTNQADVIDILP-NTLEASHCPVVKNSHPMTTRAKSGIYKP----KVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLV

Query:  SPSRDK-KVIGCKWVFKIKRNSDGTVARYKARLVAKGFLQEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMI
         P      ++GC+W+F  K NSDG++ RYKARLVAKG+ Q   LDY ETFSPV+K  +IR++L +A+  SW +RQ+D+NN FL G L +EV+MSQPPG +
Subjt:  SPSRDK-KVIGCKWVFKIKRNSDGTVARYKARLVAKGFLQEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMI

Query:  VAGSPSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLSFVNSPPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFG
            P  VCRL+KA+YGLKQAPRAW+  L T+L ++ FVNS  + SL V  +     Y+LVYVDDI+I G+    + H +  L  +FS+K+   L+YF G
Subjt:  VAGSPSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLSFVNSPPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFG

Query:  VEVSVTPAG
        +E    P G
Subjt:  VEVSVTPAG

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.0e-5743.9Show/hide
Query:  EPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFLQEDDLDYTETFSPVVKLRTIRVLLSLAL
        EP +  EA +   W  AM +E  A+    TW + +   +KK IGCKWV+KIK NSDGT+ RYKARLVAKG+ Q++ +D+ ETFSPV KL +++++L+++ 
Subjt:  EPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFLQEDDLDYTETFSPVVKLRTIRVLLSLAL

Query:  VFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAGS----PSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLSFVNSPPNASLLVRHKDGHRCYILVYV
        +++++L Q+DI+N FL+G L+EE++M  PPG          P+ VC LKK++YGLKQA R WF + S  L    FV S  + +  ++        +LVYV
Subjt:  VFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAGS----PSLVCRLKKALYGLKQAPRAWFDRLSTFLNSLSFVNSPPNASLLVRHKDGHRCYILVYV

Query:  DDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEVSVTPAG
        DDI+I  ++ +A+D L S L   F L+DLG L YF G+E++ + AG
Subjt:  DDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEVSVTPAG

ATMG00810.1 DNA/RNA polymerases superfamily protein1.9e-0846.43Show/hide
Query:  YILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEVSVTPAGGLFL
        Y+L+YVDDI++ GSS + ++ L+  L   FS+KDLG ++YF G+++   P+ GLFL
Subjt:  YILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEVSVTPAGGLFL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.0e-2954.4Show/hide
Query:  MTTRAKSGIYK--PK--VFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFLQE
        M TR+K+GI K  PK  +  T  ++ EP SV  ALK P W +AMQEE DAL +N TW LV P  ++ ++GCKWVFK K +SDGT+ R KARLVAKGF QE
Subjt:  MTTRAKSGIYK--PK--VFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWVFKIKRNSDGTVARYKARLVAKGFLQE

Query:  DDLDYTETFSPVVKLRTIRVLLSLA
        + + + ET+SPVV+  TIR +L++A
Subjt:  DDLDYTETFSPVVKLRTIRVLLSLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAACTTCAGCTCCATCTTCTTTGCCTGATGCTAGTACTAATCAAGCTGATGTAATTGATATATTACCAAACACTCTTGAGGCATCTCATTGTCCAGTAGTTAAAAA
TTCTCACCCTATGACTACTCGAGCCAAGAGTGGTATTTACAAACCCAAGGTGTTTACTACTGAATATTTGGAGATTGAACCACCATCGGTGAAGGAAGCTTTAAAGTGTC
CTCATTGGGTGAAGGCGATGCAAGAAGAGTATGATGCCTTAATAAAAAATGATACTTGGTCCTTGGTGTCTCCTTCTAGGGATAAGAAGGTTATTGGATGCAAATGGGTG
TTCAAAATTAAAAGGAATTCTGATGGCACTGTGGCTAGGTACAAGGCACGCCTTGTTGCTAAGGGTTTCCTTCAAGAGGATGATCTTGATTATACAGAAACATTTAGTCC
TGTTGTGAAGTTACGTACTATTCGAGTTCTTCTTAGCCTTGCTCTTGTATTTAGTTGGTCTTTAAGGCAAGTTGACATCAACAATACCTTTCTCCATGGATTTTTGAACG
AAGAGGTGTTTATGTCTCAACCTCCTGGAATGATTGTAGCTGGTTCGCCTTCTCTAGTATGTAGGTTGAAGAAAGCACTCTATGGTCTTAAACAGGCGCCTCGGGCTTGG
TTTGACAGGTTGAGTACCTTCCTAAATTCCCTTAGTTTTGTTAATTCTCCTCCTAATGCTTCACTTCTTGTGCGTCATAAGGATGGTCATCGTTGTTATATTTTAGTTTA
TGTTGATGACATCGTCATTATCGGGAGTTCTCAATCTGCTATTGATCATCTTGTCAGTATGCTTCATGATAAATTTTCATTGAAGGATCTGGGGTCTTTGAACTATTTTT
TTGGTGTGGAAGTCTCTGTTACACCTGCTGGTGGTTTATTTTTGGTGTGGAAGTCTCTGCCCTTATCTTCTTTACTTTACCTTTTGTTTCTTGTGATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCAACTTCAGCTCCATCTTCTTTGCCTGATGCTAGTACTAATCAAGCTGATGTAATTGATATATTACCAAACACTCTTGAGGCATCTCATTGTCCAGTAGTTAAAAA
TTCTCACCCTATGACTACTCGAGCCAAGAGTGGTATTTACAAACCCAAGGTGTTTACTACTGAATATTTGGAGATTGAACCACCATCGGTGAAGGAAGCTTTAAAGTGTC
CTCATTGGGTGAAGGCGATGCAAGAAGAGTATGATGCCTTAATAAAAAATGATACTTGGTCCTTGGTGTCTCCTTCTAGGGATAAGAAGGTTATTGGATGCAAATGGGTG
TTCAAAATTAAAAGGAATTCTGATGGCACTGTGGCTAGGTACAAGGCACGCCTTGTTGCTAAGGGTTTCCTTCAAGAGGATGATCTTGATTATACAGAAACATTTAGTCC
TGTTGTGAAGTTACGTACTATTCGAGTTCTTCTTAGCCTTGCTCTTGTATTTAGTTGGTCTTTAAGGCAAGTTGACATCAACAATACCTTTCTCCATGGATTTTTGAACG
AAGAGGTGTTTATGTCTCAACCTCCTGGAATGATTGTAGCTGGTTCGCCTTCTCTAGTATGTAGGTTGAAGAAAGCACTCTATGGTCTTAAACAGGCGCCTCGGGCTTGG
TTTGACAGGTTGAGTACCTTCCTAAATTCCCTTAGTTTTGTTAATTCTCCTCCTAATGCTTCACTTCTTGTGCGTCATAAGGATGGTCATCGTTGTTATATTTTAGTTTA
TGTTGATGACATCGTCATTATCGGGAGTTCTCAATCTGCTATTGATCATCTTGTCAGTATGCTTCATGATAAATTTTCATTGAAGGATCTGGGGTCTTTGAACTATTTTT
TTGGTGTGGAAGTCTCTGTTACACCTGCTGGTGGTTTATTTTTGGTGTGGAAGTCTCTGCCCTTATCTTCTTTACTTTACCTTTTGTTTCTTGTGATTTAA
Protein sequenceShow/hide protein sequence
MSTSAPSSLPDASTNQADVIDILPNTLEASHCPVVKNSHPMTTRAKSGIYKPKVFTTEYLEIEPPSVKEALKCPHWVKAMQEEYDALIKNDTWSLVSPSRDKKVIGCKWV
FKIKRNSDGTVARYKARLVAKGFLQEDDLDYTETFSPVVKLRTIRVLLSLALVFSWSLRQVDINNTFLHGFLNEEVFMSQPPGMIVAGSPSLVCRLKKALYGLKQAPRAW
FDRLSTFLNSLSFVNSPPNASLLVRHKDGHRCYILVYVDDIVIIGSSQSAIDHLVSMLHDKFSLKDLGSLNYFFGVEVSVTPAGGLFLVWKSLPLSSLLYLLFLVI