; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G08330 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G08330
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr7:6092900..6094937
RNA-Seq ExpressionCSPI07G08330
SyntenyCSPI07G08330
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]3.6e-7931.51Show/hide
Query:  MTPEVAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRK---------------------SGSPVPLCALVSQVLLGLD--------------
        MT E+A QL+    +K LW+  + L G  +R++  +L+  F + RK                     +G+PV    L+ Q L GLD              
Subjt:  MTPEVAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRK---------------------SGSPVPLCALVSQVLLGLD--------------

Query:  --------------EKRLEHQNSQRNTGNAVRNVTINVAQRTNSNEHKSHNGQQFYGQRGNSNNG--RGRGRGRGNKPTCQVCDKYDHYALACYNQFNRD
                      E R+E  N   N  N   N T NVA R++     S+N       RG+++ G   GRGRG+  K  CQVC   +H A+ C+++F++ 
Subjt:  --------------EKRLEHQNSQRNTGNAVRNVTINVAQRTNSNEHKSHNGQQFYGQRGNSNNG--RGRGRGRGNKPTCQVCDKYDHYALACYNQFNRD

Query:  FMSPLVQDRGQNLSASANPNPSAFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQFLAL
        + S      G +   S N    AF+ +Q+        +V D +WY DSGA+NHVT ++    + TE+ GK    VGNG  L I   G S L    + L L
Subjt:  FMSPLVQDRGQNLSASANPNPSAFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQFLAL

Query:  KNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFILSGGKNLVSINV
         ++L VP+ITKNL+S SKLA DN+I +EF    C VKDK TG+++LK  LKDGLY                                 LSG K   S  V
Subjt:  KNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFILSGGKNLVSINV

Query:  AV---WHKCLGHHSSKILNTLVKK-------------CNACQLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSF
        +V   WH+ LGH ++K+L+ +++              C ACQ  K H LPF  S S A E  +++++D+WGP PI +S GF+YY+ FVDD+SR+T     
Subjt:  AV---WHKCLGHHSSKILNTLVKK-------------CNACQLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSF

Query:  EQKSTAIDAFKHFITYVKNQFNKSIKVFQSDNGGEYKKIHLLCTTLGIN----------------------------------------WDAHTYFT---
        +QKS  + AF  F    +NQFNK IKV Q D GGEYK +  L    GI                                         W+A +      
Subjt:  EQKSTAIDAFKHFITYVKNQFNKSIKVFQSDNGGEYKKIHLLCTTLGIN----------------------------------------WDAHTYFT---

Query:  --------------------------------------RPYQSNKFQQRSEKCVLLGPSPIHKGFKCLSKSGKVFISRHVQFNEHEFPFSQMFPPTHLP-
                                              +PY  +K Q  + +CV LG S  HKG+KCL+  G++FISRHV FNE  FPF   F  T  P 
Subjt:  --------------------------------------RPYQSNKFQQRSEKCVLLGPSPIHKGFKCLSKSGKVFISRHVQFNEHEFPFSQMFPPTHLP-

Query:  SAQSNPPSLSLAIPINFNSMAQNNSIPSAQFYSHSTPITNSSQCLLSFT
            N PS S  +     ++  + S+P  +  + +   T  SQ + S T
Subjt:  SAQSNPPSLSLAIPINFNSMAQNNSIPSAQFYSHSTPITNSSQCLLSFT

KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]2.1e-8734.5Show/hide
Query:  MTPEVAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRK---------------------SGSPVPLCALVSQVLLGLD--------------
        MT EVA QL+    ++ +WE  + L G  +R+   FL+  F  TRK                     +GS V    LV+Q L GLD              
Subjt:  MTPEVAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRK---------------------SGSPVPLCALVSQVLLGLD--------------

Query:  --------------EKRLEHQNSQRN-TGNAVRNV-TINVAQRTNSNEHKSHNGQQFYGQRGNSNNGRGRGRGRGNKPTCQVCDKYDHYALACYNQFNRD
                      E RLE  N+Q N T N   N+ TI   +R  SN      G Q    RG +  GRGRGR   ++  CQVC K  H A  CY++FN++
Subjt:  --------------EKRLEHQNSQRN-TGNAVRNV-TINVAQRTNSNEHKSHNGQQFYGQRGNSNNGRGRGRGRGNKPTCQVCDKYDHYALACYNQFNRD

Query:  FMSPLVQDRGQNLSASANPNPSAFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQFLAL
        ++     ++        N N +A+V        A+  TV D +WY DSGA+NHVT     +    E  GK   TVGNG NL I   G S L   ++ L L
Subjt:  FMSPLVQDRGQNLSASANPNPSAFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQFLAL

Query:  KNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKK-QLIHKNKETSTFILSGGKNLVSIN
        K++L VP ITKNL+S SKL  DN I++EFH+  C VKDK TG ILL+  +KDGLY L             ST K+  +    KET               
Subjt:  KNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKK-QLIHKNKETSTFILSGGKNLVSIN

Query:  VAVWHKCLGHHSSKILNTLVKKCN-------------ACQLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSFEQ
           WH+ LGH +SK+LN ++K CN             ACQ  KAHNLPF  S S A E  D+++SD+WGP PI S  GF+YY++F+DD+SR+T     +Q
Subjt:  VAVWHKCLGHHSSKILNTLVKKCN-------------ACQLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSFEQ

Query:  KSTAIDAFKHFITYVKNQFNKSIKVFQSDNGGEYKKIHLLCTTLGIN----------------------------------------WDAHTYFT-----
        KS    AF  F   V+NQFNK IK  Q D GGE+K +  +    GI                                         W+A +        
Subjt:  KSTAIDAFKHFITYVKNQFNKSIKVFQSDNGGEYKKIHLLCTTLGIN----------------------------------------WDAHTYFT-----

Query:  ------------------------------------RPYQSNKFQQRSEKCVLLGPSPIHKGFKCLSKSGKVFISRHVQFNEHEFPFSQMFPPTHLPSAQ
                                            +PY  +K Q  + KCV LG S  HKG+KCL+ +G++FISRHV FNEH FPF   F  T  P+  
Subjt:  ------------------------------------RPYQSNKFQQRSEKCVLLGPSPIHKGFKCLSKSGKVFISRHVQFNEHEFPFSQMFPPTHLPSAQ

Query:  SNPPSLSLAIPIN
           P+ SL  PI+
Subjt:  SNPPSLSLAIPIN

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]3.4e-7731.35Show/hide
Query:  MTPEVAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRK---------------------SGSPVPLCALVSQVLLGLD--------------
        MT  +A QL+    +  LW+  + L G  +R++  +L+  F +TRK                     +G+P+    L+ Q L GLD              
Subjt:  MTPEVAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRK---------------------SGSPVPLCALVSQVLLGLD--------------

Query:  --------------EKRLEHQNSQRNTGNAVRNVTINVAQRTNSNEHKSHNGQQFYGQRGNSNNGR---------GRGRGRGNKPTCQVCDKYDHYALAC
                      E R+E  NS     N   N T NVA++   ++H+ +        RG++NN R         GRGRGR  K TCQVC   +H A+ C
Subjt:  --------------EKRLEHQNSQRNTGNAVRNVTINVAQRTNSNEHKSHNGQQFYGQRGNSNNGR---------GRGRGRGNKPTCQVCDKYDHYALAC

Query:  YNQFNRDFMSPLVQDRGQNLSASANPNPS--AFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYL
        + +F++ +          N SA+ +   S  AF+ +Q+        ++ D +WY DSGA+NHVT ++    N +E+ GK    VGNG  L I   G S L
Subjt:  YNQFNRDFMSPLVQDRGQNLSASANPNPS--AFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYL

Query:  TDAKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFILSG
            + L L ++L VP ITKNL+S SKLA DN+I +EF    C VKDK TG+ +L+  LKDGLY L                        K++S +    
Subjt:  TDAKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFILSG

Query:  GKNLVSINVAVWHKCLGHHSSKILNTLVKKCN-------------ACQLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRY
            VSI  + WH+ LGH ++K+L+ ++K CN             ACQ  K H LPF  S S A E  +++++D+WGP PI SS GF+YY+ F+DD++R+
Subjt:  GKNLVSINVAVWHKCLGHHSSKILNTLVKKCN-------------ACQLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRY

Query:  TLDLSFEQKSTAIDAFKHFITYVKNQFNKSIKVFQSDNGGEYKKIHLLCTTLGIN----------------------------------------WDAHT
        T     +QKS    AF  F   V+NQF+K IK  Q D GGEYK +       GI                                         W+A +
Subjt:  TLDLSFEQKSTAIDAFKHFITYVKNQFNKSIKVFQSDNGGEYKKIHLLCTTLGIN----------------------------------------WDAHT

Query:  Y-----------------------------------------FTRPYQSNKFQQRSEKCVLLGPSPIHKGFKCLSKSGKVFISRHVQFNEHEFPFSQMFP
                                                  F +PY  +K Q  + +CV LG S  HKG+KC++  G++FISRHV FNE  FPF   F 
Subjt:  Y-----------------------------------------FTRPYQSNKFQQRSEKCVLLGPSPIHKGFKCLSKSGKVFISRHVQFNEHEFPFSQMFP

Query:  PTHLP
         T +P
Subjt:  PTHLP

TYK05754.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.4e-9950.35Show/hide
Query:  VAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRK---------------------SGSPVPLCALVSQVLLGLD------------------
        +AIQLMGFTNAKDLWE T+DLFG+QSRAEEDFLRQ FQTTRK                     +GSPVP  A +SQ LLGLD                  
Subjt:  VAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRK---------------------SGSPVPLCALVSQVLLGLD------------------

Query:  ----------EKRLEHQNSQRNTGNAVRNVTINVAQRTNSNEHKSHNGQQFY--------GQRGNSNNGRGRGRGRGNKPTCQVCDKYDHYALACYNQFN
                  EKRLEHQ++Q+NT N ++NV +N+AQ  NS++ + ++  QF+        GQRG  N GRGRG+GRGNKPTCQVC+KY H AL CYN+FN
Subjt:  ----------EKRLEHQNSQRNTGNAVRNVTINVAQRTNSNEHKSHNGQQFY--------GQRGNSNNGRGRGRGRGNKPTCQVCDKYDHYALACYNQFN

Query:  RDFMSPLVQDRG-QNLSASANPNPSAFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQF
        ++F+SPLVQDRG Q+ + S + N +  VT QS   FAT++TVI+ NWYIDSGATNH+T +   L+NP+EYSG EK  VGNG++L+IS +G++YLTD    
Subjt:  RDFMSPLVQDRG-QNLSASANPNPSAFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQF

Query:  LALKNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFILSGGKNLVS
        L LKNVLCVPDITKNLVS SKLAQDN++++EFH  YC +KDK TG  LL  T+KDGLYHL+                   I K K        G +N   
Subjt:  LALKNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFILSGGKNLVS

Query:  INVAVWHKCLGHHSSKILNTLVKKC
          + V H  L  H  +IL  LV +C
Subjt:  INVAVWHKCLGHHSSKILNTLVKKC

XP_016902197.1 PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo]3.8e-8956.83Show/hide
Query:  MTPEVAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRK-------------SGSPVPLCALVSQVLLGLDEKRLEHQNSQ-RNTGNAVRNVT
        MTP+VAIQLMGFTN +DLW+ T+D FG+QSRAEEDFLRQ  QTTRK              G P  +  L  Q  L + EKRL+HQN+Q +NTGN  ++  
Subjt:  MTPEVAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRK-------------SGSPVPLCALVSQVLLGLDEKRLEHQNSQ-RNTGNAVRNVT

Query:  INVAQRTNSNEHKSHNGQQFY--------GQRGNSNNGRGRGRGRGNKPTCQVCDKYDHYALACYNQFNRDFMSPLVQDRGQNLS-ASANPNPSAFVTTQ
        +N+AQR   N  ++ + ++FY        GQRGN NNG          PTCQ+C KY H AL CYN+FN++F SPLVQ+R ++ S  S +PNP+ FV+TQ
Subjt:  INVAQRTNSNEHKSHNGQQFY--------GQRGNSNNGRGRGRGRGNKPTCQVCDKYDHYALACYNQFNRDFMSPLVQDRGQNLS-ASANPNPSAFVTTQ

Query:  SPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLE
        + TPFAT +TV+DPNWYIDSGATNHVT +   +TNPTEYSG EK TVGNGN LNIS VG++ LTD  + L LKN+LCVPDI KNL+S SKLAQDNHI++E
Subjt:  SPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLE

Query:  FHNYYCLVKDKATGE
        FH Y C +KDK+TG+
Subjt:  FHNYYCLVKDKATGE

TrEMBL top hitse value%identityAlignment
A0A151S6M8 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-8734.5Show/hide
Query:  MTPEVAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRK---------------------SGSPVPLCALVSQVLLGLD--------------
        MT EVA QL+    ++ +WE  + L G  +R+   FL+  F  TRK                     +GS V    LV+Q L GLD              
Subjt:  MTPEVAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRK---------------------SGSPVPLCALVSQVLLGLD--------------

Query:  --------------EKRLEHQNSQRN-TGNAVRNV-TINVAQRTNSNEHKSHNGQQFYGQRGNSNNGRGRGRGRGNKPTCQVCDKYDHYALACYNQFNRD
                      E RLE  N+Q N T N   N+ TI   +R  SN      G Q    RG +  GRGRGR   ++  CQVC K  H A  CY++FN++
Subjt:  --------------EKRLEHQNSQRN-TGNAVRNV-TINVAQRTNSNEHKSHNGQQFYGQRGNSNNGRGRGRGRGNKPTCQVCDKYDHYALACYNQFNRD

Query:  FMSPLVQDRGQNLSASANPNPSAFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQFLAL
        ++     ++        N N +A+V        A+  TV D +WY DSGA+NHVT     +    E  GK   TVGNG NL I   G S L   ++ L L
Subjt:  FMSPLVQDRGQNLSASANPNPSAFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQFLAL

Query:  KNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKK-QLIHKNKETSTFILSGGKNLVSIN
        K++L VP ITKNL+S SKL  DN I++EFH+  C VKDK TG ILL+  +KDGLY L             ST K+  +    KET               
Subjt:  KNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKK-QLIHKNKETSTFILSGGKNLVSIN

Query:  VAVWHKCLGHHSSKILNTLVKKCN-------------ACQLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSFEQ
           WH+ LGH +SK+LN ++K CN             ACQ  KAHNLPF  S S A E  D+++SD+WGP PI S  GF+YY++F+DD+SR+T     +Q
Subjt:  VAVWHKCLGHHSSKILNTLVKKCN-------------ACQLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSFEQ

Query:  KSTAIDAFKHFITYVKNQFNKSIKVFQSDNGGEYKKIHLLCTTLGIN----------------------------------------WDAHTYFT-----
        KS    AF  F   V+NQFNK IK  Q D GGE+K +  +    GI                                         W+A +        
Subjt:  KSTAIDAFKHFITYVKNQFNKSIKVFQSDNGGEYKKIHLLCTTLGIN----------------------------------------WDAHTYFT-----

Query:  ------------------------------------RPYQSNKFQQRSEKCVLLGPSPIHKGFKCLSKSGKVFISRHVQFNEHEFPFSQMFPPTHLPSAQ
                                            +PY  +K Q  + KCV LG S  HKG+KCL+ +G++FISRHV FNEH FPF   F  T  P+  
Subjt:  ------------------------------------RPYQSNKFQQRSEKCVLLGPSPIHKGFKCLSKSGKVFISRHVQFNEHEFPFSQMFPPTHLPSAQ

Query:  SNPPSLSLAIPIN
           P+ SL  PI+
Subjt:  SNPPSLSLAIPIN

A0A1S4E1U6 uncharacterized protein LOC107991581 isoform X11.9e-8956.83Show/hide
Query:  MTPEVAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRK-------------SGSPVPLCALVSQVLLGLDEKRLEHQNSQ-RNTGNAVRNVT
        MTP+VAIQLMGFTN +DLW+ T+D FG+QSRAEEDFLRQ  QTTRK              G P  +  L  Q  L + EKRL+HQN+Q +NTGN  ++  
Subjt:  MTPEVAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRK-------------SGSPVPLCALVSQVLLGLDEKRLEHQNSQ-RNTGNAVRNVT

Query:  INVAQRTNSNEHKSHNGQQFY--------GQRGNSNNGRGRGRGRGNKPTCQVCDKYDHYALACYNQFNRDFMSPLVQDRGQNLS-ASANPNPSAFVTTQ
        +N+AQR   N  ++ + ++FY        GQRGN NNG          PTCQ+C KY H AL CYN+FN++F SPLVQ+R ++ S  S +PNP+ FV+TQ
Subjt:  INVAQRTNSNEHKSHNGQQFY--------GQRGNSNNGRGRGRGRGNKPTCQVCDKYDHYALACYNQFNRDFMSPLVQDRGQNLS-ASANPNPSAFVTTQ

Query:  SPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLE
        + TPFAT +TV+DPNWYIDSGATNHVT +   +TNPTEYSG EK TVGNGN LNIS VG++ LTD  + L LKN+LCVPDI KNL+S SKLAQDNHI++E
Subjt:  SPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLE

Query:  FHNYYCLVKDKATGE
        FH Y C +KDK+TG+
Subjt:  FHNYYCLVKDKATGE

A0A5D3C373 Retrovirus-related Pol polyprotein from transposon TNT 1-946.8e-10050.35Show/hide
Query:  VAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRK---------------------SGSPVPLCALVSQVLLGLD------------------
        +AIQLMGFTNAKDLWE T+DLFG+QSRAEEDFLRQ FQTTRK                     +GSPVP  A +SQ LLGLD                  
Subjt:  VAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRK---------------------SGSPVPLCALVSQVLLGLD------------------

Query:  ----------EKRLEHQNSQRNTGNAVRNVTINVAQRTNSNEHKSHNGQQFY--------GQRGNSNNGRGRGRGRGNKPTCQVCDKYDHYALACYNQFN
                  EKRLEHQ++Q+NT N ++NV +N+AQ  NS++ + ++  QF+        GQRG  N GRGRG+GRGNKPTCQVC+KY H AL CYN+FN
Subjt:  ----------EKRLEHQNSQRNTGNAVRNVTINVAQRTNSNEHKSHNGQQFY--------GQRGNSNNGRGRGRGRGNKPTCQVCDKYDHYALACYNQFN

Query:  RDFMSPLVQDRG-QNLSASANPNPSAFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQF
        ++F+SPLVQDRG Q+ + S + N +  VT QS   FAT++TVI+ NWYIDSGATNH+T +   L+NP+EYSG EK  VGNG++L+IS +G++YLTD    
Subjt:  RDFMSPLVQDRG-QNLSASANPNPSAFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQF

Query:  LALKNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFILSGGKNLVS
        L LKNVLCVPDITKNLVS SKLAQDN++++EFH  YC +KDK TG  LL  T+KDGLYHL+                   I K K        G +N   
Subjt:  LALKNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFILSGGKNLVS

Query:  INVAVWHKCLGHHSSKILNTLVKKC
          + V H  L  H  +IL  LV +C
Subjt:  INVAVWHKCLGHHSSKILNTLVKKC

A0A803PEH4 Uncharacterized protein1.4e-8936.12Show/hide
Query:  MTPEVAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRK---------------------SGSPVPLCALVSQVLLGLD--------------
        MT  +A ++MG  +A +L      L+G  S+++ D  R   QTTRK                     +G P P   LV+ VL GLD              
Subjt:  MTPEVAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRK---------------------SGSPVPLCALVSQVLLGLD--------------

Query:  --------------EKRLEH-QNSQRNTGNAV-RNVTINVAQRTNSN---------EHKSHNGQQFYGQRGNSN--NGRGRGRGRGNKPTCQVCDKYDHY
                      + ++E  QN   N+  A   +   N+A +TN+N            +++G  F   RG SN   GRGRG G G++PTCQV  KY H 
Subjt:  --------------EKRLEH-QNSQRNTGNAV-RNVTINVAQRTNSN---------EHKSHNGQQFYGQRGNSN--NGRGRGRGRGNKPTCQVCDKYDHY

Query:  ALACYNQFNRDFM-SPLVQDRGQNLSASANPNPSAFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGH
        A  CYN+F+  +M S       QN +   N N SAFV        AT E +    W+ DSGA+NH+T+    LT   +Y+GKE   VGNG+ L I+ +G+
Subjt:  ALACYNQFNRDFM-SPLVQDRGQNLSASANPNPSAFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGH

Query:  SYLT-DAKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHS------TIKKQLIHKN
          L  ++  +L LK++L VP I KNLVS SKLA DN++ +EF++ +CLVKDK T ++LL   LKD LY L++     +   + S      TI        
Subjt:  SYLT-DAKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHS------TIKKQLIHKN

Query:  KETSTFILSGGKNLVSINVAVWHKCLGHHSSKILNTLVKK-------------CNACQLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYY
         +T + ++S         + V H+ LGH S K+LN +++              C+ACQ  KAH LPF  S +RA    D+I++DLWGP PI S+    YY
Subjt:  KETSTFILSGGKNLVSINVAVWHKCLGHHSSKILNTLVKK-------------CNACQLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYY

Query:  IIFVDDYSRYTLDLSFEQKSTAIDAFKHFITYVKNQFNKSIKVFQSDNGGEYKKIHLLCTTLGINWDAHTYFTRP-------------------------
        I FVDDYSRYT     + KS A+ AF  F   V+NQF K IK  +SD+GGEYK    L  T GI +      T P                         
Subjt:  IIFVDDYSRYTLDLSFEQKSTAIDAFKHFITYVKNQFNKSIKVFQSDNGGEYKKIHLLCTTLGINWDAHTYFTRP-------------------------

Query:  ------YQSNKFQQRSEKCVLLGPSPIHKGFKCLSKSGKVFISRHVQFNEHEFPFSQMF
              YQS+KFQ  S KCV LG S  +KG+KCLS +G+++IS+ V FNE  FPF   F
Subjt:  ------YQSNKFQQRSEKCVLLGPSPIHKGFKCLSKSGKVFISRHVQFNEHEFPFSQMF

A0A803QCY3 Uncharacterized protein9.9e-8332.58Show/hide
Query:  MTPEVAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRKSGSPV---------------------PLCALVSQVLLGLDEKR----LEHQNSQ
        MT  +A ++MG T+A  LW     L+G  S+++ D  R   QTT+K G+P+                     P   L + VL  LD       L+ +   
Subjt:  MTPEVAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRKSGSPV---------------------PLCALVSQVLLGLDEKR----LEHQNSQ

Query:  RNTGNAVRNVTINVAQRTNSNEHKSHNGQQFYGQRGNSNNGRGRGRGRGNKPTCQVCDKYDHYALACYNQFNRDFMSPLVQDRGQNLSASANPNPSAFVT
        + +   ++ + ++   +       ++N +  +  RGN    RGRGR   +KPTCQVC KYDH A+ CYN F+  +M        QN +   N NPSAF+ 
Subjt:  RNTGNAVRNVTINVAQRTNSNEHKSHNGQQFYGQRGNSNNGRGRGRGRGNKPTCQVCDKYDHYALACYNQFNRDFMSPLVQDRGQNLSASANPNPSAFVT

Query:  TQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYL-TDAKQFLALKNVLCVPDITKNLVSGSKLAQDNHI
               AT E +    W+ DSGA+N++T     +    EY GKEK TVGNG+ L IS  G+  L T   Q+L L  +L VP I KN +S SKL  DN +
Subjt:  TQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYL-TDAKQFLALKNVLCVPDITKNLVSGSKLAQDNHI

Query:  FLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFILSGGKNLVSINVAVWHKCLGHHSSKILNTLVKKCNAC
         +EFH+  C VKD AT  +LL+  LKDGLY L+             T + +  +    TS FI+S                          T+   C+AC
Subjt:  FLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFILSGGKNLVSINVAVWHKCLGHHSSKILNTLVKKCNAC

Query:  QLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSFEQKSTAIDAFKHFITYVKNQFNKSIKVFQSDNGGEYKKIHL
        Q  K+H+LPF  S S+A +  D++++DLWGP+PI S+  F+YY+ FVDD +R+T     + KS A DAF  F +  +NQF + IK  ++D GGEY+ +  
Subjt:  QLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSFEQKSTAIDAFKHFITYVKNQFNKSIKVFQSDNGGEYKKIHL

Query:  LCTTLGIN----------------------------------------WDAHTYFT-----------------------------------------RPY
           T GIN                                        WDA +                                            RPY
Subjt:  LCTTLGIN----------------------------------------WDAHTYFT-----------------------------------------RPY

Query:  QSNKFQQRSEKCVLLGPSPIHKGFKCLSKSGKVFISRHVQFNEHEFPFSQMF----PPTHLPSAQSNP----PSLSLAIPINFNSMAQNNSIPSAQFYSH
        Q++KFQ  S KCV LG S  HKG+KCLS +G+++I R V FNE EFPF   F       +L   QS+     PS+S A    F S    N   S+Q    
Subjt:  QSNKFQQRSEKCVLLGPSPIHKGFKCLSKSGKVFISRHVQFNEHEFPFSQMF----PPTHLPSAQSNP----PSLSLAIPINFNSMAQNNSIPSAQFYSH

Query:  STPITNSSQ
        STP + + Q
Subjt:  STPITNSSQ

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-2125.71Show/hide
Query:  RTNSNEHKSHNGQQFYGQRGNSNNGRGRGRGRGNKPTCQVCDKYDHYALACYN----------QFNRDFMSPLVQDRGQNLSASANPNPSAFVTTQSPTP
        R  S +  S+N    YG+ G    G+ + R +     C  C++  H+   C N          Q N D  + +VQ+         N N   F+  +    
Subjt:  RTNSNEHKSHNGQQFYGQRGNSNNGRGRGRGRGNKPTCQVCDKYDHYALACYN----------QFNRDFMSPLVQDRGQNLSASANPNPSAFVTTQSPTP

Query:  FATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATV--GNGNNLNISCVGHSYL-TDAKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLEF
          +     +  W +D+ A++H T           Y   +  TV  GN +   I+ +G   + T+    L LK+V  VPD+  NL+SG  L +D      +
Subjt:  FATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATV--GNGNNLNISCVGHSYL-TDAKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLEF

Query:  HNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFILSGGKNLV--SINVAVWHKCLGHHSSKILNTLVKK------
         +Y+   K                 + L    L++A  +   T+ +        T+  I  G  N     I+V +WHK +GH S K L  L KK      
Subjt:  HNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFILSGGKNLV--SINVAVWHKCLGHHSSKILNTLVKK------

Query:  -------CNACQLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSFEQKSTAIDAFKHFITYVKNQFNKSIKVFQS
               C+ C   K H + F  S  R     D++YSD+ GP  I S  G +Y++ F+DD SR       + K      F+ F   V+ +  + +K  +S
Subjt:  -------CNACQLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSFEQKSTAIDAFKHFITYVKNQFNKSIKVFQS

Query:  DNGGEY--KKIHLLCTTLGI
        DNGGEY  ++    C++ GI
Subjt:  DNGGEY--KKIHLLCTTLGI

Q03494 Transposon Ty2-DR2 Gag-Pol polyprotein5.3e-0921.38Show/hide
Query:  PTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLEF
        PT    S   +  +  IDSGA+  +   +  L + T  S +         ++ I+ +G+ +             L  P+I  +L+S S+LA  N      
Subjt:  PTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLEF

Query:  HNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTI----KKQLIHKNKETSTFILSGGKNLVSINVAVWHKCLGHHSSKIL---NTLVKKC
         N      +++ G + L   +K G ++  +   ++ + +   TI    K + ++K        + G  N  SI  ++    + +     +   N    +C
Subjt:  HNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTI----KKQLIHKNKETSTFILSGGKNLVSINVAVWHKCLGHHSSKIL---NTLVKKC

Query:  NACQLEKA----HNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRY--TLDLSFEQKSTAIDAFKHFITYVKNQFNKSIKVFQSDN
          C + K+    H     +    + E F  +++D++GP          Y+I F D+ +R+     L   ++ + ++ F   + ++KNQFN  + V Q D 
Subjt:  NACQLEKA----HNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRY--TLDLSFEQKSTAIDAFKHFITYVKNQFNKSIKVFQSDN

Query:  GGEY--KKIHLLCTTLGI
        G EY  K +H   T  GI
Subjt:  GGEY--KKIHLLCTTLGI

Q12491 Transposon Ty2-B Gag-Pol polyprotein5.3e-0921.38Show/hide
Query:  PTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLEF
        PT    S   +  +  IDSGA+  +   +  L + T  S +         ++ I+ +G+ +             L  P+I  +L+S S+LA  N      
Subjt:  PTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLEF

Query:  HNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTI----KKQLIHKNKETSTFILSGGKNLVSINVAVWHKCLGHHSSKIL---NTLVKKC
         N      +++ G + L   +K G ++  +   ++ + +   TI    K + ++K        + G  N  SI  ++    + +     +   N    +C
Subjt:  HNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTI----KKQLIHKNKETSTFILSGGKNLVSINVAVWHKCLGHHSSKIL---NTLVKKC

Query:  NACQLEKA----HNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRY--TLDLSFEQKSTAIDAFKHFITYVKNQFNKSIKVFQSDN
          C + K+    H     +    + E F  +++D++GP          Y+I F D+ +R+     L   ++ + ++ F   + ++KNQFN  + V Q D 
Subjt:  NACQLEKA----HNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRY--TLDLSFEQKSTAIDAFKHFITYVKNQFNKSIKVFQSDN

Query:  GGEY--KKIHLLCTTLGI
        G EY  K +H   T  GI
Subjt:  GGEY--KKIHLLCTTLGI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.0e-4125.42Show/hide
Query:  LCALVSQVLLGLDEKRLEHQNSQRNTGNAVRNVTINVAQRTNSNEHKSHNGQQFYGQRGNSNNGRGRGRGRGNKPTCQVCDKYDHYALACYNQFNRDFMS
        + A+ S  ++ +    + H+N+     N       N   R N  +++++N      Q+ ++N      + +     CQ+C    H A  C     + F+S
Subjt:  LCALVSQVLLGLDEKRLEHQNSQRNTGNAVRNVTINVAQRTNSNEHKSHNGQQFYGQRGNSNNGRGRGRGRGNKPTCQVCDKYDHYALACYNQFNRDFMS

Query:  PLVQDRGQNLSASANPNPSAFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQFLALKNV
                  S ++   PS F   Q     A        NW +DSGAT+H+T+    L+    Y+G +   V +G+ + IS  G + L+   + L L N+
Subjt:  PLVQDRGQNLSASANPNPSAFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQFLALKNV

Query:  LCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFILSGGKNLVSINVAVW
        L VP+I KNL+S  +L   N + +EF      VKD  TG  LL+   KD LY                      I  ++  S F     K   S     W
Subjt:  LCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFILSGGKNLVSINVAVW

Query:  HKCLGHHSSKILNTLVK--------------KCNACQLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSFEQKST
        H  LGH +  ILN+++                C+ C + K++ +PF+ S   +    + IYSD+W  +PI S D +RYY+IFVD ++RYT     +QKS 
Subjt:  HKCLGHHSSKILNTLVK--------------KCNACQLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSFEQKST

Query:  AIDAFKHFITYVKNQFNKSIKVFQSDNGGEY------------------------------KKIHLLCTTLGI---------------------------
          + F  F   ++N+F   I  F SDNGGE+                              K  H++ T L +                           
Subjt:  AIDAFKHFITYVKNQFNKSIKVFQSDNGGEY------------------------------KKIHLLCTTLGI---------------------------

Query:  -----------------NWDAHTYF-------TRPYQSNKFQQRSEKCVLLGPSPIHKGFKCLS-KSGKVFISRHVQFNEHEFPFSQMF-----------
                         N+D    F        RPY  +K   +S +CV LG S     + CL  ++ +++ISRHV+F+E+ FPFS              
Subjt:  -----------------NWDAHTYF-------TRPYQSNKFQQRSEKCVLLGPSPIHKGFKCLS-KSGKVFISRHVQFNEHEFPFSQMF-----------

Query:  -------PPTHLPSAQSNPPSLSLAIPINFNSMAQNNSIPSAQFYSHSTPITN
               P T LP+     P+ S + P   +  A   S PSA F +     +N
Subjt:  -------PPTHLPSAQSNPPSLSLAIPINFNSMAQNNSIPSAQFYSHSTPITN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.0e-4125.53Show/hide
Query:  TNAKDLWEVTRDLFGIQSRAEEDFLR--QTFQTTRKSGSPVPLCALVSQVLLGLDE----------------------KRLEHQNSQRNTGNA--VRNVT
        T A  +WE  R ++   S      LR    F      G P+     V +VL  L +                      +RL ++ S+    N+  V  +T
Subjt:  TNAKDLWEVTRDLFGIQSRAEEDFLR--QTFQTTRKSGSPVPLCALVSQVLLGLDE----------------------KRLEHQNSQRNTGNA--VRNVT

Query:  INVAQRTNSNEHKSHNGQQFYGQRGNSNN---------GRGRGRGRGNKP---TCQVCDKYDHYALAC--YNQFNRDFMSPLVQDRGQNLSASANPNPSA
         NV    N+N +++ N +       N+NN            R   R  KP    CQ+C    H A  C   +QF                + +   + S 
Subjt:  INVAQRTNSNEHKSHNGQQFYGQRGNSNN---------GRGRGRGRGNKP---TCQVCDKYDHYALAC--YNQFNRDFMSPLVQDRGQNLSASANPNPSA

Query:  FVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQFLALKNVLCVPDITKNLVSGSKLAQDN
        F   Q     A +      NW +DSGAT+H+T+    L+    Y+G +   + +G+ + I+  G + L  + + L L  VL VP+I KNL+S  +L   N
Subjt:  FVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKEKATVGNGNNLNISCVGHSYLTDAKQFLALKNVLCVPDITKNLVSGSKLAQDN

Query:  HIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFILSGGKNLVSINVAVWHKCLGHHSSKILNTLVK---
         + +EF      VKD  TG  LL+   KD LY                      I  ++  S F     K   S     WH  LGH S  ILN+++    
Subjt:  HIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHKNKETSTFILSGGKNLVSINVAVWHKCLGHHSSKILNTLVK---

Query:  -----------KCNACQLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSFEQKSTAIDAFKHFITYVKNQFNKSI
                    C+ C + K+H +PF+ S   +++  + IYSD+W  +PI S D +RYY+IFVD ++RYT     +QKS   D F  F + V+N+F   I
Subjt:  -----------KCNACQLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSFEQKSTAIDAFKHFITYVKNQFNKSI

Query:  KVFQSDNGGEY----------------------------KKIHLLCTTLGINWDAH-----TY-------------------------------------
            SDNGGE+                            ++ H     +G+   +H     TY                                     
Subjt:  KVFQSDNGGEY----------------------------KKIHLLCTTLGINWDAH-----TY-------------------------------------

Query:  -----------FTRPYQSNKFQQRSEKCVLLGPSPIHKGFKCLS-KSGKVFISRHVQFNEHEFPFS
                   + RPY  +K + +S++C  +G S     + CL   +G+++ SRHVQF+E  FPFS
Subjt:  -----------FTRPYQSNKFQQRSEKCVLLGPSPIHKGFKCLS-KSGKVFISRHVQFNEHEFPFS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACCTGAAGTAGCCATCCAACTTATGGGTTTTACAAATGCTAAGGACCTATGGGAAGTCACTCGAGATTTGTTTGGAATCCAGTCAAGAGCTGAGGAAGATTTTCT
TCGCCAAACATTCCAAACGACGAGGAAATCCGGCAGTCCTGTACCTCTGTGTGCTCTTGTTTCACAGGTGTTGTTGGGGTTAGATGAGAAAAGATTAGAGCATCAGAATA
GCCAACGAAACACTGGAAATGCAGTCCGAAATGTCACTATCAATGTAGCTCAGAGGACAAATTCAAACGAACATAAATCCCACAATGGCCAACAATTTTATGGACAGCGA
GGAAATTCAAACAATGGCAGAGGACGCGGTCGGGGAAGAGGAAACAAACCCACATGTCAAGTATGCGATAAGTATGATCACTATGCACTTGCTTGTTATAATCAATTTAA
CAGAGATTTTATGAGTCCTTTAGTTCAAGATCGAGGACAAAACTTGAGTGCATCTGCAAACCCAAATCCTTCTGCCTTTGTGACTACCCAGAGTCCCACTCCGTTTGCTA
CTTCTGAAACTGTAATTGATCCAAATTGGTATATTGATAGCGGAGCTACAAATCACGTCACGACAAAATCGTTTGGCTTGACCAATCCTACTGAATACTCAGGTAAAGAA
AAAGCAACAGTAGGAAATGGAAATAACTTGAATATCTCTTGTGTTGGACATTCTTATTTGACTGATGCAAAACAGTTTTTAGCTCTTAAAAATGTACTTTGTGTTCCTGA
TATTACAAAGAACTTGGTTAGTGGTTCAAAACTTGCCCAAGATAATCATATATTTCTAGAGTTTCATAATTATTATTGTCTTGTTAAGGACAAAGCTACAGGGGAAATAC
TGCTGAAAGAAACACTTAAGGATGGTCTATATCACCTGGAAAATGTTGGTTTGATGGTTGCTGCTGAGTTAGAACATAGTACTATCAAGAAACAGCTTATACATAAAAAT
AAGGAGACATCGACATTTATTTTGTCAGGAGGAAAAAATCTTGTCAGTATCAATGTTGCTGTTTGGCATAAATGTTTAGGCCATCATTCTTCAAAGATTTTGAACACTTT
AGTAAAGAAATGTAATGCATGTCAACTGGAAAAAGCACACAACCTCCCCTTTACTATTTCTCAATCTAGAGCTGCTGAATCTTTTGATATTATTTACTCTGATTTGTGGG
GTCCTACACCAATTTGCTCATCCGATGGATTCCGTTATTACATAATTTTTGTGGATGATTATAGCAGATATACTTTAGATTTATCTTTTGAGCAGAAAAGTACTGCTATT
GATGCTTTTAAGCACTTTATTACATATGTCAAAAACCAATTCAATAAATCCATCAAAGTTTTTCAGTCAGACAATGGGGGAGAGTATAAGAAGATACATCTACTGTGCAC
TACCTTAGGGATCAATTGGGATGCCCACACCTATTTTACAAGACCTTACCAATCCAATAAGTTTCAACAACGTTCTGAAAAGTGTGTGTTACTTGGCCCAAGCCCAATAC
ACAAGGGGTTCAAGTGTTTGTCCAAGTCGGGCAAAGTGTTTATCTCACGACATGTTCAATTTAACGAGCATGAATTCCCCTTTTCTCAAATGTTCCCTCCAACCCATTTA
CCGTCGGCCCAATCTAATCCACCTTCTCTTTCCCTTGCCATCCCAATTAACTTCAACTCCATGGCCCAAAACAATTCAATTCCATCGGCCCAATTTTATTCCCACTCCAC
CCCCATCACAAATTCCTCACAATGTCTCCTCTCATTCACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACACCTGAAGTAGCCATCCAACTTATGGGTTTTACAAATGCTAAGGACCTATGGGAAGTCACTCGAGATTTGTTTGGAATCCAGTCAAGAGCTGAGGAAGATTTTCT
TCGCCAAACATTCCAAACGACGAGGAAATCCGGCAGTCCTGTACCTCTGTGTGCTCTTGTTTCACAGGTGTTGTTGGGGTTAGATGAGAAAAGATTAGAGCATCAGAATA
GCCAACGAAACACTGGAAATGCAGTCCGAAATGTCACTATCAATGTAGCTCAGAGGACAAATTCAAACGAACATAAATCCCACAATGGCCAACAATTTTATGGACAGCGA
GGAAATTCAAACAATGGCAGAGGACGCGGTCGGGGAAGAGGAAACAAACCCACATGTCAAGTATGCGATAAGTATGATCACTATGCACTTGCTTGTTATAATCAATTTAA
CAGAGATTTTATGAGTCCTTTAGTTCAAGATCGAGGACAAAACTTGAGTGCATCTGCAAACCCAAATCCTTCTGCCTTTGTGACTACCCAGAGTCCCACTCCGTTTGCTA
CTTCTGAAACTGTAATTGATCCAAATTGGTATATTGATAGCGGAGCTACAAATCACGTCACGACAAAATCGTTTGGCTTGACCAATCCTACTGAATACTCAGGTAAAGAA
AAAGCAACAGTAGGAAATGGAAATAACTTGAATATCTCTTGTGTTGGACATTCTTATTTGACTGATGCAAAACAGTTTTTAGCTCTTAAAAATGTACTTTGTGTTCCTGA
TATTACAAAGAACTTGGTTAGTGGTTCAAAACTTGCCCAAGATAATCATATATTTCTAGAGTTTCATAATTATTATTGTCTTGTTAAGGACAAAGCTACAGGGGAAATAC
TGCTGAAAGAAACACTTAAGGATGGTCTATATCACCTGGAAAATGTTGGTTTGATGGTTGCTGCTGAGTTAGAACATAGTACTATCAAGAAACAGCTTATACATAAAAAT
AAGGAGACATCGACATTTATTTTGTCAGGAGGAAAAAATCTTGTCAGTATCAATGTTGCTGTTTGGCATAAATGTTTAGGCCATCATTCTTCAAAGATTTTGAACACTTT
AGTAAAGAAATGTAATGCATGTCAACTGGAAAAAGCACACAACCTCCCCTTTACTATTTCTCAATCTAGAGCTGCTGAATCTTTTGATATTATTTACTCTGATTTGTGGG
GTCCTACACCAATTTGCTCATCCGATGGATTCCGTTATTACATAATTTTTGTGGATGATTATAGCAGATATACTTTAGATTTATCTTTTGAGCAGAAAAGTACTGCTATT
GATGCTTTTAAGCACTTTATTACATATGTCAAAAACCAATTCAATAAATCCATCAAAGTTTTTCAGTCAGACAATGGGGGAGAGTATAAGAAGATACATCTACTGTGCAC
TACCTTAGGGATCAATTGGGATGCCCACACCTATTTTACAAGACCTTACCAATCCAATAAGTTTCAACAACGTTCTGAAAAGTGTGTGTTACTTGGCCCAAGCCCAATAC
ACAAGGGGTTCAAGTGTTTGTCCAAGTCGGGCAAAGTGTTTATCTCACGACATGTTCAATTTAACGAGCATGAATTCCCCTTTTCTCAAATGTTCCCTCCAACCCATTTA
CCGTCGGCCCAATCTAATCCACCTTCTCTTTCCCTTGCCATCCCAATTAACTTCAACTCCATGGCCCAAAACAATTCAATTCCATCGGCCCAATTTTATTCCCACTCCAC
CCCCATCACAAATTCCTCACAATGTCTCCTCTCATTCACTTGA
Protein sequenceShow/hide protein sequence
MTPEVAIQLMGFTNAKDLWEVTRDLFGIQSRAEEDFLRQTFQTTRKSGSPVPLCALVSQVLLGLDEKRLEHQNSQRNTGNAVRNVTINVAQRTNSNEHKSHNGQQFYGQR
GNSNNGRGRGRGRGNKPTCQVCDKYDHYALACYNQFNRDFMSPLVQDRGQNLSASANPNPSAFVTTQSPTPFATSETVIDPNWYIDSGATNHVTTKSFGLTNPTEYSGKE
KATVGNGNNLNISCVGHSYLTDAKQFLALKNVLCVPDITKNLVSGSKLAQDNHIFLEFHNYYCLVKDKATGEILLKETLKDGLYHLENVGLMVAAELEHSTIKKQLIHKN
KETSTFILSGGKNLVSINVAVWHKCLGHHSSKILNTLVKKCNACQLEKAHNLPFTISQSRAAESFDIIYSDLWGPTPICSSDGFRYYIIFVDDYSRYTLDLSFEQKSTAI
DAFKHFITYVKNQFNKSIKVFQSDNGGEYKKIHLLCTTLGINWDAHTYFTRPYQSNKFQQRSEKCVLLGPSPIHKGFKCLSKSGKVFISRHVQFNEHEFPFSQMFPPTHL
PSAQSNPPSLSLAIPINFNSMAQNNSIPSAQFYSHSTPITNSSQCLLSFT