; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038626 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038626
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr2:21801575..21808206
RNA-Seq ExpressionLag0038626
SyntenyLag0038626
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.1e-14468.87Show/hide
Query:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG
        PSDV+P GCKWIYKRKRDQ GKVQTFKA+LVAKGY  +EG+D++ETFSP+AMLKSIRIL+SIATFYNYEIWQMDVKT FLNGNL+ESIYMVQP+GFI Q 
Subjt:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG

Query:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGN--------------------------------
        +EQKVCKL KSIY LKQASRSWN R D  IKSYG +QN+DEPCVYK+I++S V FLILYVDDILLIGN                                
Subjt:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGN--------------------------------

Query:  ---------------------------NSKRGLLPFRHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWT
                                   NSK+G LPFRHG+HLSKEQ PKTPQEVEDM+ IPY+SAVGS+MYA LCTRPDICY++ +VSRYQSNPG+ HWT
Subjt:  ---------------------------NSKRGLLPFRHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWT

Query:  AVKNILKYLRKTRDYSLVYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAAC
        AVKNILKYLR+TR+Y LVYGAKDLILT YTD DFQ+D D+R+STSGSVFTLNGGAVVWRS++Q CIADSTMEAEYVAAC
Subjt:  AVKNILKYLRKTRDYSLVYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAAC

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]9.2e-14469.13Show/hide
Query:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG
        P  V+P GCKWIYKRKRD  GKVQTFKA+LVAKGY  REG+D++ETFSP+AMLKSIRIL+SIATFY+YEIWQMDVKT FLNGNL+ESI+M QP+GFI QG
Subjt:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG

Query:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGN--------------------------------
        +EQKVCKLN+SIY LKQASRSWN R D  IKSYG  QN+DEPCVYK+I    V FL+LYVDDILLIGN                                
Subjt:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGN--------------------------------

Query:  ---------------------------NSKRGLLPFRHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWT
                                   NSK+GLLPFRHG+HLSKEQSPKTPQEVEDM++IPYASAVGS+MYA LCTRPDICYA+ +VSRYQSNPG  HWT
Subjt:  ---------------------------NSKRGLLPFRHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWT

Query:  AVKNILKYLRKTRDYSLVYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAAC
        AVK +LKYLR+TRDY LVYGAKDLILT YTD DFQTD DSR+STSGSVFTLNGGAVVWRSI+QGCIADSTMEAEYVAAC
Subjt:  AVKNILKYLRKTRDYSLVYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAAC

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-14068.07Show/hide
Query:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG
        P  V+P GCKWIYKRKRD  GKVQTFKA+LVAKGY  +EG+D++ETFS +AMLKSIRIL+SIA FY+YEIWQMDVKT FLNGNL+ESI+M QP+GFI QG
Subjt:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG

Query:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGN--------------------------------
        +EQKVCKLN+SIY LKQASRSWN R D  IKSYG  QN+DEPCVYK+I    V FL+LYVDDILLIGN                                
Subjt:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGN--------------------------------

Query:  ---------------------------NSKRGLLPFRHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWT
                                   NSK+GLLPFRHG+HLSKEQSPKTPQEVEDM++IPYASAVGS+MYA LCTRPDICYA+ +VSRYQSNPG  HWT
Subjt:  ---------------------------NSKRGLLPFRHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWT

Query:  AVKNILKYLRKTRDYSLVYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAAC
        AVK ILKYLR+TRDY LVYGAKDLILT YT+ DFQTD DSR+STS SVFTLNGGAVVWRSI+QGCIADSTMEAEYVAAC
Subjt:  AVKNILKYLRKTRDYSLVYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAAC

KAA0050103.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-14172.89Show/hide
Query:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG
        P  V+P GCKWIYKRKRD  GK+QTFK +LVAKGY  REG++++ETF P+AMLKSIRIL+SI TFY+YEIWQMDVKT FLNGNL+ESI+M QP+GFI QG
Subjt:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG

Query:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGN-----------------------NSKRGLLPF
        ++QKVCKLN+SIY LK+ASRSWN R D  IKSYG  QN+DEPCVYK+I    V FL+LYV+DILLIGN                       NSK+GLLPF
Subjt:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGN-----------------------NSKRGLLPF

Query:  RHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWTAVKNILKYLRKTRDYSLVYGAKDLILTEYTDLDFQT
        RHG+HLSKEQ PKTPQEV+DM++ PYASAVGS+MY  LCTRPDICYA+ +VSRYQSNPG  HWT VK ILKYLR+TRDY LVYGAKDLILT YTD DFQT
Subjt:  RHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWTAVKNILKYLRKTRDYSLVYGAKDLILTEYTDLDFQT

Query:  DIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAAC
        D DSR+STSGSVFTLNGGAVVWRSI+QGCIADSTMEAEYVAAC
Subjt:  DIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAAC

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]9.2e-14469.13Show/hide
Query:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG
        P  V+P GCKWIYKRKRD  GKVQTFKA+LVAKGY  REG+D++ETFSP+AMLKSIRIL+SIATFY+YEIWQMDVKT FLNGNL+ESI+M QP+GFI QG
Subjt:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG

Query:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGN--------------------------------
        +EQKVCKLN+SIY LKQASRSWN R D  IKSYG  QN+DEPCVYK+I    V FL+LYVDDILLIGN                                
Subjt:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGN--------------------------------

Query:  ---------------------------NSKRGLLPFRHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWT
                                   NSK+GLLPFRHG+HLSKEQSPKTPQEVEDM++IPYASAVGS+MYA LCTRPDICYA+ +VSRYQSNPG  HWT
Subjt:  ---------------------------NSKRGLLPFRHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWT

Query:  AVKNILKYLRKTRDYSLVYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAAC
        AVK +LKYLR+TRDY LVYGAKDLILT YTD DFQTD DSR+STSGSVFTLNGGAVVWRSI+QGCIADSTMEAEYVAAC
Subjt:  AVKNILKYLRKTRDYSLVYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAAC

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein1.0e-14068.07Show/hide
Query:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG
        P  V+P GCKWIYKRKRD  GKVQTFKA+LVAKGY  +EG+D++ETFS +AMLKSIRIL+SIA FY+YEIWQMDVKT FLNGNL+ESI+M QP+GFI QG
Subjt:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG

Query:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGN--------------------------------
        +EQKVCKLN+SIY LKQASRSWN R D  IKSYG  QN+DEPCVYK+I    V FL+LYVDDILLIGN                                
Subjt:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGN--------------------------------

Query:  ---------------------------NSKRGLLPFRHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWT
                                   NSK+GLLPFRHG+HLSKEQSPKTPQEVEDM++IPYASAVGS+MYA LCTRPDICYA+ +VSRYQSNPG  HWT
Subjt:  ---------------------------NSKRGLLPFRHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWT

Query:  AVKNILKYLRKTRDYSLVYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAAC
        AVK ILKYLR+TRDY LVYGAKDLILT YT+ DFQTD DSR+STS SVFTLNGGAVVWRSI+QGCIADSTMEAEYVAAC
Subjt:  AVKNILKYLRKTRDYSLVYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAAC

A0A5A7TZD0 Gag/pol protein4.4e-14469.13Show/hide
Query:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG
        P  V+P GCKWIYKRKRD  GKVQTFKA+LVAKGY  REG+D++ETFSP+AMLKSIRIL+SIATFY+YEIWQMDVKT FLNGNL+ESI+M QP+GFI QG
Subjt:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG

Query:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGN--------------------------------
        +EQKVCKLN+SIY LKQASRSWN R D  IKSYG  QN+DEPCVYK+I    V FL+LYVDDILLIGN                                
Subjt:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGN--------------------------------

Query:  ---------------------------NSKRGLLPFRHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWT
                                   NSK+GLLPFRHG+HLSKEQSPKTPQEVEDM++IPYASAVGS+MYA LCTRPDICYA+ +VSRYQSNPG  HWT
Subjt:  ---------------------------NSKRGLLPFRHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWT

Query:  AVKNILKYLRKTRDYSLVYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAAC
        AVK +LKYLR+TRDY LVYGAKDLILT YTD DFQTD DSR+STSGSVFTLNGGAVVWRSI+QGCIADSTMEAEYVAAC
Subjt:  AVKNILKYLRKTRDYSLVYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAAC

A0A5A7U2H0 Gag/pol protein9.2e-14272.89Show/hide
Query:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG
        P  V+P GCKWIYKRKRD  GK+QTFK +LVAKGY  REG++++ETF P+AMLKSIRIL+SI TFY+YEIWQMDVKT FLNGNL+ESI+M QP+GFI QG
Subjt:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG

Query:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGN-----------------------NSKRGLLPF
        ++QKVCKLN+SIY LK+ASRSWN R D  IKSYG  QN+DEPCVYK+I    V FL+LYV+DILLIGN                       NSK+GLLPF
Subjt:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGN-----------------------NSKRGLLPF

Query:  RHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWTAVKNILKYLRKTRDYSLVYGAKDLILTEYTDLDFQT
        RHG+HLSKEQ PKTPQEV+DM++ PYASAVGS+MY  LCTRPDICYA+ +VSRYQSNPG  HWT VK ILKYLR+TRDY LVYGAKDLILT YTD DFQT
Subjt:  RHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWTAVKNILKYLRKTRDYSLVYGAKDLILTEYTDLDFQT

Query:  DIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAAC
        D DSR+STSGSVFTLNGGAVVWRSI+QGCIADSTMEAEYVAAC
Subjt:  DIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAAC

A0A5A7UYE8 Gag/pol protein4.4e-14469.13Show/hide
Query:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG
        P  V+P GCKWIYKRKRD  GKVQTFKA+LVAKGY  REG+D++ETFSP+AMLKSIRIL+SIATFY+YEIWQMDVKT FLNGNL+ESI+M QP+GFI QG
Subjt:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG

Query:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGN--------------------------------
        +EQKVCKLN+SIY LKQASRSWN R D  IKSYG  QN+DEPCVYK+I    V FL+LYVDDILLIGN                                
Subjt:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGN--------------------------------

Query:  ---------------------------NSKRGLLPFRHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWT
                                   NSK+GLLPFRHG+HLSKEQSPKTPQEVEDM++IPYASAVGS+MYA LCTRPDICYA+ +VSRYQSNPG  HWT
Subjt:  ---------------------------NSKRGLLPFRHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWT

Query:  AVKNILKYLRKTRDYSLVYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAAC
        AVK +LKYLR+TRDY LVYGAKDLILT YTD DFQTD DSR+STSGSVFTLNGGAVVWRSI+QGCIADSTMEAEYVAAC
Subjt:  AVKNILKYLRKTRDYSLVYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAAC

E2GK51 Gag/pol protein (Fragment)5.2e-14568.87Show/hide
Query:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG
        PSDV+P GCKWIYKRKRDQ GKVQTFKA+LVAKGY  +EG+D++ETFSP+AMLKSIRIL+SIATFYNYEIWQMDVKT FLNGNL+ESIYMVQP+GFI Q 
Subjt:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG

Query:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGN--------------------------------
        +EQKVCKL KSIY LKQASRSWN R D  IKSYG +QN+DEPCVYK+I++S V FLILYVDDILLIGN                                
Subjt:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGN--------------------------------

Query:  ---------------------------NSKRGLLPFRHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWT
                                   NSK+G LPFRHG+HLSKEQ PKTPQEVEDM+ IPY+SAVGS+MYA LCTRPDICY++ +VSRYQSNPG+ HWT
Subjt:  ---------------------------NSKRGLLPFRHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWT

Query:  AVKNILKYLRKTRDYSLVYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAAC
        AVKNILKYLR+TR+Y LVYGAKDLILT YTD DFQ+D D+R+STSGSVFTLNGGAVVWRS++Q CIADSTMEAEYVAAC
Subjt:  AVKNILKYLRKTRDYSLVYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAAC

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-4330.77Show/hide
Query:  QPSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQ
        +P +      +W++  K ++ G    +KA+LVA+G+  +  ID++ETF+P+A + S R ++S+   YN ++ QMDVKT FLNG L E IYM  P+G  I 
Subjt:  QPSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQ

Query:  GEEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIID----STVVFLILYVDDILLIG------NNSKRGLL---------PFRH-
             VCKLNK+IY LKQA+R W +  +  +K      +  + C+Y  I+D    +  ++++LYVDD+++        NN KR L+           +H 
Subjt:  GEEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIID----STVVFLILYVDDILLIG------NNSKRGLL---------PFRH-

Query:  -GMHLSKEQ--------------------------SPKTPQEV-------EDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWTAVKN
         G+ +  ++                          S   P ++       ++    P  S +G +MY  LCTRPD+  A++++SRY S      W  +K 
Subjt:  -GMHLSKEQ--------------------------SPKTPQEV-------EDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWTAVKN

Query:  ILKYLRKTRDYSLVYG---AKDLILTEYTDLDFQTDIDSRRSTSGSVFTL-NGGAVVWRSIRQGCIADSTMEAEYVA
        +L+YL+ T D  L++    A +  +  Y D D+      R+ST+G +F + +   + W + RQ  +A S+ EAEY+A
Subjt:  ILKYLRKTRDYSLVYG---AKDLILTEYTDLDFQTDIDSRRSTSGSVFTL-NGGAVVWRSIRQGCIADSTMEAEYVA

P0CV72 Secreted RxLR effector protein 1611.8e-2548.36Show/hide
Query:  MKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWTAVKNILKYLRKTRDYSLVY-GAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGA
        MK +PY SAVG+IMY  + TRPD+  A+ V+S++ S+P   HW A+K +L+YL+ T+ Y L +  A    L  Y+D D+  D++SRRSTSG +F LNGG 
Subjt:  MKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWTAVKNILKYLRKTRDYSLVY-GAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGA

Query:  VVWRSIRQGCIADSTMEAEYVA
        V WRS +Q  +A S+ E EY+A
Subjt:  VVWRSIRQGCIADSTMEAEYVA

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-7836.72Show/hide
Query:  VKKIRRTTGDDDDKRELLTSQLSRLYNSSDFRTPDLEFYLVVPNTPSETPTSLLP--------LEVSESYRCSHSSGVRN--NQPSDVQPTGCKWIYKRK
        V+++   T  ++  + L  S+  R+ +    R P  E+ L+  +   E+   +L           + E       +G       P   +P  CKW++K K
Subjt:  VKKIRRTTGDDDDKRELLTSQLSRLYNSSDFRTPDLEFYLVVPNTPSETPTSLLP--------LEVSESYRCSHSSGVRN--NQPSDVQPTGCKWIYKRK

Query:  RDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQGEEQKVCKLNKSIYELK
        +D   K+  +KA+LV KG+  ++GIDFDE FSP+  + SIR ++S+A   + E+ Q+DVKT FL+G+L+E IYM QP+GF + G++  VCKLNKS+Y LK
Subjt:  RDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQGEEQKVCKLNKSIYELK

Query:  QASRSWNKRLDPVIKSYGSKQNIDEPCVY-KRIIDSTVVFLILYVDDILLIG------------------------------------------------
        QA R W  + D  +KS    +   +PCVY KR  ++  + L+LYVDD+L++G                                                
Subjt:  QASRSWNKRLDPVIKSYGSKQNIDEPCVY-KRIIDSTVVFLILYVDDILLIG------------------------------------------------

Query:  -----------NNSKRGLLPFRHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWTAVKNILKYLRKTRDY
                    N+K    P    + LSK+  P T +E  +M ++PY+SAVGS+MYA +CTRPDI +A+ VVSR+  NPG+ HW AVK IL+YLR T   
Subjt:  -----------NNSKRGLLPFRHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWTAVKNILKYLRKTRDY

Query:  SLVYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAA
         L +G  D IL  YTD D   DID+R+S++G +FT +GGA+ W+S  Q C+A ST EAEY+AA
Subjt:  SLVYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.0e-4632.61Show/hide
Query:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG
        PS V   GC+WI+ +K +  G +  +KA+LVAKGYN R G+D+ ETFSP+    SIRI++ +A   ++ I Q+DV   FL G L + +YM QP GFI + 
Subjt:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG

Query:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGNN-----------SKR-------------GLLP
            VCKL K++Y LKQA R+W   L   + + G   ++ +  ++      ++V++++YVDDIL+ GN+           S+R             G+  
Subjt:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGNN-----------SKR-------------GLLP

Query:  FR--HGMHLSKEQ------------------SPKTPQEVEDM-------KQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWTAVKNILKY
         R   G+HLS+ +                  +P  P     +           Y   VGS+ Y    TRPDI YA++ +S++   P + H  A+K IL+Y
Subjt:  FR--HGMHLSKEQ------------------SPKTPQEVEDM-------KQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWTAVKNILKY

Query:  LRKTRDYSL-VYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEY
        L  T ++ + +     L L  Y+D D+  D D   ST+G +  L    + W S +Q  +  S+ EAEY
Subjt:  LRKTRDYSL-VYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-4732.88Show/hide
Query:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG
        P  V   GC+WI+ +K +  G +  +KA+LVAKGYN R G+D+ ETFSP+    SIRI++ +A   ++ I Q+DV   FL G L + +YM QP GF+ + 
Subjt:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQG

Query:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGNN-----------SKR-------------GLLP
            VC+L K+IY LKQA R+W   L   + + G   +I +  ++      +++++++YVDDIL+ GN+           S+R             G+  
Subjt:  EEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGNN-----------SKR-------------GLLP

Query:  FR--HGMHLSKEQ-----------------------SPKTPQEVEDMKQIP--YASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWTAVKNILKY
         R   G+HLS+ +                       SPK           P  Y   VGS+ Y    TRPD+ YA++ +S+Y   P   HW A+K +L+Y
Subjt:  FR--HGMHLSKEQ-----------------------SPKTPQEVEDMKQIP--YASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWTAVKNILKY

Query:  LRKTRDYSL-VYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEY
        L  T D+ + +     L L  Y+D D+  D D   ST+G +  L    + W S +Q  +  S+ EAEY
Subjt:  LRKTRDYSL-VYGAKDLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEY

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.8e-5032.45Show/hide
Query:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFII-Q
        P + +P GCKW+YK K +  G ++ +KA+LVAKGY  +EGIDF ETFSP+  L S++++++I+  YN+ + Q+D+   FLNG+LDE IYM  P G+   Q
Subjt:  PSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFII-Q

Query:  GEE---QKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGNNSK---------------RGLLPFRH--
        G+      VC L KSIY LKQASR W  +    +  +G  Q+  +   + +I  +  + +++YVDDI++  NN                 R L P ++  
Subjt:  GEE---QKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGNNSK---------------RGLLPFRH--

Query:  GMHLSKEQS---------------------------PKTPQ---------EVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWTAV
        G+ +++  +                           P  P          +  D K   Y   +G +MY ++ TR DI +A++ +S++   P  AH  AV
Subjt:  GMHLSKEQS---------------------------PKTPQ---------EVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWTAV

Query:  KNILKYLRKTRDYSLVYGAK-DLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVA
          IL Y++ T    L Y ++ ++ L  ++D  FQ+  D+RRST+G    L    + W+S +Q  ++ S+ EAEY A
Subjt:  KNILKYLRKTRDYSLVYGAK-DLILTEYTDLDFQTDIDSRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVA

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.0e-0436.11Show/hide
Query:  TRPDICYAISVVSRYQSNPGQAHWTAVKNILKYLRKTRDYSLVYGA-KDLILTEYTDLDFQTDIDSRRSTSG
        TRPD+ +A++ +S++ S    A   AV  +L Y++ T    L Y A  DL L  + D D+ +  D+RRS +G
Subjt:  TRPDICYAISVVSRYQSNPGQAHWTAVKNILKYLRKTRDYSLVYGA-KDLILTEYTDLDFQTDIDSRRSTSG

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)7.0e-0944.64Show/hide
Query:  GCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIA
        GCKW++K K    G +   KA+LVAKG++  EGI F ET+SP+    +IR ++++A
Subjt:  GCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILISIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCCGCCACCATCATCCGTCGTCCACAAGCCTGTTCTAAGGCTGGAGAATAGCTGGGAAGATACTAGCGGTGGTCCACATGTTCGTGCTGAGAAACAACCTATAGA
GTCGGGGGAATTCGAGCTACAAGAGATATGGAGGCTCAAGTATGTCTTCTCCTCTCTCCCTCTTCACGATGTTGAGATCCCCATCCCATGTCGCTTCAACTTTTCGCGTC
TATCGACGTTCCCTTCACCGAGAGTACGCTGGATTCGCATGGTAGGTTTTTGGAGCATGGTGAGGGGTGGGAATGAAAGCCACCCTCGTCGAGACCCACGACTGAACTCC
CTCGTCTCCCTCGTTGAGACCCACGACCTCGCCACTATAGATCTGGACTCTCTTGTCGAGACCCACGACTCCATAGCTCGATTCCAACAAGTGACCAACGGTGTGAAACC
CACTTCATTTCAGATCGAGATTCCGTGGTCGGGTCGTGGGTCTCAACGAGAAAGACGAGGGAGCTTAGTTGTGGGTCTTGACGAGGGAGTTCAGTCGCAGCCCGTCGCTC
GCGGTTCTACTCGTTTCGTATCTGCTCGCGTAGGATCTGTGAAAAAAATCCGACGAACGACGGGCGACGACGACGACAAAAGAGAACTGTTGACTTCACAACTCTCCAGA
CTTTATAACTCATCGGACTTCAGAACTCCAGACTTAGAATTCTACCTCGTTGTCCCAAACACCCCTTCAGAGACTCCCACCAGTCTCCTGCCTCTAGAAGTCTCAGAGTC
ATACCGGTGTAGCCATAGTAGTGGTGTTCGCAATAATCAACCAAGTGATGTACAACCTACAGGTTGCAAGTGGATCTATAAGAGAAAACGAGACCAAACTGGTAAGGTAC
AAACCTTCAAAGCGCAATTAGTGGCTAAAGGTTATAACCATAGAGAAGGAATTGATTTCGATGAAACCTTCTCACCATTAGCGATGTTGAAATCGATAAGAATACTCATT
TCCATTGCCACTTTCTATAATTATGAAATTTGGCAAATGGATGTCAAAACTGTGTTTCTGAACGGTAATCTTGACGAAAGTATTTACATGGTCCAACCAAAAGGGTTCAT
AATCCAGGGAGAAGAACAAAAAGTTTGCAAGCTTAACAAATCCATCTATGAATTGAAACAAGCCTCAAGATCCTGGAATAAAAGACTTGACCCTGTAATCAAATCTTATG
GTTCTAAACAGAACATAGATGAACCTTGTGTATATAAAAGGATCATTGATTCTACTGTAGTTTTCTTAATTCTGTATGTTGATGACATCCTGCTTATAGGAAATAATTCC
AAAAGGGGATTGTTGCCCTTTAGACATGGAATGCACCTATCAAAGGAACAGAGTCCAAAAACACCTCAAGAAGTTGAGGATATGAAACAAATTCCTTACGCATCAGCGGT
CGGAAGTATTATGTATGCAAAGCTATGCACACGTCCCGACATATGCTATGCAATAAGTGTTGTCAGCAGATATCAATCCAATCCTGGTCAGGCACATTGGACTGCCGTTA
AGAATATCCTTAAATACTTGAGGAAAACAAGAGACTATTCGCTGGTGTATGGAGCTAAGGATTTGATCCTTACTGAATACACTGACTTAGATTTTCAAACCGACATAGAC
TCTAGGAGATCTACGTCGGGATCTGTGTTCACTCTCAATGGAGGAGCAGTAGTGTGGAGAAGTATTAGACAAGGATGTATTGCCGACTCCACAATGGAAGCTGAATATGT
TGCAGCCTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGCCGCCACCATCATCCGTCGTCCACAAGCCTGTTCTAAGGCTGGAGAATAGCTGGGAAGATACTAGCGGTGGTCCACATGTTCGTGCTGAGAAACAACCTATAGA
GTCGGGGGAATTCGAGCTACAAGAGATATGGAGGCTCAAGTATGTCTTCTCCTCTCTCCCTCTTCACGATGTTGAGATCCCCATCCCATGTCGCTTCAACTTTTCGCGTC
TATCGACGTTCCCTTCACCGAGAGTACGCTGGATTCGCATGGTAGGTTTTTGGAGCATGGTGAGGGGTGGGAATGAAAGCCACCCTCGTCGAGACCCACGACTGAACTCC
CTCGTCTCCCTCGTTGAGACCCACGACCTCGCCACTATAGATCTGGACTCTCTTGTCGAGACCCACGACTCCATAGCTCGATTCCAACAAGTGACCAACGGTGTGAAACC
CACTTCATTTCAGATCGAGATTCCGTGGTCGGGTCGTGGGTCTCAACGAGAAAGACGAGGGAGCTTAGTTGTGGGTCTTGACGAGGGAGTTCAGTCGCAGCCCGTCGCTC
GCGGTTCTACTCGTTTCGTATCTGCTCGCGTAGGATCTGTGAAAAAAATCCGACGAACGACGGGCGACGACGACGACAAAAGAGAACTGTTGACTTCACAACTCTCCAGA
CTTTATAACTCATCGGACTTCAGAACTCCAGACTTAGAATTCTACCTCGTTGTCCCAAACACCCCTTCAGAGACTCCCACCAGTCTCCTGCCTCTAGAAGTCTCAGAGTC
ATACCGGTGTAGCCATAGTAGTGGTGTTCGCAATAATCAACCAAGTGATGTACAACCTACAGGTTGCAAGTGGATCTATAAGAGAAAACGAGACCAAACTGGTAAGGTAC
AAACCTTCAAAGCGCAATTAGTGGCTAAAGGTTATAACCATAGAGAAGGAATTGATTTCGATGAAACCTTCTCACCATTAGCGATGTTGAAATCGATAAGAATACTCATT
TCCATTGCCACTTTCTATAATTATGAAATTTGGCAAATGGATGTCAAAACTGTGTTTCTGAACGGTAATCTTGACGAAAGTATTTACATGGTCCAACCAAAAGGGTTCAT
AATCCAGGGAGAAGAACAAAAAGTTTGCAAGCTTAACAAATCCATCTATGAATTGAAACAAGCCTCAAGATCCTGGAATAAAAGACTTGACCCTGTAATCAAATCTTATG
GTTCTAAACAGAACATAGATGAACCTTGTGTATATAAAAGGATCATTGATTCTACTGTAGTTTTCTTAATTCTGTATGTTGATGACATCCTGCTTATAGGAAATAATTCC
AAAAGGGGATTGTTGCCCTTTAGACATGGAATGCACCTATCAAAGGAACAGAGTCCAAAAACACCTCAAGAAGTTGAGGATATGAAACAAATTCCTTACGCATCAGCGGT
CGGAAGTATTATGTATGCAAAGCTATGCACACGTCCCGACATATGCTATGCAATAAGTGTTGTCAGCAGATATCAATCCAATCCTGGTCAGGCACATTGGACTGCCGTTA
AGAATATCCTTAAATACTTGAGGAAAACAAGAGACTATTCGCTGGTGTATGGAGCTAAGGATTTGATCCTTACTGAATACACTGACTTAGATTTTCAAACCGACATAGAC
TCTAGGAGATCTACGTCGGGATCTGTGTTCACTCTCAATGGAGGAGCAGTAGTGTGGAGAAGTATTAGACAAGGATGTATTGCCGACTCCACAATGGAAGCTGAATATGT
TGCAGCCTGTTAA
Protein sequenceShow/hide protein sequence
MPPPPSSVVHKPVLRLENSWEDTSGGPHVRAEKQPIESGEFELQEIWRLKYVFSSLPLHDVEIPIPCRFNFSRLSTFPSPRVRWIRMVGFWSMVRGGNESHPRRDPRLNS
LVSLVETHDLATIDLDSLVETHDSIARFQQVTNGVKPTSFQIEIPWSGRGSQRERRGSLVVGLDEGVQSQPVARGSTRFVSARVGSVKKIRRTTGDDDDKRELLTSQLSR
LYNSSDFRTPDLEFYLVVPNTPSETPTSLLPLEVSESYRCSHSSGVRNNQPSDVQPTGCKWIYKRKRDQTGKVQTFKAQLVAKGYNHREGIDFDETFSPLAMLKSIRILI
SIATFYNYEIWQMDVKTVFLNGNLDESIYMVQPKGFIIQGEEQKVCKLNKSIYELKQASRSWNKRLDPVIKSYGSKQNIDEPCVYKRIIDSTVVFLILYVDDILLIGNNS
KRGLLPFRHGMHLSKEQSPKTPQEVEDMKQIPYASAVGSIMYAKLCTRPDICYAISVVSRYQSNPGQAHWTAVKNILKYLRKTRDYSLVYGAKDLILTEYTDLDFQTDID
SRRSTSGSVFTLNGGAVVWRSIRQGCIADSTMEAEYVAAC