; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026359 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026359
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr10:35424014..35430641
RNA-Seq ExpressionLag0026359
SyntenyLag0026359
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037371.1 gag/pol protein [Cucumis melo var. makuwa]4.5e-8643.79Show/hide
Query:  MVRSMMSYAQLPDSFWGYAVETVVYILNMVPSKSVPETPYELWRGSKGSLHHFRILGCPTQC--------------------------------------
        MVRS+MSY +LP+SFWGYAV+T VYILN VPSKSV ETP +LW G KGSL HFRI GCP                                         
Subjt:  MVRSMMSYAQLPDSFWGYAVETVVYILNMVPSKSVPETPYELWRGSKGSLHHFRILGCPTQC--------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------WCKTQRNWNDVQNRACLWDI-------------------LKKRKGSIVQGDEQKVCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNI
                   W K      +      +WD+                   +++ +GSI+ G EQ+VCKL RSIYGL QASRSWNIRF+TAIK+YGF+Q +
Subjt:  -----------WCKTQRNWNDVQNRACLWDI-------------------LKKRKGSIVQGDEQKVCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNI

Query:  DEPCVYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHG
        DEPCVYK+I+N+ VAFLVLYV DILL GN+VG L+DIKQWLATQFQMKDLG  QFVLG QI R+RKNK LALSQASYIDK++V+Y MQNSK+ LLPFRH 
Subjt:  DEPCVYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHG

Query:  VHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF
        V +SKEQ PKTPQ++E+MR IPYASAVGSLMYVMLC RP IC+
Subjt:  VHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF

KAA0046800.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-9348.7Show/hide
Query:  MVRSMMSYAQLPDSFWGYAVETVVYILNMVPSKSVPETPYELWRGSKGSLHHFRILGCPTQCWCKTQRN-------------------------------
        MVRSMMSYAQLP SFWGYAVETVV+ILN VPSKSV ETP +LWRG K SL HF+I GCPT       +N                               
Subjt:  MVRSMMSYAQLPDSFWGYAVETVVYILNMVPSKSVPETPYELWRGSKGSLHHFRILGCPTQCWCKTQRN-------------------------------

Query:  ---------------------------------------------------------------------------------------------------W
                                                                                                            
Subjt:  ---------------------------------------------------------------------------------------------------W

Query:  NDVQNRA---------------CLW---DILKKRKGSIVQGDEQKVCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPCVYKKIVNSTVAFLVLY
        NDV                    +W   D+ +  +G I QG EQKVCKL RSIYGL QASRSWNIRF+ AIK+YGF++N+DEPCVYKKI    VAFLVLY
Subjt:  NDVQNRA---------------CLW---DILKKRKGSIVQGDEQKVCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPCVYKKIVNSTVAFLVLY

Query:  VDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVSKEQSPKTPQEVEDMRR
        VDDILL GN+VG+L+++K WLA QFQMKDLGEAQ+VLG QIIR+ KNKTLALSQA YIDKMLVRY MQNSKK LLPFRHGVH+SKEQ PKTPQEVED+RR
Subjt:  VDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVSKEQSPKTPQEVEDMRR

Query:  IPYASAVGSLMYVMLCTRPDICF
        IPYAS +GSLMY MLCTRPDIC+
Subjt:  IPYASAVGSLMYVMLCTRPDICF

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]4.5e-8641.39Show/hide
Query:  MVRSMMSYAQLPDSFWGYAVETVVYILNMVPSKSVPETPYELWRGSKGSLHHFRILGCPTQC--------------------------------------
        MVRSMMSY  LP+SFWGYAV+T VYILN VPSKSV +TP +LW G KGSL HFRI GCP                                         
Subjt:  MVRSMMSYAQLPDSFWGYAVETVVYILNMVPSKSVPETPYELWRGSKGSLHHFRILGCPTQC--------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------WCKTQR------------------NWNDVQNRAC------------------LWDI----------------LKKRKGSIVQGDEQKVC
                   W K                     ++ ++ +                     +W +                +++ +G I+ G EQK+C
Subjt:  -----------WCKTQR------------------NWNDVQNRAC------------------LWDI----------------LKKRKGSIVQGDEQKVC

Query:  KLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPCVYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKN
        KL RSIYGL QASRSWNIRF+TAIK+YGF+Q +DEPCVYK+I+N +VAFLVLYVDDILL GN++G L+DIKQWLATQFQMKDLGEAQFVLG QI R+RKN
Subjt:  KLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPCVYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKN

Query:  KTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF
        KTLALSQASYIDK++V+Y MQNSK+ LLPFRHGV +SKEQ PKTPQ+VE+MR IPYASA+GSLMY MLCTRPDIC+
Subjt:  KTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF

TYK11833.1 putative Integrase core domain [Cucumis melo var. makuwa]1.5e-10260.18Show/hide
Query:  MVRSMMSYAQLPDSFWGYAVETVVYILNMVPSKSVPETPYELWRGSKGSLHHFRILGCPTQCWCKTQRN----------------------WNDVQNRAC
        MVRSMMSY QLP SFW YAVET VYILN VPSKSV ETP ELWRG K SL HF+I GCP     K  +                       ++  +NR  
Subjt:  MVRSMMSYAQLPDSFWGYAVETVVYILNMVPSKSVPETPYELWRGSKGSLHHFRILGCPTQCWCKTQRN----------------------WNDVQNRAC

Query:  L---------------------------------------W---DILKKRKGSIVQGDEQKVCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPC
        +                                       W   D+ +  KG I +G EQKVCKL RSIYGL QASRSWNIRF+T IK+Y F+QN+DEPC
Subjt:  L---------------------------------------W---DILKKRKGSIVQGDEQKVCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPC

Query:  VYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVS
        VYKKI    VAFLVL+V+DILL GN+VG+L++IK WLA QFQMKDLGEAQ+VLG QIIR+ KNKTLALSQA+YIDKMLVRY MQNSKK+LLPFRHGVH+S
Subjt:  VYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVS

Query:  KEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF
        KEQ PKTPQEVEDMRRIPYASA+GSLMY MLCTRPDIC+
Subjt:  KEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF

TYK11909.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-8854.52Show/hide
Query:  MVRSMMSYAQLPDSFWGYAVETVVYILNMVPSKSVPETPYELWRGSKGSL-------------------HHFRILGCPTQCWCKT----------QRNW-
        M RSM+SYAQLP SF GY VE  V+ILN V SKSV ETP+ELWRG   +L                   +   +       W K              W 
Subjt:  MVRSMMSYAQLPDSFWGYAVETVVYILNMVPSKSVPETPYELWRGSKGSL-------------------HHFRILGCPTQCWCKT----------QRNW-

Query:  -----NDVQNRACLWDILKKR--------------------------------------------KGSIVQGDEQKVCKLKRSIYGLIQASRSWNIRFET
               V+   C W   +KR                                            +G I +G EQKVCKL  SIYGL QASRSWNIRF+T
Subjt:  -----NDVQNRACLWDILKKR--------------------------------------------KGSIVQGDEQKVCKLKRSIYGLIQASRSWNIRFET

Query:  AIKTYGFEQNIDEPCVYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQN
        AIK+YGF+QN++EPCVYKKI    V FLVLYVDDILL GN VG+L+D+K WLA QFQMKDLGEAQ+VL  QIIR+RKNKTLALSQA+YI+KMLV+Y MQN
Subjt:  AIKTYGFEQNIDEPCVYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQN

Query:  SKKELLPFRHGVHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF
        SKK LLPFRHGVH+SKEQ PKTPQEVEDMRRIPYASAVGSLMYVML TRPDIC+
Subjt:  SKKELLPFRHGVHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF

TrEMBL top hitse value%identityAlignment
A0A5A7T706 Gag/pol protein2.2e-8643.79Show/hide
Query:  MVRSMMSYAQLPDSFWGYAVETVVYILNMVPSKSVPETPYELWRGSKGSLHHFRILGCPTQC--------------------------------------
        MVRS+MSY +LP+SFWGYAV+T VYILN VPSKSV ETP +LW G KGSL HFRI GCP                                         
Subjt:  MVRSMMSYAQLPDSFWGYAVETVVYILNMVPSKSVPETPYELWRGSKGSLHHFRILGCPTQC--------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------WCKTQRNWNDVQNRACLWDI-------------------LKKRKGSIVQGDEQKVCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNI
                   W K      +      +WD+                   +++ +GSI+ G EQ+VCKL RSIYGL QASRSWNIRF+TAIK+YGF+Q +
Subjt:  -----------WCKTQRNWNDVQNRACLWDI-------------------LKKRKGSIVQGDEQKVCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNI

Query:  DEPCVYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHG
        DEPCVYK+I+N+ VAFLVLYV DILL GN+VG L+DIKQWLATQFQMKDLG  QFVLG QI R+RKNK LALSQASYIDK++V+Y MQNSK+ LLPFRH 
Subjt:  DEPCVYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHG

Query:  VHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF
        V +SKEQ PKTPQ++E+MR IPYASAVGSLMYVMLC RP IC+
Subjt:  VHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF

A0A5A7TUI8 Gag/pol protein8.2e-9448.7Show/hide
Query:  MVRSMMSYAQLPDSFWGYAVETVVYILNMVPSKSVPETPYELWRGSKGSLHHFRILGCPTQCWCKTQRN-------------------------------
        MVRSMMSYAQLP SFWGYAVETVV+ILN VPSKSV ETP +LWRG K SL HF+I GCPT       +N                               
Subjt:  MVRSMMSYAQLPDSFWGYAVETVVYILNMVPSKSVPETPYELWRGSKGSLHHFRILGCPTQCWCKTQRN-------------------------------

Query:  ---------------------------------------------------------------------------------------------------W
                                                                                                            
Subjt:  ---------------------------------------------------------------------------------------------------W

Query:  NDVQNRA---------------CLW---DILKKRKGSIVQGDEQKVCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPCVYKKIVNSTVAFLVLY
        NDV                    +W   D+ +  +G I QG EQKVCKL RSIYGL QASRSWNIRF+ AIK+YGF++N+DEPCVYKKI    VAFLVLY
Subjt:  NDVQNRA---------------CLW---DILKKRKGSIVQGDEQKVCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPCVYKKIVNSTVAFLVLY

Query:  VDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVSKEQSPKTPQEVEDMRR
        VDDILL GN+VG+L+++K WLA QFQMKDLGEAQ+VLG QIIR+ KNKTLALSQA YIDKMLVRY MQNSKK LLPFRHGVH+SKEQ PKTPQEVED+RR
Subjt:  VDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVSKEQSPKTPQEVEDMRR

Query:  IPYASAVGSLMYVMLCTRPDICF
        IPYAS +GSLMY MLCTRPDIC+
Subjt:  IPYASAVGSLMYVMLCTRPDICF

A0A5A7U869 Gag/pol protein2.2e-8641.39Show/hide
Query:  MVRSMMSYAQLPDSFWGYAVETVVYILNMVPSKSVPETPYELWRGSKGSLHHFRILGCPTQC--------------------------------------
        MVRSMMSY  LP+SFWGYAV+T VYILN VPSKSV +TP +LW G KGSL HFRI GCP                                         
Subjt:  MVRSMMSYAQLPDSFWGYAVETVVYILNMVPSKSVPETPYELWRGSKGSLHHFRILGCPTQC--------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------WCKTQR------------------NWNDVQNRAC------------------LWDI----------------LKKRKGSIVQGDEQKVC
                   W K                     ++ ++ +                     +W +                +++ +G I+ G EQK+C
Subjt:  -----------WCKTQR------------------NWNDVQNRAC------------------LWDI----------------LKKRKGSIVQGDEQKVC

Query:  KLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPCVYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKN
        KL RSIYGL QASRSWNIRF+TAIK+YGF+Q +DEPCVYK+I+N +VAFLVLYVDDILL GN++G L+DIKQWLATQFQMKDLGEAQFVLG QI R+RKN
Subjt:  KLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPCVYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKN

Query:  KTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF
        KTLALSQASYIDK++V+Y MQNSK+ LLPFRHGV +SKEQ PKTPQ+VE+MR IPYASA+GSLMY MLCTRPDIC+
Subjt:  KTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF

A0A5D3CIT1 Putative Integrase core domain7.4e-10360.18Show/hide
Query:  MVRSMMSYAQLPDSFWGYAVETVVYILNMVPSKSVPETPYELWRGSKGSLHHFRILGCPTQCWCKTQRN----------------------WNDVQNRAC
        MVRSMMSY QLP SFW YAVET VYILN VPSKSV ETP ELWRG K SL HF+I GCP     K  +                       ++  +NR  
Subjt:  MVRSMMSYAQLPDSFWGYAVETVVYILNMVPSKSVPETPYELWRGSKGSLHHFRILGCPTQCWCKTQRN----------------------WNDVQNRAC

Query:  L---------------------------------------W---DILKKRKGSIVQGDEQKVCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPC
        +                                       W   D+ +  KG I +G EQKVCKL RSIYGL QASRSWNIRF+T IK+Y F+QN+DEPC
Subjt:  L---------------------------------------W---DILKKRKGSIVQGDEQKVCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPC

Query:  VYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVS
        VYKKI    VAFLVL+V+DILL GN+VG+L++IK WLA QFQMKDLGEAQ+VLG QIIR+ KNKTLALSQA+YIDKMLVRY MQNSKK+LLPFRHGVH+S
Subjt:  VYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVS

Query:  KEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF
        KEQ PKTPQEVEDMRRIPYASA+GSLMY MLCTRPDIC+
Subjt:  KEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF

A0A5D3CNQ8 Gag/pol protein6.1e-8954.52Show/hide
Query:  MVRSMMSYAQLPDSFWGYAVETVVYILNMVPSKSVPETPYELWRGSKGSL-------------------HHFRILGCPTQCWCKT----------QRNW-
        M RSM+SYAQLP SF GY VE  V+ILN V SKSV ETP+ELWRG   +L                   +   +       W K              W 
Subjt:  MVRSMMSYAQLPDSFWGYAVETVVYILNMVPSKSVPETPYELWRGSKGSL-------------------HHFRILGCPTQCWCKT----------QRNW-

Query:  -----NDVQNRACLWDILKKR--------------------------------------------KGSIVQGDEQKVCKLKRSIYGLIQASRSWNIRFET
               V+   C W   +KR                                            +G I +G EQKVCKL  SIYGL QASRSWNIRF+T
Subjt:  -----NDVQNRACLWDILKKR--------------------------------------------KGSIVQGDEQKVCKLKRSIYGLIQASRSWNIRFET

Query:  AIKTYGFEQNIDEPCVYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQN
        AIK+YGF+QN++EPCVYKKI    V FLVLYVDDILL GN VG+L+D+K WLA QFQMKDLGEAQ+VL  QIIR+RKNKTLALSQA+YI+KMLV+Y MQN
Subjt:  AIKTYGFEQNIDEPCVYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQN

Query:  SKKELLPFRHGVHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF
        SKK LLPFRHGVH+SKEQ PKTPQEVEDMRRIPYASAVGSLMYVML TRPDIC+
Subjt:  SKKELLPFRHGVHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.3e-1931.69Show/hide
Query:  DEQKVCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPCVY---KKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLG
        +   VCKL ++IYGL QA+R W   FE A+K   F  +  + C+Y   K  +N  + +++LYVDD+++   ++  +++ K++L  +F+M DL E +  +G
Subjt:  DEQKVCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPCVY---KKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLG

Query:  NQIIRNRKNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDI
         +I    +   + LSQ++Y+ K+L ++ M+N      P    ++     S       ++    P  S +G LMY+MLCTRPD+
Subjt:  NQIIRNRKNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.2e-3942.19Show/hide
Query:  LKKRKGSIVQGDEQKVCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPCVY-KKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKD
        +++ +G  V G +  VCKL +S+YGL QA R W ++F++ +K+  + +   +PCVY K+   +    L+LYVDD+L+ G + G ++ +K  L+  F MKD
Subjt:  LKKRKGSIVQGDEQKVCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPCVY-KKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKD

Query:  LGEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDI
        LG AQ +LG +I+R R ++ L LSQ  YI+++L R+ M+N+K    P    + +SK+  P T +E  +M ++PY+SAVGSLMY M+CTRPDI
Subjt:  LGEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDI

P25600 Putative transposon Ty5-1 protein YCL074W1.7e-1128.43Show/hide
Query:  LKKRKGSIVQGDEQKVCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPCVYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDL
        +K+  G + + +   V +L   +YGL QA   WN      +K  GF ++  E  +Y +  +    ++ +YVDD+L+   +      +KQ L   + MKDL
Subjt:  LKKRKGSIVQGDEQKVCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPCVYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDL

Query:  GEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICFPYLL
        G+    LG   I    N  + LS   YI K      +   K    P  +    SK     T   ++D+   PY S VG L++     RPDI +P  L
Subjt:  GEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICFPYLL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-1532.02Show/hide
Query:  VCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPCVYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNR
        VCKL++++YGL QA R+W +     + T GF  ++ +  ++      ++ ++++YVDDIL+ GN+   L +    L+ +F +KD  E  + LG  I   R
Subjt:  VCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPCVYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNR

Query:  KNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF
            L LSQ  YI  +L R  M  +K    P      +S     K     E      Y   VGSL Y+   TRPDI +
Subjt:  KNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.5e-1530.9Show/hide
Query:  VCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPCVYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNR
        VC+L+++IYGL QA R+W +   T + T GF  +I +  ++      ++ ++++YVDDIL+ GN+   L      L+ +F +K+  +  + LG  I   R
Subjt:  VCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPCVYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNR

Query:  KNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF
          + L LSQ  Y   +L R  M  +K    P      ++     K P   E      Y   VGSL Y+   TRPD+ +
Subjt:  KNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.0e-1631.02Show/hide
Query:  QGDE---QKVCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPCVYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFV
        QGD      VC LK+SIYGL QASR W ++F   +  +GF Q+  +   + KI  +    +++YVDDI++  NN   + ++K  L + F+++DLG  ++ 
Subjt:  QGDE---QKVCKLKRSIYGLIQASRSWNIRFETAIKTYGFEQNIDEPCVYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFV

Query:  LGNQIIRNRKNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF
        LG +I R+     + + Q  Y   +L    +   K   +P    V  S         +  D +   Y   +G LMY+ + TR DI F
Subjt:  LGNQIIRNRKNKTLALSQASYIDKMLVRYWMQNSKKELLPFRHGVHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICF

ATMG00810.1 DNA/RNA polymerases superfamily protein5.2e-0836.92Show/hide
Query:  FLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQNSK--KELLPFRHGVHVSKEQSPKTPQ
        +L+LYVDDILL G++   L+ +   L++ F MKDLG   + LG QI  +     L LSQ  Y +++L    M + K     LP +    VS  + P    
Subjt:  FLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQNSK--KELLPFRHGVHVSKEQSPKTPQ

Query:  EVEDMRRIPYASAVGSLMYVMLCTRPDICF
        +  D R     S VG+L Y+ L TRPDI +
Subjt:  EVEDMRRIPYASAVGSLMYVMLCTRPDICF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGATCAATGATGAGCTATGCTCAGTTGCCTGATTCATTCTGGGGATATGCAGTAGAGACTGTTGTATACATTTTGAACATGGTTCCCTCTAAGAGTGTTCCAGA
AACACCCTATGAGCTATGGAGAGGGAGTAAAGGTAGTTTACATCACTTCAGAATTTTGGGATGCCCGACACAGTGTTGGTGCAAAACCCAAAGAAATTGGAACGACGTTC
AAAACCGTGCCTGTTTGTGGGATATCCTAAAGAAACGAAAGGGGTCCATAGTCCAGGGTGATGAGCAAAAAGTTTGCAAACTTAAACGATCCATTTATGGGTTAATACAA
GCATCAAGATCCTGGAATATACGTTTTGAAACTGCGATCAAAACATATGGCTTTGAGCAAAATATTGACGAACCTTGTGTTTACAAGAAAATCGTCAATTCTACTGTAGC
TTTTCTAGTTCTGTACGTAGATGATATCCTGCTTTTTGGAAATAATGTAGGATTCCTGTCTGACATAAAACAATGGCTAGCGACTCAATTCCAAATGAAAGATTTGGGTG
AAGCTCAATTTGTTTTGGGCAATCAAATCATTAGAAATAGAAAGAACAAAACGTTAGCACTGTCTCAAGCATCATATATCGACAAGATGTTGGTACGATATTGGATGCAA
AACTCCAAGAAAGAATTATTACCTTTTAGGCACGGGGTTCACGTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGCATTCCCTATGCATC
AGCAGTTGGCAGTTTGATGTATGTCATGCTGTGTACTAGACCCGACATATGTTTCCCATATTTATTGAATAATAATAATACAGATGACTTTGATGATGTGGCAATTGAGG
TTGAGGTGGTCAACGTCTTCCTCTCTCTCTCCTTTCTCTCTTCGCCTTCCTCATTACTTCTTTTCCCTTATTCTCCATTGGCCAAGATGAGGGGGGAAGACGTGGAATGC
AAGACAATAAACCTGCACACCGGTGTGGTGCTCGCCACACCGGCTCCGATGCTTAAGTCAGCAAGCAGAACGGTAGGGAGTGAAAAAGTCAAACCAGACAAAACCGGGGC
TGACAGAGGCGGTAGGGGCCTAACGGCATCGGACGGACTCGGCTTGCGCGAGCGGGCCGAGGGCCGAGCCCGATTTCTCCCCGGTTGTCCTCGTCAGCTCCTTGTATATC
GGGGTGGTCCAAAATTGCCTATAACATTAAGCCCCACTCTTGAATTGGGATTCGGGGTGAGAGATTACAAAAGTGGTGCTGGTCGGCCTCGGCCTCGAGAAGTGGCCGAG
CACAGCTATCTCCCTTTCTGCTTGCCTGGTCGGCCTCGACCTCGGGAAGAGGCCGAGCGTTGCCCCCGAACCCCCACCGGGCGCTTCGCCCCTGGACCCCGACTCGCTGA
CCGGTCAGCGAGACCCCCGTACTTGGTTATTCGGAGCGCCTCAATCCAGTTTCTAGTTTGCAAGAGAGAAGTTAGGGAAGAGGCTAGGATCGGTCTCGGCCTCAAGCCCC
CGAGAATACCTTTAAAGTTTGTGCTTGGCCTCGACAAGAGGTGCTTGGCCTTGGCAAGAGGTGCTCGGGCTTGGCAAGTGGTGCTCGGCCTTGGCAAGAGACTCGTTGGT
CCAGGCTCGCCTGATGTGGAGATGCTCAAGCCAGAGGATGTGAGATACTCAGGCCAGAGATGTGAGGTAGACTCGTTGGTCCAGGCTCACATGGTCTCACTCTGGATGAG
GCGAGGTAGACTCGGTAGTCCAGGCTCGCCTGATGTGGAGATGCTCAAGCCAGATGAGGCTCACATGGTCTCACTTTGGATGAGGCGAGGTAGACTCGGTAGTCCAGGCT
CGCCTGATGTGGAGATGCTCAAGCCAGATGAGGTGAGACTCGGTAGTCCAGGCTCGCCTGATGTGGAGATGCTCAAGCCAGATGAGGTGAGACTCGGTAGTCGAGGCTCG
CCTGATGTGGAGATGCTCAAGCCAGAGGAGATGAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTCGATCAATGATGAGCTATGCTCAGTTGCCTGATTCATTCTGGGGATATGCAGTAGAGACTGTTGTATACATTTTGAACATGGTTCCCTCTAAGAGTGTTCCAGA
AACACCCTATGAGCTATGGAGAGGGAGTAAAGGTAGTTTACATCACTTCAGAATTTTGGGATGCCCGACACAGTGTTGGTGCAAAACCCAAAGAAATTGGAACGACGTTC
AAAACCGTGCCTGTTTGTGGGATATCCTAAAGAAACGAAAGGGGTCCATAGTCCAGGGTGATGAGCAAAAAGTTTGCAAACTTAAACGATCCATTTATGGGTTAATACAA
GCATCAAGATCCTGGAATATACGTTTTGAAACTGCGATCAAAACATATGGCTTTGAGCAAAATATTGACGAACCTTGTGTTTACAAGAAAATCGTCAATTCTACTGTAGC
TTTTCTAGTTCTGTACGTAGATGATATCCTGCTTTTTGGAAATAATGTAGGATTCCTGTCTGACATAAAACAATGGCTAGCGACTCAATTCCAAATGAAAGATTTGGGTG
AAGCTCAATTTGTTTTGGGCAATCAAATCATTAGAAATAGAAAGAACAAAACGTTAGCACTGTCTCAAGCATCATATATCGACAAGATGTTGGTACGATATTGGATGCAA
AACTCCAAGAAAGAATTATTACCTTTTAGGCACGGGGTTCACGTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGCATTCCCTATGCATC
AGCAGTTGGCAGTTTGATGTATGTCATGCTGTGTACTAGACCCGACATATGTTTCCCATATTTATTGAATAATAATAATACAGATGACTTTGATGATGTGGCAATTGAGG
TTGAGGTGGTCAACGTCTTCCTCTCTCTCTCCTTTCTCTCTTCGCCTTCCTCATTACTTCTTTTCCCTTATTCTCCATTGGCCAAGATGAGGGGGGAAGACGTGGAATGC
AAGACAATAAACCTGCACACCGGTGTGGTGCTCGCCACACCGGCTCCGATGCTTAAGTCAGCAAGCAGAACGGTAGGGAGTGAAAAAGTCAAACCAGACAAAACCGGGGC
TGACAGAGGCGGTAGGGGCCTAACGGCATCGGACGGACTCGGCTTGCGCGAGCGGGCCGAGGGCCGAGCCCGATTTCTCCCCGGTTGTCCTCGTCAGCTCCTTGTATATC
GGGGTGGTCCAAAATTGCCTATAACATTAAGCCCCACTCTTGAATTGGGATTCGGGGTGAGAGATTACAAAAGTGGTGCTGGTCGGCCTCGGCCTCGAGAAGTGGCCGAG
CACAGCTATCTCCCTTTCTGCTTGCCTGGTCGGCCTCGACCTCGGGAAGAGGCCGAGCGTTGCCCCCGAACCCCCACCGGGCGCTTCGCCCCTGGACCCCGACTCGCTGA
CCGGTCAGCGAGACCCCCGTACTTGGTTATTCGGAGCGCCTCAATCCAGTTTCTAGTTTGCAAGAGAGAAGTTAGGGAAGAGGCTAGGATCGGTCTCGGCCTCAAGCCCC
CGAGAATACCTTTAAAGTTTGTGCTTGGCCTCGACAAGAGGTGCTTGGCCTTGGCAAGAGGTGCTCGGGCTTGGCAAGTGGTGCTCGGCCTTGGCAAGAGACTCGTTGGT
CCAGGCTCGCCTGATGTGGAGATGCTCAAGCCAGAGGATGTGAGATACTCAGGCCAGAGATGTGAGGTAGACTCGTTGGTCCAGGCTCACATGGTCTCACTCTGGATGAG
GCGAGGTAGACTCGGTAGTCCAGGCTCGCCTGATGTGGAGATGCTCAAGCCAGATGAGGCTCACATGGTCTCACTTTGGATGAGGCGAGGTAGACTCGGTAGTCCAGGCT
CGCCTGATGTGGAGATGCTCAAGCCAGATGAGGTGAGACTCGGTAGTCCAGGCTCGCCTGATGTGGAGATGCTCAAGCCAGATGAGGTGAGACTCGGTAGTCGAGGCTCG
CCTGATGTGGAGATGCTCAAGCCAGAGGAGATGAGGTAG
Protein sequenceShow/hide protein sequence
MVRSMMSYAQLPDSFWGYAVETVVYILNMVPSKSVPETPYELWRGSKGSLHHFRILGCPTQCWCKTQRNWNDVQNRACLWDILKKRKGSIVQGDEQKVCKLKRSIYGLIQ
ASRSWNIRFETAIKTYGFEQNIDEPCVYKKIVNSTVAFLVLYVDDILLFGNNVGFLSDIKQWLATQFQMKDLGEAQFVLGNQIIRNRKNKTLALSQASYIDKMLVRYWMQ
NSKKELLPFRHGVHVSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICFPYLLNNNNTDDFDDVAIEVEVVNVFLSLSFLSSPSSLLLFPYSPLAKMRGEDVEC
KTINLHTGVVLATPAPMLKSASRTVGSEKVKPDKTGADRGGRGLTASDGLGLRERAEGRARFLPGCPRQLLVYRGGPKLPITLSPTLELGFGVRDYKSGAGRPRPREVAE
HSYLPFCLPGRPRPREEAERCPRTPTGRFAPGPRLADRSARPPYLVIRSASIQFLVCKREVREEARIGLGLKPPRIPLKFVLGLDKRCLALARGARAWQVVLGLGKRLVG
PGSPDVEMLKPEDVRYSGQRCEVDSLVQAHMVSLWMRRGRLGSPGSPDVEMLKPDEAHMVSLWMRRGRLGSPGSPDVEMLKPDEVRLGSPGSPDVEMLKPDEVRLGSRGS
PDVEMLKPEEMR