; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022006 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022006
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr7:15781196..15782856
RNA-Seq ExpressionLag0022006
SyntenyLag0022006
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]7.2e-5440.11Show/hide
Query:  SSTASKDLVSPIFLLSNICNLVSIRLDSSN-----------------------------------SVVP------YEDWIAKDHAFMTLINAILSLAAFA
        SS+A KD +SPIFLLSNICNL+S+RLDS+N                                   S VP      YEDWIAKD A MT+INA LS  A A
Subjt:  SSTASKDLVSPIFLLSNICNLVSIRLDSSN-----------------------------------SVVP------YEDWIAKDHAFMTLINAILSLAAFA

Query:  YVVGHETSKQMWDTLVTHYSSSSRTNIVSLKSDLQSITKKPSETIDRYFKRIKELKDKLVHVFVVIDNEYLVIYALNGLPTEYITFRTSMRTCATPITFV
        YVVG  +SKQ+WD L   YSS SR+N+V+LKSDLQ+I KKP E+ID Y KRIKE+KDKL +V   I+ E L+IYALNGLP EY TFRTSMRT + P+TF 
Subjt:  YVVGHETSKQMWDTLVTHYSSSSRTNIVSLKSDLQSITKKPSETIDRYFKRIKELKDKLVHVFVVIDNEYLVIYALNGLPTEYITFRTSMRTCATPITFV

Query:  ELHVLV-SEEIAIEKQIKRDKAFSLSTALFVAHNS--------NQNY------NSNFGRRR---------------------------------------
        ELHVL+ +EE A+ KQ K D +++  T L  +  S        N N+        N+G  R                                       
Subjt:  ELHVLV-SEEIAIEKQIKRDKAFSLSTALFVAHNS--------NQNY------NSNFGRRR---------------------------------------

Query:  --------------------------------------DSGCNSHVTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTG
                                              DSGCN+H+T D++  +   +Y+GEEQVGVG+GQ+ PI+H+G
Subjt:  --------------------------------------DSGCNSHVTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTG

KAG6588985.1 Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyrosperma subsp. sororia]1.3e-5543.26Show/hide
Query:  SSSTASKDLVSPIFLLSNICNLVSIRLDSSNSVV-----------------------------PYEDWIAKDHAFMTLINAILSLAAFAYVVGHETSKQM
        SSS+   +L+SPI LLSNICNL+SI+LDS+N V+                              Y+DW AKD A MT+INA LS  A AYVVG  TSKQ+
Subjt:  SSSTASKDLVSPIFLLSNICNLVSIRLDSSNSVV-----------------------------PYEDWIAKDHAFMTLINAILSLAAFAYVVGHETSKQM

Query:  WDTLVTHYSSSSRTNIVSLKSDLQSITKKPSETIDRYFKRIKELKDKLVHVFVVIDNEYLVIYALNGLPTEYITFRTSMRTCATPITFVELHVLV-SEEI
        W+ L   YSSSSR+N+V+LKSDLQ+I+KK  E+ID Y KRIKE+KDKL +V  V+++E L+IYALNGLPTEY TFRTSMRT +TP+TF ELHVL+ +EE 
Subjt:  WDTLVTHYSSSSRTNIVSLKSDLQSITKKPSETIDRYFKRIKELKDKLVHVFVVIDNEYLVIYALNGLPTEYITFRTSMRTCATPITFVELHVLV-SEEI

Query:  AIEKQIKRDKAFSLSTALFVAHNSNQNY----NSNF------GRRRDSGC---------------------NSH--------------------------
        A+ KQ KRD      TAL  +  S  +Y    N+NF      GR    GC                     N+H                          
Subjt:  AIEKQIKRDKAFSLSTALFVAHNSNQNY----NSNF------GRRRDSGC---------------------NSH--------------------------

Query:  ---------------------VTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTGQ
                             V  D++  +  S Y GEEQVGVGSGQSLPI+H+GQ
Subjt:  ---------------------VTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTGQ

KAG7015254.1 hypothetical protein SDJN02_22888, partial [Cucurbita argyrosperma subsp. argyrosperma]6.5e-5543.1Show/hide
Query:  SSSTASKDLVSPIFLLSNICNLVSIRLDSSNSVV-----------------------------PYEDWIAKDHAFMTLINAILSLAAFAYVVGHETSKQM
        SSS+   +L+SPI LLSNICNL+SI+LDS+N V+                              Y+DW AKD A MT+INA LS  A AYVVG  TSKQ+
Subjt:  SSSTASKDLVSPIFLLSNICNLVSIRLDSSNSVV-----------------------------PYEDWIAKDHAFMTLINAILSLAAFAYVVGHETSKQM

Query:  WDTLVTHYSSSSRTNIVSLKSDLQSITKKPSETIDRYFKRIKELKDKLVHVFVVIDNEYLVIYALNGLPTEYITFRTSMRTCATPITFVELHVLV-SEEI
        W+ L   YSSSSR+N+V+LKSDLQ+I+KK  E+ID Y KRIKE+KDKL +V  V+++E L+IYALNGLPTEY TFRTSMRT +TP+TF ELHVL+ +EE 
Subjt:  WDTLVTHYSSSSRTNIVSLKSDLQSITKKPSETIDRYFKRIKELKDKLVHVFVVIDNEYLVIYALNGLPTEYITFRTSMRTCATPITFVELHVLV-SEEI

Query:  AIEKQIKRDKAFSLSTALFVAHNSNQNY----NSNF------GRRRDSGC---------------------NSH--------------------------
        A+ KQ KRD      TAL  +  S  +Y    N+NF      GR    GC                     N+H                          
Subjt:  AIEKQIKRDKAFSLSTALFVAHNSNQNY----NSNF------GRRRDSGC---------------------NSH--------------------------

Query:  ---------------------VTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTG
                             V  D++  +  S Y GEEQVGVGSGQSLPI+H+G
Subjt:  ---------------------VTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTG

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]1.2e-5339.58Show/hide
Query:  SSTASKDLVSPIFLLSNICNLVSIRLDSSN-------------------------------------SVVP------YEDWIAKDHAFMTLINAILSLAA
        SS+A KD +SPIFLLSNICNL+S+RLDS+N                                     S VP      YEDWIAKD A MT+INA LS  A
Subjt:  SSTASKDLVSPIFLLSNICNLVSIRLDSSN-------------------------------------SVVP------YEDWIAKDHAFMTLINAILSLAA

Query:  FAYVVGHETSKQMWDTLVTHYSSSSRTNIVSLKSDLQSITKKPSETIDRYFKRIKELKDKLVHVFVVIDNEYLVIYALNGLPTEYITFRTSMRTCATPIT
         AYVVG  +SKQ+WD L   YSS SR+N+V+LKSDLQ+I KKP E+ID Y KRIKE+KDKL +V   I+ E L+IYALNGLP EY TFRTSMRT + P+T
Subjt:  FAYVVGHETSKQMWDTLVTHYSSSSRTNIVSLKSDLQSITKKPSETIDRYFKRIKELKDKLVHVFVVIDNEYLVIYALNGLPTEYITFRTSMRTCATPIT

Query:  FVELHVLV-SEEIAIEKQIKRDKAFSLSTAL-----------------FVAHN-----------------------------------------------
        F ELHVL+ +EE A+ KQ K D +++  T L                 FV  N                                               
Subjt:  FVELHVLV-SEEIAIEKQIKRDKAFSLSTAL-----------------FVAHN-----------------------------------------------

Query:  ---SNQNYN------------------------SNFGRRRDSGCNSHVTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTGQIF
           +  NYN                         N     DSGCN+ +T D++  +   +Y+GEEQVG+G+GQ+ P++H+GQ+F
Subjt:  ---SNQNYN------------------------SNFGRRRDSGCNSHVTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTGQIF

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]1.3e-5540.31Show/hide
Query:  SSTASKDLVSPIFLLSNICNLVSIRLDSSN-----------------------------------SVVP------YEDWIAKDHAFMTLINAILSLAAFA
        SS+A KD +SPIFLLSNICNL+S+RLDS+N                                   S VP      YEDWIAKD A MT+INA LS  A A
Subjt:  SSTASKDLVSPIFLLSNICNLVSIRLDSSN-----------------------------------SVVP------YEDWIAKDHAFMTLINAILSLAAFA

Query:  YVVGHETSKQMWDTLVTHYSSSSRTNIVSLKSDLQSITKKPSETIDRYFKRIKELKDKLVHVFVVIDNEYLVIYALNGLPTEYITFRTSMRTCATPITFV
        YVVG  +SKQ+WD L   YSS SR+N+V+LKSDLQ+I KKP E+ID Y KRIKE+KDKL +V   I+ E L+IYALNGLP EY TFRTSMRT + P+TF 
Subjt:  YVVGHETSKQMWDTLVTHYSSSSRTNIVSLKSDLQSITKKPSETIDRYFKRIKELKDKLVHVFVVIDNEYLVIYALNGLPTEYITFRTSMRTCATPITFV

Query:  ELHVLV-SEEIAIEKQIKRDKAFSLSTALFVAHNS--------NQNY------NSNFGRRR---------------------------------------
        ELHVL+ +EE A+ KQ K D +++  T L  +  S        N N+        N+G  R                                       
Subjt:  ELHVLV-SEEIAIEKQIKRDKAFSLSTALFVAHNS--------NQNY------NSNFGRRR---------------------------------------

Query:  --------------------------------------DSGCNSHVTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTGQIF
                                              DSGCN+H+T D++  +   +Y+GEEQVGVG+GQ+ PI+H+GQ+F
Subjt:  --------------------------------------DSGCNSHVTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTGQIF

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X23.3e-5239.37Show/hide
Query:  SSTASKDLVSPIFLLSNICNLVSIRLDSSN-------------------------------------SVVP------YEDWIAKDHAFMTLINAILSLAA
        SS+A KD +SPIFLLSNICNL+S+RLDS+N                                     S VP      YEDWIAKD A MT+INA LS  A
Subjt:  SSTASKDLVSPIFLLSNICNLVSIRLDSSN-------------------------------------SVVP------YEDWIAKDHAFMTLINAILSLAA

Query:  FAYVVGHETSKQMWDTLVTHYSSSSRTNIVSLKSDLQSITKKPSETIDRYFKRIKELKDKLVHVFVVIDNEYLVIYALNGLPTEYITFRTSMRTCATPIT
         AYVVG  +SKQ+WD L   YSS SR+N+V+LKSDLQ+I KKP E+ID Y KRIKE+KDKL +V   I+ E L+IYALNGLP EY TFRTSMRT + P+T
Subjt:  FAYVVGHETSKQMWDTLVTHYSSSSRTNIVSLKSDLQSITKKPSETIDRYFKRIKELKDKLVHVFVVIDNEYLVIYALNGLPTEYITFRTSMRTCATPIT

Query:  FVELHVLV-SEEIAIEKQIKRDKAFSLSTAL-----------------FVAHN-----------------------------------------------
        F ELHVL+ +EE A+ KQ K D +++  T L                 FV  N                                               
Subjt:  FVELHVLV-SEEIAIEKQIKRDKAFSLSTAL-----------------FVAHN-----------------------------------------------

Query:  ---SNQNYN------------------------SNFGRRRDSGCNSHVTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTG
           +  NYN                         N     DSGCN+ +T D++  +   +Y+GEEQVG+G+GQ+ P++H+G
Subjt:  ---SNQNYN------------------------SNFGRRRDSGCNSHVTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTG

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X36.0e-5439.58Show/hide
Query:  SSTASKDLVSPIFLLSNICNLVSIRLDSSN-------------------------------------SVVP------YEDWIAKDHAFMTLINAILSLAA
        SS+A KD +SPIFLLSNICNL+S+RLDS+N                                     S VP      YEDWIAKD A MT+INA LS  A
Subjt:  SSTASKDLVSPIFLLSNICNLVSIRLDSSN-------------------------------------SVVP------YEDWIAKDHAFMTLINAILSLAA

Query:  FAYVVGHETSKQMWDTLVTHYSSSSRTNIVSLKSDLQSITKKPSETIDRYFKRIKELKDKLVHVFVVIDNEYLVIYALNGLPTEYITFRTSMRTCATPIT
         AYVVG  +SKQ+WD L   YSS SR+N+V+LKSDLQ+I KKP E+ID Y KRIKE+KDKL +V   I+ E L+IYALNGLP EY TFRTSMRT + P+T
Subjt:  FAYVVGHETSKQMWDTLVTHYSSSSRTNIVSLKSDLQSITKKPSETIDRYFKRIKELKDKLVHVFVVIDNEYLVIYALNGLPTEYITFRTSMRTCATPIT

Query:  FVELHVLV-SEEIAIEKQIKRDKAFSLSTAL-----------------FVAHN-----------------------------------------------
        F ELHVL+ +EE A+ KQ K D +++  T L                 FV  N                                               
Subjt:  FVELHVLV-SEEIAIEKQIKRDKAFSLSTAL-----------------FVAHN-----------------------------------------------

Query:  ---SNQNYN------------------------SNFGRRRDSGCNSHVTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTGQIF
           +  NYN                         N     DSGCN+ +T D++  +   +Y+GEEQVG+G+GQ+ P++H+GQ+F
Subjt:  ---SNQNYN------------------------SNFGRRRDSGCNSHVTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTGQIF

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X13.3e-5239.37Show/hide
Query:  SSTASKDLVSPIFLLSNICNLVSIRLDSSN-------------------------------------SVVP------YEDWIAKDHAFMTLINAILSLAA
        SS+A KD +SPIFLLSNICNL+S+RLDS+N                                     S VP      YEDWIAKD A MT+INA LS  A
Subjt:  SSTASKDLVSPIFLLSNICNLVSIRLDSSN-------------------------------------SVVP------YEDWIAKDHAFMTLINAILSLAA

Query:  FAYVVGHETSKQMWDTLVTHYSSSSRTNIVSLKSDLQSITKKPSETIDRYFKRIKELKDKLVHVFVVIDNEYLVIYALNGLPTEYITFRTSMRTCATPIT
         AYVVG  +SKQ+WD L   YSS SR+N+V+LKSDLQ+I KKP E+ID Y KRIKE+KDKL +V   I+ E L+IYALNGLP EY TFRTSMRT + P+T
Subjt:  FAYVVGHETSKQMWDTLVTHYSSSSRTNIVSLKSDLQSITKKPSETIDRYFKRIKELKDKLVHVFVVIDNEYLVIYALNGLPTEYITFRTSMRTCATPIT

Query:  FVELHVLV-SEEIAIEKQIKRDKAFSLSTAL-----------------FVAHN-----------------------------------------------
        F ELHVL+ +EE A+ KQ K D +++  T L                 FV  N                                               
Subjt:  FVELHVLV-SEEIAIEKQIKRDKAFSLSTAL-----------------FVAHN-----------------------------------------------

Query:  ---SNQNYN------------------------SNFGRRRDSGCNSHVTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTG
           +  NYN                         N     DSGCN+ +T D++  +   +Y+GEEQVG+G+GQ+ P++H+G
Subjt:  ---SNQNYN------------------------SNFGRRRDSGCNSHVTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTG

A0A5D3CLI6 T4.51.1e-5237.56Show/hide
Query:  SSTASKDLVSPIFLLSNICNLVSIRLDSSN-------------------------------------SVVP------YEDWIAKDHAFMTLINAILSLAA
        SS+A KD +SPIFLLSNICNL+S+RLDS+N                                     S VP      YEDWIAKD A MT+INA LS  A
Subjt:  SSTASKDLVSPIFLLSNICNLVSIRLDSSN-------------------------------------SVVP------YEDWIAKDHAFMTLINAILSLAA

Query:  FAYVVGHETSKQMWDTLVTHYSSSSRTNIVSLKSDLQSITKKPSETIDRYFKRIKELKDKLVHVFVVIDNEYLVIYALNGLPTEYITFRTSMRTCATPIT
         AYVVG  +SKQ+WD L   YSS SR+N+V+LKSDLQ+I KKP E+ID Y KRIKE+KDKL +V   I+ E L+IYALNGLP EY TFRTSMRT + P+T
Subjt:  FAYVVGHETSKQMWDTLVTHYSSSSRTNIVSLKSDLQSITKKPSETIDRYFKRIKELKDKLVHVFVVIDNEYLVIYALNGLPTEYITFRTSMRTCATPIT

Query:  FVELHVLV-SEEIAIEKQIKRDKAFSLSTAL-----------------FVAHN-----------------------------------------------
        F ELHVL+ +EE A+ KQ K D +++  T L                 FV  N                                               
Subjt:  FVELHVLV-SEEIAIEKQIKRDKAFSLSTAL-----------------FVAHN-----------------------------------------------

Query:  ---SNQNYN------------------------SNFGRRRDSGCNSHVTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTGQIFRNDPLLANHVAHVGIK
           +  NYN                         N     DSGCN+ +T D++  +   +Y+GEEQVG+G+GQ+ P++H+  +    P+  + VA+V  K
Subjt:  ---SNQNYN------------------------SNFGRRRDSGCNSHVTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTGQIFRNDPLLANHVAHVGIK

Query:  ISSSLSHNRLGHLNTSVLNFVLRSLNLSPCNTSG
          SS +H  +      +L FV+  L+L   +  G
Subjt:  ISSSLSHNRLGHLNTSVLNFVLRSLNLSPCNTSG

A0A6J1D9L6 uncharacterized protein LOC1110188921.4e-5037.99Show/hide
Query:  VVDSSSTASKDLVSPIFLLSNICNLVSIRLDSSNSVV--------------------------------------------------PYEDWIAKDHAFM
        +  SS+   KDL SPIFLLSNICNLVSIRLDS++ ++                                                   +EDWIAKD A M
Subjt:  VVDSSSTASKDLVSPIFLLSNICNLVSIRLDSSNSVV--------------------------------------------------PYEDWIAKDHAFM

Query:  TLINAILSLAAFAYVVGHETSKQMWDTLVTHYSSSSRTNIVSLKSDLQSITKKPSETIDRYFKRIKELKDKLVHVFVVIDNEYLVIYALNGLPTEYITFR
        TLINA LS  A AYVV   TSKQ+W+ L  HYSS+SRTN+V+LKSDLQSI KK  E+ID Y KRIKE+KDK  +V + I++EYL+IYALNGL TEY T  
Subjt:  TLINAILSLAAFAYVVGHETSKQMWDTLVTHYSSSSRTNIVSLKSDLQSITKKPSETIDRYFKRIKELKDKLVHVFVVIDNEYLVIYALNGLPTEYITFR

Query:  TSMRTCATPITFVELHVLV-SEEIAIEKQIKRDKAFSLSTALFVAHNSNQNYNSNF----------------------------GRRRDSG---------
        TSMRT A  ++F ELHV + SEE AIEKQ+KR+   +   ALF +   +QN  S F                            GR R SG         
Subjt:  TSMRTCATPITFVELHVLV-SEEIAIEKQIKRDKAFSLSTALFVAHNSNQNYNSNF----------------------------GRRRDSG---------

Query:  --------------------------------------------------------------CNSHVTVDLSQF---TSTSKYSGEEQVGVGSGQSLPIN
                                                                      CN+H+T DLS     +  S Y+GEE + VGSGQS PI 
Subjt:  --------------------------------------------------------------CNSHVTVDLSQF---TSTSKYSGEEQVGVGSGQSLPIN

Query:  H--TGQIF
        H   GQ+F
Subjt:  H--TGQIF

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.7e-0533.98Show/hide
Query:  LSHNRLGHLNTSVL------NFVLRSLNLSPCNTSGCFCEHCIHGKLHKLPFPS-SDSVFL-QPLELVHADVWGLALEVSINGFNYYVSFVDDYSKYTWL
        L H R GH++   L      N       L+    S   CE C++GK  +LPF    D   + +PL +VH+DV G    V+++  NY+V FVD ++ Y   
Subjt:  LSHNRLGHLNTSVL------NFVLRSLNLSPCNTSGCFCEHCIHGKLHKLPFPS-SDSVFL-QPLELVHADVWGLALEVSINGFNYYVSFVDDYSKYTWL

Query:  YPI
        Y I
Subjt:  YPI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.3e-0934.34Show/hide
Query:  KISSSLSHNRLGHLNTSVLNFVLRSLNLSPC-NTSGCFCEHCIHGKLHKLPFPSSDSVFLQPLELVHADVWGLALEVSINGFNYYVSFVDDYSKYTWLY
        +IS  L H R+GH++   L  + +   +S    T+   C++C+ GK H++ F +S    L  L+LV++DV G     S+ G  Y+V+F+DD S+  W+Y
Subjt:  KISSSLSHNRLGHLNTSVLNFVLRSLNLSPC-NTSGCFCEHCIHGKLHKLPFPSSDSVFLQPLELVHADVWGLALEVSINGFNYYVSFVDDYSKYTWLY

P93293 Uncharacterized mitochondrial protein AtMg003001.7e-0533.33Show/hide
Query:  KISSSLSHNRLGHLNTSVLNFVLRSLNLSPCNTSGC-FCEHCIHGKLHKLPFPSSDSVFLQPLELVHADVWG
        K  + L H+RL H++   +  +++   L     S   FCE CI+GK H++ F +       PL+ VH+D+WG
Subjt:  KISSSLSHNRLGHLNTSVLNFVLRSLNLSPCNTSGC-FCEHCIHGKLHKLPFPSSDSVFLQPLELVHADVWG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-1526.64Show/hide
Query:  DSGCNSHVTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTGQI---FRNDPLLANHVAHV----------------------------------------
        DSG   H+T D +  +    Y+G + V V  G ++PI+HTG      ++ PL  +++ +V                                        
Subjt:  DSGCNSHVTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTGQI---FRNDPLLANHVAHV----------------------------------------

Query:  -------------------------GIKISSSLSHNRLGHLNTSVLNFVLRSLNLSPCNTSGCF--CEHCIHGKLHKLPFPSSDSVFLQPLELVHADVWG
                                   K + S  H RLGH   S+LN V+ + +LS  N S  F  C  C+  K +K+PF  S     +PLE +++DVW 
Subjt:  -------------------------GIKISSSLSHNRLGHLNTSVLNFVLRSLNLSPCNTSGCF--CEHCIHGKLHKLPFPSSDSVFLQPLELVHADVWG

Query:  LALEVSINGFNYYVSFVDDYSKYTWLYPI
          + +S + + YYV FVD +++YTWLYP+
Subjt:  LALEVSINGFNYYVSFVDDYSKYTWLYPI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.1e-1526.97Show/hide
Query:  NQNYNSNFGRRRDSGCNSHVTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTG---------------------------QIFR----------------
        N  YN+N     DSG   H+T D +  +    Y+G + V +  G ++PI HTG                            ++R                
Subjt:  NQNYNSNFGRRRDSGCNSHVTVDLSQFTSTSKYSGEEQVGVGSGQSLPINHTG---------------------------QIFR----------------

Query:  -------NDPLL------------------ANHVAHVGIKISSSLSHNRLGHLNTSVLNFVLRSLNLSPCNTSG--CFCEHCIHGKLHKLPFPSSDSVFL
                 PLL                   +  A    K + S  H+RLGH + ++LN V+ + +L   N S     C  C   K HK+PF +S     
Subjt:  -------NDPLL------------------ANHVAHVGIKISSSLSHNRLGHLNTSVLNFVLRSLNLSPCNTSG--CFCEHCIHGKLHKLPFPSSDSVFL

Query:  QPLELVHADVWGLALEVSINGFNYYVSFVDDYSKYTWLYPI
        +PLE +++DVW   + +SI+ + YYV FVD +++YTWLYP+
Subjt:  QPLELVHADVWGLALEVSINGFNYYVSFVDDYSKYTWLYPI

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein1.2e-0633.33Show/hide
Query:  KISSSLSHNRLGHLNTSVLNFVLRSLNLSPCNTSGC-FCEHCIHGKLHKLPFPSSDSVFLQPLELVHADVWG
        K  + L H+RL H++   +  +++   L     S   FCE CI+GK H++ F +       PL+ VH+D+WG
Subjt:  KISSSLSHNRLGHLNTSVLNFVLRSLNLSPCNTSGC-FCEHCIHGKLHKLPFPSSDSVFLQPLELVHADVWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGTTGATTCTTCATCCACCGCTTCGAAAGATCTTGTTTCACCCATCTTTCTCTTATCAAACATCTGCAACCTTGTTTCCATTCGCTTGGACTCTTCCAATTCTGT
CGTACCTTATGAAGACTGGATTGCCAAGGATCATGCTTTCATGACCTTGATCAACGCGATTCTTTCTCTAGCTGCTTTTGCCTATGTTGTTGGACACGAGACCTCAAAGC
AAATGTGGGATACTCTTGTAACTCACTACTCCTCGAGTTCGCGAACAAACATTGTAAGTCTCAAATCTGATCTTCAGTCAATCACTAAAAAGCCTTCTGAAACAATTGAT
CGATATTTCAAAAGAATTAAAGAACTCAAAGATAAATTGGTGCATGTCTTTGTTGTTATTGACAATGAATATTTGGTAATCTATGCCTTAAATGGTTTGCCTACTGAGTA
CATTACCTTTAGGACGTCTATGAGAACTTGCGCTACTCCGATTACTTTTGTTGAATTACATGTCCTTGTTTCCGAAGAAATTGCGATTGAGAAACAAATCAAAAGAGACA
AAGCATTTTCCTTGTCTACTGCTCTTTTTGTAGCACATAATTCTAATCAGAATTATAATTCAAATTTTGGAAGAAGACGAGACTCTGGATGTAACTCTCATGTTACTGTT
GATCTAAGCCAATTTACCTCTACTTCTAAGTATTCTGGTGAAGAACAAGTTGGTGTAGGAAGTGGTCAATCTCTTCCAATAAATCATACAGGACAAATCTTCAGGAACGA
TCCTCTATTAGCCAATCATGTTGCTCATGTTGGAATAAAGATTTCTTCATCTCTTTCACATAACCGATTAGGACATCTGAATACTTCTGTTCTAAATTTTGTACTTCGTT
CTTTGAATTTGTCTCCTTGTAATACTTCTGGCTGTTTCTGTGAACATTGTATTCATGGCAAATTGCATAAGTTGCCCTTTCCATCATCTGATTCTGTTTTTTTACAACCT
TTAGAACTTGTTCATGCTGATGTATGGGGTCTTGCCCTTGAAGTCTCTATTAATGGCTTCAATTACTACGTATCTTTTGTAGATGATTACTCTAAATATACATGGCTTTA
TCCTATCTGTCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGTTGATTCTTCATCCACCGCTTCGAAAGATCTTGTTTCACCCATCTTTCTCTTATCAAACATCTGCAACCTTGTTTCCATTCGCTTGGACTCTTCCAATTCTGT
CGTACCTTATGAAGACTGGATTGCCAAGGATCATGCTTTCATGACCTTGATCAACGCGATTCTTTCTCTAGCTGCTTTTGCCTATGTTGTTGGACACGAGACCTCAAAGC
AAATGTGGGATACTCTTGTAACTCACTACTCCTCGAGTTCGCGAACAAACATTGTAAGTCTCAAATCTGATCTTCAGTCAATCACTAAAAAGCCTTCTGAAACAATTGAT
CGATATTTCAAAAGAATTAAAGAACTCAAAGATAAATTGGTGCATGTCTTTGTTGTTATTGACAATGAATATTTGGTAATCTATGCCTTAAATGGTTTGCCTACTGAGTA
CATTACCTTTAGGACGTCTATGAGAACTTGCGCTACTCCGATTACTTTTGTTGAATTACATGTCCTTGTTTCCGAAGAAATTGCGATTGAGAAACAAATCAAAAGAGACA
AAGCATTTTCCTTGTCTACTGCTCTTTTTGTAGCACATAATTCTAATCAGAATTATAATTCAAATTTTGGAAGAAGACGAGACTCTGGATGTAACTCTCATGTTACTGTT
GATCTAAGCCAATTTACCTCTACTTCTAAGTATTCTGGTGAAGAACAAGTTGGTGTAGGAAGTGGTCAATCTCTTCCAATAAATCATACAGGACAAATCTTCAGGAACGA
TCCTCTATTAGCCAATCATGTTGCTCATGTTGGAATAAAGATTTCTTCATCTCTTTCACATAACCGATTAGGACATCTGAATACTTCTGTTCTAAATTTTGTACTTCGTT
CTTTGAATTTGTCTCCTTGTAATACTTCTGGCTGTTTCTGTGAACATTGTATTCATGGCAAATTGCATAAGTTGCCCTTTCCATCATCTGATTCTGTTTTTTTACAACCT
TTAGAACTTGTTCATGCTGATGTATGGGGTCTTGCCCTTGAAGTCTCTATTAATGGCTTCAATTACTACGTATCTTTTGTAGATGATTACTCTAAATATACATGGCTTTA
TCCTATCTGTCTTTAA
Protein sequenceShow/hide protein sequence
MVVDSSSTASKDLVSPIFLLSNICNLVSIRLDSSNSVVPYEDWIAKDHAFMTLINAILSLAAFAYVVGHETSKQMWDTLVTHYSSSSRTNIVSLKSDLQSITKKPSETID
RYFKRIKELKDKLVHVFVVIDNEYLVIYALNGLPTEYITFRTSMRTCATPITFVELHVLVSEEIAIEKQIKRDKAFSLSTALFVAHNSNQNYNSNFGRRRDSGCNSHVTV
DLSQFTSTSKYSGEEQVGVGSGQSLPINHTGQIFRNDPLLANHVAHVGIKISSSLSHNRLGHLNTSVLNFVLRSLNLSPCNTSGCFCEHCIHGKLHKLPFPSSDSVFLQP
LELVHADVWGLALEVSINGFNYYVSFVDDYSKYTWLYPICL