; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021126 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021126
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr7:4847850..4859167
RNA-Seq ExpressionLag0021126
SyntenyLag0021126
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_012837652.1 PREDICTED: uncharacterized protein LOC105958190 [Erythranthe guttata]1.8e-7735.57Show/hide
Query:  SVNFAESAREIWLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLWNELTASRSLCSC---GRCSCGGVKELTTYFQTEYVMAFLMGLNDS
        SV ++ SA E+W DL+ R+Q+ N PR+FQLR E++NLAQD  ++T Y+ KL ++W+ELT  R  C+C    +C+CGG+ ++  +F  EYVM FLMGLN+S
Subjt:  SVNFAESAREIWLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLWNELTASRSLCSC---GRCSCGGVKELTTYFQTEYVMAFLMGLNDS

Query:  FAQVRSQLLLMEPERIIQRAFSLVAQEVEQRASTTSPSSTTIPATTLLVKTNSANPSSSR--ATSSN---KKKERPMCTHYNIQGHTVDRCYKIHGYP--
        F   RSQ+LLMEP   I + FSL+ QE  QR+ + + S       T+L +  S+     R    +SN   KKK+RP CTH N  GHT++ CYK+HGYP  
Subjt:  FAQVRSQLLLMEPERIIQRAFSLVAQEVEQRASTTSPSSTTIPATTLLVKTNSANPSSSR--ATSSN---KKKERPMCTHYNIQGHTVDRCYKIHGYP--

Query:  ------------------------------------------------LEQCQGLLTLLQSHLNKAKTDSADTSNNTH----IAGTYLSDISSDIMQNTW
                                                          QCQ L++L  SHL+         S N H    + G   + I   +   +W
Subjt:  ------------------------------------------------LEQCQGLLTLLQSHLNKAKTDSADTSNNTH----IAGTYLSDISSDIMQNTW

Query:  VLDSGASAHICCSKKFFVNLKAISGMSISLPNRE----------------------------RIIDKYSLRTIGGVKIWQGQYLL-----QTDAMVDSQS
        ++DSGAS HI   K  F +L++I   S++LP+                               ++    L+ IG  +   G Y+L     ++ A+ DS S
Subjt:  VLDSGASAHICCSKKFFVNLKAISGMSISLPNRE----------------------------RIIDKYSLRTIGGVKIWQGQYLL-----QTDAMVDSQS

Query:  RCNSVSVSKKFHNCKNLATWHDRLRHPSDKHLDVLK--------------------------------GVTHQFSCVERPEQNSVVERRHQHLLNIARPL
         CN VS S          TWH+RL H S+K LD +K                                GV HQ+SCV+ P+QNSVVER+HQHLLN+AR L
Subjt:  RCNSVSVSKKFHNCKNLATWHDRLRHPSDKHLDVLK--------------------------------GVTHQFSCVERPEQNSVVERRHQHLLNIARPL

Query:  LFKSRLPIQIWGESILTAAYIVNRTPSRVLNWITPYYSLNQQHPDYHHLRV
         F+S++PI  W E +LTA Y++NRTPSR+LN  TP+  L  +   Y+ LRV
Subjt:  LFKSRLPIQIWGESILTAAYIVNRTPSRVLNWITPYYSLNQQHPDYHHLRV

XP_012856897.1 PREDICTED: uncharacterized protein LOC105976150 [Erythranthe guttata]1.6e-6228.1Show/hide
Query:  VDLYINLYYLHYSYGTSLVLVSKPLIESNYASWSQAMIIGLTVKNKM------------------------------------------SVNFAESAREI
        VD   + +YLH S G  LVLVS+ L E N+ASWS+AM I LTVKNK+                                          S+ +++SA E+
Subjt:  VDLYINLYYLHYSYGTSLVLVSKPLIESNYASWSQAMIIGLTVKNKM------------------------------------------SVNFAESAREI

Query:  WLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLWNELTASRSLCSCGRCSCGGVKELTTYFQTEYVMAFLMGLNDSFAQVRSQLLLMEPE
        W DL  R+ + N PRIFQLR E+SNL QD Q++  YF KL ++W+EL+  R  C+CG C+CGGV++L  ++  E+VMAFLMGLN+S    R Q+LLM+P 
Subjt:  WLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLWNELTASRSLCSCGRCSCGGVKELTTYFQTEYVMAFLMGLNDSFAQVRSQLLLMEPE

Query:  RIIQRAFSLVAQEVEQRASTTSPSSTTIPATTLLVKTNSANPS--SSRATSSNKKKERPMCTHYNIQGHTVDRCYKIHGYP-------------------
          I + F+LV+QE  QR+  +S +         +    S   S  +   TS+ K+KER  CTH NI GHT+D+CYK+HGYP                   
Subjt:  RIIQRAFSLVAQEVEQRASTTSPSSTTIPATTLLVKTNSANPS--SSRATSSNKKKERPMCTHYNIQGHTVDRCYKIHGYP-------------------

Query:  ----------------------------------LEQCQGLLTLLQSHLNKAKT--------DSADTSNNTHIAGTYL-SDISSDIMQNTWVLDSGASAH
                                            QCQ L+    + +   K         D A+ ++ + ++G  L + +      + W++DSGAS H
Subjt:  ----------------------------------LEQCQGLLTLLQSHLNKAKT--------DSADTSNNTHIAGTYL-SDISSDIMQNTWVLDSGASAH

Query:  ICCSKKFFVNLKAISGMSISLPNRERIIDKYS----------LRTIGGVKIWQGQYL--------LQTDAMVDSQSRCNSVSVSKKFHNCKNL----ATW
        IC  K  F +L  ++   + LP+   ++ +Y           L+ +  V  ++   +        L    + DS S        KK      +      W
Subjt:  ICCSKKFFVNLKAISGMSISLPNRERIIDKYS----------LRTIGGVKIWQGQYL--------LQTDAMVDSQSRCNSVSVSKKFHNCKNL----ATW

Query:  HDRLRHPSDKHLDVLK------------------------------------------------------------------------------------
        H+RL H     LD+L                                                                                     
Subjt:  HDRLRHPSDKHLDVLK------------------------------------------------------------------------------------

Query:  ---------------------------------------GVTHQFSCVERPEQNSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRV
                                               GV HQFSCV  P+QN+VVER+HQH+LN+AR LLF+S +PI  W E I TA +++NRTP+  
Subjt:  ---------------------------------------GVTHQFSCVERPEQNSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRV

Query:  LNWITPYYSLNQQHP----DYHHLRV
        LN ++P+  L   +P    DY+ L+V
Subjt:  LNWITPYYSLNQQHP----DYHHLRV

XP_012857659.1 PREDICTED: uncharacterized protein LOC105976934 [Erythranthe guttata]6.0e-6529.14Show/hide
Query:  SSSSVVDLYINLYYLHYSYGTSLVLVSKPLIESNYASWSQAMIIGLTVKNKM------------------------------------------SVNFAE
        ++ S +D   + YYLH S G  LVLVS  L E NYA+W++AM+I LTVKNK+                                          SV ++E
Subjt:  SSSSVVDLYINLYYLHYSYGTSLVLVSKPLIESNYASWSQAMIIGLTVKNKM------------------------------------------SVNFAE

Query:  SAREIWLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLWNELTASRSLCSCGRCSCGGVKELTTYFQTEYVMAFLMGLNDSFAQVRSQLL
        SA +IW DL+ R+ + N PRIFQLR E++NL QDQQ++  YF KL ++W+EL   R  C+CGRCSCGGV +L  +   E+VM+FLMGLNDS A  R Q+L
Subjt:  SAREIWLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLWNELTASRSLCSCGRCSCGGVKELTTYFQTEYVMAFLMGLNDSFAQVRSQLL

Query:  LMEPERIIQRAFSLVAQEVEQRASTTSPSSTTIPATTLLVKTNSANPSSSRAT------SSNKKKERPMCTHYNIQGHTVDRCYKIHGYP----------
        LM+P   I + F+LV+QE   R+   + SS    +     +    N    R        +++++K++  CTH +  GHTV++CY++HG+P          
Subjt:  LMEPERIIQRAFSLVAQEVEQRASTTSPSSTTIPATTLLVKTNSANPSSSRAT------SSNKKKERPMCTHYNIQGHTVDRCYKIHGYP----------

Query:  ----------------------------------------------LEQCQGLLTLLQSHL-NKA-------KTDSADTSNNTHIAGTYLSDI--SSDIM
                                                        QCQ LL+ + SHL NKA        ++  DTS+ + + G  L +   +   M
Subjt:  ----------------------------------------------LEQCQGLLTLLQSHL-NKA-------KTDSADTSNNTHIAGTYLSDI--SSDIM

Query:  QNTWVLDSGASAHICCSKKFFVNLKAISGMSISLPNRERII--------------------------------------------DKYS-------LRTI
         + W+LDSGAS HIC +K  F+N+K++S   + LP+   ++                                            D++S       +  I
Subjt:  QNTWVLDSGASAHICCSKKFFVNLKAISGMSISLPNRERII--------------------------------------------DKYS-------LRTI

Query:  GGVKIWQGQYLL--------------------------------------QTDAMVD--SQSRC---------------NSVSVS---------------
        G     QG Y+L                                      +    VD  S+S C               NS SVS               
Subjt:  GGVKIWQGQYLL--------------------------------------QTDAMVD--SQSRC---------------NSVSVS---------------

Query:  -----KKFHNCKNLA------TWHDRLRHPSD-----------------KHLDVLK----------------GVTHQFSCVERPEQNSVVERRHQHLLNI
               FH    L       TW   L+  S+                 K + V +                GV HQFSCV  P+QN++VER+HQH+LN+
Subjt:  -----KKFHNCKNLA------TWHDRLRHPSD-----------------KHLDVLK----------------GVTHQFSCVERPEQNSVVERRHQHLLNI

Query:  ARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVLNWITPYYSLNQQHP-DYHHLR
        AR L F+S +PI  W E ILTA +++NR P+  LN ++PY  L    P DYH L+
Subjt:  ARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVLNWITPYYSLNQQHP-DYHHLR

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]3.3e-8741.01Show/hide
Query:  PSSSSVVDLYINLYYLHYSYGTSLVLVSKPLIESNYASWSQAMIIGLTVKNKM---------------------------------------SVNFAESA
        P +  VV+ + N Y+LH+S  TSLVLVS  L + NY SWS++++I LTVKNK+                                       SV F++SA
Subjt:  PSSSSVVDLYINLYYLHYSYGTSLVLVSKPLIESNYASWSQAMIIGLTVKNKM---------------------------------------SVNFAESA

Query:  REIWLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLWNELTASRSLCSCGRCSCGGVKELTTYFQTEYVMAFLMGLNDSFAQVRSQLLLM
         EIWLDL++R+QR+NRPRIFQLR E+SNL QDQ ++T YF +L +LW+EL   R  CSCGRCS GGVK +  ++Q EYVMAFLMGLN SF+Q+R+QLLLM
Subjt:  REIWLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLWNELTASRSLCSCGRCSCGGVKELTTYFQTEYVMAFLMGLNDSFAQVRSQLLLM

Query:  EPERIIQRAFSLVAQEVEQRASTTSPSSTTIPATTLLVKTNSANPS-SSRATSSNKKKERPMCTHYNIQGHTVDRCYKIHGYPLEQCQGLLTLLQSHLNK
        EP   I RAF+LVAQE++QR S + PS T+  A+ +   +NS+N   +S + S  K+K++ +CTH  I GHTVD+CYK+H YP           +S ++K
Subjt:  EPERIIQRAFSLVAQEVEQRASTTSPSSTTIPATTLLVKTNSANPS-SSRATSSNKKKERPMCTHYNIQGHTVDRCYKIHGYPLEQCQGLLTLLQSHLNK

Query:  AKTDSADTSNNTHIAGTYLSDISSDIMQNTWVLDSGASAHICCSKKFFVNLKAISGMSISLPNRERIIDKYSLRTIGGVKIWQGQYLLQTDAMVDSQSRC
          + +A +S +       +S   S I  +   L    +A  C                      +R++                 +L  T    D+ S  
Subjt:  AKTDSADTSNNTHIAGTYLSDISSDIMQNTWVLDSGASAHICCSKKFFVNLKAISGMSISLPNRERIIDKYSLRTIGGVKIWQGQYLLQTDAMVDSQSRC

Query:  NSVSVSKKFHNCKNLATWHDRLRHPSDKHLDVLKGVTHQFSCVERPEQNSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVLNWIT
        + V+ ++     K   +  D+    S       KGV HQFSCV  PEQNSVVER+HQHLLN+AR L F+SR+P   WGE +LTAAY++NRTP+ VL+W T
Subjt:  NSVSVSKKFHNCKNLATWHDRLRHPSDKHLDVLKGVTHQFSCVERPEQNSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVLNWIT

Query:  PYYSLNQQHPDYHHLRV
        PY  L     DY  L+V
Subjt:  PYYSLNQQHPDYHHLRV

XP_038904477.1 uncharacterized protein LOC120090845 [Benincasa hispida]1.2e-6258.19Show/hide
Query:  SVNFAESAREIWLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLWNELTASRSLCSCGRCSCGGVKELTTYFQTEYVMAFLMGLNDSFAQ
        S+NF++SA+EIW+DLQ+RYQR+NRPR+FQLR E SNL+Q+Q ++TTY+AKL +LWNEL + R  CSCG+C+CGGVK L TYFQTEYV+AFLMGLNDS A 
Subjt:  SVNFAESAREIWLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLWNELTASRSLCSCGRCSCGGVKELTTYFQTEYVMAFLMGLNDSFAQ

Query:  VRSQLLLMEPERIIQRAFSLVAQEVEQRASTTS--PSSTTIPATTLLVKTNSAN-----PSSSRATSSNKKKERPMCTHYNIQGHTVDRCYKIHGYP---
        +RSQLLLMEP+  I RAFSLVAQE++Q+A ++S   + ++  AT LLVK  S+      PS S  T+ NKKK+RP+CTH +IQGHTVDRCYK+HGYP   
Subjt:  VRSQLLLMEPERIIQRAFSLVAQEVEQRASTTS--PSSTTIPATTLLVKTNSAN-----PSSSRATSSNKKKERPMCTHYNIQGHTVDRCYKIHGYP---

Query:  ---------------------LEQCQGLLTLL
                              EQCQGLL +L
Subjt:  ---------------------LEQCQGLLTLL

TrEMBL top hitse value%identityAlignment
A0A2N9GFH3 Integrase catalytic domain-containing protein4.2e-6438.03Show/hide
Query:  SVNFAESAREIWLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLWNELTASRSLCSCGRCSCGGVKELTTYFQTEYVMAFLMGLNDSFAQ
        SV +A +A+E+W DL++R+ + N PRIF+++  IS+L QDQ  ++ YF KL SLW+EL   RS  +   CSCG +K L    Q E VM FLMGLNDSFA 
Subjt:  SVNFAESAREIWLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLWNELTASRSLCSCGRCSCGGVKELTTYFQTEYVMAFLMGLNDSFAQ

Query:  VRSQLLLMEPERIIQRAFSLVAQEVEQRA--STTSPSSTTIPATTLLVKTNSANPSSS-RATSSNKKKERPMCTHYNIQGHTVDRCYKIHGYP-------
        VR+Q+L+MEP   I +AFSLV QE  QR+   TT   ST     ++ + T S  P ++      + KKERP+C+H  I GH VD+CYK+HG+P       
Subjt:  VRSQLLLMEPERIIQRAFSLVAQEVEQRA--STTSPSSTTIPATTLLVKTNSANPSSS-RATSSNKKKERPMCTHYNIQGHTVDRCYKIHGYP-------

Query:  ---------------------LEQCQGLLTLLQSHLNKAKTDSADTSNN---------THIAGTYLSDISS----DIMQNTWVLDSGASAHICCSKKFFV
                               QCQ LL +L S  + +   S  + N+         T +A +     SS     +   T    SGA+ H+  S   F 
Subjt:  ---------------------LEQCQGLLTLLQSHLNKAKTDSADTSNN---------THIAGTYLSDISS----DIMQNTWVLDSGASAHICCSKKFFV

Query:  NLKAISGMSISLPNRERIIDKYSLRTIGGVKIWQGQYLLQTDAMVDSQSR-----CNSVSVSKKFHNCKNLATWHDRLRHPSDKHLD---VLKGVTHQFS
        ++ +     I LPN ++  D  S R IG  K   G Y+LQ   + DSQ       C ++   K   N      WH R  +  +  +D     +G  HQ S
Subjt:  NLKAISGMSISLPNRERIIDKYSLRTIGGVKIWQGQYLLQTDAMVDSQSR-----CNSVSVSKKFHNCKNLATWHDRLRHPSDKHLD---VLKGVTHQFS

Query:  CVERPEQNSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVLNWITPYYSLNQQHPDYHHLRV
        C+  P+QNS VER+HQHLL +AR L F++ LP+  WG  +LTA Y++NR PS +LN  +PY  L +  P Y HLRV
Subjt:  CVERPEQNSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVLNWITPYYSLNQQHPDYHHLRV

A0A2N9HCW1 Integrase catalytic domain-containing protein2.3e-7036.03Show/hide
Query:  SSSVVDLYINLYYLHYSYGTSLVLVSKPLIESNYASWSQAMIIGLTVKNKM------------------------------------------SVNFAES
        SS   D   N ++LH+      VLVS+PL   NY +WS++MI+ LT KNK+                                          SV +A +
Subjt:  SSSVVDLYINLYYLHYSYGTSLVLVSKPLIESNYASWSQAMIIGLTVKNKM------------------------------------------SVNFAES

Query:  AREIWLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLWNELTASRSLCSCGRCSCGGVKELTTYFQTEYVMAFLMGLNDSFAQVRSQLLL
        A+E+W DL++R+ + N PRIF+++  IS+L QDQ  ++ YF KL SLW+EL   RS  +   CSCG +K L    Q E VM FLMGLNDSFA VR+Q+L+
Subjt:  AREIWLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLWNELTASRSLCSCGRCSCGGVKELTTYFQTEYVMAFLMGLNDSFAQVRSQLLL

Query:  MEPERIIQRAFSLVAQEVEQRA--STTSPSSTTIPATTLLVKTNSANPSSS-RATSSNKKKERPMCTHYNIQGHTVDRCYKIHGYP--------------
        MEP   I +AFSLV QE  QR+   TT   ST     ++ + T S  P ++      + KKERP+C+H  I GH VD+CYK+HG+P              
Subjt:  MEPERIIQRAFSLVAQEVEQRA--STTSPSSTTIPATTLLVKTNSANPSSS-RATSSNKKKERPMCTHYNIQGHTVDRCYKIHGYP--------------

Query:  --------------LEQCQGLLTLLQSHLNKAKTDSADTSNN---------THIAGTYLSDISS----DIMQNTWVLDSGASAHICCSKKFFVNLKAISG
                        QCQ LL +L S  + +   S  + N+         T +A +     SS     +   T    SGA+ H+  S   F ++ +   
Subjt:  --------------LEQCQGLLTLLQSHLNKAKTDSADTSNN---------THIAGTYLSDISS----DIMQNTWVLDSGASAHICCSKKFFVNLKAISG

Query:  MSISLPNRERIIDKYSLRTIGGVKIWQGQYLLQTDAMVDSQSR-----CNSVSVSKKFHNCKNLATWHDRLRHPSDKHLD---VLKGVTHQFSCVERPEQ
          I LPN ++  D  S R IG  K   G Y+LQ   + DSQ       C ++   K   N      WH R  +  +  +D     +G  HQ SC+  P+Q
Subjt:  MSISLPNRERIIDKYSLRTIGGVKIWQGQYLLQTDAMVDSQSR-----CNSVSVSKKFHNCKNLATWHDRLRHPSDKHLD---VLKGVTHQFSCVERPEQ

Query:  NSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVLNWITPYYSLNQQHPDYHHLRV
        NS VER+HQHLL +AR L F++ LP+  WG  +LTAAY++NR PS +LN  +PY  L +  P Y HLRV
Subjt:  NSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVLNWITPYYSLNQQHPDYHHLRV

A0A2N9HL88 Integrase catalytic domain-containing protein6.4e-6534.77Show/hide
Query:  NLYYLHYSYGTSLVLVSKPLIESNYASWSQAMIIGLTVKNK---MSVNFAESAREIWLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLW
        N + L       L LV++ L   N+ +WS+    GL  + +      ++     +   DL++R+ + N PR++QL+  I++L+QDQ +++T++ KL +LW
Subjt:  NLYYLHYSYGTSLVLVSKPLIESNYASWSQAMIIGLTVKNK---MSVNFAESAREIWLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLW

Query:  NELTASRSLCSCGRCSCGGVKELTTYFQTEYVMAFLMGLNDSFAQVRSQLLLMEPERIIQRAFSLVAQEVEQRASTTSPSSTTIPATTLLVKTNSANPSS
        +EL   R + +   C+CG +K L  Y   EYVM FL+GLNDS+A +R Q+LLMEP   I + F+LV+QE  QR   + P    + + +      +  P  
Subjt:  NELTASRSLCSCGRCSCGGVKELTTYFQTEYVMAFLMGLNDSFAQVRSQLLLMEPERIIQRAFSLVAQEVEQRASTTSPSSTTIPATTLLVKTNSANPSS

Query:  SRATSSNKKKERPMCTHYNIQGHTVDRCYKIHGYPLEQCQGLLTLLQSHLNKAKTDSADTSNNTHIAGTYLSDISSDIMQNTWVLDSGASAHICCSKKFF
                KKERP+C+H  I GHTV++CYKIHGYP                         + N   A T+   I +     TWV+D+GA+ H+  S K F
Subjt:  SRATSSNKKKERPMCTHYNIQGHTVDRCYKIHGYPLEQCQGLLTLLQSHLNKAKTDSADTSNNTHIAGTYLSDISSDIMQNTWVLDSGASAHICCSKKFF

Query:  VNLKAISGMSISLPNRERIIDKYSLRTIGGVKIWQGQYLLQTDAMVDSQSRCNSVSVSKKFHNCKNL-----------------ATWHDRLRHPSDKHLD
          + +    ++ LPN E  +D    + IG  K  +G YLL+     D  S C  V + +     +NL                 A   D  R  S     
Subjt:  VNLKAISGMSISLPNRERIIDKYSLRTIGGVKIWQGQYLLQTDAMVDSQSRCNSVSVSKKFHNCKNL-----------------ATWHDRLRHPSDKHLD

Query:  VLKGVTHQFSCVERPEQNSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVLNWITPYYSLNQQHPDYHHLRV
          +GV H  SCV  P+QNSVVER+HQH+LN+AR L F+S +P++ W + IL A Y++NRTPS VL   TP+  L    P Y HLRV
Subjt:  VLKGVTHQFSCVERPEQNSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVLNWITPYYSLNQQHPDYHHLRV

A0A2N9I5J0 Integrase catalytic domain-containing protein2.3e-7036.03Show/hide
Query:  SSSVVDLYINLYYLHYSYGTSLVLVSKPLIESNYASWSQAMIIGLTVKNKM------------------------------------------SVNFAES
        SS   D   N ++LH+      VLVS+PL   NY +WS++MI+ LT KNK+                                          SV +A +
Subjt:  SSSVVDLYINLYYLHYSYGTSLVLVSKPLIESNYASWSQAMIIGLTVKNKM------------------------------------------SVNFAES

Query:  AREIWLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLWNELTASRSLCSCGRCSCGGVKELTTYFQTEYVMAFLMGLNDSFAQVRSQLLL
        A+E+W DL++R+ + N PRIF+++  IS+L QDQ  ++ YF KL SLW+EL   RS  +   CSCG +K L    Q E VM FLMGLNDSFA VR+Q+L+
Subjt:  AREIWLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLWNELTASRSLCSCGRCSCGGVKELTTYFQTEYVMAFLMGLNDSFAQVRSQLLL

Query:  MEPERIIQRAFSLVAQEVEQRA--STTSPSSTTIPATTLLVKTNSANPSSS-RATSSNKKKERPMCTHYNIQGHTVDRCYKIHGYP--------------
        MEP   I +AFSLV QE  QR+   TT   ST     ++ + T S  P ++      + KKERP+C+H  I GH VD+CYK+HG+P              
Subjt:  MEPERIIQRAFSLVAQEVEQRA--STTSPSSTTIPATTLLVKTNSANPSSS-RATSSNKKKERPMCTHYNIQGHTVDRCYKIHGYP--------------

Query:  --------------LEQCQGLLTLLQSHLNKAKTDSADTSNN---------THIAGTYLSDISS----DIMQNTWVLDSGASAHICCSKKFFVNLKAISG
                        QCQ LL +L S  + +   S  + N+         T +A +     SS     +   T    SGA+ H+  S   F ++ +   
Subjt:  --------------LEQCQGLLTLLQSHLNKAKTDSADTSNN---------THIAGTYLSDISS----DIMQNTWVLDSGASAHICCSKKFFVNLKAISG

Query:  MSISLPNRERIIDKYSLRTIGGVKIWQGQYLLQTDAMVDSQSR-----CNSVSVSKKFHNCKNLATWHDRLRHPSDKHLD---VLKGVTHQFSCVERPEQ
          I LPN ++  D  S R IG  K   G Y+LQ   + DSQ       C ++   K   N      WH R  +  +  +D     +G  HQ SC+  P+Q
Subjt:  MSISLPNRERIIDKYSLRTIGGVKIWQGQYLLQTDAMVDSQSR-----CNSVSVSKKFHNCKNLATWHDRLRHPSDKHLD---VLKGVTHQFSCVERPEQ

Query:  NSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVLNWITPYYSLNQQHPDYHHLRV
        NS VER+HQHLL +AR L F++ LP+  WG  +LTAAY++NR PS +LN  +PY  L +  P Y HLRV
Subjt:  NSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVLNWITPYYSLNQQHPDYHHLRV

A0A6J1DNP7 uncharacterized protein LOC1110220651.6e-8741.01Show/hide
Query:  PSSSSVVDLYINLYYLHYSYGTSLVLVSKPLIESNYASWSQAMIIGLTVKNKM---------------------------------------SVNFAESA
        P +  VV+ + N Y+LH+S  TSLVLVS  L + NY SWS++++I LTVKNK+                                       SV F++SA
Subjt:  PSSSSVVDLYINLYYLHYSYGTSLVLVSKPLIESNYASWSQAMIIGLTVKNKM---------------------------------------SVNFAESA

Query:  REIWLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLWNELTASRSLCSCGRCSCGGVKELTTYFQTEYVMAFLMGLNDSFAQVRSQLLLM
         EIWLDL++R+QR+NRPRIFQLR E+SNL QDQ ++T YF +L +LW+EL   R  CSCGRCS GGVK +  ++Q EYVMAFLMGLN SF+Q+R+QLLLM
Subjt:  REIWLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLWNELTASRSLCSCGRCSCGGVKELTTYFQTEYVMAFLMGLNDSFAQVRSQLLLM

Query:  EPERIIQRAFSLVAQEVEQRASTTSPSSTTIPATTLLVKTNSANPS-SSRATSSNKKKERPMCTHYNIQGHTVDRCYKIHGYPLEQCQGLLTLLQSHLNK
        EP   I RAF+LVAQE++QR S + PS T+  A+ +   +NS+N   +S + S  K+K++ +CTH  I GHTVD+CYK+H YP           +S ++K
Subjt:  EPERIIQRAFSLVAQEVEQRASTTSPSSTTIPATTLLVKTNSANPS-SSRATSSNKKKERPMCTHYNIQGHTVDRCYKIHGYPLEQCQGLLTLLQSHLNK

Query:  AKTDSADTSNNTHIAGTYLSDISSDIMQNTWVLDSGASAHICCSKKFFVNLKAISGMSISLPNRERIIDKYSLRTIGGVKIWQGQYLLQTDAMVDSQSRC
          + +A +S +       +S   S I  +   L    +A  C                      +R++                 +L  T    D+ S  
Subjt:  AKTDSADTSNNTHIAGTYLSDISSDIMQNTWVLDSGASAHICCSKKFFVNLKAISGMSISLPNRERIIDKYSLRTIGGVKIWQGQYLLQTDAMVDSQSRC

Query:  NSVSVSKKFHNCKNLATWHDRLRHPSDKHLDVLKGVTHQFSCVERPEQNSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVLNWIT
        + V+ ++     K   +  D+    S       KGV HQFSCV  PEQNSVVER+HQHLLN+AR L F+SR+P   WGE +LTAAY++NRTP+ VL+W T
Subjt:  NSVSVSKKFHNCKNLATWHDRLRHPSDKHLDVLKGVTHQFSCVERPEQNSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVLNWIT

Query:  PYYSLNQQHPDYHHLRV
        PY  L     DY  L+V
Subjt:  PYYSLNQQHPDYHHLRV

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.1e-0836.36Show/hide
Query:  VLKGVTHQFSCVERPEQNSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVL--NWITPYYSLNQQHPDYHHLRV
        V KG+++  +    P+ N V ER  + +   AR ++  ++L    WGE++LTA Y++NR PSR L  +  TPY   + + P   HLRV
Subjt:  VLKGVTHQFSCVERPEQNSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVL--NWITPYYSLNQQHPDYHHLRV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.5e-0732.53Show/hide
Query:  GVTHQFSCVERPEQNSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVLNWITPYYSLNQQHPDYHHLRV
        G+ H+ +    P+ N V ER ++ ++   R +L  ++LP   WGE++ TA Y++NR+PS  L +  P      +   Y HL+V
Subjt:  GVTHQFSCVERPEQNSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVLNWITPYYSLNQQHPDYHHLRV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.9e-0732.53Show/hide
Query:  GVTHQFSCVERPEQNSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVLNWITPYYSLNQQHPDYHHLRV
        G++H  S    PE N + ER+H+H++     LL  + +P   W  +   A Y++NR P+ +L   +P+  L    P+Y  LRV
Subjt:  GVTHQFSCVERPEQNSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVLNWITPYYSLNQQHPDYHHLRV

Q9SLE0 Gluconokinase5.0e-0655.56Show/hide
Query:  SDFQAMAIVLMGVSGSGKSTIGAMLANSMDSTFLDADDFHPISNK
        +D     I +MGVSG+GKSTIG ML  ++   FLDADDFH +SN+
Subjt:  SDFQAMAIVLMGVSGSGKSTIGAMLANSMDSTFLDADDFHPISNK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.5e-0732.53Show/hide
Query:  GVTHQFSCVERPEQNSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVLNWITPYYSLNQQHPDYHHLRV
        G++H  S    PE N + ER+H+H++ +   LL  + +P   W  +   A Y++NR P+ +L   +P+  L  Q P+Y  L+V
Subjt:  GVTHQFSCVERPEQNSVVERRHQHLLNIARPLLFKSRLPIQIWGESILTAAYIVNRTPSRVLNWITPYYSLNQQHPDYHHLRV

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.8e-1131.33Show/hide
Query:  YASWSQ--AMII-----GLTVKNKMSVNFAESAREIWLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLWNELTASRSL--CSCGRCSCG
        Y  W Q  AM++      +T K   SV +AE+A ++W DL++ +      +I+QLR  ++ L Q   ++  YF KL+ +W EL+    +  C CG C+C 
Subjt:  YASWSQ--AMII-----GLTVKNKMSVNFAESAREIWLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSLWNELTASRSL--CSCGRCSCG

Query:  GVKELTTYFQTEYVMAFLMG--LNDSFAQVRSQLLLMEPERIIQRAFSLV
          K      + E    FLMG  LN  F  V ++++  +P   +  AF++V
Subjt:  GVKELTTYFQTEYVMAFLMG--LNDSFAQVRSQLLLMEPERIIQRAFSLV

AT2G16790.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein3.5e-0755.56Show/hide
Query:  SDFQAMAIVLMGVSGSGKSTIGAMLANSMDSTFLDADDFHPISNK
        +D     I +MGVSG+GKSTIG ML  ++   FLDADDFH +SN+
Subjt:  SDFQAMAIVLMGVSGSGKSTIGAMLANSMDSTFLDADDFHPISNK

AT2G16790.2 P-loop containing nucleoside triphosphate hydrolases superfamily protein1.0e-0663.16Show/hide
Query:  IVLMGVSGSGKSTIGAMLANSMDSTFLDADDFHPISNK
        I +MGVSG+GKSTIG ML  ++   FLDADDFH +SN+
Subjt:  IVLMGVSGSGKSTIGAMLANSMDSTFLDADDFHPISNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCAACCATCCAGTTCTTCCGTTGTAGATCTATACATCAACCTATACTATCTTCACTACTCATATGGCACCAGTTTGGTTTTGGTTTCAAAACCCTTAATAGAGTC
CAATTATGCATCTTGGAGTCAGGCTATGATCATAGGCCTCACCGTGAAGAATAAAATGAGCGTCAATTTTGCTGAATCTGCTCGTGAAATCTGGCTCGATCTCCAACAAC
GGTATCAGCGAAGGAATCGACCACGTATTTTTCAATTGCGGTGCGAAATCTCAAATCTCGCTCAAGATCAACAGACTCTTACCACCTATTTCGCCAAACTTAATTCTCTA
TGGAATGAACTCACTGCTTCTCGATCGTTATGTTCCTGCGGACGGTGTTCTTGTGGAGGAGTGAAGGAATTAACCACTTATTTTCAGACTGAATATGTTATGGCCTTCTT
AATGGGCTTAAACGACTCATTTGCTCAAGTTCGCTCCCAACTCTTACTTATGGAACCCGAACGCATAATCCAACGAGCCTTCTCTCTTGTTGCACAAGAAGTAGAGCAAA
GAGCTTCGACAACATCTCCTTCTTCTACAACCATTCCTGCTACCACTCTATTGGTAAAGACCAACTCTGCTAATCCAAGTTCTTCTCGAGCCACGAGTTCAAACAAAAAG
AAGGAGAGACCTATGTGCACTCACTACAACATCCAAGGTCATACTGTCGACCGTTGCTACAAAATTCACGGTTACCCCCTGGAACAATGCCAAGGATTACTAACCTTGTT
GCAATCTCACCTTAATAAGGCCAAGACTGATTCTGCTGATACTTCAAACAACACGCACATAGCAGGTACCTATCTCTCTGACATATCCTCTGATATTATGCAAAATACTT
GGGTTCTTGACTCCGGAGCTTCAGCTCATATCTGTTGTTCAAAAAAGTTTTTTGTCAACCTTAAGGCTATTTCTGGAATGTCTATATCTTTGCCAAACCGAGAGCGAATC
ATAGACAAGTACTCTTTGAGGACGATTGGCGGAGTTAAGATATGGCAAGGTCAGTATCTTCTTCAAACGGATGCTATGGTTGACTCACAGTCTCGTTGCAATTCTGTTTC
TGTTAGCAAGAAGTTTCATAATTGTAAAAATCTTGCTACTTGGCATGATAGACTAAGACACCCATCTGATAAACATTTAGATGTCTTGAAGGGTGTTACTCATCAATTTT
CCTGTGTGGAACGACCTGAACAGAATTCAGTGGTTGAGAGACGTCATCAACATCTACTTAACATTGCTCGGCCCTTACTCTTTAAGTCTCGGTTGCCCATTCAGATTTGG
GGAGAATCCATTTTAACAGCCGCATACATTGTTAACAGGACTCCTTCACGTGTTCTCAACTGGATAACTCCTTATTACAGTCTCAATCAGCAACATCCTGATTATCATCA
CCTAAGAGTCTCACTACAAGAAAACAGGAAATTCGGGATGCCCATACGGGAGGCATCCGAGGAGCTTTTGATGCCCTCGATATCGATGAAGAGGAAGAGGACTCGGAGAA
GACGAGGAGCTGGTGTTACGGACACTCGTGAAGGACTAACTAGTCGATATTGGTCTATATCCGTGGACACAGAAAATATGTCTGCAGTGAGAAGAGTGCAACTAAGAAAA
TTCCAGCGATTAAGAGGCGAGGAGGCTGCTGCGTTTTCGTTTGTTGGAGCGTCGTTGGCGAAGAACGGTCAAGTCTACAACGGAGGATCAATAACACACCAAGAAACGGT
GCATCAAGTTCATGAAGTCAACCAAACATCTGATTTTCAAGCTATGGCGATCGTGCTTATGGGAGTTAGTGGTTCTGGAAAATCTACTATTGGTGCGATGCTAGCCAACT
CCATGGACTCCACTTTTCTTGATGCTGATGATTTTCACCCAATTTCTAACAAGGGTATGCCACAACCATCGCACATTGTCAACAAGTATTATTGTGGAATGTCGGAACTC
AAATCATATGCTTCTCAAGTAGTTGATGGTACACTTGACATGAACGAGGATGAGATTTTTGTGCAAGTTTTTGGACCAGAGAAACATGGACGTGTTCGTGGTTATGGAGT
AGGGGTTCTTCTAATGTACGTGATCTTGATTGTCGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTCAACCATCCAGTTCTTCCGTTGTAGATCTATACATCAACCTATACTATCTTCACTACTCATATGGCACCAGTTTGGTTTTGGTTTCAAAACCCTTAATAGAGTC
CAATTATGCATCTTGGAGTCAGGCTATGATCATAGGCCTCACCGTGAAGAATAAAATGAGCGTCAATTTTGCTGAATCTGCTCGTGAAATCTGGCTCGATCTCCAACAAC
GGTATCAGCGAAGGAATCGACCACGTATTTTTCAATTGCGGTGCGAAATCTCAAATCTCGCTCAAGATCAACAGACTCTTACCACCTATTTCGCCAAACTTAATTCTCTA
TGGAATGAACTCACTGCTTCTCGATCGTTATGTTCCTGCGGACGGTGTTCTTGTGGAGGAGTGAAGGAATTAACCACTTATTTTCAGACTGAATATGTTATGGCCTTCTT
AATGGGCTTAAACGACTCATTTGCTCAAGTTCGCTCCCAACTCTTACTTATGGAACCCGAACGCATAATCCAACGAGCCTTCTCTCTTGTTGCACAAGAAGTAGAGCAAA
GAGCTTCGACAACATCTCCTTCTTCTACAACCATTCCTGCTACCACTCTATTGGTAAAGACCAACTCTGCTAATCCAAGTTCTTCTCGAGCCACGAGTTCAAACAAAAAG
AAGGAGAGACCTATGTGCACTCACTACAACATCCAAGGTCATACTGTCGACCGTTGCTACAAAATTCACGGTTACCCCCTGGAACAATGCCAAGGATTACTAACCTTGTT
GCAATCTCACCTTAATAAGGCCAAGACTGATTCTGCTGATACTTCAAACAACACGCACATAGCAGGTACCTATCTCTCTGACATATCCTCTGATATTATGCAAAATACTT
GGGTTCTTGACTCCGGAGCTTCAGCTCATATCTGTTGTTCAAAAAAGTTTTTTGTCAACCTTAAGGCTATTTCTGGAATGTCTATATCTTTGCCAAACCGAGAGCGAATC
ATAGACAAGTACTCTTTGAGGACGATTGGCGGAGTTAAGATATGGCAAGGTCAGTATCTTCTTCAAACGGATGCTATGGTTGACTCACAGTCTCGTTGCAATTCTGTTTC
TGTTAGCAAGAAGTTTCATAATTGTAAAAATCTTGCTACTTGGCATGATAGACTAAGACACCCATCTGATAAACATTTAGATGTCTTGAAGGGTGTTACTCATCAATTTT
CCTGTGTGGAACGACCTGAACAGAATTCAGTGGTTGAGAGACGTCATCAACATCTACTTAACATTGCTCGGCCCTTACTCTTTAAGTCTCGGTTGCCCATTCAGATTTGG
GGAGAATCCATTTTAACAGCCGCATACATTGTTAACAGGACTCCTTCACGTGTTCTCAACTGGATAACTCCTTATTACAGTCTCAATCAGCAACATCCTGATTATCATCA
CCTAAGAGTCTCACTACAAGAAAACAGGAAATTCGGGATGCCCATACGGGAGGCATCCGAGGAGCTTTTGATGCCCTCGATATCGATGAAGAGGAAGAGGACTCGGAGAA
GACGAGGAGCTGGTGTTACGGACACTCGTGAAGGACTAACTAGTCGATATTGGTCTATATCCGTGGACACAGAAAATATGTCTGCAGTGAGAAGAGTGCAACTAAGAAAA
TTCCAGCGATTAAGAGGCGAGGAGGCTGCTGCGTTTTCGTTTGTTGGAGCGTCGTTGGCGAAGAACGGTCAAGTCTACAACGGAGGATCAATAACACACCAAGAAACGGT
GCATCAAGTTCATGAAGTCAACCAAACATCTGATTTTCAAGCTATGGCGATCGTGCTTATGGGAGTTAGTGGTTCTGGAAAATCTACTATTGGTGCGATGCTAGCCAACT
CCATGGACTCCACTTTTCTTGATGCTGATGATTTTCACCCAATTTCTAACAAGGGTATGCCACAACCATCGCACATTGTCAACAAGTATTATTGTGGAATGTCGGAACTC
AAATCATATGCTTCTCAAGTAGTTGATGGTACACTTGACATGAACGAGGATGAGATTTTTGTGCAAGTTTTTGGACCAGAGAAACATGGACGTGTTCGTGGTTATGGAGT
AGGGGTTCTTCTAATGTACGTGATCTTGATTGTCGTTTAA
Protein sequenceShow/hide protein sequence
MIQPSSSSVVDLYINLYYLHYSYGTSLVLVSKPLIESNYASWSQAMIIGLTVKNKMSVNFAESAREIWLDLQQRYQRRNRPRIFQLRCEISNLAQDQQTLTTYFAKLNSL
WNELTASRSLCSCGRCSCGGVKELTTYFQTEYVMAFLMGLNDSFAQVRSQLLLMEPERIIQRAFSLVAQEVEQRASTTSPSSTTIPATTLLVKTNSANPSSSRATSSNKK
KERPMCTHYNIQGHTVDRCYKIHGYPLEQCQGLLTLLQSHLNKAKTDSADTSNNTHIAGTYLSDISSDIMQNTWVLDSGASAHICCSKKFFVNLKAISGMSISLPNRERI
IDKYSLRTIGGVKIWQGQYLLQTDAMVDSQSRCNSVSVSKKFHNCKNLATWHDRLRHPSDKHLDVLKGVTHQFSCVERPEQNSVVERRHQHLLNIARPLLFKSRLPIQIW
GESILTAAYIVNRTPSRVLNWITPYYSLNQQHPDYHHLRVSLQENRKFGMPIREASEELLMPSISMKRKRTRRRRGAGVTDTREGLTSRYWSISVDTENMSAVRRVQLRK
FQRLRGEEAAAFSFVGASLAKNGQVYNGGSITHQETVHQVHEVNQTSDFQAMAIVLMGVSGSGKSTIGAMLANSMDSTFLDADDFHPISNKGMPQPSHIVNKYYCGMSEL
KSYASQVVDGTLDMNEDEIFVQVFGPEKHGRVRGYGVGVLLMYVILIVV