; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017913 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017913
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr5:11518859..11520495
RNA-Seq ExpressionLag0017913
SyntenyLag0017913
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8524269.1 hypothetical protein F0562_010692 [Nyssa sinensis]2.9e-4830.16Show/hide
Query:  NYLFWRFQVESILKAHSLFGVIDGSTPRPDKFSLQEDGTRSTTVNPALTQWTAQDNAMITLINAT---------VGYKTSREVWITLEKRFSSLT-----
        NY+ W+FQ+ SILKAHSL G IDG+ P P+KF   E G  +  +NP    W  QD A++TL+NAT         +GY TSRE W+ LE+RFS+ T     
Subjt:  NYLFWRFQVESILKAHSLFGVIDGSTPRPDKFSLQEDGTRSTTVNPALTQWTAQDNAMITLINAT---------VGYKTSREVWITLEKRFSSLT-----

Query:  ----------------------------------------------------------------SEAVTLEEFYALLKLEAKFIEQQNK-TTSIFNPTAM
                                                                        SE +TLEE YA+LK+E + IE  +K   S   P AM
Subjt:  ----------------------------------------------------------------SEAVTLEEFYALLKLEAKFIEQQNK-TTSIFNPTAM

Query:  TTTLGQGSSNSFRGRDRQSPSSLNFSPHGRGRG----IGG-----------------------------SESNSS------------------WSRPSSN
          T  + + +S RG    SPS  NFS  GRGRG     GG                               SN+S                  + R   +
Subjt:  TTTLGQGSSNSFRGRDRQSPSSLNFSPHGRGRG----IGG-----------------------------SESNSS------------------WSRPSSN

Query:  STGRHPPSQLAAMGTAVDPATNSS--FWPSDSGCNSNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSLSNVLCVPDISV-----
          G+ P  QL AM    +  ++ S  +W +D+G  ++IT+D++NLN    Y G+D+I +A+GQ L ++  G  ++  +D+   L+NVLCVP ++      
Subjt:  STGRHPPSQLAAMGTAVDPATNSS--FWPSDSGCNSNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSLSNVLCVPDISV-----

Query:  ----------------------KVTGKVLYKGQSKDGLYPLPFGGFKGGEVVS--------------SSDQAVSRPSVVYSCFHVYAFTLNRKTTDLWHF
                              K T ++L++G S  GLYPLP          S              ++   + R +  YS     A+   + +T LWH 
Subjt:  ----------------------KVTGKVLYKGQSKDGLYPLPFGGFKGGEVVS--------------SSDQAVSRPSVVYSCFHVYAFTLNRKTTDLWHF

Query:  RLGHPSPDVLRKILSQSSIYVGQSVNITECVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPVLSITGHRYYI
        RLGHPS   L+ ILS +SI   +  +   C  CL GKM+KL F  S   +T PL+L+HSD+WG +P  S     YY+
Subjt:  RLGHPSPDVLRKILSQSSIYVGQSVNITECVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPVLSITGHRYYI

KAA8535282.1 hypothetical protein F0562_030285 [Nyssa sinensis]1.2e-3029.41Show/hide
Query:  NYLFWRFQVESILKAHSLFGVIDGSTPRPDKFSLQEDGTRSTTVNPALTQWTAQDNAMITLINAT---------VGYKTSREVWITLEKRFSSLT-----
        NY+ W+FQ+ SILKAHSL G IDG+ P P+KF   E G  +  +NP    W  QD A++TL+NAT         +GY TSRE W+ LE+RFS+ T     
Subjt:  NYLFWRFQVESILKAHSLFGVIDGSTPRPDKFSLQEDGTRSTTVNPALTQWTAQDNAMITLINAT---------VGYKTSREVWITLEKRFSSLT-----

Query:  ----------------------------------------------------------------SEAVTLEEFYALLKLEAKFIEQQNK-TTSIFNPTAM
                                                                        SE +TLEE YA+LK+E + IE  +K   S   P AM
Subjt:  ----------------------------------------------------------------SEAVTLEEFYALLKLEAKFIEQQNK-TTSIFNPTAM

Query:  TTTLGQGSSNSFRGRDRQSPSSLNFSPHGRGRG----IGG-----------------------------SESNSS------------------WSRPSSN
          T  + + +S RG    SPS  NFS  GRGRG     GG                               SN+S                  + R   +
Subjt:  TTTLGQGSSNSFRGRDRQSPSSLNFSPHGRGRG----IGG-----------------------------SESNSS------------------WSRPSSN

Query:  STGRHPPSQLAAMGTAVDPATNSS--FWPSDSGCNSNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSLSNVLCVPDISV-----
          G+ P  QL AM    +  ++ S  +W +D+G  ++IT+D++NLN    Y G+D+I +A+GQ L ++  G  ++  +D+   L+NVLCVP ++      
Subjt:  STGRHPPSQLAAMGTAVDPATNSS--FWPSDSGCNSNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSLSNVLCVPDISV-----

Query:  ----------------------KVTGKVLYKGQSKDGLYPLP
                              K T ++L++G S  GLYPLP
Subjt:  ----------------------KVTGKVLYKGQSKDGLYPLP

RWR76373.1 putative polyprotein [Cinnamomum micranthum f. kanehirae]6.3e-4328.54Show/hide
Query:  NYLFWRFQVESILKAHSLFGVIDGSTPRPDKFSLQEDGTRSTTVNPALTQWTAQDNAMITLINAT---------VGYKTSREVWITLEKRFSSLTSEAVT
        NYL WR Q E +L +H L G +DGS   P+KF+   +   ++T+ PA  +W  QD  +++ I AT         +G +TSR  W+ +E+RF+SL S A T
Subjt:  NYLFWRFQVESILKAHSLFGVIDGSTPRPDKFSLQEDGTRSTTVNPALTQWTAQDNAMITLINAT---------VGYKTSREVWITLEKRFSSLTSEAVT

Query:  LEEFYALLKLE--AKFIEQQNKTTSI-------FNPTAMTTTLGQGSSN-----------------SFRGRDRQSPS----SLNFSP-------------
        +E    L  L+     +++ ++ T I       ++P  M       S +                 +      QSPS    +  F+P             
Subjt:  LEEFYALLKLE--AKFIEQQNKTTSI-------FNPTAMTTTLGQGSSN-----------------SFRGRDRQSPS----SLNFSP-------------

Query:  ---HGRGRGIGGSESNSSWS-RPSSNST-----------------------------------------GRHPPSQLAAMGTAVDPATNSSFWPSDSGCN
            GRGRG  G   + S++  P+SN T                                         G HPP++LAAM  +   +   + W +D+G  
Subjt:  ---HGRGRGIGGSESNSSWS-RPSSNST-----------------------------------------GRHPPSQLAAMGTAVDPATNSSFWPSDSGCN

Query:  SNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSLSNVLCVPDISV---------------------------KVTGKVLYKGQSK
         +ITS++ NL+L  +Y+  D + V +G  L ++ IG  ++S   ++  L+N+LCVP IS                            K +GK L++GQSK
Subjt:  SNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSLSNVLCVPDISV---------------------------KVTGKVLYKGQSK

Query:  DGLYPLPFGGFKGGEVVSSSDQAVSRPSVVYSCFHVYAFTLNRKTTDLWHFRLGHPSPDVLRKILSQSSIYVGQSVNITE-CVGCLKGKMSKLHFQSSVI
        +GLYP P                   P+   +  H  AF   R T  +WH RLGHP+  V + + S   + V  S  ++  C  C  GK  KL F  S  
Subjt:  DGLYPLPFGGFKGGEVVSSSDQAVSRPSVVYSCFHVYAFTLNRKTTDLWHFRLGHPSPDVLRKILSQSSIYVGQSVNITE-CVGCLKGKMSKLHFQSSVI

Query:  VTTKPLELLHSDVWGSSPVLSITGHRYYI
        +++ PL+L+H D+WGSSP LSI+G+ YY+
Subjt:  VTTKPLELLHSDVWGSSPVLSITGHRYYI

TQE01264.1 hypothetical protein C1H46_013171 [Malus baccata]4.2e-3133.83Show/hide
Query:  WSRPSSNSTGRHPPSQLAAMGTAVDP-ATNSSFWPSDSGCNSNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSLSNVLCVPDIS
        + R +    GR PPS L+AM T   P A    FW +D+G  S++TSD+SNLNL   ++G D++  ASG  LP++ IG   L        L N+L VP +S
Subjt:  WSRPSSNSTGRHPPSQLAAMGTAVDP-ATNSSFWPSDSGCNSNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSLSNVLCVPDIS

Query:  V---------------------------KVTGKVLYKGQSKDGLYPLPFGGFKGGEVVSSSDQAVSRPSVVYSCFHVYAFTLNRKTTDLWHFRLGHPSPD
                                    K+TG+++ +G  ++GLYP+PF   +   + +    A  +    Y    V          +LWH RLGHPS  
Subjt:  V---------------------------KVTGKVLYKGQSKDGLYPLPFGGFKGGEVVSSSDQAVSRPSVVYSCFHVYAFTLNRKTTDLWHFRLGHPSPD

Query:  VLRKILSQSSIYVGQSVNITECVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPVLSITGHRYYI
        V   +L QS I +      + C  CL+GK +KL F      T  PLE++HSDVWG S  +SI G+++Y+
Subjt:  VLRKILSQSSIYVGQSVNITECVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPVLSITGHRYYI

XP_022158189.1 uncharacterized protein LOC111024722 [Momordica charantia]3.6e-3839.84Show/hide
Query:  AMGTAVDPATNSSFWPSDSGCNSNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSLSNVLCVPDISV------------------
        AM  +    T+++FW SDSGCN+++T+D+ NLNL  +YNGE+ + V +GQ+L ++  G G LS S +  ++SNVL  PD++                   
Subjt:  AMGTAVDPATNSSFWPSDSGCNSNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSLSNVLCVPDISV------------------

Query:  ---------KVTGKVLYKGQSKDGLYPLPFGGFKGGEVVSSSDQAVSRPSVVYSCFHVYAFTLNRKT-TDLWHFRLGHPSPDVLRKILSQSSIYVGQSVN
                 KVT   LYKG+S +GLYP+P          SSS  + +R  +     H     L  K  + LWH RLGH SP +LR  LS   + +  S N
Subjt:  ---------KVTGKVLYKGQSKDGLYPLPFGGFKGGEVVSSSDQAVSRPSVVYSCFHVYAFTLNRKT-TDLWHFRLGHPSPDVLRKILSQSSIYVGQSVN

Query:  ITECVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPVLSITGHRYYI
          +C  CLK KMSKL F  S   +  PLE +HSDVWG SPV+S+TG RYY+
Subjt:  ITECVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPVLSITGHRYYI

TrEMBL top hitse value%identityAlignment
A0A2N9G2N5 Uncharacterized protein5.2e-4328.72Show/hide
Query:  NYLFWRFQVESILKAHSLFGVIDGSTPRPDKFSLQEDGTRSTTVNPALTQWTAQDNAMITLINAT---------VGYKTSREVWITLEKRFSSLT-----
        N++ W+ Q+ SILKA+S+   +DG+ P P +F   E+GT ++  NP    W  +D A++TLIN+T         VG  +++ VW TLE+RF+S +     
Subjt:  NYLFWRFQVESILKAHSLFGVIDGSTPRPDKFSLQEDGTRSTTVNPALTQWTAQDNAMITLINAT---------VGYKTSREVWITLEKRFSSLT-----

Query:  ----------------------------------------------------------------SEAVTLEEFYALLKL-EAKFIEQQNKTTSI------
                                                                        +E V+ EE   LL+  E   +E  +    +      
Subjt:  ----------------------------------------------------------------SEAVTLEEFYALLKL-EAKFIEQQNKTTSI------

Query:  ---------FNPTAM-----TTTLGQGSSNSFRGRDRQSPSSLNFSPH-----------GRGRGIGGSESNSSWSRPSSN-------------------S
                 FN  +      + + G+G +NS RGR  ++ ++  +SPH           G+      S++ S  SRP                       
Subjt:  ---------FNPTAM-----TTTLGQGSSNSFRGRDRQSPSSLNFSPH-----------GRGRGIGGSESNSSWSRPSSN-------------------S

Query:  TGRHPPSQLAAMGTAVDPATNSSFWPSDSGCNSNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNL----------------SISDNDLSL---
         GRHPP++LAAM +  +       W +D+G   ++T++MSNLN++  Y G D + V +GQ++P+  IG G L                 IS N LS+   
Subjt:  TGRHPPSQLAAMGTAVDPATNSSFWPSDSGCNSNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNL----------------SISDNDLSL---

Query:  ---SNVLCVPD-----ISVKVTGKVLYKGQSKDGLYPLPFGGFKGGEVVSSSDQAVSRPSVVYSCF--HVYAFTLNRKTTDLWHFRLGHPSPDVLRKIL-
           +N  C  D     I    +GKVLYKG S++GLYP+                  + PSV  +     V AF  ++    LWH RLGHPS  VL   L 
Subjt:  ---SNVLCVPD-----ISVKVTGKVLYKGQSKDGLYPLPFGGFKGGEVVSSSDQAVSRPSVVYSCF--HVYAFTLNRKTTDLWHFRLGHPSPDVLRKIL-

Query:  SQSSIYVGQSVNIT-ECVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPVLSITGHRYYI
        S SS    Q+ ++   C  CL GKM KL F+ S   +T+PLEL+ SDVWG +P+ S  G+RYYI
Subjt:  SQSSIYVGQSVNIT-ECVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPVLSITGHRYYI

A0A2N9GCR2 Uncharacterized protein4.2e-4528.52Show/hide
Query:  NYLFWRFQVESILKAHSLFGVIDGSTPRPDKFSLQEDGTRSTTVNPALTQWTAQDNAMITLINAT---------VGYKTSREVWITLEKRFSSLT-----
        N++ W+ Q+ SILKA+S+   +DG+ P P +F   E+G  ++  NP    W  +D A++TLIN+T         VG  +++ VW TLE+RF+S +     
Subjt:  NYLFWRFQVESILKAHSLFGVIDGSTPRPDKFSLQEDGTRSTTVNPALTQWTAQDNAMITLINAT---------VGYKTSREVWITLEKRFSSLT-----

Query:  ----------------------------------------------------------------SEAVTLEEFYALLKLEAKFIEQQNKTTSIFNPTAM-
                                                                        +E V+ EE   LL+ E   + + + +       A+ 
Subjt:  ----------------------------------------------------------------SEAVTLEEFYALLKLEAKFIEQQNKTTSIFNPTAM-

Query:  --------------------TTTLGQGSSNSFRGRDRQSPSSLNFSPH-----------GRGRGIGGSESNSSWSRPSSN-------------------S
                            + + G+G +NS RGR  ++ ++  +SPH           G+      S++ S  SRP                       
Subjt:  --------------------TTTLGQGSSNSFRGRDRQSPSSLNFSPH-----------GRGRGIGGSESNSSWSRPSSN-------------------S

Query:  TGRHPPSQLAAMGTAVDPATNSSFWPSDSGCNSNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSL---SNVLCVPDISVKVTGK
         GRHPP++LAAM +  + A     W +D+G   ++T++M+NLN++  Y G D + V +GQ++P+  IG  +    DN+      SN   + D+    +GK
Subjt:  TGRHPPSQLAAMGTAVDPATNSSFWPSDSGCNSNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSL---SNVLCVPDISVKVTGK

Query:  VLYKGQSKDGLYPLPFGGFKGGEVVSSSDQAVSRPSVVYSCF--HVYAFTLNRKTTDLWHFRLGHPSPDVLRKIL-SQSSIYVGQSVNIT-ECVGCLKGK
        VLYKG S++GLYP+                  + PSV  +     V AF  ++    LWH RLGHPS  VL   L S SS    Q+ ++   C  CL GK
Subjt:  VLYKGQSKDGLYPLPFGGFKGGEVVSSSDQAVSRPSVVYSCF--HVYAFTLNRKTTDLWHFRLGHPSPDVLRKIL-SQSSIYVGQSVNIT-ECVGCLKGK

Query:  MSKLHFQSSVIVTTKPLELLHSDVWGSSPVLSITGHRYYI
        M KL F+ S   +T+PLEL+HSDVWG +P+ S  G+RYYI
Subjt:  MSKLHFQSSVIVTTKPLELLHSDVWGSSPVLSITGHRYYI

A0A2N9I8F3 Uncharacterized protein2.3e-4328.18Show/hide
Query:  NYLFWRFQVESILKAHSLFGVIDGSTPRPDKFSLQEDGTRSTTVNPALTQWTAQDNAMITLINAT---------VGYKTSREVWITLEKRFSSLT-----
        N++ W+ Q+ SILKA+S+   +DG+ P P +F +  DG  +TTVNP    W  +D  ++ LIN+T         VG+ +++EVW TLE RF+S +     
Subjt:  NYLFWRFQVESILKAHSLFGVIDGSTPRPDKFSLQEDGTRSTTVNPALTQWTAQDNAMITLINAT---------VGYKTSREVWITLEKRFSSLT-----

Query:  ----------------------------------------------------------------SEAVTLEEFYALLKLEAKFIEQQNKTTSIFNPTAM-
                                                                        +E VT EE   LL+ E +   + + +    +P AM 
Subjt:  ----------------------------------------------------------------SEAVTLEEFYALLKLEAKFIEQQNKTTSIFNPTAM-

Query:  -------------------TTTLGQGSSNSFRGRDRQ--SPSSLNFSPHGRGR-----------------GIGGSESNSSWSRPSSNSTGRHPPSQLAAM
                           T   G+G +NS RGR  +  + +   FS   +G                  G  G ++   + R      GRHPP++LAAM
Subjt:  -------------------TTTLGQGSSNSFRGRDRQ--SPSSLNFSPHGRGR-----------------GIGGSESNSSWSRPSSNSTGRHPPSQLAAM

Query:  GTAVDPATNSSFWPSDSGCNSNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSLSNVLCVPDISVKV------------------
         +  + +     W +D+G   ++T++++NL     Y G + + V +GQ++P+  IG G LS  + +  L N+L    IS  +                  
Subjt:  GTAVDPATNSSFWPSDSGCNSNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSLSNVLCVPDISVKV------------------

Query:  ---------TGKVLYKGQSKDGLYPLPFGGFKGGEVVSSSDQAVSRPSVVYSCFHVYAFTLNRKTTDLWHFRLGHPSPDVLRKILSQSSIYVGQSVNITE
                 +GKVLYKG SK+GLYP+         + SSS  +   PS   S   V AF  ++    LWH RLGHPS  VL   +   S  +  S    +
Subjt:  ---------TGKVLYKGQSKDGLYPLPFGGFKGGEVVSSSDQAVSRPSVVYSCFHVYAFTLNRKTTDLWHFRLGHPSPDVLRKILSQSSIYVGQSVNITE

Query:  --CVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPVLSITGHRYYI
          C  CL GKM +L F  S   +T+PLEL+HSDVWG +PV S  G++YY+
Subjt:  --CVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPVLSITGHRYYI

A0A443NCX3 Putative polyprotein3.0e-4328.54Show/hide
Query:  NYLFWRFQVESILKAHSLFGVIDGSTPRPDKFSLQEDGTRSTTVNPALTQWTAQDNAMITLINAT---------VGYKTSREVWITLEKRFSSLTSEAVT
        NYL WR Q E +L +H L G +DGS   P+KF+   +   ++T+ PA  +W  QD  +++ I AT         +G +TSR  W+ +E+RF+SL S A T
Subjt:  NYLFWRFQVESILKAHSLFGVIDGSTPRPDKFSLQEDGTRSTTVNPALTQWTAQDNAMITLINAT---------VGYKTSREVWITLEKRFSSLTSEAVT

Query:  LEEFYALLKLE--AKFIEQQNKTTSI-------FNPTAMTTTLGQGSSN-----------------SFRGRDRQSPS----SLNFSP-------------
        +E    L  L+     +++ ++ T I       ++P  M       S +                 +      QSPS    +  F+P             
Subjt:  LEEFYALLKLE--AKFIEQQNKTTSI-------FNPTAMTTTLGQGSSN-----------------SFRGRDRQSPS----SLNFSP-------------

Query:  ---HGRGRGIGGSESNSSWS-RPSSNST-----------------------------------------GRHPPSQLAAMGTAVDPATNSSFWPSDSGCN
            GRGRG  G   + S++  P+SN T                                         G HPP++LAAM  +   +   + W +D+G  
Subjt:  ---HGRGRGIGGSESNSSWS-RPSSNST-----------------------------------------GRHPPSQLAAMGTAVDPATNSSFWPSDSGCN

Query:  SNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSLSNVLCVPDISV---------------------------KVTGKVLYKGQSK
         +ITS++ NL+L  +Y+  D + V +G  L ++ IG  ++S   ++  L+N+LCVP IS                            K +GK L++GQSK
Subjt:  SNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSLSNVLCVPDISV---------------------------KVTGKVLYKGQSK

Query:  DGLYPLPFGGFKGGEVVSSSDQAVSRPSVVYSCFHVYAFTLNRKTTDLWHFRLGHPSPDVLRKILSQSSIYVGQSVNITE-CVGCLKGKMSKLHFQSSVI
        +GLYP P                   P+   +  H  AF   R T  +WH RLGHP+  V + + S   + V  S  ++  C  C  GK  KL F  S  
Subjt:  DGLYPLPFGGFKGGEVVSSSDQAVSRPSVVYSCFHVYAFTLNRKTTDLWHFRLGHPSPDVLRKILSQSSIYVGQSVNITE-CVGCLKGKMSKLHFQSSVI

Query:  VTTKPLELLHSDVWGSSPVLSITGHRYYI
        +++ PL+L+H D+WGSSP LSI+G+ YY+
Subjt:  VTTKPLELLHSDVWGSSPVLSITGHRYYI

A0A5J5A1U7 Integrase catalytic domain-containing protein1.4e-4830.16Show/hide
Query:  NYLFWRFQVESILKAHSLFGVIDGSTPRPDKFSLQEDGTRSTTVNPALTQWTAQDNAMITLINAT---------VGYKTSREVWITLEKRFSSLT-----
        NY+ W+FQ+ SILKAHSL G IDG+ P P+KF   E G  +  +NP    W  QD A++TL+NAT         +GY TSRE W+ LE+RFS+ T     
Subjt:  NYLFWRFQVESILKAHSLFGVIDGSTPRPDKFSLQEDGTRSTTVNPALTQWTAQDNAMITLINAT---------VGYKTSREVWITLEKRFSSLT-----

Query:  ----------------------------------------------------------------SEAVTLEEFYALLKLEAKFIEQQNK-TTSIFNPTAM
                                                                        SE +TLEE YA+LK+E + IE  +K   S   P AM
Subjt:  ----------------------------------------------------------------SEAVTLEEFYALLKLEAKFIEQQNK-TTSIFNPTAM

Query:  TTTLGQGSSNSFRGRDRQSPSSLNFSPHGRGRG----IGG-----------------------------SESNSS------------------WSRPSSN
          T  + + +S RG    SPS  NFS  GRGRG     GG                               SN+S                  + R   +
Subjt:  TTTLGQGSSNSFRGRDRQSPSSLNFSPHGRGRG----IGG-----------------------------SESNSS------------------WSRPSSN

Query:  STGRHPPSQLAAMGTAVDPATNSS--FWPSDSGCNSNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSLSNVLCVPDISV-----
          G+ P  QL AM    +  ++ S  +W +D+G  ++IT+D++NLN    Y G+D+I +A+GQ L ++  G  ++  +D+   L+NVLCVP ++      
Subjt:  STGRHPPSQLAAMGTAVDPATNSS--FWPSDSGCNSNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSLSNVLCVPDISV-----

Query:  ----------------------KVTGKVLYKGQSKDGLYPLPFGGFKGGEVVS--------------SSDQAVSRPSVVYSCFHVYAFTLNRKTTDLWHF
                              K T ++L++G S  GLYPLP          S              ++   + R +  YS     A+   + +T LWH 
Subjt:  ----------------------KVTGKVLYKGQSKDGLYPLPFGGFKGGEVVS--------------SSDQAVSRPSVVYSCFHVYAFTLNRKTTDLWHF

Query:  RLGHPSPDVLRKILSQSSIYVGQSVNITECVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPVLSITGHRYYI
        RLGHPS   L+ ILS +SI   +  +   C  CL GKM+KL F  S   +T PL+L+HSD+WG +P  S     YY+
Subjt:  RLGHPSPDVLRKILSQSSIYVGQSVNITECVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPVLSITGHRYYI

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-0825.39Show/hide
Query:  YNGEDSIVVASGQTLPVARIGFGNLSISDN---DLSLSNVLCVPDISVKVTGKVLYKGQSKDGLYPLPFGGFKGGEVVSSSDQAVSRPSVVYSCFHVYAF
        Y   D   V  G T      G G++ I  N    L L +V  VPD+ + +   +       +  +         G +V +  + V+R ++  +   +   
Subjt:  YNGEDSIVVASGQTLPVARIGFGNLSISDN---DLSLSNVLCVPDISVKVTGKVLYKGQSKDGLYPLPFGGFKGGEVVSSSDQAVSRPSVVYSCFHVYAF

Query:  TLNRK----TTDLWHFRLGHPSPDVLRKILSQSSIYVGQSVNITECVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPVLSITGHRYYI
         LN      + DLWH R+GH S   L+ +  +S I   +   +  C  CL GK  ++ FQ+S       L+L++SDV G   + S+ G++Y++
Subjt:  TLNRK----TTDLWHFRLGHPSPDVLRKILSQSSIYVGQSVNITECVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPVLSITGHRYYI

P93293 Uncharacterized mitochondrial protein AtMg003002.4e-0530.14Show/hide
Query:  TDLWHFRLGHPSPDVLRKILSQSSIYVGQSVNITECVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPV
        T LWH RL H S   +  ++ +  +   +  ++  C  C+ GK  +++F +    T  PL+ +HSD+WG+  V
Subjt:  TDLWHFRLGHPSPDVLRKILSQSSIYVGQSVNITECVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.9e-2629.34Show/hide
Query:  LLKLEAKFIEQQNKTTSIFNPTAMTTTLGQGSSNSFRGRDRQSPSSLNFSPHGR------GR----GIGGSESN--SSWSRPSSNSTGRHPPSQLAAMGT
        ++ + A  +  +N TT+  N            +N+   +  Q  SS NF P+        G+    G+ G  +   S      S+   + PPS       
Subjt:  LLKLEAKFIEQQNKTTSIFNPTAMTTTLGQGSSNSFRGRDRQSPSSLNFSPHGR------GR----GIGGSESN--SSWSRPSSNSTGRHPPSQLAAMGT

Query:  AVDPATNSSF----WPSDSGCNSNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSLSNVLCVPDISVKV----------------
          + A  S +    W  DSG   +ITSD +NL+L+  Y G D ++VA G T+P++  G  +LS     L+L N+L VP+I   +                
Subjt:  AVDPATNSSF----WPSDSGCNSNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSLSNVLCVPDISVKV----------------

Query:  -----------TGKVLYKGQSKDGLYPLPFGGFKGGEVVSSSDQAVSRPSVVYSCFHVYAFTLNRKTTDLWHFRLGHPSPDVLRKILSQSSIYV-GQSVN
                   TG  L +G++KD LY  P           +S Q VS          ++A   ++ T   WH RLGHP+P +L  ++S  S+ V   S  
Subjt:  -----------TGKVLYKGQSKDGLYPLPFGGFKGGEVVSSSDQAVSRPSVVYSCFHVYAFTLNRKTTDLWHFRLGHPSPDVLRKILSQSSIYV-GQSVN

Query:  ITECVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPVLSITGHRYYI
           C  CL  K +K+ F  S I +T+PLE ++SDVW SSP+LS   +RYY+
Subjt:  ITECVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPVLSITGHRYYI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.5e-2833.59Show/hide
Query:  STGRHPPSQLAAMGTAVDPATNSSFWPSDSGCNSNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSLSNVLCVPDISVKV-----
        ST    P Q  A   AV+   N++ W  DSG   +ITSD +NL+ +  Y G D +++A G T+P+   G  +L  S   L L+ VL VP+I   +     
Subjt:  STGRHPPSQLAAMGTAVDPATNSSFWPSDSGCNSNITSDMSNLNLNGNYNGEDSIVVASGQTLPVARIGFGNLSISDNDLSLSNVLCVPDISVKV-----

Query:  ----------------------TGKVLYKGQSKDGLYPLPFGGFKGGEVVSSSDQAVSRPSVVYSCFHVYAFTLNRKTTDLWHFRLGHPSPDVLRKILSQ
                              TG  L +G++KD LY  P           +S QAVS          ++A   ++ T   WH RLGHPS  +L  ++S 
Subjt:  ----------------------TGKVLYKGQSKDGLYPLPFGGFKGGEVVSSSDQAVSRPSVVYSCFHVYAFTLNRKTTDLWHFRLGHPSPDVLRKILSQ

Query:  SSIYV-GQSVNITECVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPVLSITGHRYYI
         S+ V   S  +  C  C   K  K+ F +S I ++KPLE ++SDVW SSP+LSI  +RYY+
Subjt:  SSIYV-GQSVNITECVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPVLSITGHRYYI

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein1.7e-0630.14Show/hide
Query:  TDLWHFRLGHPSPDVLRKILSQSSIYVGQSVNITECVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPV
        T LWH RL H S   +  ++ +  +   +  ++  C  C+ GK  +++F +    T  PL+ +HSD+WG+  V
Subjt:  TDLWHFRLGHPSPDVLRKILSQSSIYVGQSVNITECVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTATCTTTTTTGGCGCTTCCAAGTTGAATCGATTCTGAAGGCACATTCACTTTTTGGAGTAATCGATGGATCTACTCCTCGGCCTGACAAATTTTCTCTTCAAGA
AGATGGAACACGATCTACTACAGTCAATCCAGCCTTGACTCAGTGGACTGCTCAAGACAATGCCATGATTACCTTGATTAATGCAACTGTTGGCTACAAAACCTCAAGAG
AAGTCTGGATAACTCTTGAGAAGCGATTTTCTTCTCTCACCAGCGAAGCTGTAACACTTGAGGAGTTTTATGCTCTACTCAAGCTTGAAGCCAAATTCATTGAGCAACAG
AATAAAACTACTTCAATTTTTAATCCAACGGCTATGACGACAACACTTGGACAAGGCTCCTCAAATTCCTTTCGTGGACGTGACCGTCAATCTCCTAGCAGCTTAAATTT
TTCACCACATGGTCGTGGACGTGGAATTGGAGGCTCTGAATCAAACTCTTCTTGGTCCCGACCTTCTTCGAATTCCACTGGACGTCATCCTCCGTCTCAACTTGCTGCTA
TGGGCACTGCAGTTGATCCTGCTACGAATTCTTCCTTTTGGCCTTCTGATAGTGGATGTAACTCCAACATTACATCTGATATGTCTAACCTGAATTTAAATGGAAACTAT
AATGGTGAAGATTCGATTGTTGTTGCTAGTGGCCAAACACTTCCTGTTGCTCGGATTGGCTTTGGTAATCTTTCGATATCTGATAATGACTTATCCTTATCCAATGTTCT
ATGTGTACCAGATATTTCGGTCAAGGTAACGGGCAAGGTTCTGTACAAAGGACAGAGTAAAGATGGTTTGTATCCCCTTCCATTTGGTGGTTTCAAAGGTGGTGAGGTTG
TTTCGTCCTCTGATCAGGCTGTTTCACGACCGTCAGTTGTTTACTCTTGTTTCCATGTTTATGCTTTTACTTTGAATAGAAAGACAACCGATTTGTGGCACTTTAGACTT
GGACATCCATCTCCCGATGTCCTTCGTAAGATTTTGTCTCAGTCTTCCATTTATGTTGGACAGTCTGTTAATATTACTGAATGTGTTGGTTGTCTTAAAGGCAAAATGAG
CAAATTACATTTTCAGTCTTCAGTAATAGTGACTACTAAGCCTCTTGAATTGTTACACAGTGATGTGTGGGGGTCTTCCCCTGTTTTATCTATTACTGGCCATAGATATT
ATATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATTATCTTTTTTGGCGCTTCCAAGTTGAATCGATTCTGAAGGCACATTCACTTTTTGGAGTAATCGATGGATCTACTCCTCGGCCTGACAAATTTTCTCTTCAAGA
AGATGGAACACGATCTACTACAGTCAATCCAGCCTTGACTCAGTGGACTGCTCAAGACAATGCCATGATTACCTTGATTAATGCAACTGTTGGCTACAAAACCTCAAGAG
AAGTCTGGATAACTCTTGAGAAGCGATTTTCTTCTCTCACCAGCGAAGCTGTAACACTTGAGGAGTTTTATGCTCTACTCAAGCTTGAAGCCAAATTCATTGAGCAACAG
AATAAAACTACTTCAATTTTTAATCCAACGGCTATGACGACAACACTTGGACAAGGCTCCTCAAATTCCTTTCGTGGACGTGACCGTCAATCTCCTAGCAGCTTAAATTT
TTCACCACATGGTCGTGGACGTGGAATTGGAGGCTCTGAATCAAACTCTTCTTGGTCCCGACCTTCTTCGAATTCCACTGGACGTCATCCTCCGTCTCAACTTGCTGCTA
TGGGCACTGCAGTTGATCCTGCTACGAATTCTTCCTTTTGGCCTTCTGATAGTGGATGTAACTCCAACATTACATCTGATATGTCTAACCTGAATTTAAATGGAAACTAT
AATGGTGAAGATTCGATTGTTGTTGCTAGTGGCCAAACACTTCCTGTTGCTCGGATTGGCTTTGGTAATCTTTCGATATCTGATAATGACTTATCCTTATCCAATGTTCT
ATGTGTACCAGATATTTCGGTCAAGGTAACGGGCAAGGTTCTGTACAAAGGACAGAGTAAAGATGGTTTGTATCCCCTTCCATTTGGTGGTTTCAAAGGTGGTGAGGTTG
TTTCGTCCTCTGATCAGGCTGTTTCACGACCGTCAGTTGTTTACTCTTGTTTCCATGTTTATGCTTTTACTTTGAATAGAAAGACAACCGATTTGTGGCACTTTAGACTT
GGACATCCATCTCCCGATGTCCTTCGTAAGATTTTGTCTCAGTCTTCCATTTATGTTGGACAGTCTGTTAATATTACTGAATGTGTTGGTTGTCTTAAAGGCAAAATGAG
CAAATTACATTTTCAGTCTTCAGTAATAGTGACTACTAAGCCTCTTGAATTGTTACACAGTGATGTGTGGGGGTCTTCCCCTGTTTTATCTATTACTGGCCATAGATATT
ATATTTGA
Protein sequenceShow/hide protein sequence
MNYLFWRFQVESILKAHSLFGVIDGSTPRPDKFSLQEDGTRSTTVNPALTQWTAQDNAMITLINATVGYKTSREVWITLEKRFSSLTSEAVTLEEFYALLKLEAKFIEQQ
NKTTSIFNPTAMTTTLGQGSSNSFRGRDRQSPSSLNFSPHGRGRGIGGSESNSSWSRPSSNSTGRHPPSQLAAMGTAVDPATNSSFWPSDSGCNSNITSDMSNLNLNGNY
NGEDSIVVASGQTLPVARIGFGNLSISDNDLSLSNVLCVPDISVKVTGKVLYKGQSKDGLYPLPFGGFKGGEVVSSSDQAVSRPSVVYSCFHVYAFTLNRKTTDLWHFRL
GHPSPDVLRKILSQSSIYVGQSVNITECVGCLKGKMSKLHFQSSVIVTTKPLELLHSDVWGSSPVLSITGHRYYI