; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011919 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011919
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:34983714..34984995
RNA-Seq ExpressionLag0011919
SyntenyLag0011919
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]3.0e-8139.44Show/hide
Query:  GNFPQRIREAKQKVQLAIEGLRGAGSREPLSQAEAQLEDVLQEEELYWKQRSR---------------------------------EGEWRQDRAMVLQL
        GN P++I+E K+ +   +   R       ++    ++ ++L  EE+ W+QRSR                                  G W+     + ++
Subjt:  GNFPQRIREAKQKVQLAIEGLRGAGSREPLSQAEAQLEDVLQEEELYWKQRSR---------------------------------EGEWRQDRAMVLQL

Query:  VTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNDGCSPGSINE
           YFQ ++S+S P+       L  +  +V  EMN  L++ FT EEI  AL Q HP KAPGPDG+S  F++ +W+IVG  ++   L VLN   S   IN+
Subjt:  VTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNDGCSPGSINE

Query:  TMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIPG---------------------------------------RIEW
        T I L+PKIK P ++SDFRPISLCN  YKLISKV+ NR+K+ILP +IS NQSAF+ G                                       R+EW
Subjt:  TMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIPG---------------------------------------RIEW

Query:  SFLRAVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFFADDSL
         F++ VM++MGF ++W  L++ C++SVS+S  +NG   G++ P+RGLRQGDP+SPY+FLLCA+G SSLL    R+  ISG  + + CP I+HLFFADDSL
Subjt:  SFLRAVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFFADDSL

Query:  LFFKANVVEVGAIRDLLIRYERASGR
        LF KAN  E   + D+L  YE ASG+
Subjt:  LFFKANVVEVGAIRDLLIRYERASGR

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]4.9e-8441.3Show/hide
Query:  MGNFPQRIREAKQKVQ----LAIEGLRGAGSREPLSQAEAQLEDVLQEEELYWKQRSR---------------------------------EGEWRQDRA
        +GN P++I E ++ +     +  +G RGA     ++Q   +L D+L  EE+ W+QRS+                                 +G W   + 
Subjt:  MGNFPQRIREAKQKVQ----LAIEGLRGAGSREPLSQAEAQLEDVLQEEELYWKQRSR---------------------------------EGEWRQDRA

Query:  MVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNDGCSP
         +      YF+ ++++S P+    +  +  + R V  EMN +L K FT EE+L+ALKQ HP KAPGPDG+S +F+ N+W IVGPS+    L VLN     
Subjt:  MVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNDGCSP

Query:  GSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIPG--------------------------------------
          IN+T I LIPK   P R+++FRPISLCN +YK+ISKV+ NR K ILPN+IS NQSAF P                                       
Subjt:  GSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIPG--------------------------------------

Query:  -RIEWSFLRAVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFF
         R+EWSF++ VM+++GF ++W  LI+ CVSSVS+S  +NGE  GN+ PSRG+RQGDPLSP LFLLCAEGLS+L+  A R   I+G  + + CP I+HLFF
Subjt:  -RIEWSFLRAVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFF

Query:  ADDSLLFFKANVVEVGAIRDLLIRYERASGR
        ADDSLLF KA   E  A+  +L RYE ASG+
Subjt:  ADDSLLFFKANVVEVGAIRDLLIRYERASGR

XP_030502823.1 uncharacterized protein LOC115717993 [Cannabis sativa]1.4e-7840.14Show/hide
Query:  GNFPQRIREAKQKVQLAIEGLRGAGSR-----EPLSQAEAQLEDVLQEEELYWKQRSR---------------------------------EGEWRQDRA
        GN  ++I +A    QL +E L  A +R     + L ++EA LED+L++E++YW QRSR                                  G W   ++
Subjt:  GNFPQRIREAKQKVQLAIEGLRGAGSR-----EPLSQAEAQLEDVLQEEELYWKQRSR---------------------------------EGEWRQDRA

Query:  MVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNDGCSP
         +L  + DYF  +F+ S   ++    +L  +  SV ++MN +L+KPFT  E+  AL    P  +PG DG+S  FY+++W  VG  + ++ L+VLN+G   
Subjt:  MVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNDGCSP

Query:  GSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIPG--------------------------------------
          +N T+I LIPKIK P+R++D+RPISLCN   KLI+KV+VNR KH+LP++IS  QSAF+P                                       
Subjt:  GSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIPG--------------------------------------

Query:  -RIEWSFLRAVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFF
         R+EW F+  VM +MGFA +W  LI+ C+ + SFSF LNGE LG++IPSRGLRQG PLSPYLFL+C+EGLS LL+  E    + GF++ +  P ISHLFF
Subjt:  -RIEWSFLRAVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFF

Query:  ADDSLLFFKANVVEVGAIRDLLIRYERASGR
        ADDSLLF +AN     A++  L  Y+RASG+
Subjt:  ADDSLLFFKANVVEVGAIRDLLIRYERASGR

XP_030942013.1 uncharacterized protein LOC115967068 [Quercus lobata]3.6e-7939.85Show/hide
Query:  EPLSQAEAQLEDVLQEEELYWKQRSR---------------------------------EGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQ
        E +   + ++ +V   EE+ W QRSR                                 EG W++D+  V  ++ +YFQ++FSTS P   +F  SL  ++
Subjt:  EPLSQAEAQLEDVLQEEELYWKQRSR---------------------------------EGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQ

Query:  RSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNDGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFS
        R V  +MN DLL+ F EEE+ RALKQ HP K+PGP+ +S  F++++W +VGP V+   L  L  G  P  +N+T I LIPK+  P+++S+FRPISLCN  
Subjt:  RSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNDGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFS

Query:  YKLISKVVVNRMKHILPNLISSNQSAFIPG---------------------------------------RIEWSFLRAVMDRMGFAQQWTDLILRCVSSV
        YK++SKV+ NR+K +LP +IS  QSAF+PG                                       R+EW++L A+M R+GF ++W  L++ CV+SV
Subjt:  YKLISKVVVNRMKHILPNLISSNQSAFIPG---------------------------------------RIEWSFLRAVMDRMGFAQQWTDLILRCVSSV

Query:  SFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFFADDSLLFFKANVVEVGAIRDLLIRYERASGR
        S+S  LNGE  G ++P+RGLRQGDP+SPYLFLLCAEGLS++LR  E + +  G  V +  P +SHL FADD ++F  A+  E   +  +L  YER SG+
Subjt:  SFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFFADDSLLFFKANVVEVGAIRDLLIRYERASGR

XP_030964220.1 uncharacterized protein LOC115985421 [Quercus lobata]6.0e-8239.95Show/hide
Query:  MGNFPQRIREAKQKVQLAIEGLRGAGSR-EPLSQAEAQLEDVLQEEELYWKQRSR---------------------------------EGEWRQDRAMVL
        +G  P++I E K+++  +I  +   G R   ++Q   ++ D+L  EE  W+QRS+                                 +G W + +  + 
Subjt:  MGNFPQRIREAKQKVQLAIEGLRGAGSR-EPLSQAEAQLEDVLQEEELYWKQRSR---------------------------------EGEWRQDRAMVL

Query:  QLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNDGCSPGSI
             YF+++++TS P+  +  +S   + R V  EMN++L K FT EE+L+AL+Q HP+KAPGPDG+S  F+ N+W IVG ++I   L+VLN       I
Subjt:  QLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNDGCSPGSI

Query:  NETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIP---------------------------------------GRI
        N+T I LIPK   P ++++F PISLCN +YK+ISKV+ NR+K ILPN+IS NQSAF P                                        R+
Subjt:  NETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIP---------------------------------------GRI

Query:  EWSFLRAVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFFADD
        EW+F++ VM ++GF  +W DL++ CVSSVS+S  +NGE  GN+ PSRG+RQGDPLSP LFLLCAEG S+L+  A R   I+G  + + CP I+H FFADD
Subjt:  EWSFLRAVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFFADD

Query:  SLLFFKANVVEVGAIRDLLIRYERASGR
        SLLF KA V E  A+  +L +YE ASG+
Subjt:  SLLFFKANVVEVGAIRDLLIRYERASGR

TrEMBL top hitse value%identityAlignment
A0A2N9F7A6 Uncharacterized protein6.1e-8039.66Show/hide
Query:  LRGAGSREPLSQAEAQLEDVLQEEELYWKQRSR-----EGE----------------------------WRQDRAMVLQLVTDYFQQLFSTSEPSDQDFD
        L  +G    L + + +L  +L++EE++W+QRSR     EG+                            W+ ++  +  +   YFQ +F++S+P     +
Subjt:  LRGAGSREPLSQAEAQLEDVLQEEELYWKQRSR-----EGE----------------------------WRQDRAMVLQLVTDYFQQLFSTSEPSDQDFD

Query:  VSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNDGCSPGSINETMIVLIPKIKAPRRVSDFRP
          L  +   V ++MN +LL  FTEEE+  AL+Q +P KAPGPDG+S  FY+ +W +VGP V Q+ L++++ G     IN T I L+PKI +P +++DFRP
Subjt:  VSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNDGCSPGSINETMIVLIPKIKAPRRVSDFRP

Query:  ISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIPG---------------------------------------RIEWSFLRAVMDRMGFAQQWTDLI
        I+LCN  YK+ISKV+ NR+K  LP ++S +QSAF+PG                                       R+EWSFL A+M R+GFA++W  L+
Subjt:  ISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIPG---------------------------------------RIEWSFLRAVMDRMGFAQQWTDLI

Query:  LRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFFADDSLLFFKANVVEVGAIRDLLIRY
        + C+ SVS+S  +NGE+ G    SRG+RQGD LSPYLFLLCAEGLS LLR AE    I+G   ++S P ++HLFFADDSLLF +AN+     + ++L +Y
Subjt:  LRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFFADDSLLFFKANVVEVGAIRDLLIRY

Query:  ERASGR
        E ASG+
Subjt:  ERASGR

A0A2N9GQ35 Reverse transcriptase domain-containing protein7.1e-8139.95Show/hide
Query:  EAKQKVQLAIEGLRGAGSREPLSQAEAQLEDVLQEEELYWKQRSR-----EGE----------------------------WRQDRAMVLQLVTDYFQQL
        + + +  L+I GL    SR  L + + +L  +L++EE++W+QRSR     EG+                            W+ ++  +  +   YFQ +
Subjt:  EAKQKVQLAIEGLRGAGSREPLSQAEAQLEDVLQEEELYWKQRSR-----EGE----------------------------WRQDRAMVLQLVTDYFQQL

Query:  FSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNDGCSPGSINETMIVLIPK
        F++S+P     +  L  +   V ++MN +LL  FTEEE+  AL+Q +P KAPGPDG+S  FY+ +W +VGP V Q+ L++++ G     IN T I L+PK
Subjt:  FSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNDGCSPGSINETMIVLIPK

Query:  IKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIPG---------------------------------------RIEWSFLRAVMD
        I +P +++DFRPI+LCN  YK+ISKV+ NR+K ILP ++S +QSAF+PG                                       R+EWSFL A+M 
Subjt:  IKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIPG---------------------------------------RIEWSFLRAVMD

Query:  RMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFFADDSLLFFKANVV
        R+GFA++W  LI+ C+ SVS+S  +NGE+ G    SRG+RQGD LSPYLFLLCAEGLS LLR AE    I+G   ++S P ++HLFFADDSLLF +AN+ 
Subjt:  RMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFFADDSLLFFKANVV

Query:  EVGAIRDLLIRYERASGR
            + ++L +YE ASG+
Subjt:  EVGAIRDLLIRYERASGR

A0A2N9I335 Reverse transcriptase domain-containing protein8.4e-8240.28Show/hide
Query:  MGNFPQRIREAKQKVQLAIEGLRGAGSREPLSQAEAQLEDVLQEEELYWKQRSR---------------------------------EGEWRQDRAMVLQ
        +G+   +I+E +  +Q  ++   G  S E +++ + +L  +L+ EE+YW+QRSR                                 EG  + D+  +  
Subjt:  MGNFPQRIREAKQKVQLAIEGLRGAGSREPLSQAEAQLEDVLQEEELYWKQRSR---------------------------------EGEWRQDRAMVLQ

Query:  LVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNDGCSPGSIN
        +  DYFQ +FS+S P D+  +  L  L+R V  EMN  LL+ F  EE+ +ALKQ +P KAPGPDG+S  FY+ +W IVGP V Q+ L++L+ G     IN
Subjt:  LVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNDGCSPGSIN

Query:  ETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIPG---------------------------------------RIE
         T I LIPK+K P R++DFRPISLCN  YK++SK++ NR+K +LP +IS +QSAF+PG                                       R+E
Subjt:  ETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIPG---------------------------------------RIE

Query:  WSFLRAVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFFADDS
        W F+ A+M R+GFA+ W +LI+ C+ SVS+S  +NGE+ G    SRG+RQGD LSPYLFLLCAEGLS LLR A     ISG   ++  P ++HLFFADDS
Subjt:  WSFLRAVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFFADDS

Query:  LLFFKANVVEVGAIRDLLIRYERASGR
        LLF +A +    A+  +L +YE  SG+
Subjt:  LLFFKANVVEVGAIRDLLIRYERASGR

A0A7N2L6Z9 Reverse transcriptase domain-containing protein6.1e-8047.49Show/hide
Query:  GEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLA
        G W +DR  +      YF+ ++STS PS  D +V+   +   +  EMN +L + FT EEI+ ALKQ HP K+PGPDG+S  F++ +W IVG +V    L 
Subjt:  GEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLA

Query:  VLNDGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIP-------------------------------
        VLN G S   IN+T IVLIPK   P+R++DFRPISLCN  YKLISK + NR+K  LP +I+ NQSAF                                 
Subjt:  VLNDGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIP-------------------------------

Query:  --------GRIEWSFLRAVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSC
                 R+EW F+  VM +MGF + W  LI+RC+SSVS+S  +NGE  GN++P+RGLRQGDPLSPYLFLLCAEGLS+LL  A R  L++G  + + C
Subjt:  --------GRIEWSFLRAVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSC

Query:  PPISHLFFADDSLLFFKANVVEVGAIRDLLIRYERASGR
        P I+HLFFADDSLLF KAN  E   ++++L +YE ASG+
Subjt:  PPISHLFFADDSLLFFKANVVEVGAIRDLLIRYERASGR

A0A7N2R0C3 Reverse transcriptase domain-containing protein6.1e-8039.15Show/hide
Query:  SREPLSQAEAQLEDVLQEEELYWKQRSR---------------------------------EGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRD
        S E + + + ++ +V+  EE+ W QRSR                                 EG WR++   V +++ +YF++++S++ P+  +F   L  
Subjt:  SREPLSQAEAQLEDVLQEEELYWKQRSR---------------------------------EGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRD

Query:  LQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNDGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCN
        + R V  +MN DLL+ F EEE+ +AL Q HP K+PGPDG+S  F++ +W +VGP V+QS +  L  G  P  +NET I LIPK+K P++++++RPISLCN
Subjt:  LQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNDGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCN

Query:  FSYKLISKVVVNRMKHILPNLISSNQSAFIPG---------------------------------------RIEWSFLRAVMDRMGFAQQWTDLILRCVS
          YKL+SKV+ NR+K +LP+++   QSAF+PG                                       R+EW +L A+M RMGF ++W  L++ CV+
Subjt:  FSYKLISKVVVNRMKHILPNLISSNQSAFIPG---------------------------------------RIEWSFLRAVMDRMGFAQQWTDLILRCVS

Query:  SVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFFADDSLLFFKANVVEVGAIRDLLIRYERASG
        +VSFS  +NGE  G ++P+RGLRQGDP+SPYLFLLCAEGLS++LR  E    +SG ++ +  P ISHL FADD ++F KA++ E   +  +L  YER SG
Subjt:  SVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFFADDSLLFFKANVVEVGAIRDLLIRYERASG

Query:  R
        +
Subjt:  R

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein7.7e-2423.91Show/hide
Query:  RSREGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSL--RDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSV
        ++ +G+   D   +   + +Y++ L++    + ++ D  L    L R    E+   L +P T  EI+  +      K+PGPDG +  FY+ +   + P +
Subjt:  RSREGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSL--RDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSV

Query:  IQSCLAVLNDGCSPGSINETMIVLIPKI-KAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIPG-----------------------
        ++   ++  +G  P S  E  I+LIPK  +   +  +FRPISL N   K+++K++ NR++  +  LI  +Q  FIPG                       
Subjt:  IQSCLAVLNDGCSPGSINETMIVLIPKI-KAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIPG-----------------------

Query:  --------------RIEWSFLRAVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFR
                      +I+  F+   ++++G    +  +I       + +  LNG++L       G RQG PLSP LF +  E L+  +R  +    I G +
Subjt:  --------------RIEWSFLRAVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFR

Query:  VAQSCPPISHLFFADDSLLFFKANVVEVGAIRDLLIRYERASG
        + +    +S   FADD +++ +  +V    +  L+  + + SG
Subjt:  VAQSCPPISHLFFADDSLLFFKANVVEVGAIRDLLIRYERASG

P08548 LINE-1 reverse transcriptase homolog3.8e-2324.85Show/hide
Query:  RSREGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLL-KPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVI
        R+   E   D + + +++ +Y+++L+S    + ++ D  L        S+  V++L +P +  EI   ++     K+PGPDG +  FY+     + P ++
Subjt:  RSREGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLL-KPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVI

Query:  QSCLAVLNDGCSPGSINETMIVLIPKI-KAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIPGRIEW--------------------
             +  +G  P +  E  I LIPK  K P R  ++RPISL N   K+++K++ NR++  +  +I  +Q  FIPG   W                    
Subjt:  QSCLAVLNDGCSPGSINETMIVLIPKI-KAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIPGRIEW--------------------

Query:  -----------------SFLRAVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRV
                          F+   + ++G    +  LI    S  + +  LNG +L +     G RQG PLSP LF +  E L+  +R  E +A I G  +
Subjt:  -----------------SFLRAVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRV

Query:  AQSCPPISHLFFADDSLLFFKANVVEVGAIRDLLIRYERASG
              I    FADD +++ +        + +++  Y   SG
Subjt:  AQSCPPISHLFFADDSLLFFKANVVEVGAIRDLLIRYERASG

P11369 LINE-1 retrotransposable element ORF2 protein5.9e-2426.87Show/hide
Query:  AQLEDVLQEEELYWKQRSREGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVD-LLKPFTEEEILRALKQSHPHKAPGPDGLS
        A+L    +++ L  K R+ +G+   D   +   +  ++++L+ST   +  + D  L   Q    ++  VD L  P + +EI   +      K+PGPDG S
Subjt:  AQLEDVLQEEELYWKQRSREGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVD-LLKPFTEEEILRALKQSHPHKAPGPDGLS

Query:  GSFYKNHWSIVGPSVIQSCLAVLNDGCSPGSINETMIVLIPK-IKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIPG--------
          FY+     + P + +    +  +G  P S  E  I LIPK  K P ++ +FRPISL N   K+++K++ NR++  +  +I  +Q  FIPG        
Subjt:  GSFYKNHWSIVGPSVIQSCLAVLNDGCSPGSINETMIVLIPK-IKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIPG--------

Query:  -----------------------------RIEWSFLRAVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSS
                                     +I+  F+  V++R G    + ++I    S    +  +NGE+L  +    G RQG PLSPYLF +  E L+ 
Subjt:  -----------------------------RIEWSFLRAVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSS

Query:  LLRGAERRALISGFRVAQSCPPISHLFFADDSLLF
         +R   ++  I G ++ +    IS L  ADD +++
Subjt:  LLRGAERRALISGFRVAQSCPPISHLFFADDSLLF

P14381 Transposon TX1 uncharacterized 149 kDa protein1.8e-1724.38Show/hide
Query:  SREGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQS
        + +G   +D   +      ++Q LFS  +P   D    L D    V       L  P T +E+ +AL+    +K+PG DGL+  F++  W  +GP   + 
Subjt:  SREGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQS

Query:  CLAVLNDGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIPG---------------------------
               G  P S    ++ L+PK    R + ++RP+SL +  YK+++K +  R+K +L  +I  +QS  +PG                           
Subjt:  CLAVLNDGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLISSNQSAFIPG---------------------------

Query:  ---------RIEWSFLRAVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLR
                 R++  +L   +    F  Q+   +    +S      +N      +   RG+RQG PLS  L+ L  E    LLR
Subjt:  ---------RIEWSFLRAVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLR

P92555 Uncharacterized mitochondrial protein AtMg012501.7e-1557.97Show/hide
Query:  FNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFFADDS
        F +NG   G V PSRGLRQGDPLSPYLF+LC E LS L R A+ +  + G RV+ + P I+HL FADD+
Subjt:  FNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFFADDS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.2e-0934.44Show/hide
Query:  TEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNDGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLIS
        +++EI  A+     +KAPGPD  +  F+   W +V  S I +       G      N T I LIPK+    ++S FRP+S C   YK+I+
Subjt:  TEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNDGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLIS

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.2e-1657.97Show/hide
Query:  FNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFFADDS
        F +NG   G V PSRGLRQGDPLSPYLF+LC E LS L R A+ +  + G RV+ + P I+HL FADD+
Subjt:  FNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQSCPPISHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAACTTCCCTCAGCGCATCAGGGAGGCCAAGCAGAAGGTACAGCTGGCCATTGAGGGATTGAGAGGGGCTGGGTCCCGTGAGCCACTTTCCCAGGCAGAG
GCCCAGTTGGAAGATGTGTTACAAGAGGAGGAACTTTACTGGAAGCAAAGATCCAGAGAGGGGGAATGGCGCCAGGACAGAGCTATGGTTCTTCAGTTGGTGACT
GATTATTTCCAGCAGCTTTTCTCGACATCAGAGCCGAGTGATCAGGATTTCGATGTATCCCTCCGGGACCTTCAGCGATCTGTGGATAGTGAAATGAATGTGGAT
CTGTTGAAACCTTTTACTGAGGAGGAGATTCTTCGGGCTTTGAAGCAGTCTCATCCTCATAAGGCCCCGGGTCCAGATGGGTTATCTGGCAGTTTCTATAAGAAT
CACTGGTCGATAGTGGGGCCTTCAGTGATCCAGAGTTGTCTGGCCGTTTTGAATGATGGATGCTCCCCAGGTTCGATTAATGAGACTATGATTGTTCTTATTCCG
AAGATCAAGGCCCCTCGACGAGTTTCTGATTTTCGTCCCATTTCCTTATGCAATTTTAGCTATAAGTTGATTTCGAAGGTTGTGGTTAATAGGATGAAACATATC
CTTCCAAATCTTATATCATCCAATCAGAGTGCCTTTATCCCTGGGAGGATAGAGTGGTCTTTTCTGCGGGCAGTTATGGATAGAATGGGTTTCGCTCAACAGTGG
ACTGATTTGATTCTCCGGTGTGTTAGCTCGGTCTCCTTTTCGTTTAACCTGAATGGGGAGAGGTTGGGGAATGTGATTCCTTCCCGTGGGCTCAGACAGGGAGAT
CCGTTGTCTCCGTATTTGTTTTTGCTCTGTGCGGAGGGTTTGTCGAGTCTGTTGCGAGGGGCAGAACGTCGAGCTTTGATATCTGGGTTTCGGGTTGCGCAGAGT
TGTCCTCCGATTTCTCACCTTTTTTTCGCAGATGATAGCCTCCTTTTCTTCAAAGCAAACGTGGTGGAGGTAGGGGCTATCAGGGATTTGTTGATCCGTTATGAA
CGAGCTTCGGGCAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAACTTCCCTCAGCGCATCAGGGAGGCCAAGCAGAAGGTACAGCTGGCCATTGAGGGATTGAGAGGGGCTGGGTCCCGTGAGCCACTTTCCCAGGCAGAG
GCCCAGTTGGAAGATGTGTTACAAGAGGAGGAACTTTACTGGAAGCAAAGATCCAGAGAGGGGGAATGGCGCCAGGACAGAGCTATGGTTCTTCAGTTGGTGACT
GATTATTTCCAGCAGCTTTTCTCGACATCAGAGCCGAGTGATCAGGATTTCGATGTATCCCTCCGGGACCTTCAGCGATCTGTGGATAGTGAAATGAATGTGGAT
CTGTTGAAACCTTTTACTGAGGAGGAGATTCTTCGGGCTTTGAAGCAGTCTCATCCTCATAAGGCCCCGGGTCCAGATGGGTTATCTGGCAGTTTCTATAAGAAT
CACTGGTCGATAGTGGGGCCTTCAGTGATCCAGAGTTGTCTGGCCGTTTTGAATGATGGATGCTCCCCAGGTTCGATTAATGAGACTATGATTGTTCTTATTCCG
AAGATCAAGGCCCCTCGACGAGTTTCTGATTTTCGTCCCATTTCCTTATGCAATTTTAGCTATAAGTTGATTTCGAAGGTTGTGGTTAATAGGATGAAACATATC
CTTCCAAATCTTATATCATCCAATCAGAGTGCCTTTATCCCTGGGAGGATAGAGTGGTCTTTTCTGCGGGCAGTTATGGATAGAATGGGTTTCGCTCAACAGTGG
ACTGATTTGATTCTCCGGTGTGTTAGCTCGGTCTCCTTTTCGTTTAACCTGAATGGGGAGAGGTTGGGGAATGTGATTCCTTCCCGTGGGCTCAGACAGGGAGAT
CCGTTGTCTCCGTATTTGTTTTTGCTCTGTGCGGAGGGTTTGTCGAGTCTGTTGCGAGGGGCAGAACGTCGAGCTTTGATATCTGGGTTTCGGGTTGCGCAGAGT
TGTCCTCCGATTTCTCACCTTTTTTTCGCAGATGATAGCCTCCTTTTCTTCAAAGCAAACGTGGTGGAGGTAGGGGCTATCAGGGATTTGTTGATCCGTTATGAA
CGAGCTTCGGGCAGATGA
Protein sequenceShow/hide protein sequence
MGNFPQRIREAKQKVQLAIEGLRGAGSREPLSQAEAQLEDVLQEEELYWKQRSREGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVD
LLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNDGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHI
LPNLISSNQSAFIPGRIEWSFLRAVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVAQS
CPPISHLFFADDSLLFFKANVVEVGAIRDLLIRYERASGR