; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021974 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021974
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:15250899..15252457
RNA-Seq ExpressionLag0021974
SyntenyLag0021974
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017250619.1 PREDICTED: uncharacterized protein LOC108221234 [Daucus carota subsp. sativus]2.5e-7241.21Show/hide
Query:  LLTDYVQQFFLSLDPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSIN
        ++  + +  F +  P+    D  L  +   V  +MN +L + FT  E+ RA+    P K+PGPDG++  F++ HW IVGP V ++ L  LN+      IN
Subjt:  LLTDYVQQFFLSLDPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSIN

Query:  ETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKLDMSKAYDMIE
         T++ L+ KVK P+ V  FRPISL NV YK I+K L+NR+K ILP +I+  QSAF+PGR + DNA++ +EC+H LR    GK  +  +KLDMSKAYD +E
Subjt:  ETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKLDMSKAYDMIE

Query:  WSFLRIVMARMGFAQQ---------------------------------QGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFANDS
        W F+  ++ ++GF+QQ                                 QGDP S YLFL+C EG S+LLR AE +  I G + AR++P ISHLFFA+DS
Subjt:  WSFLRIVMARMGFAQQ---------------------------------QGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFANDS

Query:  LLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRKDTQ
        LLF +A+   + +I+++   Y   SGQ++N+ KS++ FSPNT  D +
Subjt:  LLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRKDTQ

XP_018815621.1 uncharacterized protein LOC108987197 [Juglans regia]9.2e-7537.44Show/hide
Query:  NCFCVDNRGKSGGLALLWNSSVTFSLLSYSNNHIDGWITWDDYHWRLTCIGFPTADMRDQTTKRRVAEINALFIL-------------------------
        +C  VD  G+ GG+ALLW   V+ S+LSYS+ HID  I  D           P   +     ++ + ++NAL  L                         
Subjt:  NCFCVDNRGKSGGLALLWNSSVTFSLLSYSNNHIDGWITWDDYHWRLTCIGFPTADMRDQTTKRRVAEINALFIL-------------------------

Query:  -----------QLLTDYVQQFFLSLDP-NDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCL
                   +++ DY ++ F S +P    DF   L  L   V S MN  L QP+TE E+  +L Q HP KAPGPDG+S  FY+ +W +VG SV ++ L
Subjt:  -----------QLLTDYVQQFFLSLDP-NDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCL

Query:  AVLNQGCSPGSINETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWAT
          LN G  P ++N T I+L+ K K P +V  +RPISL NV+YKLI+K + NR+K +LP +I  +QSAF+PGR + DN ++ +E +H L  +T GK  + +
Subjt:  AVLNQGCSPGSINETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWAT

Query:  LKLDMSKAYDMIEWSFLRIVMARMGFAQ---------------------------------QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARS
        +KLDMSKAYD +EW F+R VM  MGF +                                 +Q DP S YLFLLC EGL +LL+ A  +  IS  +  + 
Subjt:  LKLDMSKAYDMIEWSFLRIVMARMGFAQ---------------------------------QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARS

Query:  SPPISHLFFANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPN
        +P I+HL FA+DS++F RA+      ++ LL  YE  SGQ +N EK+ + FS N
Subjt:  SPPISHLFFANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPN

XP_028068804.1 uncharacterized protein LOC114271378 [Camellia sinensis]1.5e-7246.15Show/hide
Query:  LQLLTDYVQQFFLSLDPNDQDFDIS--LRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSP
        L+ L   V  +F  L    Q  DI   L  +   V  E NV L +P+T +E+  AL Q HP KAPGPDG    FY+  W IVG  V ++ L VLN+G + 
Subjt:  LQLLTDYVQQFFLSLDPNDQDFDIS--LRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSP

Query:  GSINETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKLDMSKAY
         ++N+T IVL+ KVK+P+R+S FRPISL NV YKL+SK L NRM+ ILP +IS NQSAF+ GR + DN +  FE  H L+ +  GK     LKLDMSKAY
Subjt:  GSINETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKLDMSKAY

Query:  DMIEWSFLRIVMARMGFAQ---------------------------------QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFF
        D +EWSFLR VM RMGF Q                                 +QGDP S YLF+LC EGLS+L++ AE +  ++G    R SP +SHL F
Subjt:  DMIEWSFLRIVMARMGFAQ---------------------------------QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFF

Query:  ANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRKDTQ
        A+DSLLF  AN   +V ++D+L +YE  SGQ +N EKS + FS N   D Q
Subjt:  ANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRKDTQ

XP_030505314.1 uncharacterized protein LOC115720302 [Cannabis sativa]3.3e-7244.54Show/hide
Query:  DYVQQFFLSLDPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSINETM
        DY    F +   +    D+ L  +P  +  EMN +L QPFT DEIL AL      K+PGPDG+S  F+ NHWS +GP +  + L VLN    P  IN+T+
Subjt:  DYVQQFFLSLDPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSINETM

Query:  IVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKLDMSKAYDMIEWSF
        I L+ KVK P  V+ +RPISL NV YKLISKA+V RMK +L  +IS  QSAFI  R ++DN ++ +E +H LR +T G+  +A LKLDMSKA+D +EW F
Subjt:  IVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKLDMSKAYDMIEWSF

Query:  LRIVMARMGFAQ---------------------------------QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFANDSLLF
        L  V+ +MGF                                    QGDP S YLFL+C E LS LL+  EQQ +++G    R +P +SHL FANDSLLF
Subjt:  LRIVMARMGFAQ---------------------------------QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFANDSLLF

Query:  FRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNT
         R +A  +  ++  L  Y   SGQ++NY+KSV++FSPNT
Subjt:  FRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNT

XP_030505314.1 uncharacterized protein LOC115720302 [Cannabis sativa]1.8e-0633.06Show/hide
Query:  GVGVSPFIPTLGQVGASKIPLVLFLSETKLSSNRMAPAKRALGFENCFCVDNRGKSGGLALLWNSSVTFSLLSYSNNHIDGWITWDDY--HWRLTCIGFP
        G+G       L  + + + P +LFL ETKL +  +   + AL F N F V  RG  GGL LLW  ++  +LL+YS NHI  ++  D+    +     G P
Subjt:  GVGVSPFIPTLGQVGASKIPLVLFLSETKLSSNRMAPAKRALGFENCFCVDNRGKSGGLALLWNSSVTFSLLSYSNNHIDGWITWDDY--HWRLTCIGFP

Query:  TADMRDQTTK--RRVAEINAL
           +R  T +  +R+A+I+ +
Subjt:  TADMRDQTTK--RRVAEINAL

XP_030505314.1 uncharacterized protein LOC115720302 [Cannabis sativa]2.1e-7143.39Show/hide
Query:  QLLTDYVQQFFLSLDPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSI
        + + +Y +Q F S  P+  +FD  L+ +   V   MN  L + FT DE+  ALKQ  P  APGPDG+S  FYK+ W+ +G  VI + LA+LN G  P S+
Subjt:  QLLTDYVQQFFLSLDPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSI

Query:  NETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKLDMSKAYDMI
        N T I L+ K+K+P + + FRPISL NV YK++SK + NR+K +LPKL+S +QSAF+  R + DN ++ FE +H L+ +T GK+ +  +KLDMSKAYD +
Subjt:  NETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKLDMSKAYDMI

Query:  EWSFLRIVMARMGF---------------------------------AQQQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFAND
        EW+FL  VM ++GF                                   +QGDP S YLFLLC EGL SL++  E    I G     + P +SHLFFA+D
Subjt:  EWSFLRIVMARMGF---------------------------------AQQQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFAND

Query:  SLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRKDTQ
        SLLF RAN+  V +I ++L +YE  SGQ +N EK+ + FSPNT    Q
Subjt:  SLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRKDTQ

TrEMBL top hitse value%identityAlignment
A0A2N9GJ35 Uncharacterized protein6.9e-7643.87Show/hide
Query:  MRDQTTKRRVAEINALFILQLLTDYVQQFFLSLDPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGP
        +RDQ +  R      L + Q+  DY    F S +P  +  D  L ++   V   MN VLM+PFT++EI RAL Q HP K+PGPDG+S  F++ +W IV  
Subjt:  MRDQTTKRRVAEINALFILQLLTDYVQQFFLSLDPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGP

Query:  SVIQSCLAVLNQGCSPGSINETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTG
         V  + L  L  G   GSIN T +VL+ KV AP  ++ FRPISL NV YK++SK LVNRMK ILP++IS +QSAF+PGR + DN I+ FE IH L+    
Subjt:  SVIQSCLAVLNQGCSPGSINETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTG

Query:  GKSKWATLKLDMSKAYDMIEWSFLRIVMARMGFAQQ---------------------------------QGDPRSLYLFLLCVEGLSSLLRGAEQQYLIS
        G +    +KLDMSKAYD +EW +L+ +M ++GF  Q                                 QGDP S YLFLLC EGLS++LR AE++ L+ 
Subjt:  GKSKWATLKLDMSKAYDMIEWSFLRIVMARMGFAQQ---------------------------------QGDPRSLYLFLLCVEGLSSLLRGAEQQYLIS

Query:  GFRFARSSPPISHLFFANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRKDTQ
        G    R  P +SHLFFA+DS++F RA     V +++LL +Y   SGQVVN +K+ + FSPNT + ++
Subjt:  GFRFARSSPPISHLFFANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRKDTQ

A0A2N9GJ35 Uncharacterized protein3.5e-1137.5Show/hide
Query:  LKCSGVGVSPFIPTLGQVGASKIPLVLFLSETKLSSNRMAPAKRALGFENCFCVDNRGKSGGLALLWNSSVTFSLLSYSNNHIDGWITWDD-YHWRLT-C
        L C G+G    +  L  +   + P ++FL ET+L+   +   +  LG + C  V+  G+ GGLALLW+SSV  ++ SYS +HIDG +  +D   WRLT  
Subjt:  LKCSGVGVSPFIPTLGQVGASKIPLVLFLSETKLSSNRMAPAKRALGFENCFCVDNRGKSGGLALLWNSSVTFSLLSYSNNHIDGWITWDD-YHWRLT-C

Query:  IGFPTADMRDQT
         G+P A +R ++
Subjt:  IGFPTADMRDQT

A0A2N9GJ35 Uncharacterized protein4.4e-7537.44Show/hide
Query:  NCFCVDNRGKSGGLALLWNSSVTFSLLSYSNNHIDGWITWDDYHWRLTCIGFPTADMRDQTTKRRVAEINALFIL-------------------------
        +C  VD  G+ GG+ALLW   V+ S+LSYS+ HID  I  D           P   +     ++ + ++NAL  L                         
Subjt:  NCFCVDNRGKSGGLALLWNSSVTFSLLSYSNNHIDGWITWDDYHWRLTCIGFPTADMRDQTTKRRVAEINALFIL-------------------------

Query:  -----------QLLTDYVQQFFLSLDP-NDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCL
                   +++ DY ++ F S +P    DF   L  L   V S MN  L QP+TE E+  +L Q HP KAPGPDG+S  FY+ +W +VG SV ++ L
Subjt:  -----------QLLTDYVQQFFLSLDP-NDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCL

Query:  AVLNQGCSPGSINETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWAT
          LN G  P ++N T I+L+ K K P +V  +RPISL NV+YKLI+K + NR+K +LP +I  +QSAF+PGR + DN ++ +E +H L  +T GK  + +
Subjt:  AVLNQGCSPGSINETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWAT

Query:  LKLDMSKAYDMIEWSFLRIVMARMGFAQ---------------------------------QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARS
        +KLDMSKAYD +EW F+R VM  MGF +                                 +Q DP S YLFLLC EGL +LL+ A  +  IS  +  + 
Subjt:  LKLDMSKAYDMIEWSFLRIVMARMGFAQ---------------------------------QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARS

Query:  SPPISHLFFANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPN
        +P I+HL FA+DS++F RA+      ++ LL  YE  SGQ +N EK+ + FS N
Subjt:  SPPISHLFFANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPN

A0A2N9I335 Reverse transcriptase domain-containing protein3.2e-7342.42Show/hide
Query:  VAEINALFILQLLTDYVQQFFLSLDPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAV
        V + + + +  +  DY Q  F S +P D+  +  L  L R V  EMN +L++ F  +E+ +ALKQ +P KAPGPDG+S  FY+ +W IVGP V Q+ L++
Subjt:  VAEINALFILQLLTDYVQQFFLSLDPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAV

Query:  LNQGCSPGSINETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLK
        L+ G     IN T I L+ KVK P R++ FRPISL NV YK++SK L NR+K +LP +IS +QSAF+PGR + DN ++ FE +H +  +  G+     LK
Subjt:  LNQGCSPGSINETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLK

Query:  LDMSKAYDMIEWSFLRIVMARMGFAQ---------------------------------QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSP
        LDMSKAYD +EW F+  +M R+GFA+                                 +QGD  S YLFLLC EGLS LLR A  +  ISG   +R  P
Subjt:  LDMSKAYDMIEWSFLRIVMARMGFAQ---------------------------------QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSP

Query:  PISHLFFANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRKD
         ++HLFFA+DSLLF +A     +A+  +L +YE  SGQ +N  K+ + F+ NT  D
Subjt:  PISHLFFANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRKD

A0A803PQT0 Uncharacterized protein3.8e-7434.77Show/hide
Query:  ASKIPLVLFLSETKLSSNRMAPAKRALGFENCFCVDNRGKSGGLALLWNSSVTFSLLSYSNNHIDGWITWDD---YHWRLTCIGFPTADMRDQTTKR---
        + + P VLFL ETKL S  ++  + +L F +   V   G SGG+ LLW S    ++ +YS+NHID ++ + D   +H+     G P  + R  T  +   
Subjt:  ASKIPLVLFLSETKLSSNRMAPAKRALGFENCFCVDNRGKSGGLALLWNSSVTFSLLSYSNNHIDGWITWDD---YHWRLTCIGFPTADMRDQTTKR---

Query:  --RVAEINALFIL----QLLTDYVQQFFLSLDPNDQDFD---------------------ISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAP
           VA +    ++    +   D  Q  F +L  ++ + D                       L  +P  +  E + +L QP+T +++  ALK    +++P
Subjt:  --RVAEINALFIL----QLLTDYVQQFFLSLDPNDQDFD---------------------ISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAP

Query:  GPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSINETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCV
        G DG+S  FY N+W IVG  V  + L VLN GC P  +N T+I L+ KVK P +++ +RPISL NV YKL+SK +V R++  L  +IS  QSAF+    +
Subjt:  GPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSINETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCV

Query:  VDNAILGFECIHELRRQTGGKSKWATLKLDMSKAYDMIEWSFLRIVMARMGFAQ---------------------------------QQGDPRSLYLFLL
         DN ++ FE +H ++ +  G   +A +KLDMSKA+D +EW  ++ VM +MGF                                   +QGDP S YLFL+
Subjt:  VDNAILGFECIHELRRQTGGKSKWATLKLDMSKAYDMIEWSFLRIVMARMGFAQ---------------------------------QQGDPRSLYLFLL

Query:  CVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRKDTQ
        C E  S LL+  E Q  + G   +R +PP+SHL FA+DS+LF RA+     A+   L  + R SGQV+N++K V++FSPNTR   Q
Subjt:  CVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRKDTQ

A0A803QP43 Uncharacterized protein4.2e-7337.84Show/hide
Query:  LSETKLSSNRMAPAKRALGFENCFCVDNRGKSGGLALLWNSSVTFSLLSYSNNHIDGWITWD-DYHWRLTCIGFPTADMRDQTTKRRVAEINALF-----
        +++   + N  A    +LG   CF VD +GKSGGLALLW       + SY+++HID  +  +  + WR T  GF  +     +T+++   I  LF     
Subjt:  LSETKLSSNRMAPAKRALGFENCFCVDNRGKSGGLALLWNSSVTFSLLSYSNNHIDGWITWD-DYHWRLTCIGFPTADMRDQTTKRRVAEINALF-----

Query:  -------ILQLLTDYVQQFFLSLDPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVL
               I  +   Y +Q F   +      +I  R +P  ++ + N  L++PFT DE+  ++   HP KAPG DGL G F++  W  VG  VI +CL +L
Subjt:  -------ILQLLTDYVQQFFLSLDPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVL

Query:  NQGCSPGSINETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKL
        N       INET+I L+ KV  P ++  FRPISL NV YK++SK L NRMK  L  +IS NQSAFI GR + DNAI+GFE +H +R+   G  +   +KL
Subjt:  NQGCSPGSINETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKL

Query:  DMSKAYDMIEWSFLRIVMARMGF---------------------------------AQQQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPP
        DMSKAYD +EW FL  +M  +GF                                   +QGDP SL+LFLLC EGLS ++  AE+   I G RF      
Subjt:  DMSKAYDMIEWSFLRIVMARMGF---------------------------------AQQQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPP

Query:  ISHLFFANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKS
        +SHL FA++SL+F  A       ++++L  YE+ SGQ +N+EK+
Subjt:  ISHLFFANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKS

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.8e-2024.07Show/hide
Query:  LTDYVQQFFLSLDPNDQDFD--ISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSI
        + +Y +  + +   N ++ D  +    LPR    E+   L +P T  EI+  +      K+PGPDG +  FY+ +   + P +++   ++  +G  P S 
Subjt:  LTDYVQQFFLSLDPNDQDFD--ISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSI

Query:  NETMIVLVSKV-KAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKLDMSKAYDM
         E  I+L+ K  +   +   FRPISL N+  K+++K L NR++  + KLI  +Q  FIPG     N       I  + R          + +D  KA+D 
Subjt:  NETMIVLVSKV-KAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKLDMSKAYDM

Query:  IEWSFLRIVMARMGF---------------------------------AQQQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFAN
        I+  F+   + ++G                                    +QG P S  LF + +E L+  +R   Q+  I G +  +    +S   FA+
Subjt:  IEWSFLRIVMARMGF---------------------------------AQQQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFAN

Query:  DSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRKDTQ
        D +++     +    +  L+  + + SG  +N +KS  AF  N  + T+
Subjt:  DSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRKDTQ

P08548 LINE-1 reverse transcriptase homolog2.5e-1924Show/hide
Query:  ILQLLTDYVQQFFLSLDPNDQDFDISLR--DLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCS
        I ++L +Y ++ +     N ++ D  L    LPR    E+  +L +P +  EI   ++     K+PGPDG +  FY+     + P ++     +  +G  
Subjt:  ILQLLTDYVQQFFLSLDPNDQDFDISLR--DLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCS

Query:  PGSINETMIVLVSKV-KAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKLDMSK
        P +  E  I L+ K  K P R   +RPISL N+  K+++K L NR++  + K+I  +Q  FIPG     N       I  + +          L +D  K
Subjt:  PGSINETMIVLVSKV-KAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKLDMSK

Query:  AYDMIEWSFLRIVMARMGF---------------------------------AQQQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHL
        A+D I+  F+   + ++G                                    +QG P S  LF + +E L+  +R   ++  I G      S  I   
Subjt:  AYDMIEWSFLRIVMARMGF---------------------------------AQQQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHL

Query:  FFANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRK
         FA+D +++          + +++  Y   SG  +N  KSV     N  +
Subjt:  FFANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRK

P11369 LINE-1 retrotransposable element ORF2 protein1.3e-1826.51Show/hide
Query:  LMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSINETMIVLVSK-VKAPRRVSYFRPISLYNVSYKLISKALV
        L  P +  EI   +      K+PGPDG S  FY+     + P + +    +  +G  P S  E  I L+ K  K P ++  FRPISL N+  K+++K L 
Subjt:  LMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSINETMIVLVSK-VKAPRRVSYFRPISLYNVSYKLISKALV

Query:  NRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKLDMSKAYDMIEWSFLRIVMARMGF------------------------
        NR++  +  +I P+Q  FIPG     N       IH + +          + LD  KA+D I+  F+  V+ R G                         
Subjt:  NRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKLDMSKAYDMIEWSFLRIVMARMGF------------------------

Query:  ---------AQQQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSV
                   +QG P S YLF + +E L+  +R   QQ  I G +  +    IS L  A+D +++          + +L+  +    G  +N  KS+
Subjt:  ---------AQQQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSV

P14381 Transposon TX1 uncharacterized 149 kDa protein9.4e-2229Show/hide
Query:  QLLTDYVQQFFLSL---DPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSP
        + + D  + F+ +L   DP   D    L D    V       L  P T DE+ +AL+    +K+PG DGL+  F++  W  +GP   +       +G  P
Subjt:  QLLTDYVQQFFLSL---DPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSP

Query:  GSINETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKLDMSKAY
         S    ++ L+ K    R +  +RP+SL +  YK+++KA+  R+K +L ++I P+QS  +PGR + DN  L  + +H  RR TG     A L LD  KA+
Subjt:  GSINETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKLDMSKAY

Query:  DMIEWSFLRIVMARMGFAQQ-QGDPRSLYLFLLCVEGLSSLL-------RGAEQQYLISGFRFARSSPP
        D ++  +L   +    F  Q  G  +++Y    C+  ++  L       RG  Q   +SG  ++ +  P
Subjt:  DMIEWSFLRIVMARMGFAQQ-QGDPRSLYLFLLCVEGLSSLL-------RGAEQQYLISGFRFARSSPP

P92555 Uncharacterized mitochondrial protein AtMg012502.9e-0751.92Show/hide
Query:  QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFANDS
        +QGDP S YLF+LC E LS L R A++Q  + G R + +SP I+HL FA+D+
Subjt:  QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFANDS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.9e-0733.33Show/hide
Query:  TEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSINETMIVLVSKVKAPRRVSYFRPISLYNVSYKLIS
        ++ EI  A+     +KAPGPD  +  F+   W +V  S I +       G      N T I L+ KV    ++S FRP+S   V YK+I+
Subjt:  TEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSINETMIVLVSKVKAPRRVSYFRPISLYNVSYKLIS

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.6e-1342.5Show/hide
Query:  LVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKLDMSKAYDMIEWSFLRIVMARMGFAQ
        +V R+K ++  LI P Q++FIPGR   DN +   E +H +RR+ G K  W  LKLD+ KAYD I W +L   +   GF +
Subjt:  LVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKLDMSKAYDMIEWSFLRIVMARMGFAQ

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.1e-0851.92Show/hide
Query:  QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFANDS
        +QGDP S YLF+LC E LS L R A++Q  + G R + +SP I+HL FA+D+
Subjt:  QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFANDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGATGGATATGGCGGTGGCTGGATCCCAACCCCACCCGGGATTATGAGTCTGATATTTTGAAATGTTCGGGGGTTGGGGTCTCCCCGTTCATTCCGACGCTTGGCCA
AGTTGGTGCAAGCAAAATACCTTTGGTGCTTTTTCTGTCTGAGACTAAATTGTCATCAAACAGAATGGCGCCAGCGAAGCGAGCTCTAGGCTTCGAGAACTGTTTTTGTG
TTGACAATAGAGGGAAAAGTGGTGGTTTGGCTCTTTTGTGGAATTCGTCTGTCACTTTCAGCCTCCTGTCGTATTCGAATAATCACATTGATGGGTGGATCACGTGGGAT
GATTATCATTGGCGTCTCACTTGCATAGGTTTCCCTACAGCGGACATGCGAGACCAGACCACGAAAAGGAGGGTGGCAGAGATTAATGCCCTGTTTATTCTTCAGTTGTT
GACTGATTATGTCCAGCAGTTCTTCTTGTCATTAGATCCCAATGACCAGGATTTTGATATATCTCTCAGGGACCTTCCTCGCTTTGTGGATAGTGAGATGAATGTCGTTC
TAATGCAGCCTTTTACAGAGGATGAGATTCTTCGGGCTTTGAAGCAGTCTCACCCCCATAAGGCCCCAGGCCCAGATGGGTTATCTGGCAGTTTCTACAAAAACCACTGG
TCGATAGTGGGGCCTTCAGTGATCCAGAGTTGTTTGGCTGTGTTGAATCAGGGATGCTCCCCGGGGTCAATCAATGAGACTATGATTGTCCTCGTTTCGAAGGTCAAGGC
CCCTCGTCGGGTATCTTATTTTCGACCCATCTCTCTCTACAATGTTAGTTATAAGCTAATATCGAAGGCCTTGGTCAACAGGATGAAACATATTCTTCCAAAACTTATTT
CTCCCAACCAGAGTGCCTTTATCCCAGGGAGGTGTGTTGTGGATAATGCCATCTTGGGGTTTGAGTGCATCCATGAGTTGAGGAGGCAGACTGGAGGGAAATCTAAATGG
GCTACTCTAAAACTTGACATGAGCAAAGCTTATGACATGATAGAATGGTCTTTTTTGCGAATAGTTATGGCTAGAATGGGTTTCGCTCAGCAGCAGGGGGATCCGCGGTC
CCTATATCTGTTTTTACTCTGTGTTGAGGGTTTATCGAGCCTGTTGCGAGGAGCAGAGCAGCAATATTTGATATCTGGGTTTCGATTTGCACGGAGTAGCCCCCCGATTT
CTCATCTATTTTTTGCGAATGATAGCCTCCTGTTCTTCAGGGCAAATGCTATGGGAGTTGTGGCTATCCGGGACCTATTGATCCGCTATGAACGAACCTCAGGACAGGTG
GTCAATTATGAGAAGTCAGTGGTTGCATTCAGCCCAAATACTAGAAAGGACACACAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCGATGGATATGGCGGTGGCTGGATCCCAACCCCACCCGGGATTATGAGTCTGATATTTTGAAATGTTCGGGGGTTGGGGTCTCCCCGTTCATTCCGACGCTTGGCCA
AGTTGGTGCAAGCAAAATACCTTTGGTGCTTTTTCTGTCTGAGACTAAATTGTCATCAAACAGAATGGCGCCAGCGAAGCGAGCTCTAGGCTTCGAGAACTGTTTTTGTG
TTGACAATAGAGGGAAAAGTGGTGGTTTGGCTCTTTTGTGGAATTCGTCTGTCACTTTCAGCCTCCTGTCGTATTCGAATAATCACATTGATGGGTGGATCACGTGGGAT
GATTATCATTGGCGTCTCACTTGCATAGGTTTCCCTACAGCGGACATGCGAGACCAGACCACGAAAAGGAGGGTGGCAGAGATTAATGCCCTGTTTATTCTTCAGTTGTT
GACTGATTATGTCCAGCAGTTCTTCTTGTCATTAGATCCCAATGACCAGGATTTTGATATATCTCTCAGGGACCTTCCTCGCTTTGTGGATAGTGAGATGAATGTCGTTC
TAATGCAGCCTTTTACAGAGGATGAGATTCTTCGGGCTTTGAAGCAGTCTCACCCCCATAAGGCCCCAGGCCCAGATGGGTTATCTGGCAGTTTCTACAAAAACCACTGG
TCGATAGTGGGGCCTTCAGTGATCCAGAGTTGTTTGGCTGTGTTGAATCAGGGATGCTCCCCGGGGTCAATCAATGAGACTATGATTGTCCTCGTTTCGAAGGTCAAGGC
CCCTCGTCGGGTATCTTATTTTCGACCCATCTCTCTCTACAATGTTAGTTATAAGCTAATATCGAAGGCCTTGGTCAACAGGATGAAACATATTCTTCCAAAACTTATTT
CTCCCAACCAGAGTGCCTTTATCCCAGGGAGGTGTGTTGTGGATAATGCCATCTTGGGGTTTGAGTGCATCCATGAGTTGAGGAGGCAGACTGGAGGGAAATCTAAATGG
GCTACTCTAAAACTTGACATGAGCAAAGCTTATGACATGATAGAATGGTCTTTTTTGCGAATAGTTATGGCTAGAATGGGTTTCGCTCAGCAGCAGGGGGATCCGCGGTC
CCTATATCTGTTTTTACTCTGTGTTGAGGGTTTATCGAGCCTGTTGCGAGGAGCAGAGCAGCAATATTTGATATCTGGGTTTCGATTTGCACGGAGTAGCCCCCCGATTT
CTCATCTATTTTTTGCGAATGATAGCCTCCTGTTCTTCAGGGCAAATGCTATGGGAGTTGTGGCTATCCGGGACCTATTGATCCGCTATGAACGAACCTCAGGACAGGTG
GTCAATTATGAGAAGTCAGTGGTTGCATTCAGCCCAAATACTAGAAAGGACACACAATAG
Protein sequenceShow/hide protein sequence
MRWIWRWLDPNPTRDYESDILKCSGVGVSPFIPTLGQVGASKIPLVLFLSETKLSSNRMAPAKRALGFENCFCVDNRGKSGGLALLWNSSVTFSLLSYSNNHIDGWITWD
DYHWRLTCIGFPTADMRDQTTKRRVAEINALFILQLLTDYVQQFFLSLDPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHW
SIVGPSVIQSCLAVLNQGCSPGSINETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKW
ATLKLDMSKAYDMIEWSFLRIVMARMGFAQQQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFANDSLLFFRANAMGVVAIRDLLIRYERTSGQV
VNYEKSVVAFSPNTRKDTQ