; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032226 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032226
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr11:27697012..27699250
RNA-Seq ExpressionLag0032226
SyntenyLag0032226
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5481481.1 hypothetical protein F2P56_002126 [Juglans regia]5.9e-2125.36Show/hide
Query:  AIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSL-----------------
        +IPTY MSCFKLP SLC + +  +   FWWGQ K   R+ W+SW++LC SK   GM  +D++ FN A+LAK GWR +K   SL                 
Subjt:  AIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSL-----------------

Query:  ---------------------------EVKE----NFSSIDVDL------------ILNTPPVDSKAKDEIIWTYDNK---------------KEEELEP
                                   +V E    N +  DV+             +LN      +  D ++W ++                  EE    
Subjt:  ---------------------------EVKE----NFSSIDVDL------------ILNTPPVDSKAKDEIIWTYDNK---------------KEEELEP

Query:  NGLLGWDWS------------IHNLKEEEIDKAIIILWKEEASKNRAMIEHQLQPPPEL-------------------KSLSSQGRWTPPSPNVWKINSD
         G      S            +   K  E++  ++  W     +N+ M E++   P +                    + L  Q RW PP   V K+N D
Subjt:  NGLLGWDWS------------IHNLKEEEIDKAIIILWKEEASKNRAMIEHQLQPPPEL-------------------KSLSSQGRWTPPSPNVWKINSD

Query:  ASWSEMQNRGGVGWIVCNSTG--------------SPIGRRF----------YP----PIEVESDAIRVINLLNLEVDDLSKSANLVEAILQMKSALEVV
         +    Q R GVG ++ +  G               PI   F          +P     +EVESD++ V+  LN E + +S   NLV  I +M      V
Subjt:  ASWSEMQNRGGVGWIVCNSTG--------------SPIGRRF----------YP----PIEVESDAIRVINLLNLEVDDLSKSANLVEAILQMKSALEVV

Query:  KFSHCPRQINGSAHRLAQ
           H  R  N  AH LA+
Subjt:  KFSHCPRQINGSAHRLAQ

XP_022145148.1 uncharacterized protein LOC111014662 [Momordica charantia]2.0e-2152.81Show/hide
Query:  INKAIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSLEVK
        + +AIP Y+MSCF+ P++LC++ IN +   FWWG     K++HW SWKRLCVSK+ GG+  +D+ +FNQAMLAK  W+ +KNP SL V+
Subjt:  INKAIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSLEVK

XP_022155286.1 uncharacterized protein LOC111022423 [Momordica charantia]2.9e-2051.69Show/hide
Query:  INKAIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSLEVK
        + +AIP YT+SCFKLP+S+C + ++++   FWWG     +++HW SWK LC+ KD GGM  RDI +FNQAMLAK  WR +++P SL  K
Subjt:  INKAIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSLEVK

XP_023871998.1 uncharacterized protein LOC111984613 [Quercus suber]1.0e-2053.49Show/hide
Query:  INKAIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSL
        + KA+PTYTMSCFK+P S+C+++ + +S  FWWGQ K  +++ WLSW +LC+ KD GGM  RD++ FN+A+LAK GWR   +P SL
Subjt:  INKAIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSL

XP_030495270.1 uncharacterized protein LOC115711072 [Cannabis sativa]1.7e-2028.9Show/hide
Query:  KAIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSL---EVKENFSSIDVDL
        ++IPTY MSCFKL    C+ + + MS +FWWG  ++  ++HW  WK LC SK  GGM  R    FNQA+LAK  WR    P +L    +K  + S    L
Subjt:  KAIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSL---EVKENFSSIDVDL

Query:  ILNTPPVDSKAKDEIIWTYDNKKEEELE------------PNGLLGWDWSIHNLKEEEIDKAIIILWKEEASKNRAMIEH------------QLQPPPEL
          N     S     I W     KE  L+             +G   W  S +N K  +ID+ + I     A  +R +  H             L    E 
Subjt:  ILNTPPVDSKAKDEIIWTYDNKKEEELE------------PNGLLGWDWSIHNLKEEEIDKAIIILWKEEASKNRAMIEH------------QLQPPPEL

Query:  KSLSSQGR------WTPPSPNVWKINSDASWSEMQNRGGVGWIVCNSTGSP----------------------------IGRRFYPPIEVESDAIRVINL
        ++ SS         W PP  N + +N DA+ +  Q + G+G I+ +  G+                             + +  +P   +E+DA RV N 
Subjt:  KSLSSQGR------WTPPSPNVWKINSDASWSEMQNRGGVGWIVCNSTGSP----------------------------IGRRFYPPIEVESDAIRVINL

Query:  LNLEVDDLSKSANLVEAILQMKSALEVVKFSHCPRQINGSAHRLAQ
        LN    DLS  ++L+  I  + S    V  +H  R  N +AH LA+
Subjt:  LNLEVDDLSKSANLVEAILQMKSALEVVKFSHCPRQINGSAHRLAQ

TrEMBL top hitse value%identityAlignment
A0A2N9G3J3 Uncharacterized protein1.2e-2259.3Show/hide
Query:  INKAIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSL
        + ++IPTYTMSCFK+P  LC+D +N M  DFWWG     K+ HWL W +LC SKDSGGM  RD++ FN AMLAK GWR ++NP SL
Subjt:  INKAIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSL

A0A2N9H0Z5 CCHC-type domain-containing protein4.4e-2256.32Show/hide
Query:  IINKAIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSL
        ++ ++IPTYTMSCFKLP  LC+D +N M  DFWWG    +K+ HW+ W +LC SK++GG+  RD++ FN AMLAK GWR V+NP SL
Subjt:  IINKAIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSL

A0A2N9H6D3 Reverse transcriptase domain-containing protein9.8e-2258.14Show/hide
Query:  INKAIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSL
        + ++IPTYTMSCF LP  LC+D INKM   FWWG    +K+ HWL W +LC  KD GGM  RDI+ FN+A+LAK GWR ++NP SL
Subjt:  INKAIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSL

A0A6J1CV63 uncharacterized protein LOC1110146629.8e-2252.81Show/hide
Query:  INKAIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSLEVK
        + +AIP Y+MSCF+ P++LC++ IN +   FWWG     K++HW SWKRLCVSK+ GG+  +D+ +FNQAMLAK  W+ +KNP SL V+
Subjt:  INKAIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSLEVK

A0A803PKA4 Uncharacterized protein8.9e-2331.95Show/hide
Query:  KAIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSL---EVKENFSSIDVDL
        ++IPTY MSCFKLP   C ++ + MS +FWWG    +K++HW  WK LC SK  GG+  R+   FNQA+LAK  WR  +NP SL    +K  + S    L
Subjt:  KAIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSL---EVKENFSSIDVDL

Query:  ILNTPPVDSKAKDEIIWTYDNKKEE---ELEPNGLLGWDWSIHNLKEE----EIDKAIIILWKEEASKNRAMIEHQLQPPPELKSLSSQGR------WTP
           T    S     I W  +  K+    ++        +W+I  L  +    ++++ +++     A+ +   + H       L  L+  G       WTP
Subjt:  ILNTPPVDSKAKDEIIWTYDNKKEE---ELEPNGLLGWDWSIHNLKEE----EIDKAIIILWKEEASKNRAMIEHQLQPPPELKSLSSQGR------WTP

Query:  PSPNVWKINSDASWSEMQNRGGVGWIVCNSTGSPIGRRFYP
        P P   K+N DA++    NR G G I+ +STG  +    +P
Subjt:  PSPNVWKINSDASWSEMQNRGGVGWIVCNSTGSPIGRRFYP

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.2e-0829.67Show/hide
Query:  SLQDIINKAIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSL
        +L   +  ++P ++MS   LP S+  + ++++S  F WG    +K+ H + W ++C  K  GG+  R  +  N+A+++K+GWR ++   SL
Subjt:  SLQDIINKAIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSL

P93295 Uncharacterized mitochondrial protein AtMg003101.1e-1441.67Show/hide
Query:  AIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSK-DSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSL
        A+P Y MSCF+L   LC+ + + M+ +FWW   ++++++ W++W++LC SK D GG+  RD+  FNQA+LAK  +R +  P +L
Subjt:  AIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSK-DSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSL

Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.8e-0528.93Show/hide
Query:  EASKNRAMIEHQLQPPPELKSLSSQGRWTPPSPNVWKINSDASWSEMQNRGGVGWIVCNSTGS----------------------------PIGRRFYPP
        E    R  +E +   P   ++LS Q  W  P     K N+DA+W     R G+GWI+ N +G                              + R  Y  
Subjt:  EASKNRAMIEHQLQPPPELKSLSSQGRWTPPSPNVWKINSDASWSEMQNRGGVGWIVCNSTGS----------------------------PIGRRFYPP

Query:  IEVESDAIRVINLLNLEVDDLSKSANLVEAILQMKSALEVVKFSHCPRQINGSAHRLAQ
        I  ESDA  ++NLLN + D        +E I Q+    E VKF   PR  N  A R+A+
Subjt:  IEVESDAIRVINLLNLEVDDLSKSANLVEAILQMKSALEVVKFSHCPRQINGSAHRLAQ

AT4G29090.1 Ribonuclease H-like superfamily protein1.6e-1632.07Show/hide
Query:  AIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSLEVK----ENFSSIDVDL
        A+PTYTM+CF LP ++C+ II+ ++ DFWW   +  K +HW +W  L   K  GG+  +DI+ FN A+L K  WR +  P SL  K      F   D   
Subjt:  AIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSLEVK----ENFSSIDVDL

Query:  ILNTPPVDSKAKDEIIWTYDNKKEEELEPNGLLGWDWSIHNLKEEEIDKAIIILWKE---EASKNRAMIEHQLQPPPELKSLSS
         LN P     ++   +W   +  +E L      G    + N ++       II+W+    ++    A +  Q  PP E  S+SS
Subjt:  ILNTPPVDSKAKDEIIWTYDNKKEEELEPNGLLGWDWSIHNLKEEEIDKAIIILWKE---EASKNRAMIEHQLQPPPELKSLSS

AT4G29090.1 Ribonuclease H-like superfamily protein3.7e-0540.91Show/hide
Query:  SSQGRWTPPSPNVW-KINSDASWSEMQNRGGVGWIVCNSTGSP--IGRRFYPP----IEVESDAIR
        SS GRW PP P+ W K N+DA+W+    R G+GW++ N  G    +G R  P     +E E +A+R
Subjt:  SSQGRWTPPSPNVW-KINSDASWSEMQNRGGVGWIVCNSTGSP--IGRRFYPP----IEVESDAIR

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.0e-1641.67Show/hide
Query:  AIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSK-DSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSL
        A+P Y MSCF+L   LC+ + + M+ +FWW   ++++++ W++W++LC SK D GG+  RD+  FNQA+LAK  +R +  P +L
Subjt:  AIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLCVSK-DSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAGGGTAGAAGAAGAAGAAGCAGAGAACAAACAACTCGAAGATGAAGTAGTAAACAAGCAGCTTGAAAATCTCAAAATAACAGTTGAGGAAAAGGCCAAGAAGGT
GGCTATAGCAGACGAGGATCTAAACGTTGCAGATGAGGATTTGCAAGTCTTCTCTCTGCAAGATATTATCAACAAGGCTATACCGACCTATACCATGAGTTGTTTTAAGC
TTCCTATCTCCTTATGCGAAGATATTATTAACAAGATGAGTGTAGACTTCTGGTGGGGCCAAGGTAAATCAAGAAAAAGAGTGCATTGGTTAAGCTGGAAAAGACTTTGT
GTTAGTAAAGATTCGGGTGGGATGAGATCCAGAGATATCCAGCTGTTCAACCAAGCGATGCTAGCAAAGATCGGCTGGAGGCGTGTTAAAAATCCCACCAGCTTGGAAGT
CAAAGAAAATTTCTCCTCCATTGATGTGGATTTAATTCTTAACACGCCCCCGGTTGACTCTAAGGCAAAAGATGAGATAATTTGGACCTATGACAACAAGAAGGAGGAAG
AGTTGGAACCCAATGGATTACTGGGCTGGGATTGGAGCATTCATAATCTAAAAGAGGAAGAAATCGATAAAGCTATTATCATCTTGTGGAAAGAAGAAGCGAGCAAGAAC
AGAGCGATGATCGAGCACCAGCTGCAACCTCCCCCGGAATTGAAGAGCCTGTCGAGTCAAGGACGCTGGACCCCTCCTAGTCCGAATGTCTGGAAGATAAACTCAGATGC
CTCCTGGAGCGAAATGCAAAATAGAGGAGGAGTGGGGTGGATCGTTTGTAACTCTACAGGATCTCCAATTGGCAGGCGCTTCTATCCCCCTATTGAGGTTGAATCCGATG
CAATCAGAGTGATCAATCTCCTAAATCTCGAAGTCGATGACCTTTCGAAATCGGCCAATCTGGTTGAGGCCATCCTCCAGATGAAATCGGCCTTGGAAGTGGTCAAGTTC
AGCCACTGCCCTCGCCAGATCAATGGTTCAGCTCACCGGCTCGCGCAAATGGTCGTCGTCGGTCTGCCATTTAATTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAAGGGTAGAAGAAGAAGAAGCAGAGAACAAACAACTCGAAGATGAAGTAGTAAACAAGCAGCTTGAAAATCTCAAAATAACAGTTGAGGAAAAGGCCAAGAAGGT
GGCTATAGCAGACGAGGATCTAAACGTTGCAGATGAGGATTTGCAAGTCTTCTCTCTGCAAGATATTATCAACAAGGCTATACCGACCTATACCATGAGTTGTTTTAAGC
TTCCTATCTCCTTATGCGAAGATATTATTAACAAGATGAGTGTAGACTTCTGGTGGGGCCAAGGTAAATCAAGAAAAAGAGTGCATTGGTTAAGCTGGAAAAGACTTTGT
GTTAGTAAAGATTCGGGTGGGATGAGATCCAGAGATATCCAGCTGTTCAACCAAGCGATGCTAGCAAAGATCGGCTGGAGGCGTGTTAAAAATCCCACCAGCTTGGAAGT
CAAAGAAAATTTCTCCTCCATTGATGTGGATTTAATTCTTAACACGCCCCCGGTTGACTCTAAGGCAAAAGATGAGATAATTTGGACCTATGACAACAAGAAGGAGGAAG
AGTTGGAACCCAATGGATTACTGGGCTGGGATTGGAGCATTCATAATCTAAAAGAGGAAGAAATCGATAAAGCTATTATCATCTTGTGGAAAGAAGAAGCGAGCAAGAAC
AGAGCGATGATCGAGCACCAGCTGCAACCTCCCCCGGAATTGAAGAGCCTGTCGAGTCAAGGACGCTGGACCCCTCCTAGTCCGAATGTCTGGAAGATAAACTCAGATGC
CTCCTGGAGCGAAATGCAAAATAGAGGAGGAGTGGGGTGGATCGTTTGTAACTCTACAGGATCTCCAATTGGCAGGCGCTTCTATCCCCCTATTGAGGTTGAATCCGATG
CAATCAGAGTGATCAATCTCCTAAATCTCGAAGTCGATGACCTTTCGAAATCGGCCAATCTGGTTGAGGCCATCCTCCAGATGAAATCGGCCTTGGAAGTGGTCAAGTTC
AGCCACTGCCCTCGCCAGATCAATGGTTCAGCTCACCGGCTCGCGCAAATGGTCGTCGTCGGTCTGCCATTTAATTTTTAG
Protein sequenceShow/hide protein sequence
MERVEEEEAENKQLEDEVVNKQLENLKITVEEKAKKVAIADEDLNVADEDLQVFSLQDIINKAIPTYTMSCFKLPISLCEDIINKMSVDFWWGQGKSRKRVHWLSWKRLC
VSKDSGGMRSRDIQLFNQAMLAKIGWRRVKNPTSLEVKENFSSIDVDLILNTPPVDSKAKDEIIWTYDNKKEEELEPNGLLGWDWSIHNLKEEEIDKAIIILWKEEASKN
RAMIEHQLQPPPELKSLSSQGRWTPPSPNVWKINSDASWSEMQNRGGVGWIVCNSTGSPIGRRFYPPIEVESDAIRVINLLNLEVDDLSKSANLVEAILQMKSALEVVKF
SHCPRQINGSAHRLAQMVVVGLPFNF