; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041102 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041102
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr13:12023278..12029325
RNA-Seq ExpressionLag0041102
SyntenyLag0041102
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BAT06703.1 Os08g0561250, partial [Oryza sativa Japonica Group]5.1e-1644.74Show/hide
Query:  GCGSSARCESPEGVSLYWGRLSVVFLPNKAPRTHATSATVKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRFW--
        G  S AR E   G+ +Y GR               T + +KDRVWK LQGWKE+L S  GKE+LIK+V Q+IP Y MSCF L KT+CN++  +  RFW  
Subjt:  GCGSSARCESPEGVSLYWGRLSVVFLPNKAPRTHATSATVKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRFW--

Query:  -SKEESVVH-IIWD
          + E+ VH + W+
Subjt:  -SKEESVVH-IIWD

KAB8109613.1 hypothetical protein EE612_045933, partial [Oryza sativa]5.1e-1644.74Show/hide
Query:  GCGSSARCESPEGVSLYWGRLSVVFLPNKAPRTHATSATVKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRFW--
        G  S AR E   G+ +Y GR               T + +KDRVWK LQGWKE+L S  GKE+LIK+V Q+IP Y MSCF L KT+CN++  +  RFW  
Subjt:  GCGSSARCESPEGVSLYWGRLSVVFLPNKAPRTHATSATVKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRFW--

Query:  -SKEESVVH-IIWD
          + E+ VH + W+
Subjt:  -SKEESVVH-IIWD

XP_018821989.1 uncharacterized protein LOC108992010 [Juglans regia]3.0e-1659.72Show/hide
Query:  VKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRFW---SKEESVVHII
        +K RVW+ LQGWKEK+ S GGKEVLIKAV QAIP Y+MSCFKLP ++CN++ +M  +FW     EE  +H I
Subjt:  VKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRFW---SKEESVVHII

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]2.1e-1753.12Show/hide
Query:  RLSVVFLPNKAPRTHATSAT-VKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRFW---SKEESVVH-IIWD
        +L  + LP   PR        +KDRVWK LQGWK KLFS+GGKEVLIKAV QAIP YTMSCF+LPK +  + + +  RFW   SKE+  +H + W+
Subjt:  RLSVVFLPNKAPRTHATSAT-VKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRFW---SKEESVVH-IIWD

XP_022155286.1 uncharacterized protein LOC111022423 [Momordica charantia]8.5e-1950.78Show/hide
Query:  LPNKAPRTHATSATVKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRFW---SKEESVVHIIWDCKVFRQLDPSPR
        L NK+ R+   SA V DRVWK LQGWK KLFSMGGKEVLIK V QAIPNYT+SCFKLP +ICN+++++  RFW   S E+  +H  W             
Subjt:  LPNKAPRTHATSATVKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRFW---SKEESVVHIIWDCKVFRQLDPSPR

Query:  QSWKINTDAAWAEDKGTGGIGWIARDLS
        QSWK+         K  GG+G+  RD+S
Subjt:  QSWKINTDAAWAEDKGTGGIGWIARDLS

TrEMBL top hitse value%identityAlignment
A0A0P0XJF7 Os08g0561250 protein (Fragment)2.5e-1644.74Show/hide
Query:  GCGSSARCESPEGVSLYWGRLSVVFLPNKAPRTHATSATVKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRFW--
        G  S AR E   G+ +Y GR               T + +KDRVWK LQGWKE+L S  GKE+LIK+V Q+IP Y MSCF L KT+CN++  +  RFW  
Subjt:  GCGSSARCESPEGVSLYWGRLSVVFLPNKAPRTHATSATVKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRFW--

Query:  -SKEESVVH-IIWD
          + E+ VH + W+
Subjt:  -SKEESVVH-IIWD

A0A2I4ERH4 uncharacterized protein LOC1089920101.5e-1659.72Show/hide
Query:  VKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRFW---SKEESVVHII
        +K RVW+ LQGWKEK+ S GGKEVLIKAV QAIP Y+MSCFKLP ++CN++ +M  +FW     EE  +H I
Subjt:  VKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRFW---SKEESVVHII

A0A2N9J7Z2 Reverse transcriptase domain-containing protein7.7e-1831.25Show/hide
Query:  LPNKAPRTHATSAT-VKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRFW----SKEESVVHIIW-----------
        LP+   R  A S T +K+RVW  L+GWKEKL S   +EVLIKAV QAIP Y+MSCF+LP  +C DI  M  RFW      +  +  + W           
Subjt:  LPNKAPRTHATSAT-VKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRFW----SKEESVVHIIW-----------

Query:  ----DCKVFRQL---------------DPSPRQSWKINTDAAWAEDKGTGGIGWIARDLSESFICVGFKKFDMNWPIKCLEMKTIMESLKSLLERISSPF
            D + F +                 P     +K N D A  ++    GIG + RD     +    +K      ++C+E     +++K + E      
Subjt:  ----DCKVFRQL---------------DPSPRQSWKINTDAAWAEDKGTGGIGWIARDLSESFICVGFKKFDMNWPIKCLEMKTIMESLKSLLERISSPF

Query:  SWIIIESDSIEVVAILNHESEDLS
        + +  E DS  +VA LN+ S  L+
Subjt:  SWIIIESDSIEVVAILNHESEDLS

A0A6J1DAR4 uncharacterized protein LOC1110189541.0e-1753.12Show/hide
Query:  RLSVVFLPNKAPRTHATSAT-VKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRFW---SKEESVVH-IIWD
        +L  + LP   PR        +KDRVWK LQGWK KLFS+GGKEVLIKAV QAIP YTMSCF+LPK +  + + +  RFW   SKE+  +H + W+
Subjt:  RLSVVFLPNKAPRTHATSAT-VKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRFW---SKEESVVH-IIWD

A0A6J1DRA0 uncharacterized protein LOC1110224234.1e-1950.78Show/hide
Query:  LPNKAPRTHATSATVKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRFW---SKEESVVHIIWDCKVFRQLDPSPR
        L NK+ R+   SA V DRVWK LQGWK KLFSMGGKEVLIK V QAIPNYT+SCFKLP +ICN+++++  RFW   S E+  +H  W             
Subjt:  LPNKAPRTHATSATVKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRFW---SKEESVVHIIWDCKVFRQLDPSPR

Query:  QSWKINTDAAWAEDKGTGGIGWIARDLS
        QSWK+         K  GG+G+  RD+S
Subjt:  QSWKINTDAAWAEDKGTGGIGWIARDLS

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.2e-0428.32Show/hide
Query:  TSATVKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRF-W--SKEESVVHIIWDCKVFRQLDPSPRQSWKINTDAA
        T   + +RV   + GW+EK  S  G+  L KAV  ++P ++MS   LP++I N ++++   F W  + E+   H++   KV      SP++   +   AA
Subjt:  TSATVKDRVWKVLQGWKEKLFSMGGKEVLIKAVTQAIPNYTMSCFKLPKTICNDINRMCFRF-W--SKEESVVHIIWDCKVFRQLDPSPRQSWKINTDAA

Query:  WAEDKG-TGGIGW
         + ++     +GW
Subjt:  WAEDKG-TGGIGW

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein1.3e-0429.67Show/hide
Query:  PSPRQSWKINTDAAWAEDKGTGGIGWIARDLSESFICVGFKKFDMNWPIKCLEMKTIMESLKSLLERISSPFSWIIIESDSIEVVAILNHE
        P P Q  K NTDA W  D    GIGW+ R+       +G +       +   E++ +  ++ SL       ++++I ESDS  ++ ILN++
Subjt:  PSPRQSWKINTDAAWAEDKGTGGIGWIARDLSESFICVGFKKFDMNWPIKCLEMKTIMESLKSLLERISSPFSWIIIESDSIEVVAILNHE

AT4G29090.1 Ribonuclease H-like superfamily protein4.1e-0348.57Show/hide
Query:  AIPNYTMSCFKLPKTICNDINRMCFRFW--SKEES
        A+P YTM+CF LPKT+C  I  +   FW  +K+E+
Subjt:  AIPNYTMSCFKLPKTICNDINRMCFRFW--SKEES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTCTCAGACGGGGCATGGGGAAAATAAGGTGTTCCGAAAGTATTCGCGTAGAAGGAGAATCGATGGAGGATGTGAGGTGAGCAAGAGTCTCTACAGCGCGGCGGA
AGAAGGTCGTCGGCACGGCGGATGTGGGTCGTCGGCGCGATGCGAGAGTCCCGAAGGTGTTTCTCTCTACTGGGGGAGGTTGAGCGTTGTATTCCTCCCCAATAAAGCCC
CCCGAACCCATGCTACCTCCGCCACTGTGAAGGATAGGGTTTGGAAAGTTCTGCAGGGGTGGAAAGAAAAACTCTTCTCTATGGGAGGAAAAGAAGTCCTTATTAAGGCA
GTGACCCAAGCTATTCCCAACTACACCATGAGCTGCTTCAAGCTCCCCAAGACTATATGTAATGATATTAATAGGATGTGCTTCAGATTCTGGTCCAAGGAGGAATCAGT
GGTGCACATTATTTGGGATTGTAAAGTCTTCAGACAGTTGGACCCCTCCCCACGACAAAGCTGGAAAATCAACACTGATGCAGCTTGGGCAGAAGACAAAGGAACAGGGG
GAATTGGATGGATTGCTCGTGACTTAAGCGAATCTTTTATCTGTGTTGGATTCAAGAAATTTGATATGAATTGGCCGATCAAATGCCTGGAGATGAAAACGATCATGGAG
AGTTTGAAAAGCTTACTTGAAAGAATCAGTAGCCCTTTCTCGTGGATCATCATCGAATCAGACTCAATTGAAGTCGTTGCAATCCTCAACCATGAATCTGAAGACCTTTC
AGAGATCAGGCAATTTTGGACCACCCCGATGTACAAGGAGCTGACGAGGACAACCGAGGAGAAATCGGACTGGGAGATGGACCCAAGAGGCGAAACCGGCAAGTGGGATG
GGCCAAGACCGAAGGGGTCGGGCTCTTGGCCCGACCCCCTACTCGGCCGAGGCCGAGCCTCATGGACGCTAGCGCGGGCCGAGCCCGTCCGGCTCCGTTTGGTCCTTATC
GCCTCTGGCCGCCCTGGTTTCGCCTGGTTTGTCCCGAAGTGCCTCCGAATTCCTAAAAACCCTAGGAGCATGAGCATGTATTTATACCCCTCTTCGCCACTGAAGAAGGG
GACCCGAATTCTATCCCTAAACTCTACTCTCTATTTTCTACTTTCTCCTCTTGCTCTTACTTTCTTGCTTCCCACAGTTCTGTTTGCTGACTTAAGCATCGGAGCCGGTG
TGGCAAGCACCACACCGGTGTGCAGGTTTACCGTCTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTTCTCAGACGGGGCATGGGGAAAATAAGGTGTTCCGAAAGTATTCGCGTAGAAGGAGAATCGATGGAGGATGTGAGGTGAGCAAGAGTCTCTACAGCGCGGCGGA
AGAAGGTCGTCGGCACGGCGGATGTGGGTCGTCGGCGCGATGCGAGAGTCCCGAAGGTGTTTCTCTCTACTGGGGGAGGTTGAGCGTTGTATTCCTCCCCAATAAAGCCC
CCCGAACCCATGCTACCTCCGCCACTGTGAAGGATAGGGTTTGGAAAGTTCTGCAGGGGTGGAAAGAAAAACTCTTCTCTATGGGAGGAAAAGAAGTCCTTATTAAGGCA
GTGACCCAAGCTATTCCCAACTACACCATGAGCTGCTTCAAGCTCCCCAAGACTATATGTAATGATATTAATAGGATGTGCTTCAGATTCTGGTCCAAGGAGGAATCAGT
GGTGCACATTATTTGGGATTGTAAAGTCTTCAGACAGTTGGACCCCTCCCCACGACAAAGCTGGAAAATCAACACTGATGCAGCTTGGGCAGAAGACAAAGGAACAGGGG
GAATTGGATGGATTGCTCGTGACTTAAGCGAATCTTTTATCTGTGTTGGATTCAAGAAATTTGATATGAATTGGCCGATCAAATGCCTGGAGATGAAAACGATCATGGAG
AGTTTGAAAAGCTTACTTGAAAGAATCAGTAGCCCTTTCTCGTGGATCATCATCGAATCAGACTCAATTGAAGTCGTTGCAATCCTCAACCATGAATCTGAAGACCTTTC
AGAGATCAGGCAATTTTGGACCACCCCGATGTACAAGGAGCTGACGAGGACAACCGAGGAGAAATCGGACTGGGAGATGGACCCAAGAGGCGAAACCGGCAAGTGGGATG
GGCCAAGACCGAAGGGGTCGGGCTCTTGGCCCGACCCCCTACTCGGCCGAGGCCGAGCCTCATGGACGCTAGCGCGGGCCGAGCCCGTCCGGCTCCGTTTGGTCCTTATC
GCCTCTGGCCGCCCTGGTTTCGCCTGGTTTGTCCCGAAGTGCCTCCGAATTCCTAAAAACCCTAGGAGCATGAGCATGTATTTATACCCCTCTTCGCCACTGAAGAAGGG
GACCCGAATTCTATCCCTAAACTCTACTCTCTATTTTCTACTTTCTCCTCTTGCTCTTACTTTCTTGCTTCCCACAGTTCTGTTTGCTGACTTAAGCATCGGAGCCGGTG
TGGCAAGCACCACACCGGTGTGCAGGTTTACCGTCTTGTAG
Protein sequenceShow/hide protein sequence
MPSQTGHGENKVFRKYSRRRRIDGGCEVSKSLYSAAEEGRRHGGCGSSARCESPEGVSLYWGRLSVVFLPNKAPRTHATSATVKDRVWKVLQGWKEKLFSMGGKEVLIKA
VTQAIPNYTMSCFKLPKTICNDINRMCFRFWSKEESVVHIIWDCKVFRQLDPSPRQSWKINTDAAWAEDKGTGGIGWIARDLSESFICVGFKKFDMNWPIKCLEMKTIME
SLKSLLERISSPFSWIIIESDSIEVVAILNHESEDLSEIRQFWTTPMYKELTRTTEEKSDWEMDPRGETGKWDGPRPKGSGSWPDPLLGRGRASWTLARAEPVRLRLVLI
ASGRPGFAWFVPKCLRIPKNPRSMSMYLYPSSPLKKGTRILSLNSTLYFLLSPLALTFLLPTVLFADLSIGAGVASTTPVCRFTVL