; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011137 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011137
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:15421949..15423359
RNA-Seq ExpressionLag0011137
SyntenyLag0011137
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_021763631.1 uncharacterized protein LOC110728264 [Chenopodium quinoa]5.3e-1828.08Show/hide
Query:  ESTIHAIWECKVAQKVWLDCIPQILMLFHLRRDDWVCSDYWSWCCGNL----TEEDLGKAALLIWCIWDYRNRNL--NIQITKNTDCTHFYNHISRIFQE
        +S +H +  C   Q +W       +    L    W   +++ WC   +     ++      + IW IW+ RN+ L   +++     C+   N        
Subjt:  ESTIHAIWECKVAQKVWLDCIPQILMLFHLRRDDWVCSDYWSWCCGNL----TEEDLGKAALLIWCIWDYRNRNL--NIQITKNTDCTHFYNHISRIFQE

Query:  NQRISETHLENSRQRRSENHQNHEIWVPPRPCRWKFNSDASWNDDLKIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEGLSCLFGRFGDNLD
        N R   T +E S +          +W+PP   R+K N+DAS + D + GG G VVRD+ G ++    +S  G   +S +EA A+  G+  +F        
Subjt:  NQRISETHLENSRQRRSENHQNHEIWVPPRPCRWKFNSDASWNDDLKIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEGLSCLFGRFGDNLD

Query:  LEVESDCLGLVRCLNKDCEDLSEIKNLVDAILDLAPKLGVLSFKHCPRIQNGVAHSIARM
        LEV SDCL ++  LN    + S  + +V  IL  A     + F  CPR+ N VAHS+A +
Subjt:  LEVESDCLGLVRCLNKDCEDLSEIKNLVDAILDLAPKLGVLSFKHCPRIQNGVAHSIARM

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]7.6e-1729.11Show/hide
Query:  EEDLGKAALLIWCIWDYRNRNL---------NIQITKNTDCTHFYNHISRIFQENQRISETHLENSRQRRSENHQNHEIWVPPRPCRWKFNSDASWNDDL
        EE+  ++ ++ W IW+ RN+++         +IQ+  +    +  N   R    N +   T+ +    RR E++   + W PP    WK N++A+W  D 
Subjt:  EEDLGKAALLIWCIWDYRNRNL---------NIQITKNTDCTHFYNHISRIFQENQRISETHLENSRQRRSENHQNHEIWVPPRPCRWKFNSDASWNDDL

Query:  KIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEGLSCLFGRFGDNLDLEVESDCLGLVRCLNKDCEDLSEIKNLVDAILDLAPKLGVLSFKHC
          GG GW++RD  G +I A  +  + E  ++ LE  A+ EGL  +  R      + +ESD L  +  L++ C+D +EI  L++ I  +   + ++S +H 
Subjt:  KIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEGLSCLFGRFGDNLDLEVESDCLGLVRCLNKDCEDLSEIKNLVDAILDLAPKLGVLSFKHC

Query:  PRIQNGVAHSIAR
         R  N VAH +AR
Subjt:  PRIQNGVAHSIAR

XP_022148549.1 uncharacterized protein LOC111017181 [Momordica charantia]3.1e-1832.41Show/hide
Query:  DYWSWCCGNLTEEDLGKAALLIWCIWDYRNRNLNIQITKNTDCTHFYNHISRIFQENQRIS-ETHLENSRQRRSENHQNHEIWVPPRPCRWKFNSDASWN
        DY+ W   +  +       +L+W IW YRN+ ++  I +   C+       R F E++     T+L  +     +N      W PP    WK N DA+W 
Subjt:  DYWSWCCGNLTEEDLGKAALLIWCIWDYRNRNLNIQITKNTDCTHFYNHISRIFQENQRIS-ETHLENSRQRRSENHQNHEIWVPPRPCRWKFNSDASWN

Query:  DDLKIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEGLSCLFGRFGDNLDLEVESDCLGLVRCLNKDCEDLSEIKNLVDAILDLAPKLGVLSF
        D L  GG GW+VRDS G  I A                KA+ + L+   G     + +E+ESDCL +V  +NK    L+E+  +V+ I      L +  F
Subjt:  DDLKIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEGLSCLFGRFGDNLDLEVESDCLGLVRCLNKDCEDLSEIKNLVDAILDLAPKLGVLSF

Query:  KHCPRIQNGVAHSIAR
        KH P   NGVAH IAR
Subjt:  KHCPRIQNGVAHSIAR

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]1.3e-2127.95Show/hide
Query:  NPNLRDQKKKFESTIHAIWECKVAQKVWLDCIPQILMLFHLRRDDWVCSDYWSWCCGNLTEEDLGKAALLIWCIWDYRNRNLNIQITKNTDCTHFYNHIS
        N N  + +KK E+T H +WECKV + +W++C P     F++ R +W   +YW W      EE+  ++ ++   IW+ RN+++   +       H      
Subjt:  NPNLRDQKKKFESTIHAIWECKVAQKVWLDCIPQILMLFHLRRDDWVCSDYWSWCCGNLTEEDLGKAALLIWCIWDYRNRNLNIQITKNTDCTHFYNHIS

Query:  RIFQENQRISETHLENSRQRRSEN-HQNHEI-------WVPPRPCRWKFNSDASWNDDLKIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEG
        ++  +   I+    + + +R+S++ H    I       W PP    WK N+DA+W  D    G GW++RD  G +I  G +  + E  ++ LE  A+ EG
Subjt:  RIFQENQRISETHLENSRQRRSEN-HQNHEI-------WVPPRPCRWKFNSDASWNDDLKIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEG

Query:  LSCLFGRFGDNLDLEVESDCLGLVRCLNK
        L  +  R      + +ESD L  +  L++
Subjt:  LSCLFGRFGDNLDLEVESDCLGLVRCLNK

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]1.7e-1627.88Show/hide
Query:  EEDLGKAALLIWCIWDYRNRNLNIQITKNTDCTHFYNHISRIFQENQRISETHLENSRQR------RSENHQNHEI----------WVPPRPCRWKFNSD
        EE+  ++ ++ W IW+ RN+++   +   T             ++ Q + + ++ NS  R      +S N   H I          W PP    WK N+D
Subjt:  EEDLGKAALLIWCIWDYRNRNLNIQITKNTDCTHFYNHISRIFQENQRISETHLENSRQR------RSENHQNHEI----------WVPPRPCRWKFNSD

Query:  ASWNDDLKIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEGLSCLFGRFGDNLDLE------VESDCLGLVRCLNKDCEDLSEIKNLVDAILD
        A+W  D   GG GW++RD  G +I A  +  + E  ++ LE  A+ EGL  +       +  E      +ESD L  +  L++ C+D +EI  L++ I  
Subjt:  ASWNDDLKIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEGLSCLFGRFGDNLDLE------VESDCLGLVRCLNKDCEDLSEIKNLVDAILD

Query:  LAPKLGVLSFKHCPRIQNGVAHSIAR
        +   + ++S +H  R  N VAH +AR
Subjt:  LAPKLGVLSFKHCPRIQNGVAHSIAR

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134123.7e-1729.11Show/hide
Query:  EEDLGKAALLIWCIWDYRNRNL---------NIQITKNTDCTHFYNHISRIFQENQRISETHLENSRQRRSENHQNHEIWVPPRPCRWKFNSDASWNDDL
        EE+  ++ ++ W IW+ RN+++         +IQ+  +    +  N   R    N +   T+ +    RR E++   + W PP    WK N++A+W  D 
Subjt:  EEDLGKAALLIWCIWDYRNRNL---------NIQITKNTDCTHFYNHISRIFQENQRISETHLENSRQRRSENHQNHEIWVPPRPCRWKFNSDASWNDDL

Query:  KIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEGLSCLFGRFGDNLDLEVESDCLGLVRCLNKDCEDLSEIKNLVDAILDLAPKLGVLSFKHC
          GG GW++RD  G +I A  +  + E  ++ LE  A+ EGL  +  R      + +ESD L  +  L++ C+D +EI  L++ I  +   + ++S +H 
Subjt:  KIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEGLSCLFGRFGDNLDLEVESDCLGLVRCLNKDCEDLSEIKNLVDAILDLAPKLGVLSFKHC

Query:  PRIQNGVAHSIAR
         R  N VAH +AR
Subjt:  PRIQNGVAHSIAR

A0A6J1D4B6 uncharacterized protein LOC1110171811.5e-1832.41Show/hide
Query:  DYWSWCCGNLTEEDLGKAALLIWCIWDYRNRNLNIQITKNTDCTHFYNHISRIFQENQRIS-ETHLENSRQRRSENHQNHEIWVPPRPCRWKFNSDASWN
        DY+ W   +  +       +L+W IW YRN+ ++  I +   C+       R F E++     T+L  +     +N      W PP    WK N DA+W 
Subjt:  DYWSWCCGNLTEEDLGKAALLIWCIWDYRNRNLNIQITKNTDCTHFYNHISRIFQENQRIS-ETHLENSRQRRSENHQNHEIWVPPRPCRWKFNSDASWN

Query:  DDLKIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEGLSCLFGRFGDNLDLEVESDCLGLVRCLNKDCEDLSEIKNLVDAILDLAPKLGVLSF
        D L  GG GW+VRDS G  I A                KA+ + L+   G     + +E+ESDCL +V  +NK    L+E+  +V+ I      L +  F
Subjt:  DDLKIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEGLSCLFGRFGDNLDLEVESDCLGLVRCLNKDCEDLSEIKNLVDAILDLAPKLGVLSF

Query:  KHCPRIQNGVAHSIAR
        KH P   NGVAH IAR
Subjt:  KHCPRIQNGVAHSIAR

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X16.5e-2227.95Show/hide
Query:  NPNLRDQKKKFESTIHAIWECKVAQKVWLDCIPQILMLFHLRRDDWVCSDYWSWCCGNLTEEDLGKAALLIWCIWDYRNRNLNIQITKNTDCTHFYNHIS
        N N  + +KK E+T H +WECKV + +W++C P     F++ R +W   +YW W      EE+  ++ ++   IW+ RN+++   +       H      
Subjt:  NPNLRDQKKKFESTIHAIWECKVAQKVWLDCIPQILMLFHLRRDDWVCSDYWSWCCGNLTEEDLGKAALLIWCIWDYRNRNLNIQITKNTDCTHFYNHIS

Query:  RIFQENQRISETHLENSRQRRSEN-HQNHEI-------WVPPRPCRWKFNSDASWNDDLKIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEG
        ++  +   I+    + + +R+S++ H    I       W PP    WK N+DA+W  D    G GW++RD  G +I  G +  + E  ++ LE  A+ EG
Subjt:  RIFQENQRISETHLENSRQRRSEN-HQNHEI-------WVPPRPCRWKFNSDASWNDDLKIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEG

Query:  LSCLFGRFGDNLDLEVESDCLGLVRCLNK
        L  +  R      + +ESD L  +  L++
Subjt:  LSCLFGRFGDNLDLEVESDCLGLVRCLNK

A0A6J1DSV1 uncharacterized protein LOC1110236088.2e-1727.88Show/hide
Query:  EEDLGKAALLIWCIWDYRNRNLNIQITKNTDCTHFYNHISRIFQENQRISETHLENSRQR------RSENHQNHEI----------WVPPRPCRWKFNSD
        EE+  ++ ++ W IW+ RN+++   +   T             ++ Q + + ++ NS  R      +S N   H I          W PP    WK N+D
Subjt:  EEDLGKAALLIWCIWDYRNRNLNIQITKNTDCTHFYNHISRIFQENQRISETHLENSRQR------RSENHQNHEI----------WVPPRPCRWKFNSD

Query:  ASWNDDLKIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEGLSCLFGRFGDNLDLE------VESDCLGLVRCLNKDCEDLSEIKNLVDAILD
        A+W  D   GG GW++RD  G +I A  +  + E  ++ LE  A+ EGL  +       +  E      +ESD L  +  L++ C+D +EI  L++ I  
Subjt:  ASWNDDLKIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEGLSCLFGRFGDNLDLE------VESDCLGLVRCLNKDCEDLSEIKNLVDAILD

Query:  LAPKLGVLSFKHCPRIQNGVAHSIAR
        +   + ++S +H  R  N VAH +AR
Subjt:  LAPKLGVLSFKHCPRIQNGVAHSIAR

A0A803L9N8 Uncharacterized protein2.5e-1828.08Show/hide
Query:  ESTIHAIWECKVAQKVWLDCIPQILMLFHLRRDDWVCSDYWSWCCGNL----TEEDLGKAALLIWCIWDYRNRNL--NIQITKNTDCTHFYNHISRIFQE
        +S +H +  C   Q +W       +    L    W   +++ WC   +     ++      + IW IW+ RN+ L   +++     C+   N        
Subjt:  ESTIHAIWECKVAQKVWLDCIPQILMLFHLRRDDWVCSDYWSWCCGNL----TEEDLGKAALLIWCIWDYRNRNL--NIQITKNTDCTHFYNHISRIFQE

Query:  NQRISETHLENSRQRRSENHQNHEIWVPPRPCRWKFNSDASWNDDLKIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEGLSCLFGRFGDNLD
        N R   T +E S +          +W+PP   R+K N+DAS + D + GG G VVRD+ G ++    +S  G   +S +EA A+  G+  +F        
Subjt:  NQRISETHLENSRQRRSENHQNHEIWVPPRPCRWKFNSDASWNDDLKIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEGLSCLFGRFGDNLD

Query:  LEVESDCLGLVRCLNKDCEDLSEIKNLVDAILDLAPKLGVLSFKHCPRIQNGVAHSIARM
        LEV SDCL ++  LN    + S  + +V  IL  A     + F  CPR+ N VAHS+A +
Subjt:  LEVESDCLGLVRCLNKDCEDLSEIKNLVDAILDLAPKLGVLSFKHCPRIQNGVAHSIARM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52990.1 thioredoxin family protein9.3e-0525.78Show/hide
Query:  CRWKFNSDASWNDDLKIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEGLSCLFGRFGDNLDLEVESDCLGLVRCLNKDCEDLSEIKNLVDAI
        CR K N DAS ++   + G GW++R+S G+++  G+   +G       E  A+   +      FG    +  E D   + R +N    D   +K+ +D I
Subjt:  CRWKFNSDASWNDDLKIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEGLSCLFGRFGDNLDLEVESDCLGLVRCLNKDCEDLSEIKNLVDAI

Query:  LDLAPKLGVLSFKHCPRIQNGVAHSIAR
            P      F    R QN  A ++ +
Subjt:  LDLAPKLGVLSFKHCPRIQNGVAHSIAR

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.4e-0823.35Show/hide
Query:  LIWCIWDYRNRNLNIQITKNTDCTHFYNHISRIFQENQRISETHLENSRQ--RRSENHQNHEIWVPPRPCRWKFNSDASWNDDLKIGGFGWVVRDSGGSL
        L+W IW   N      +  N   T F   +     + +   +  + N +Q   R+ +   +  W PP   + K N DAS ++   + G GW++R+S G++
Subjt:  LIWCIWDYRNRNLNIQITKNTDCTHFYNHISRIFQENQRISETHLENSRQ--RRSENHQNHEIWVPPRPCRWKFNSDASWNDDLKIGGFGWVVRDSGGSL

Query:  ICAGLQSSKGEWPVSVLEAKAMWEGLSCLFGRFGDNLDLEVESDCLGLVRCLNKDCEDLSEIKNLVDAILDLAPKLGVLSFKHCPRIQNGVAHSIAR
        I  G+   +G       E   +   +   +G FG +  +  E D   + R +N    +   +++ +D I    P    + F    R QNG A  +A+
Subjt:  ICAGLQSSKGEWPVSVLEAKAMWEGLSCLFGRFGDNLDLEVESDCLGLVRCLNKDCEDLSEIKNLVDAILDLAPKLGVLSFKHCPRIQNGVAHSIAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACCTAACCCAAACCTTAGGGACCAAAAGAAGAAATTTGAATCGACCATTCATGCTATCTGGGAGTGTAAGGTGGCTCAGAAGGTCTGGCTTGATTGCATTCCTCA
GATTCTTATGCTTTTTCATTTGCGCAGGGATGATTGGGTTTGCTCGGATTATTGGAGCTGGTGCTGTGGCAACTTAACCGAAGAGGATTTGGGGAAAGCTGCTTTGTTAA
TATGGTGTATTTGGGATTATAGGAATCGAAATCTCAACATCCAGATCACAAAGAATACAGATTGCACACATTTTTACAACCATATCTCAAGAATTTTTCAGGAAAATCAA
AGAATTTCTGAGACTCACCTGGAAAATTCTCGCCAACGTAGATCTGAGAACCACCAGAATCACGAGATCTGGGTCCCTCCTCGCCCTTGCAGATGGAAATTTAATTCAGA
TGCTTCCTGGAATGATGATTTGAAGATCGGCGGTTTCGGTTGGGTGGTGCGTGATTCCGGCGGATCTTTGATCTGTGCGGGCCTGCAATCTTCAAAAGGGGAGTGGCCTG
TTAGTGTTCTCGAAGCCAAAGCAATGTGGGAAGGGTTGTCTTGCCTATTTGGAAGATTTGGAGATAACCTGGATTTAGAAGTTGAATCTGATTGCCTGGGTCTCGTCCGC
TGCTTGAACAAAGATTGTGAAGACCTGTCTGAAATCAAGAATTTGGTTGATGCCATTTTAGATCTGGCTCCCAAATTAGGAGTGCTATCTTTCAAGCACTGCCCTAGAAT
TCAGAATGGGGTGGCTCACTCCATTGCCCGAATGGGTGCTTTTTGTAATTCTAATTTTGATTTTGTTGCTGAGGATCAGAGGCTTCTTCCTTGCTGGAAGGTCAGCACCT
TTTTTGGTCTTCTGATCCTCCTCCCTGGCTATCATCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAACCTAACCCAAACCTTAGGGACCAAAAGAAGAAATTTGAATCGACCATTCATGCTATCTGGGAGTGTAAGGTGGCTCAGAAGGTCTGGCTTGATTGCATTCCTCA
GATTCTTATGCTTTTTCATTTGCGCAGGGATGATTGGGTTTGCTCGGATTATTGGAGCTGGTGCTGTGGCAACTTAACCGAAGAGGATTTGGGGAAAGCTGCTTTGTTAA
TATGGTGTATTTGGGATTATAGGAATCGAAATCTCAACATCCAGATCACAAAGAATACAGATTGCACACATTTTTACAACCATATCTCAAGAATTTTTCAGGAAAATCAA
AGAATTTCTGAGACTCACCTGGAAAATTCTCGCCAACGTAGATCTGAGAACCACCAGAATCACGAGATCTGGGTCCCTCCTCGCCCTTGCAGATGGAAATTTAATTCAGA
TGCTTCCTGGAATGATGATTTGAAGATCGGCGGTTTCGGTTGGGTGGTGCGTGATTCCGGCGGATCTTTGATCTGTGCGGGCCTGCAATCTTCAAAAGGGGAGTGGCCTG
TTAGTGTTCTCGAAGCCAAAGCAATGTGGGAAGGGTTGTCTTGCCTATTTGGAAGATTTGGAGATAACCTGGATTTAGAAGTTGAATCTGATTGCCTGGGTCTCGTCCGC
TGCTTGAACAAAGATTGTGAAGACCTGTCTGAAATCAAGAATTTGGTTGATGCCATTTTAGATCTGGCTCCCAAATTAGGAGTGCTATCTTTCAAGCACTGCCCTAGAAT
TCAGAATGGGGTGGCTCACTCCATTGCCCGAATGGGTGCTTTTTGTAATTCTAATTTTGATTTTGTTGCTGAGGATCAGAGGCTTCTTCCTTGCTGGAAGGTCAGCACCT
TTTTTGGTCTTCTGATCCTCCTCCCTGGCTATCATCCTTAA
Protein sequenceShow/hide protein sequence
MKPNPNLRDQKKKFESTIHAIWECKVAQKVWLDCIPQILMLFHLRRDDWVCSDYWSWCCGNLTEEDLGKAALLIWCIWDYRNRNLNIQITKNTDCTHFYNHISRIFQENQ
RISETHLENSRQRRSENHQNHEIWVPPRPCRWKFNSDASWNDDLKIGGFGWVVRDSGGSLICAGLQSSKGEWPVSVLEAKAMWEGLSCLFGRFGDNLDLEVESDCLGLVR
CLNKDCEDLSEIKNLVDAILDLAPKLGVLSFKHCPRIQNGVAHSIARMGAFCNSNFDFVAEDQRLLPCWKVSTFFGLLILLPGYHP