; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012070 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012070
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:37053504..37058855
RNA-Seq ExpressionLag0012070
SyntenyLag0012070
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004497 - monooxygenase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588936.1 hypothetical protein SDJN03_17501, partial [Cucurbita argyrosperma subsp. sororia]3.9e-4286.44Show/hide
Query:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRREEEEGTASLLGASLLNVV
        DGFVYKRKRRRLDP E VA RSSVAQAADL+AEENRRRERRR TLLKVRAKYQREIEQWEVLSNNLRAMEE TRKL+E YRRE EEGTA  L AS LN V
Subjt:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRREEEEGTASLLGASLLNVV

Query:  QEKELSCASMVEELLSQI
        QEKELSCASMVE+LLSQ+
Subjt:  QEKELSCASMVEELLSQI

XP_022928186.1 uncharacterized protein LOC111435083 [Cucurbita moschata]6.7e-4286.44Show/hide
Query:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRREEEEGTASLLGASLLNVV
        DGFVYKRKRRRLDP E VA RSSVAQAADL+AEENRRRERRR TLLKVRAKYQREIEQWEVLSNNLRA EE TRKLQE YRRE EEGTA  L AS LN V
Subjt:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRREEEEGTASLLGASLLNVV

Query:  QEKELSCASMVEELLSQI
        QEKELSCASMVE+LLSQ+
Subjt:  QEKELSCASMVEELLSQI

XP_022989565.1 uncharacterized protein LOC111486625 [Cucurbita maxima]9.3e-4487.29Show/hide
Query:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRREEEEGTASLLGASLLNVV
        DGFVYKRKRRRLDPTE VA RSSVAQAADL+AEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEE TRKLQE YRRE EEGTAS L AS LN++
Subjt:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRREEEEGTASLLGASLLNVV

Query:  QEKELSCASMVEELLSQI
         EKELSC+SMVE+LLSQ+
Subjt:  QEKELSCASMVEELLSQI

XP_023530399.1 uncharacterized protein LOC111792986 [Cucurbita pepo subsp. pepo]2.2e-4084.75Show/hide
Query:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRREEEEGTASLLGASLLNVV
        DGFVYKRKRRRLDP E VA RSSVAQAADL+AEENRRRERRR TLLKVRAKYQREIEQWEVLSNNLRAMEE TRKL+E  RRE EEG A  L AS LN V
Subjt:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRREEEEGTASLLGASLLNVV

Query:  QEKELSCASMVEELLSQI
        QEKELSCASMVE+LLSQ+
Subjt:  QEKELSCASMVEELLSQI

XP_038904940.1 uncharacterized protein LOC120091144 [Benincasa hispida]4.8e-4082.2Show/hide
Query:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRREEEEGTASLLGASLLNVV
        DGFVYKRKRRRLDP E VA RSSVAQA DL+AEENRRRE RRKTLLKVRAKYQRE+EQWEVLS NLR MEE TRKLQE YRR+ EEGTASLL AS L VV
Subjt:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRREEEEGTASLLGASLLNVV

Query:  QEKELSCASMVEELLSQI
        +EKE+SCASMV++LLSQ+
Subjt:  QEKELSCASMVEELLSQI

TrEMBL top hitse value%identityAlignment
A0A6J1DE60 uncharacterized protein LOC1110199622.6e-3981.36Show/hide
Query:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRREEEEGTASLLGASLLNVV
        DGFVYKRKRRRLDP E VA RSSVAQAAD++AEENRRRERRRKTLLKVRAKYQREIEQWEVLS NLRAMEE  R+LQE YRRE EEGT  L  AS L+VV
Subjt:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRREEEEGTASLLGASLLNVV

Query:  QEKELSCASMVEELLSQI
        Q++ELS ASMVE+LLSQ+
Subjt:  QEKELSCASMVEELLSQI

A0A6J1EJL6 uncharacterized protein LOC1114350833.2e-4286.44Show/hide
Query:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRREEEEGTASLLGASLLNVV
        DGFVYKRKRRRLDP E VA RSSVAQAADL+AEENRRRERRR TLLKVRAKYQREIEQWEVLSNNLRA EE TRKLQE YRRE EEGTA  L AS LN V
Subjt:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRREEEEGTASLLGASLLNVV

Query:  QEKELSCASMVEELLSQI
        QEKELSCASMVE+LLSQ+
Subjt:  QEKELSCASMVEELLSQI

A0A6J1F3L3 uncharacterized protein LOC1114420711.2e-3982.2Show/hide
Query:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRREEEEGTASLLGASLLNVV
        DGFVYKRKRRRLDP E VA RSSVAQA DL+AEENRRRERRRKTLLKVRAKY+REIEQWEVLS+NL+AMEE  RKLQE YR+E E G AS L ASLL VV
Subjt:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRREEEEGTASLLGASLLNVV

Query:  QEKELSCASMVEELLSQI
        Q KELSCASMVE+LLSQ+
Subjt:  QEKELSCASMVEELLSQI

A0A6J1IZN1 uncharacterized protein LOC1114813971.5e-3982.2Show/hide
Query:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRREEEEGTASLLGASLLNVV
        DGFVYKRKRRRLDP E VA RSSVAQAADL+AEENRRRERRRKTLLKVRAKY++EIEQWEVLS+NLRAMEE  RKLQE YR+E E G AS L ASLL  V
Subjt:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRREEEEGTASLLGASLLNVV

Query:  QEKELSCASMVEELLSQI
        Q KELSCASMVE+LLSQ+
Subjt:  QEKELSCASMVEELLSQI

A0A6J1JMQ6 uncharacterized protein LOC1114866254.5e-4487.29Show/hide
Query:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRREEEEGTASLLGASLLNVV
        DGFVYKRKRRRLDPTE VA RSSVAQAADL+AEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEE TRKLQE YRRE EEGTAS L AS LN++
Subjt:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRREEEEGTASLLGASLLNVV

Query:  QEKELSCASMVEELLSQI
         EKELSC+SMVE+LLSQ+
Subjt:  QEKELSCASMVEELLSQI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G27520.1 unknown protein2.0e-0740.96Show/hide
Query:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRRE
        DGFVY RK+R     +   T        D   EE  RR R+++ L+K++ KYQ EI+QWE+LSN+  AM+E   + Q   R E
Subjt:  DGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQWEVLSNNLRAMEEMTRKLQELYRRE

AT4G29090.1 Ribonuclease H-like superfamily protein5.9e-0426.19Show/hide
Query:  PWFISLIRRWECRGSKIYWRESPFVPRLGPSTIRRPTDLLWCCWREMVGTEFEDFCG--DVLGPCKCGEGGDNEVWFPSVFPNYKLNTDIAVNKELYLSS
        PW   L R W+ R +++ +R   F  +     +RR  D L   WR  + TE E  CG    +    CG       W P      K NTD   N++     
Subjt:  PWFISLIRRWECRGSKIYWRESPFVPRLGPSTIRRPTDLLWCCWREMVGTEFEDFCG--DVLGPCKCGEGGDNEVWFPSVFPNYKLNTDIAVNKELYLSS

Query:  IGTMIRNERGEVMLTLMKLIRYVLDVDVLEVMAIREGLAVVVEAGFSQMEVESDSARVVRRIRSESNECSELGLIVEDIKSLAGVLSFCSFSWCRRSRNL
        IG ++RNE+GEV     + +  +  V   E+ A+R  +  +    ++ +  ESDS +V+  I +       L   ++D++ L    +   F +  R  N 
Subjt:  IGTMIRNERGEVMLTLMKLIRYVLDVDVLEVMAIREGLAVVVEAGFSQMEVESDSARVVRRIRSESNECSELGLIVEDIKSLAGVLSFCSFSWCRRSRNL

Query:  VAHEIAKAAL
        +A  +A+ +L
Subjt:  VAHEIAKAAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCAGATTAACATGCTTTCTTCCTCATAGACGGCGCCAACTTTTGATGGTCAAATTTGGTCAGAGGAACTCACCTGACCTTCACGTGCCACCAACGGTAAATTTGTT
GGAGAGGGGGGAAGATGTGGCCTGCAAGACGGTAAACCTACACACCGGTGTGGTGCTAGCCACACCGGCTTCGATGCTTAAGTCAGAAAACGGAAGTAGGAGGGCTAGAG
AGCGTTTCGGGACAAACCAGGTGAAACCGGGGCGGCTAGAGGCTACAAGGACCGAATGGAGTCGGAGGGACTCGGCTAGAGGCCAAAGGCCGAGGCCGAGCATGGGGTCG
GGCCAAAAGCCCGACCCCTTCGGTCTCGGCCCGACCCACTGGCCCAAGAGGGAGAGGGAGGAGGCTTTTCCCTCCCTTTCCGGCGCCGGCGACGGGAGGAGGGAGGCCCT
CCTCCCTCACGGCTTCAAATCTATAAGTTTACAAGCGAAAAGGGATGGCTTTGTCTACAAGCGCAAGAGGCGTCGGCTGGACCCGACAGAAGGCGTTGCAACTCGTTCGT
CGGTGGCTCAGGCGGCGGACCTTCAGGCGGAGGAGAATCGGCGGCGAGAGCGCAGGAGGAAGACGTTGTTGAAGGTTAGAGCGAAGTATCAGAGAGAGATTGAGCAGTGG
GAGGTTTTGTCGAACAACTTGCGGGCGATGGAGGAGATGACTCGGAAGCTGCAGGAACTGTACAGACGGGAAGAGGAAGAAGGAACCGCGTCGCTTCTGGGCGCTTCCTT
GTTGAACGTGGTTCAGGAGAAGGAGTTGTCCTGCGCGTCGATGGTGGAGGAACTTCTCTCTCAGATCTGGACGGGGAGGGAGGAAGGCCTCCCTCCTCCTCTGTTTCGCG
CCGGAAACGGAGGAGGAGACAGTCTCCTCCTCTGTTTCGCGCCGGAAATGGAGGAGTCGAAGAATATCTGGAAGATGAACACTGATGCTACCTGGAATGAGAAGAATGAC
ACATTGGAGGAATTGGGTGGGCTATCCATGACTCCACTGGCTCCCCGACTAGTTTTACGCGTTGGAGGTGATCAACGTTCTACGAAAGACTTCTTTGAGATGAAGAATTT
TACGGACGTCATATCGCCCTTTTGTGCCGACTTGGGAGATGTGAAGTTTCGCCACTGCCCTAAGTATCATAACACAGTAACCCACTGTATAGCTCGTTTGTTTCATCGGA
TTTTGGCTAGAATTAGAGATCCTCCTTTTTGCGGGAGGATGGGCGCGACTGTGTTCCCCTGGTTTATTTCCCTCATTAGGAGGTGGGAGTGTAGAGGGTCGAAGATTTAT
TGGAGAGAGTCTCCTTTTGTCCCTCGGCTTGGGCCTTCGACTATAAGGAGACCGACAGATTTGTTGTGGTGCTGCTGGAGGGAGATGGTGGGGACTGAATTTGAGGATTT
TTGTGGTGATGTGTTGGGGCCGTGCAAGTGTGGTGAGGGGGGTGATAATGAAGTTTGGTTTCCCTCTGTTTTCCCAAATTACAAGCTCAATACTGATATAGCAGTCAACA
AAGAACTCTATTTGAGTAGTATTGGGACAATGATCAGAAATGAAAGAGGGGAAGTGATGCTTACCTTGATGAAGCTGATCAGATATGTGCTTGATGTCGATGTTTTGGAG
GTGATGGCTATTCGCGAAGGGCTGGCAGTAGTTGTTGAGGCCGGCTTCTCACAGATGGAGGTGGAGTCGGATTCGGCTAGAGTGGTGAGAAGGATCAGATCTGAGTCTAA
TGAGTGTTCGGAGTTGGGTTTAATAGTAGAAGATATCAAGAGTTTAGCCGGTGTGTTGAGCTTTTGCTCGTTTAGCTGGTGTCGGCGGTCGAGGAACCTGGTGGCGCACG
AGATAGCGAAAGCAGCGTTGAGGTCGGAACTGGAAGGGGTGTGGCTGGAGGAGTTGCCGACGGATGCTGGGAATTCCTTCGTTGCAGAGCTGAGGCCTTTCGATTTGGGG
GCTAGGGATTTCTGTAATGATTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGCAGATTAACATGCTTTCTTCCTCATAGACGGCGCCAACTTTTGATGGTCAAATTTGGTCAGAGGAACTCACCTGACCTTCACGTGCCACCAACGGTAAATTTGTT
GGAGAGGGGGGAAGATGTGGCCTGCAAGACGGTAAACCTACACACCGGTGTGGTGCTAGCCACACCGGCTTCGATGCTTAAGTCAGAAAACGGAAGTAGGAGGGCTAGAG
AGCGTTTCGGGACAAACCAGGTGAAACCGGGGCGGCTAGAGGCTACAAGGACCGAATGGAGTCGGAGGGACTCGGCTAGAGGCCAAAGGCCGAGGCCGAGCATGGGGTCG
GGCCAAAAGCCCGACCCCTTCGGTCTCGGCCCGACCCACTGGCCCAAGAGGGAGAGGGAGGAGGCTTTTCCCTCCCTTTCCGGCGCCGGCGACGGGAGGAGGGAGGCCCT
CCTCCCTCACGGCTTCAAATCTATAAGTTTACAAGCGAAAAGGGATGGCTTTGTCTACAAGCGCAAGAGGCGTCGGCTGGACCCGACAGAAGGCGTTGCAACTCGTTCGT
CGGTGGCTCAGGCGGCGGACCTTCAGGCGGAGGAGAATCGGCGGCGAGAGCGCAGGAGGAAGACGTTGTTGAAGGTTAGAGCGAAGTATCAGAGAGAGATTGAGCAGTGG
GAGGTTTTGTCGAACAACTTGCGGGCGATGGAGGAGATGACTCGGAAGCTGCAGGAACTGTACAGACGGGAAGAGGAAGAAGGAACCGCGTCGCTTCTGGGCGCTTCCTT
GTTGAACGTGGTTCAGGAGAAGGAGTTGTCCTGCGCGTCGATGGTGGAGGAACTTCTCTCTCAGATCTGGACGGGGAGGGAGGAAGGCCTCCCTCCTCCTCTGTTTCGCG
CCGGAAACGGAGGAGGAGACAGTCTCCTCCTCTGTTTCGCGCCGGAAATGGAGGAGTCGAAGAATATCTGGAAGATGAACACTGATGCTACCTGGAATGAGAAGAATGAC
ACATTGGAGGAATTGGGTGGGCTATCCATGACTCCACTGGCTCCCCGACTAGTTTTACGCGTTGGAGGTGATCAACGTTCTACGAAAGACTTCTTTGAGATGAAGAATTT
TACGGACGTCATATCGCCCTTTTGTGCCGACTTGGGAGATGTGAAGTTTCGCCACTGCCCTAAGTATCATAACACAGTAACCCACTGTATAGCTCGTTTGTTTCATCGGA
TTTTGGCTAGAATTAGAGATCCTCCTTTTTGCGGGAGGATGGGCGCGACTGTGTTCCCCTGGTTTATTTCCCTCATTAGGAGGTGGGAGTGTAGAGGGTCGAAGATTTAT
TGGAGAGAGTCTCCTTTTGTCCCTCGGCTTGGGCCTTCGACTATAAGGAGACCGACAGATTTGTTGTGGTGCTGCTGGAGGGAGATGGTGGGGACTGAATTTGAGGATTT
TTGTGGTGATGTGTTGGGGCCGTGCAAGTGTGGTGAGGGGGGTGATAATGAAGTTTGGTTTCCCTCTGTTTTCCCAAATTACAAGCTCAATACTGATATAGCAGTCAACA
AAGAACTCTATTTGAGTAGTATTGGGACAATGATCAGAAATGAAAGAGGGGAAGTGATGCTTACCTTGATGAAGCTGATCAGATATGTGCTTGATGTCGATGTTTTGGAG
GTGATGGCTATTCGCGAAGGGCTGGCAGTAGTTGTTGAGGCCGGCTTCTCACAGATGGAGGTGGAGTCGGATTCGGCTAGAGTGGTGAGAAGGATCAGATCTGAGTCTAA
TGAGTGTTCGGAGTTGGGTTTAATAGTAGAAGATATCAAGAGTTTAGCCGGTGTGTTGAGCTTTTGCTCGTTTAGCTGGTGTCGGCGGTCGAGGAACCTGGTGGCGCACG
AGATAGCGAAAGCAGCGTTGAGGTCGGAACTGGAAGGGGTGTGGCTGGAGGAGTTGCCGACGGATGCTGGGAATTCCTTCGTTGCAGAGCTGAGGCCTTTCGATTTGGGG
GCTAGGGATTTCTGTAATGATTCTTAG
Protein sequenceShow/hide protein sequence
MCRLTCFLPHRRRQLLMVKFGQRNSPDLHVPPTVNLLERGEDVACKTVNLHTGVVLATPASMLKSENGSRRARERFGTNQVKPGRLEATRTEWSRRDSARGQRPRPSMGS
GQKPDPFGLGPTHWPKREREEAFPSLSGAGDGRREALLPHGFKSISLQAKRDGFVYKRKRRRLDPTEGVATRSSVAQAADLQAEENRRRERRRKTLLKVRAKYQREIEQW
EVLSNNLRAMEEMTRKLQELYRREEEEGTASLLGASLLNVVQEKELSCASMVEELLSQIWTGREEGLPPPLFRAGNGGGDSLLLCFAPEMEESKNIWKMNTDATWNEKND
TLEELGGLSMTPLAPRLVLRVGGDQRSTKDFFEMKNFTDVISPFCADLGDVKFRHCPKYHNTVTHCIARLFHRILARIRDPPFCGRMGATVFPWFISLIRRWECRGSKIY
WRESPFVPRLGPSTIRRPTDLLWCCWREMVGTEFEDFCGDVLGPCKCGEGGDNEVWFPSVFPNYKLNTDIAVNKELYLSSIGTMIRNERGEVMLTLMKLIRYVLDVDVLE
VMAIREGLAVVVEAGFSQMEVESDSARVVRRIRSESNECSELGLIVEDIKSLAGVLSFCSFSWCRRSRNLVAHEIAKAALRSELEGVWLEELPTDAGNSFVAELRPFDLG
ARDFCNDS