; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005862 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005862
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr6:32432292..32432720
RNA-Seq ExpressionLag0005862
SyntenyLag0005862
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146110.1 uncharacterized protein LOC111015405 [Momordica charantia]1.7e-5171.43Show/hide
Query:  MHDFDVILGMDWLAGNNATIHCAKKEVHLKLPSGQGFKFKGAKNGSPKVVSALKAKRLMQHGAWAYLASVVDVSKESPSLDSVPIVKGFPDVFPEDLPGL
        M DFDVILGMDWLA N A I+C+++EV  +LPSG+ F FKG     P+ VSALKA+RL+Q+GAW YLA+VVD+SK  PS+DSV +V+ FPDVFP DLPGL
Subjt:  MHDFDVILGMDWLAGNNATIHCAKKEVHLKLPSGQGFKFKGAKNGSPKVVSALKAKRLMQHGAWAYLASVVDVSKESPSLDSVPIVKGFPDVFPEDLPGL

Query:  PPAREVDFCIDLLPGTAPLSKAPYRMAPAELKELKVQLEE
        PP RE+DFCIDL PGTAP+SKAPYRMAPAELKELK QLEE
Subjt:  PPAREVDFCIDLLPGTAPLSKAPYRMAPAELKELKVQLEE

XP_022156880.1 uncharacterized protein LOC111023714 [Momordica charantia]6.4e-5171.43Show/hide
Query:  MHDFDVILGMDWLAGNNATIHCAKKEVHLKLPSGQGFKFKGAKNGSPKVVSALKAKRLMQHGAWAYLASVVDVSKESPSLDSVPIVKGFPDVFPEDLPGL
        M DFDVILGMDWLA N A I+C+++EV  +LPSGQ F FK    G P+ VSALKA+RL+Q+GAW YLASVVD+S   P++DSV +VK FPDVFPEDL GL
Subjt:  MHDFDVILGMDWLAGNNATIHCAKKEVHLKLPSGQGFKFKGAKNGSPKVVSALKAKRLMQHGAWAYLASVVDVSKESPSLDSVPIVKGFPDVFPEDLPGL

Query:  PPAREVDFCIDLLPGTAPLSKAPYRMAPAELKELKVQLEE
        PP RE+DFCIDLLPGTAP+SKAPYRM+PAEL+ELK QLEE
Subjt:  PPAREVDFCIDLLPGTAPLSKAPYRMAPAELKELKVQLEE

XP_022156985.1 uncharacterized protein LOC111023814 [Momordica charantia]2.7e-4970.71Show/hide
Query:  MHDFDVILGMDWLAGNNATIHCAKKEVHLKLPSGQGFKFKGAKNGSPKVVSALKAKRLMQHGAWAYLASVVDVSKESPSLDSVPIVKGFPDVFPEDLPGL
        M DFD+ILGMDWLA N A+I+  ++EV  +LPSGQ F FKG     PKVVSALKA++L+QHGAW YL SVVD SK +PS+DSV +   FPDVFPEDLPGL
Subjt:  MHDFDVILGMDWLAGNNATIHCAKKEVHLKLPSGQGFKFKGAKNGSPKVVSALKAKRLMQHGAWAYLASVVDVSKESPSLDSVPIVKGFPDVFPEDLPGL

Query:  PPAREVDFCIDLLPGTAPLSKAPYRMAPAELKELKVQLEE
        PP RE+DF IDL PGT P+S+APYRMAPAELKELKVQLEE
Subjt:  PPAREVDFCIDLLPGTAPLSKAPYRMAPAELKELKVQLEE

XP_022156992.1 uncharacterized protein LOC111023821 [Momordica charantia]1.2e-4972.14Show/hide
Query:  MHDFDVILGMDWLAGNNATIHCAKKEVHLKLPSGQGFKFKGAKNGSPKVVSALKAKRLMQHGAWAYLASVVDVSKESPSLDSVPIVKGFPDVFPEDLPGL
        M DFDVILGMDWLA N A I C+KKE   +LPS Q F FKG K   P+VVSALKA   +Q GAWAYLASVVD  K  PS+++V +V  F DVFPEDLPGL
Subjt:  MHDFDVILGMDWLAGNNATIHCAKKEVHLKLPSGQGFKFKGAKNGSPKVVSALKAKRLMQHGAWAYLASVVDVSKESPSLDSVPIVKGFPDVFPEDLPGL

Query:  PPAREVDFCIDLLPGTAPLSKAPYRMAPAELKELKVQLEE
        PP+REVDFCI+LLPGTAP+SKAPYRMAPAELKELK+QLEE
Subjt:  PPAREVDFCIDLLPGTAPLSKAPYRMAPAELKELKVQLEE

XP_022159077.1 uncharacterized protein LOC111025517 [Momordica charantia]2.7e-4970.71Show/hide
Query:  MHDFDVILGMDWLAGNNATIHCAKKEVHLKLPSGQGFKFKGAKNGSPKVVSALKAKRLMQHGAWAYLASVVDVSKESPSLDSVPIVKGFPDVFPEDLPGL
        M DFDVILGMDWLA N A I+C+K+EV  +LPSG+ F FKG   G P+ VSALKA+RL+ +GAW YLASVVD+S   PS+DS  +VK F DVFPEDL GL
Subjt:  MHDFDVILGMDWLAGNNATIHCAKKEVHLKLPSGQGFKFKGAKNGSPKVVSALKAKRLMQHGAWAYLASVVDVSKESPSLDSVPIVKGFPDVFPEDLPGL

Query:  PPAREVDFCIDLLPGTAPLSKAPYRMAPAELKELKVQLEE
        PP RE+DFCIDLLPGTAP+SKAP RMAP ELKELK QLEE
Subjt:  PPAREVDFCIDLLPGTAPLSKAPYRMAPAELKELKVQLEE

TrEMBL top hitse value%identityAlignment
A0A6J1CWD0 uncharacterized protein LOC1110154058.1e-5271.43Show/hide
Query:  MHDFDVILGMDWLAGNNATIHCAKKEVHLKLPSGQGFKFKGAKNGSPKVVSALKAKRLMQHGAWAYLASVVDVSKESPSLDSVPIVKGFPDVFPEDLPGL
        M DFDVILGMDWLA N A I+C+++EV  +LPSG+ F FKG     P+ VSALKA+RL+Q+GAW YLA+VVD+SK  PS+DSV +V+ FPDVFP DLPGL
Subjt:  MHDFDVILGMDWLAGNNATIHCAKKEVHLKLPSGQGFKFKGAKNGSPKVVSALKAKRLMQHGAWAYLASVVDVSKESPSLDSVPIVKGFPDVFPEDLPGL

Query:  PPAREVDFCIDLLPGTAPLSKAPYRMAPAELKELKVQLEE
        PP RE+DFCIDL PGTAP+SKAPYRMAPAELKELK QLEE
Subjt:  PPAREVDFCIDLLPGTAPLSKAPYRMAPAELKELKVQLEE

A0A6J1DRK1 uncharacterized protein LOC1110237143.1e-5171.43Show/hide
Query:  MHDFDVILGMDWLAGNNATIHCAKKEVHLKLPSGQGFKFKGAKNGSPKVVSALKAKRLMQHGAWAYLASVVDVSKESPSLDSVPIVKGFPDVFPEDLPGL
        M DFDVILGMDWLA N A I+C+++EV  +LPSGQ F FK    G P+ VSALKA+RL+Q+GAW YLASVVD+S   P++DSV +VK FPDVFPEDL GL
Subjt:  MHDFDVILGMDWLAGNNATIHCAKKEVHLKLPSGQGFKFKGAKNGSPKVVSALKAKRLMQHGAWAYLASVVDVSKESPSLDSVPIVKGFPDVFPEDLPGL

Query:  PPAREVDFCIDLLPGTAPLSKAPYRMAPAELKELKVQLEE
        PP RE+DFCIDLLPGTAP+SKAPYRM+PAEL+ELK QLEE
Subjt:  PPAREVDFCIDLLPGTAPLSKAPYRMAPAELKELKVQLEE

A0A6J1DRW8 uncharacterized protein LOC1110238141.3e-4970.71Show/hide
Query:  MHDFDVILGMDWLAGNNATIHCAKKEVHLKLPSGQGFKFKGAKNGSPKVVSALKAKRLMQHGAWAYLASVVDVSKESPSLDSVPIVKGFPDVFPEDLPGL
        M DFD+ILGMDWLA N A+I+  ++EV  +LPSGQ F FKG     PKVVSALKA++L+QHGAW YL SVVD SK +PS+DSV +   FPDVFPEDLPGL
Subjt:  MHDFDVILGMDWLAGNNATIHCAKKEVHLKLPSGQGFKFKGAKNGSPKVVSALKAKRLMQHGAWAYLASVVDVSKESPSLDSVPIVKGFPDVFPEDLPGL

Query:  PPAREVDFCIDLLPGTAPLSKAPYRMAPAELKELKVQLEE
        PP RE+DF IDL PGT P+S+APYRMAPAELKELKVQLEE
Subjt:  PPAREVDFCIDLLPGTAPLSKAPYRMAPAELKELKVQLEE

A0A6J1DTE5 uncharacterized protein LOC1110238215.8e-5072.14Show/hide
Query:  MHDFDVILGMDWLAGNNATIHCAKKEVHLKLPSGQGFKFKGAKNGSPKVVSALKAKRLMQHGAWAYLASVVDVSKESPSLDSVPIVKGFPDVFPEDLPGL
        M DFDVILGMDWLA N A I C+KKE   +LPS Q F FKG K   P+VVSALKA   +Q GAWAYLASVVD  K  PS+++V +V  F DVFPEDLPGL
Subjt:  MHDFDVILGMDWLAGNNATIHCAKKEVHLKLPSGQGFKFKGAKNGSPKVVSALKAKRLMQHGAWAYLASVVDVSKESPSLDSVPIVKGFPDVFPEDLPGL

Query:  PPAREVDFCIDLLPGTAPLSKAPYRMAPAELKELKVQLEE
        PP+REVDFCI+LLPGTAP+SKAPYRMAPAELKELK+QLEE
Subjt:  PPAREVDFCIDLLPGTAPLSKAPYRMAPAELKELKVQLEE

A0A6J1DYU5 uncharacterized protein LOC1110255171.3e-4970.71Show/hide
Query:  MHDFDVILGMDWLAGNNATIHCAKKEVHLKLPSGQGFKFKGAKNGSPKVVSALKAKRLMQHGAWAYLASVVDVSKESPSLDSVPIVKGFPDVFPEDLPGL
        M DFDVILGMDWLA N A I+C+K+EV  +LPSG+ F FKG   G P+ VSALKA+RL+ +GAW YLASVVD+S   PS+DS  +VK F DVFPEDL GL
Subjt:  MHDFDVILGMDWLAGNNATIHCAKKEVHLKLPSGQGFKFKGAKNGSPKVVSALKAKRLMQHGAWAYLASVVDVSKESPSLDSVPIVKGFPDVFPEDLPGL

Query:  PPAREVDFCIDLLPGTAPLSKAPYRMAPAELKELKVQLEE
        PP RE+DFCIDLLPGTAP+SKAP RMAP ELKELK QLEE
Subjt:  PPAREVDFCIDLLPGTAPLSKAPYRMAPAELKELKVQLEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACGATTTTGATGTGATTCTAGGCATGGACTGGTTGGCTGGTAATAACGCCACCATTCATTGTGCTAAGAAGGAAGTGCATCTCAAGTTGCCTTCGGGCCAAGGGTT
CAAGTTTAAAGGAGCGAAGAACGGAAGTCCTAAGGTAGTTTCTGCCTTGAAAGCCAAGCGTCTTATGCAGCATGGAGCTTGGGCGTATTTGGCAAGTGTGGTGGATGTTT
CTAAAGAATCGCCTAGTTTGGACTCCGTCCCCATCGTGAAAGGGTTCCCAGATGTCTTTCCAGAAGATCTTCCTGGGCTACCTCCAGCTCGAGAAGTGGACTTCTGTATA
GACCTTCTTCCTGGGACAGCACCGTTGTCTAAGGCACCTTATAGGATGGCACCAGCTGAGCTCAAGGAGCTCAAAGTGCAGCTTGAGGAGGAACCATAG
mRNA sequenceShow/hide mRNA sequence
ATGCACGATTTTGATGTGATTCTAGGCATGGACTGGTTGGCTGGTAATAACGCCACCATTCATTGTGCTAAGAAGGAAGTGCATCTCAAGTTGCCTTCGGGCCAAGGGTT
CAAGTTTAAAGGAGCGAAGAACGGAAGTCCTAAGGTAGTTTCTGCCTTGAAAGCCAAGCGTCTTATGCAGCATGGAGCTTGGGCGTATTTGGCAAGTGTGGTGGATGTTT
CTAAAGAATCGCCTAGTTTGGACTCCGTCCCCATCGTGAAAGGGTTCCCAGATGTCTTTCCAGAAGATCTTCCTGGGCTACCTCCAGCTCGAGAAGTGGACTTCTGTATA
GACCTTCTTCCTGGGACAGCACCGTTGTCTAAGGCACCTTATAGGATGGCACCAGCTGAGCTCAAGGAGCTCAAAGTGCAGCTTGAGGAGGAACCATAG
Protein sequenceShow/hide protein sequence
MHDFDVILGMDWLAGNNATIHCAKKEVHLKLPSGQGFKFKGAKNGSPKVVSALKAKRLMQHGAWAYLASVVDVSKESPSLDSVPIVKGFPDVFPEDLPGLPPAREVDFCI
DLLPGTAPLSKAPYRMAPAELKELKVQLEEEP