; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005756 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005756
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr6:28473818..28474762
RNA-Seq ExpressionLag0005756
SyntenyLag0005756
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156067.1 uncharacterized protein LOC111023035 [Momordica charantia]9.8e-3344.49Show/hide
Query:  GTMSVIQYERKFTELSRFASDLVSTPERKIKRFIKGLREEIRGTIDMSAPTTFAAALHGAL-------------EVGSTSGVKRKPPPTQARPPQKAPRQ
        G+++V QY+RKFTELSRF    + T + KI +FI GLR EI+G + +   TT+AAA+  AL              +GS+SGVKRK     +       + 
Subjt:  GTMSVIQYERKFTELSRFASDLVSTPERKIKRFIKGLREEIRGTIDMSAPTTFAAALHGAL-------------EVGSTSGVKRKPPPTQARPPQKAPRQ

Query:  QPQKQALAIPRCNVCNKQHA----------------GHYARWCP-NRDETQARRPALQTAQAPRQGGKQKARVFTL-KDEIAEDDVVVTGTILVLRIPTF
          Q+Q  A P C  C K HA                GH+AR CP     TQA      TA A  QGG Q+ARVF L + ++   + VVTGTILVL +P +
Subjt:  QPQKQALAIPRCNVCNKQHA----------------GHYARWCP-NRDETQARRPALQTAQAPRQGGKQKARVFTL-KDEIAEDDVVVTGTILVLRIPTF

Query:  VLVDSGSSHSFISSAFVDQADLKLEPLGFVLSVSPP
         L DSGSSHSFI+S FV  ADL+LE LGF+LSVS P
Subjt:  VLVDSGSSHSFISSAFVDQADLKLEPLGFVLSVSPP

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]1.8e-4249.15Show/hide
Query:  GTMSVIQYERKFTELSRFASDLVSTPERKIKRFIKGLREEIRGTIDMSAPTTFAAALHGAL--------------EVGSTSGVKRKPPPTQARPPQKAPR
        GT+SV QYERKFTELSRFA +L+ T   KIKRF+KGLR+ IRG +D+  PTT+A A+ GAL              EVGS+SGVKRK P T A    +AP+
Subjt:  GTMSVIQYERKFTELSRFASDLVSTPERKIKRFIKGLREEIRGTIDMSAPTTFAAALHGAL--------------EVGSTSGVKRKPPPTQARPPQKAPR

Query:  QQPQKQALAIPRCNVCNKQHA----------------GHYARWCPNRDETQARRPALQTAQAPRQGGKQKARVFTL-KDEIAEDDVVVTGTILVLRIPTF
        +Q Q Q +  P C  C K+H                 GH+AR CP       R           QG  Q+ARVF L + E A+ + VVTGT+LV  +P +
Subjt:  QQPQKQALAIPRCNVCNKQHA----------------GHYARWCPNRDETQARRPALQTAQAPRQGGKQKARVFTL-KDEIAEDDVVVTGTILVLRIPTF

Query:  VLVDSGSSHSFISSAFVDQADLKLEPLGFVLSVSPP
        VL DSGSSH+FISS FV QA L+LEPLGF+LSVS P
Subjt:  VLVDSGSSHSFISSAFVDQADLKLEPLGFVLSVSPP

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]9.8e-3344.92Show/hide
Query:  GTMSVIQYERKFTELSRFASDLVSTPERKIKRFIKGLREEIRGTIDMSAPTTFAAALHGAL-------------EVGSTSGVKRKPPPTQARPPQKAPRQ
        G+++V QYERKFTELSRF +  V T + KI +FI GLR EI+G + +  PTT+AAA+  AL              +GS SGVKRK     A    +  + 
Subjt:  GTMSVIQYERKFTELSRFASDLVSTPERKIKRFIKGLREEIRGTIDMSAPTTFAAALHGAL-------------EVGSTSGVKRKPPPTQARPPQKAPRQ

Query:  QPQKQALAIPRCNVCNKQHA----------------GHYARWC-PNRDETQARRPALQTAQAPRQGGKQKARVFTL-KDEIAEDDVVVTGTILVLRIPTF
          Q+Q  A P C  C K HA                GH+ R C      TQA      TA A  QGG Q ARVF L + ++   + VVTGTIL+L IP +
Subjt:  QPQKQALAIPRCNVCNKQHA----------------GHYARWC-PNRDETQARRPALQTAQAPRQGGKQKARVFTL-KDEIAEDDVVVTGTILVLRIPTF

Query:  VLVDSGSSHSFISSAFVDQADLKLEPLGFVLSVSPP
         L DSGSSHSFI+S FV  ADL+LE  GF LSVS P
Subjt:  VLVDSGSSHSFISSAFVDQADLKLEPLGFVLSVSPP

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]9.8e-3343.64Show/hide
Query:  GTMSVIQYERKFTELSRFASDLVSTPERKIKRFIKGLREEIRGTIDMSAPTTFAAALHGAL-------------EVGSTSGVKRKPPPTQARPPQKAPRQ
        G+++V +YERKFTELSRF    + T + KI +FI GLR EI+G + +  PTT+AAA+  AL              +GS+SGVKRK     +  P +  + 
Subjt:  GTMSVIQYERKFTELSRFASDLVSTPERKIKRFIKGLREEIRGTIDMSAPTTFAAALHGAL-------------EVGSTSGVKRKPPPTQARPPQKAPRQ

Query:  QPQKQALAIPRCNVCNKQHA----------------GHYARWCP-NRDETQARRPALQTAQAPRQGGKQKARVFTL-KDEIAEDDVVVTGTILVLRIPTF
          Q+Q  A P C  C K HA                GH+AR CP     TQA    +  A A  QGG  +ARVF L + ++   + VVT T+LVL +P +
Subjt:  QPQKQALAIPRCNVCNKQHA----------------GHYARWCP-NRDETQARRPALQTAQAPRQGGKQKARVFTL-KDEIAEDDVVVTGTILVLRIPTF

Query:  VLVDSGSSHSFISSAFVDQADLKLEPLGFVLSVSPP
         L DSGSSHSFI+S FV  ADL+LE LGF+LSVS P
Subjt:  VLVDSGSSHSFISSAFVDQADLKLEPLGFVLSVSPP

XP_022159077.1 uncharacterized protein LOC111025517 [Momordica charantia]4.9e-3245.13Show/hide
Query:  GTMSVIQYERKFTELSRFASDLVSTPERKIKRFIKGLREEIRGTIDMSAPTTFAAALHGAL--------------EVGSTSGVKRKPPPTQARPPQKAPR
        GT++V QYERKFTELS FA +L+ T   KIKRF+KGLR+ IRG +D+  P T+A A+ G L              EVGS+SGVKRK  P  A  P +AP+
Subjt:  GTMSVIQYERKFTELSRFASDLVSTPERKIKRFIKGLREEIRGTIDMSAPTTFAAALHGAL--------------EVGSTSGVKRKPPPTQARPPQKAPR

Query:  QQPQKQALAIPRCNVCNKQHAGHYARWCPN-------RDETQARRPALQTAQAPRQGGKQKARVFTLKDEIAEDDVVVTGTILVLRIPTFVLVDSGSSHS
        +  Q+Q L  P C  C K+ AG    W  N       R+   AR  ++  A   R G +    V T             GT LV  +P +VL D GSSH+
Subjt:  QQPQKQALAIPRCNVCNKQHAGHYARWCPN-------RDETQARRPALQTAQAPRQGGKQKARVFTLKDEIAEDDVVVTGTILVLRIPTFVLVDSGSSHS

Query:  FISSAFVDQADLKLEPLGFVLSVSPP
        FIS+AFV QA L+LEPLGF+LSVS P
Subjt:  FISSAFVDQADLKLEPLGFVLSVSPP

TrEMBL top hitse value%identityAlignment
A0A6J1DQB9 Reverse transcriptase4.8e-3344.92Show/hide
Query:  GTMSVIQYERKFTELSRFASDLVSTPERKIKRFIKGLREEIRGTIDMSAPTTFAAALHGAL-------------EVGSTSGVKRKPPPTQARPPQKAPRQ
        G+++V QYERKFTELSRF +  V T + KI +FI GLR EI+G + +  PTT+AAA+  AL              +GS SGVKRK     A    +  + 
Subjt:  GTMSVIQYERKFTELSRFASDLVSTPERKIKRFIKGLREEIRGTIDMSAPTTFAAALHGAL-------------EVGSTSGVKRKPPPTQARPPQKAPRQ

Query:  QPQKQALAIPRCNVCNKQHA----------------GHYARWC-PNRDETQARRPALQTAQAPRQGGKQKARVFTL-KDEIAEDDVVVTGTILVLRIPTF
          Q+Q  A P C  C K HA                GH+ R C      TQA      TA A  QGG Q ARVF L + ++   + VVTGTIL+L IP +
Subjt:  QPQKQALAIPRCNVCNKQHA----------------GHYARWC-PNRDETQARRPALQTAQAPRQGGKQKARVFTL-KDEIAEDDVVVTGTILVLRIPTF

Query:  VLVDSGSSHSFISSAFVDQADLKLEPLGFVLSVSPP
         L DSGSSHSFI+S FV  ADL+LE  GF LSVS P
Subjt:  VLVDSGSSHSFISSAFVDQADLKLEPLGFVLSVSPP

A0A6J1DR22 uncharacterized protein LOC1110230354.8e-3344.49Show/hide
Query:  GTMSVIQYERKFTELSRFASDLVSTPERKIKRFIKGLREEIRGTIDMSAPTTFAAALHGAL-------------EVGSTSGVKRKPPPTQARPPQKAPRQ
        G+++V QY+RKFTELSRF    + T + KI +FI GLR EI+G + +   TT+AAA+  AL              +GS+SGVKRK     +       + 
Subjt:  GTMSVIQYERKFTELSRFASDLVSTPERKIKRFIKGLREEIRGTIDMSAPTTFAAALHGAL-------------EVGSTSGVKRKPPPTQARPPQKAPRQ

Query:  QPQKQALAIPRCNVCNKQHA----------------GHYARWCP-NRDETQARRPALQTAQAPRQGGKQKARVFTL-KDEIAEDDVVVTGTILVLRIPTF
          Q+Q  A P C  C K HA                GH+AR CP     TQA      TA A  QGG Q+ARVF L + ++   + VVTGTILVL +P +
Subjt:  QPQKQALAIPRCNVCNKQHA----------------GHYARWCP-NRDETQARRPALQTAQAPRQGGKQKARVFTL-KDEIAEDDVVVTGTILVLRIPTF

Query:  VLVDSGSSHSFISSAFVDQADLKLEPLGFVLSVSPP
         L DSGSSHSFI+S FV  ADL+LE LGF+LSVS P
Subjt:  VLVDSGSSHSFISSAFVDQADLKLEPLGFVLSVSPP

A0A6J1DUM2 uncharacterized protein LOC1110232478.6e-4349.15Show/hide
Query:  GTMSVIQYERKFTELSRFASDLVSTPERKIKRFIKGLREEIRGTIDMSAPTTFAAALHGAL--------------EVGSTSGVKRKPPPTQARPPQKAPR
        GT+SV QYERKFTELSRFA +L+ T   KIKRF+KGLR+ IRG +D+  PTT+A A+ GAL              EVGS+SGVKRK P T A    +AP+
Subjt:  GTMSVIQYERKFTELSRFASDLVSTPERKIKRFIKGLREEIRGTIDMSAPTTFAAALHGAL--------------EVGSTSGVKRKPPPTQARPPQKAPR

Query:  QQPQKQALAIPRCNVCNKQHA----------------GHYARWCPNRDETQARRPALQTAQAPRQGGKQKARVFTL-KDEIAEDDVVVTGTILVLRIPTF
        +Q Q Q +  P C  C K+H                 GH+AR CP       R           QG  Q+ARVF L + E A+ + VVTGT+LV  +P +
Subjt:  QQPQKQALAIPRCNVCNKQHA----------------GHYARWCPNRDETQARRPALQTAQAPRQGGKQKARVFTL-KDEIAEDDVVVTGTILVLRIPTF

Query:  VLVDSGSSHSFISSAFVDQADLKLEPLGFVLSVSPP
        VL DSGSSH+FISS FV QA L+LEPLGF+LSVS P
Subjt:  VLVDSGSSHSFISSAFVDQADLKLEPLGFVLSVSPP

A0A6J1DWP4 uncharacterized protein LOC1110252154.8e-3343.64Show/hide
Query:  GTMSVIQYERKFTELSRFASDLVSTPERKIKRFIKGLREEIRGTIDMSAPTTFAAALHGAL-------------EVGSTSGVKRKPPPTQARPPQKAPRQ
        G+++V +YERKFTELSRF    + T + KI +FI GLR EI+G + +  PTT+AAA+  AL              +GS+SGVKRK     +  P +  + 
Subjt:  GTMSVIQYERKFTELSRFASDLVSTPERKIKRFIKGLREEIRGTIDMSAPTTFAAALHGAL-------------EVGSTSGVKRKPPPTQARPPQKAPRQ

Query:  QPQKQALAIPRCNVCNKQHA----------------GHYARWCP-NRDETQARRPALQTAQAPRQGGKQKARVFTL-KDEIAEDDVVVTGTILVLRIPTF
          Q+Q  A P C  C K HA                GH+AR CP     TQA    +  A A  QGG  +ARVF L + ++   + VVT T+LVL +P +
Subjt:  QPQKQALAIPRCNVCNKQHA----------------GHYARWCP-NRDETQARRPALQTAQAPRQGGKQKARVFTL-KDEIAEDDVVVTGTILVLRIPTF

Query:  VLVDSGSSHSFISSAFVDQADLKLEPLGFVLSVSPP
         L DSGSSHSFI+S FV  ADL+LE LGF+LSVS P
Subjt:  VLVDSGSSHSFISSAFVDQADLKLEPLGFVLSVSPP

A0A6J1DYU5 uncharacterized protein LOC1110255172.4e-3245.13Show/hide
Query:  GTMSVIQYERKFTELSRFASDLVSTPERKIKRFIKGLREEIRGTIDMSAPTTFAAALHGAL--------------EVGSTSGVKRKPPPTQARPPQKAPR
        GT++V QYERKFTELS FA +L+ T   KIKRF+KGLR+ IRG +D+  P T+A A+ G L              EVGS+SGVKRK  P  A  P +AP+
Subjt:  GTMSVIQYERKFTELSRFASDLVSTPERKIKRFIKGLREEIRGTIDMSAPTTFAAALHGAL--------------EVGSTSGVKRKPPPTQARPPQKAPR

Query:  QQPQKQALAIPRCNVCNKQHAGHYARWCPN-------RDETQARRPALQTAQAPRQGGKQKARVFTLKDEIAEDDVVVTGTILVLRIPTFVLVDSGSSHS
        +  Q+Q L  P C  C K+ AG    W  N       R+   AR  ++  A   R G +    V T             GT LV  +P +VL D GSSH+
Subjt:  QQPQKQALAIPRCNVCNKQHAGHYARWCPN-------RDETQARRPALQTAQAPRQGGKQKARVFTLKDEIAEDDVVVTGTILVLRIPTFVLVDSGSSHS

Query:  FISSAFVDQADLKLEPLGFVLSVSPP
        FIS+AFV QA L+LEPLGF+LSVS P
Subjt:  FISSAFVDQADLKLEPLGFVLSVSPP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCCCGCTATTGGTGGAATTCAGTATCAGCCGCTGAGGATCACGCGAATGTATCGATTTCGTGGGATAGGGACCATGTCTGTGATCCAGTACGAGCGGAAATTCAC
TGAGCTCTCGCGTTTTGCTTCCGATCTAGTCAGCACGCCGGAGCGGAAGATCAAGAGGTTCATCAAGGGCCTCAGAGAAGAAATTCGGGGTACAATTGACATGAGTGCGC
CTACGACCTTTGCAGCAGCCCTCCATGGGGCATTGGAAGTCGGTTCGACTTCCGGGGTTAAGCGAAAGCCCCCTCCAACCCAAGCGAGGCCACCTCAGAAGGCTCCTCGC
CAACAACCTCAGAAGCAGGCTCTGGCAATTCCTCGTTGCAATGTGTGCAACAAGCAACATGCTGGACATTACGCCAGGTGGTGTCCCAACAGAGACGAGACCCAAGCAAG
AAGACCAGCCTTGCAGACCGCCCAGGCTCCTAGACAAGGCGGTAAGCAAAAAGCTCGTGTTTTCACCCTCAAGGATGAAATTGCAGAGGATGATGTTGTGGTAACAGGAA
CAATCCTTGTTTTAAGAATCCCTACTTTTGTGTTAGTTGACTCGGGGTCGAGTCACTCTTTTATCTCATCGGCATTTGTCGATCAAGCTGATCTGAAGTTAGAGCCGCTA
GGATTCGTTCTATCAGTGTCCCCCCCCCTTCTGGATCCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCCCGCTATTGGTGGAATTCAGTATCAGCCGCTGAGGATCACGCGAATGTATCGATTTCGTGGGATAGGGACCATGTCTGTGATCCAGTACGAGCGGAAATTCAC
TGAGCTCTCGCGTTTTGCTTCCGATCTAGTCAGCACGCCGGAGCGGAAGATCAAGAGGTTCATCAAGGGCCTCAGAGAAGAAATTCGGGGTACAATTGACATGAGTGCGC
CTACGACCTTTGCAGCAGCCCTCCATGGGGCATTGGAAGTCGGTTCGACTTCCGGGGTTAAGCGAAAGCCCCCTCCAACCCAAGCGAGGCCACCTCAGAAGGCTCCTCGC
CAACAACCTCAGAAGCAGGCTCTGGCAATTCCTCGTTGCAATGTGTGCAACAAGCAACATGCTGGACATTACGCCAGGTGGTGTCCCAACAGAGACGAGACCCAAGCAAG
AAGACCAGCCTTGCAGACCGCCCAGGCTCCTAGACAAGGCGGTAAGCAAAAAGCTCGTGTTTTCACCCTCAAGGATGAAATTGCAGAGGATGATGTTGTGGTAACAGGAA
CAATCCTTGTTTTAAGAATCCCTACTTTTGTGTTAGTTGACTCGGGGTCGAGTCACTCTTTTATCTCATCGGCATTTGTCGATCAAGCTGATCTGAAGTTAGAGCCGCTA
GGATTCGTTCTATCAGTGTCCCCCCCCCTTCTGGATCCGTAA
Protein sequenceShow/hide protein sequence
MKPAIGGIQYQPLRITRMYRFRGIGTMSVIQYERKFTELSRFASDLVSTPERKIKRFIKGLREEIRGTIDMSAPTTFAAALHGALEVGSTSGVKRKPPPTQARPPQKAPR
QQPQKQALAIPRCNVCNKQHAGHYARWCPNRDETQARRPALQTAQAPRQGGKQKARVFTLKDEIAEDDVVVTGTILVLRIPTFVLVDSGSSHSFISSAFVDQADLKLEPL
GFVLSVSPPLLDP