; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001401 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001401
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag-protease polyprotein
Genome locationchr4:30844867..30847160
RNA-Seq ExpressionLag0001401
SyntenyLag0001401
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]1.4e-2134.39Show/hide
Query:  QHVGANEAPARGKDVQFFRSFIKVKPPSFDGHPESSQAVEEWTVELEALFQHPGADAQQRVQRAAFMLRGYARTWWNVVGQFENRRENSLTWSGFKGLVQ
        Q +   +A A   + QF R F +  PP+F+G  E +  VEEW  ELEAL+ + G   Q +V+ A FMLRG A  WW+VV   E+     +TW+  K L+ 
Subjt:  QHVGANEAPARGKDVQFFRSFIKVKPPSFDGHPESSQAVEEWTVELEALFQHPGADAQQRVQRAAFMLRGYARTWWNVVGQFENRRENSLTWSGFKGLVQ

Query:  DQFGRRFLGVDVDLEAEFVLLVQRTMTVTQYVSRSEELSCRVPALVATEASRINRFINGLRAEIRG-----------------------------LSQEV
        D +  +   +  + E EF+ L QRT+ V QY  +  E S     L+ TEA +I RF+ GL   I+G                               Q+V
Subjt:  DQFGRRFLGVDVDLEAEFVLLVQRTMTVTQYVSRSEELSCRVPALVATEASRINRFINGLRAEIRG-----------------------------LSQEV

Query:  GTSSGAKRKHEELMPA-PSQT
        G SSG KRK   +  + PS+T
Subjt:  GTSSGAKRKHEELMPA-PSQT

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]6.3e-2241.57Show/hide
Query:  QHVGANEAPARGKDVQFFRSFIKVKPPSFDGHPESSQAVEEWTVELEALFQHPGADAQQRVQRAAFMLRGYARTWWNVVGQFENRRENSLTWSGFKGLVQ
        QH+   ++ AR     F + F +  PP+FDG  E + AVEEW  ELEAL+ + G + Q +V+ A FMLRG A  WW+ V   E+     + W+ FK L+ 
Subjt:  QHVGANEAPARGKDVQFFRSFIKVKPPSFDGHPESSQAVEEWTVELEALFQHPGADAQQRVQRAAFMLRGYARTWWNVVGQFENRRENSLTWSGFKGLVQ

Query:  DQFGRRFLGVDVDLEAEFVLLVQRTMTVTQYVSRSEELSCRVPALVATEASRINRFINGLRAEIRG
        D +      V    EAEF+ LVQ T++V QY  +  ELS     L+ TEA +I RF+ GLR  IRG
Subjt:  DQFGRRFLGVDVDLEAEFVLLVQRTMTVTQYVSRSEELSCRVPALVATEASRINRFINGLRAEIRG

XP_022156330.1 uncharacterized protein LOC111023250 [Momordica charantia]7.0e-2141.18Show/hide
Query:  DVQFFRSFIKVKPPSFDGHPESSQAVEEWTVELEALFQHPGADAQQRVQRAAFMLRGYARTWWNVVGQFENRRENSLTWSGFKGLVQDQFGRRFLGVDVD
        + QF + F +  PP+FDG  + +   EEW  ELEAL+ + G + Q +V+ A FMLRG A  WW+ V   E+     +TW+ FK L+ D +      V   
Subjt:  DVQFFRSFIKVKPPSFDGHPESSQAVEEWTVELEALFQHPGADAQQRVQRAAFMLRGYARTWWNVVGQFENRRENSLTWSGFKGLVQDQFGRRFLGVDVD

Query:  LEAEFVLLVQRTMTVTQYVSRSEELSCRVPALVATEASRINRFINGLRAEIRG
         EAEF+   Q T+TV QY  +  ELS     L+ TEA +I RF+ GLR  IRG
Subjt:  LEAEFVLLVQRTMTVTQYVSRSEELSCRVPALVATEASRINRFINGLRAEIRG

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]4.1e-2139.33Show/hide
Query:  AIQNNLQHV-GANEAPAR-----GKDVQFFRSFIKVKPPSFDGHPESSQAVEEWTVELEALFQHPGADAQQRVQRAAFMLRGYARTWWNVVGQFENRREN
        A+ NN   V GA   P R       + QF + F +  PP+F G  E +   EEW  ELEAL+ + G + Q +V+ A FMLR  A  WW+ V   E+    
Subjt:  AIQNNLQHV-GANEAPAR-----GKDVQFFRSFIKVKPPSFDGHPESSQAVEEWTVELEALFQHPGADAQQRVQRAAFMLRGYARTWWNVVGQFENRREN

Query:  SLTWSGFKGLVQDQFGRRFLGVDVDLEAEFVLLVQRTMTVTQYVSRSEELSCRVPALVATEASRINRFINGLRAEIRG
         + W+ FK L+ D + R    V+   E EF+ LVQ T+TV QY  +  ELS     L+ TEA +I RF+ GL   IRG
Subjt:  SLTWSGFKGLVQDQFGRRFLGVDVDLEAEFVLLVQRTMTVTQYVSRSEELSCRVPALVATEASRINRFINGLRAEIRG

XP_022158637.1 uncharacterized protein LOC111025088 [Momordica charantia]2.0e-2038.76Show/hide
Query:  AIQNNLQHVGANEA-PAR-----GKDVQFFRSFIKVKPPSFDGHPESSQAVEEWTVELEALFQHPGADAQQRVQRAAFMLRGYARTWWNVVGQFENRREN
        A+ NN   VG  +A P R       + QF + F +  PP+FDG  E + A E W  ELEAL+ + G + Q +V+   FMLRG A  WW+ +   E+    
Subjt:  AIQNNLQHVGANEA-PAR-----GKDVQFFRSFIKVKPPSFDGHPESSQAVEEWTVELEALFQHPGADAQQRVQRAAFMLRGYARTWWNVVGQFENRREN

Query:  SLTWSGFKGLVQDQFGRRFLGVDVDLEAEFVLLVQRTMTVTQYVSRSEELSCRVPALVATEASRINRFINGLRAEIRG
         + W+ FK L+ D +      V    EAEF+ L Q T+TV QY  +  ELS      + TEA +I RF+ GLR  IRG
Subjt:  SLTWSGFKGLVQDQFGRRFLGVDVDLEAEFVLLVQRTMTVTQYVSRSEELSCRVPALVATEASRINRFINGLRAEIRG

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196036.8e-2234.39Show/hide
Query:  QHVGANEAPARGKDVQFFRSFIKVKPPSFDGHPESSQAVEEWTVELEALFQHPGADAQQRVQRAAFMLRGYARTWWNVVGQFENRRENSLTWSGFKGLVQ
        Q +   +A A   + QF R F +  PP+F+G  E +  VEEW  ELEAL+ + G   Q +V+ A FMLRG A  WW+VV   E+     +TW+  K L+ 
Subjt:  QHVGANEAPARGKDVQFFRSFIKVKPPSFDGHPESSQAVEEWTVELEALFQHPGADAQQRVQRAAFMLRGYARTWWNVVGQFENRRENSLTWSGFKGLVQ

Query:  DQFGRRFLGVDVDLEAEFVLLVQRTMTVTQYVSRSEELSCRVPALVATEASRINRFINGLRAEIRG-----------------------------LSQEV
        D +  +   +  + E EF+ L QRT+ V QY  +  E S     L+ TEA +I RF+ GL   I+G                               Q+V
Subjt:  DQFGRRFLGVDVDLEAEFVLLVQRTMTVTQYVSRSEELSCRVPALVATEASRINRFINGLRAEIRG-----------------------------LSQEV

Query:  GTSSGAKRKHEELMPA-PSQT
        G SSG KRK   +  + PS+T
Subjt:  GTSSGAKRKHEELMPA-PSQT

A0A6J1DL73 uncharacterized protein LOC1110221442.2e-2034.4Show/hide
Query:  HVGANEAPARGKDVQFFRSFIKVKPPSFDGHPESSQAVEEWTVELEALFQHPGADAQQRVQRAAFMLRGYARTWWNVVGQFENRRENSLTWSGFKGLVQD
        HV  +EA        F + F +  PP+FDG  E + A EEW  ELEA + + G + Q +V+ A FMLRG A  WW+ +   E+    ++ W+ FK L+ D
Subjt:  HVGANEAPARGKDVQFFRSFIKVKPPSFDGHPESSQAVEEWTVELEALFQHPGADAQQRVQRAAFMLRGYARTWWNVVGQFENRRENSLTWSGFKGLVQD

Query:  QFGRRFLGVDVDL-EAEFVLLVQRTMTVTQYVSRSEELSCRVPALVATEASRINRFINGLRAEIRG-----------------------------LSQEV
         +   +L    D+ EAEF+ LVQ T++V QY  +  ELS     L+   A +I RF+ GL   IRG                                EV
Subjt:  QFGRRFLGVDVDL-EAEFVLLVQRTMTVTQYVSRSEELSCRVPALVATEASRINRFINGLRAEIRG-----------------------------LSQEV

Query:  GTSSGAKRKHEELMPAPS
        G+SSG KRK       PS
Subjt:  GTSSGAKRKHEELMPAPS

A0A6J1DQ01 uncharacterized protein LOC1110232503.4e-2141.18Show/hide
Query:  DVQFFRSFIKVKPPSFDGHPESSQAVEEWTVELEALFQHPGADAQQRVQRAAFMLRGYARTWWNVVGQFENRRENSLTWSGFKGLVQDQFGRRFLGVDVD
        + QF + F +  PP+FDG  + +   EEW  ELEAL+ + G + Q +V+ A FMLRG A  WW+ V   E+     +TW+ FK L+ D +      V   
Subjt:  DVQFFRSFIKVKPPSFDGHPESSQAVEEWTVELEALFQHPGADAQQRVQRAAFMLRGYARTWWNVVGQFENRRENSLTWSGFKGLVQDQFGRRFLGVDVD

Query:  LEAEFVLLVQRTMTVTQYVSRSEELSCRVPALVATEASRINRFINGLRAEIRG
         EAEF+   Q T+TV QY  +  ELS     L+ TEA +I RF+ GLR  IRG
Subjt:  LEAEFVLLVQRTMTVTQYVSRSEELSCRVPALVATEASRINRFINGLRAEIRG

A0A6J1DUM2 uncharacterized protein LOC1110232473.1e-2241.57Show/hide
Query:  QHVGANEAPARGKDVQFFRSFIKVKPPSFDGHPESSQAVEEWTVELEALFQHPGADAQQRVQRAAFMLRGYARTWWNVVGQFENRRENSLTWSGFKGLVQ
        QH+   ++ AR     F + F +  PP+FDG  E + AVEEW  ELEAL+ + G + Q +V+ A FMLRG A  WW+ V   E+     + W+ FK L+ 
Subjt:  QHVGANEAPARGKDVQFFRSFIKVKPPSFDGHPESSQAVEEWTVELEALFQHPGADAQQRVQRAAFMLRGYARTWWNVVGQFENRRENSLTWSGFKGLVQ

Query:  DQFGRRFLGVDVDLEAEFVLLVQRTMTVTQYVSRSEELSCRVPALVATEASRINRFINGLRAEIRG
        D +      V    EAEF+ LVQ T++V QY  +  ELS     L+ TEA +I RF+ GLR  IRG
Subjt:  DQFGRRFLGVDVDLEAEFVLLVQRTMTVTQYVSRSEELSCRVPALVATEASRINRFINGLRAEIRG

A0A6J1DXQ7 uncharacterized protein LOC1110250889.8e-2138.76Show/hide
Query:  AIQNNLQHVGANEA-PAR-----GKDVQFFRSFIKVKPPSFDGHPESSQAVEEWTVELEALFQHPGADAQQRVQRAAFMLRGYARTWWNVVGQFENRREN
        A+ NN   VG  +A P R       + QF + F +  PP+FDG  E + A E W  ELEAL+ + G + Q +V+   FMLRG A  WW+ +   E+    
Subjt:  AIQNNLQHVGANEA-PAR-----GKDVQFFRSFIKVKPPSFDGHPESSQAVEEWTVELEALFQHPGADAQQRVQRAAFMLRGYARTWWNVVGQFENRREN

Query:  SLTWSGFKGLVQDQFGRRFLGVDVDLEAEFVLLVQRTMTVTQYVSRSEELSCRVPALVATEASRINRFINGLRAEIRG
         + W+ FK L+ D +      V    EAEF+ L Q T+TV QY  +  ELS      + TEA +I RF+ GLR  IRG
Subjt:  SLTWSGFKGLVQDQFGRRFLGVDVDLEAEFVLLVQRTMTVTQYVSRSEELSCRVPALVATEASRINRFINGLRAEIRG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGGCAATGCAATCCAGAATAACTTGCAGCACGTCGGTGCAAATGAAGCCCCTGCTCGTGGCAAGGATGTGCAATTTTTCCGAAGCTTCATAAAGGTTAAGCCTCC
TTCGTTCGATGGTCACCCTGAAAGTTCTCAAGCGGTAGAAGAATGGACTGTAGAGTTGGAAGCGCTATTTCAACATCCCGGAGCTGACGCTCAACAACGAGTCCAAAGAG
CTGCCTTTATGCTCAGAGGCTATGCTCGCACTTGGTGGAATGTAGTGGGCCAGTTTGAGAATCGTCGAGAGAATTCCCTTACATGGTCAGGATTTAAGGGTCTTGTGCAA
GACCAATTTGGTCGACGTTTTCTCGGCGTAGATGTGGATCTAGAAGCGGAGTTCGTCTTGCTCGTTCAAAGGACCATGACCGTGACACAATATGTTAGCAGGTCCGAGGA
GCTGTCTTGCCGTGTCCCTGCATTGGTTGCCACTGAGGCAAGCCGGATCAACCGGTTCATTAATGGCCTGCGTGCAGAAATCAGAGGGTTGTCCCAGGAAGTGGGCACGT
CGTCTGGTGCCAAGAGGAAGCACGAAGAGTTAATGCCTGCACCTAGTCAGACGGTCAGAAGATCTCTGCCAGGGACCAACTCATTATTAGCCTTGCTGCTGGGTGGTTTT
CGTGGATCAAGTCTCGAGGTTTTTGGACTTGCTCGTGACGATGCTCTTAACGGTGTTGGTTTCTCAAACCAAGAGTCGGAGCATTTCCCTCACGCATTAGTGGTTGAAAC
TTACCAGCTAAGTGGGTTTAAGTTGGTGAAGTTGCTGATGAAGCTTCTTGCAGGTTCGTCGCTTGATAAAAAGGGGAAGCTTAACCTGGGTTCATTGGACCATGTGAGAA
CTTGGAGCGCATCTAGCCTGTTGCTTACAGGTATCAAGGATGATCTGGACCTTGGTGGCGAGAAGGAGAACTATGTGGAGGATCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATTGGCAATGCAATCCAGAATAACTTGCAGCACGTCGGTGCAAATGAAGCCCCTGCTCGTGGCAAGGATGTGCAATTTTTCCGAAGCTTCATAAAGGTTAAGCCTCC
TTCGTTCGATGGTCACCCTGAAAGTTCTCAAGCGGTAGAAGAATGGACTGTAGAGTTGGAAGCGCTATTTCAACATCCCGGAGCTGACGCTCAACAACGAGTCCAAAGAG
CTGCCTTTATGCTCAGAGGCTATGCTCGCACTTGGTGGAATGTAGTGGGCCAGTTTGAGAATCGTCGAGAGAATTCCCTTACATGGTCAGGATTTAAGGGTCTTGTGCAA
GACCAATTTGGTCGACGTTTTCTCGGCGTAGATGTGGATCTAGAAGCGGAGTTCGTCTTGCTCGTTCAAAGGACCATGACCGTGACACAATATGTTAGCAGGTCCGAGGA
GCTGTCTTGCCGTGTCCCTGCATTGGTTGCCACTGAGGCAAGCCGGATCAACCGGTTCATTAATGGCCTGCGTGCAGAAATCAGAGGGTTGTCCCAGGAAGTGGGCACGT
CGTCTGGTGCCAAGAGGAAGCACGAAGAGTTAATGCCTGCACCTAGTCAGACGGTCAGAAGATCTCTGCCAGGGACCAACTCATTATTAGCCTTGCTGCTGGGTGGTTTT
CGTGGATCAAGTCTCGAGGTTTTTGGACTTGCTCGTGACGATGCTCTTAACGGTGTTGGTTTCTCAAACCAAGAGTCGGAGCATTTCCCTCACGCATTAGTGGTTGAAAC
TTACCAGCTAAGTGGGTTTAAGTTGGTGAAGTTGCTGATGAAGCTTCTTGCAGGTTCGTCGCTTGATAAAAAGGGGAAGCTTAACCTGGGTTCATTGGACCATGTGAGAA
CTTGGAGCGCATCTAGCCTGTTGCTTACAGGTATCAAGGATGATCTGGACCTTGGTGGCGAGAAGGAGAACTATGTGGAGGATCTTTAG
Protein sequenceShow/hide protein sequence
MIGNAIQNNLQHVGANEAPARGKDVQFFRSFIKVKPPSFDGHPESSQAVEEWTVELEALFQHPGADAQQRVQRAAFMLRGYARTWWNVVGQFENRRENSLTWSGFKGLVQ
DQFGRRFLGVDVDLEAEFVLLVQRTMTVTQYVSRSEELSCRVPALVATEASRINRFINGLRAEIRGLSQEVGTSSGAKRKHEELMPAPSQTVRRSLPGTNSLLALLLGGF
RGSSLEVFGLARDDALNGVGFSNQESEHFPHALVVETYQLSGFKLVKLLMKLLAGSSLDKKGKLNLGSLDHVRTWSASSLLLTGIKDDLDLGGEKENYVEDL