; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022080 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022080
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr7:17672813..17683925
RNA-Seq ExpressionLag0022080
SyntenyLag0022080
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155341.1 uncharacterized protein LOC111022474 [Momordica charantia]8.6e-1936.02Show/hide
Query:  RTKAEAGGHVTPEVRRGESSCPPPAPP----------VLAAEALQAMLGNAI-LNNVQHVGANEAPAHGEEVQFIKSFVKAKPPSFDGRSEGSEAVVEWT
        R +  A  +V P V RG +   P   P           L AEALQ +L NA      Q      A    EEVQFI+ F +  PP F+G SE   A  EW 
Subjt:  RTKAEAGGHVTPEVRRGESSCPPPAPP----------VLAAEALQAMLGNAI-LNNVQHVGANEAPAHGEEVQFIKSFVKAKPPSFDGRSEGSEAVVEWT

Query:  GALEAIFQFLEANAQQRVQGATFMLKGYARTWWKAVGQTKNRPENPISWSGFKGLVQDHFGCRFAGVEVEAEFVSLVQGILSVEQY
          LEA++ +L  + + +V+GA FML+G A  WW++V   ++    P++W+ FK L+ +++       E  AEF+ L QG L+V QY
Subjt:  GALEAIFQFLEANAQQRVQGATFMLKGYARTWWKAVGQTKNRPENPISWSGFKGLVQDHFGCRFAGVEVEAEFVSLVQGILSVEQY

XP_022156172.1 uncharacterized protein LOC111023126 [Momordica charantia]1.1e-1834.54Show/hide
Query:  AEAGGHVTPEVRRGESSCPPPAPPV-LAAEALQAMLGNAI-LNNVQHVGANEAPAHGEEVQFIKSFVKAKPPSFDGRSEGSEAVVEWTGALEAIFQFLEA
        A  GG V P  +      P   P V L AEALQ +L NA      Q      A    EEVQFI+ F +  P  F+G SE   A  EW   LEA+  +L  
Subjt:  AEAGGHVTPEVRRGESSCPPPAPPV-LAAEALQAMLGNAI-LNNVQHVGANEAPAHGEEVQFIKSFVKAKPPSFDGRSEGSEAVVEWTGALEAIFQFLEA

Query:  NAQQRVQGATFMLKGYARTWWKAVGQTKNRPENPISWSGFKGLVQDHFGCRFAGVEVEAEFVSLVQGILSVEQYVGGSKSCPVVSQSWLPPRKL
        + + +V+GA FML+G    WWK+V   ++    P++W+ FK L+ +++       E  AEF+ L QG L+V QY             ++P ++L
Subjt:  NAQQRVQGATFMLKGYARTWWKAVGQTKNRPENPISWSGFKGLVQDHFGCRFAGVEVEAEFVSLVQGILSVEQYVGGSKSCPVVSQSWLPPRKL

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]1.9e-1844.83Show/hide
Query:  EVQFIKSFVKAKPPSFDGRSEGSEAVVEWTGALEAIFQFLEANAQQRVQGATFMLKGYARTWWKAVGQTKNRPENPISWSGFKGLVQDHFGCRFAGVEVE
        E +FIK F +  PP+FDG SE + AV EW   LEA++ +L    Q +V+GA FML+G A  WW +V   ++    PI W+ FK L+ D++         E
Subjt:  EVQFIKSFVKAKPPSFDGRSEGSEAVVEWTGALEAIFQFLEANAQQRVQGATFMLKGYARTWWKAVGQTKNRPENPISWSGFKGLVQDHFGCRFAGVEVE

Query:  AEFVSLVQGILSVEQY
        AEF+ LVQG LSV QY
Subjt:  AEFVSLVQGILSVEQY

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]3.9e-1938.36Show/hide
Query:  PPAPPVLAAEALQAMLGNAILNNVQHVGA--NEAPAH----GEEVQFIKSFVKAKPPSFDGRSEGSEAVVEWTGALEAIFQFLEANAQQRVQGATFMLKG
        PP PP  A E        A++NN   VG    + P H      E QFIK F +  PP+F G SE +    EW   LEA++ +L    Q +V+GA FML+ 
Subjt:  PPAPPVLAAEALQAMLGNAILNNVQHVGA--NEAPAH----GEEVQFIKSFVKAKPPSFDGRSEGSEAVVEWTGALEAIFQFLEANAQQRVQGATFMLKG

Query:  YARTWWKAVGQTKNRPENPISWSGFKGLVQDHFGCRFAGVEVEAEFVSLVQGILSVEQY
         A  WW +V  T++    P+ W+ FK L+ DH+         E EF+ LVQG L+V QY
Subjt:  YARTWWKAVGQTKNRPENPISWSGFKGLVQDHFGCRFAGVEVEAEFVSLVQGILSVEQY

XP_022158749.1 uncharacterized protein LOC111025213 [Momordica charantia]6.6e-1935Show/hide
Query:  GCGSGRTKAEAGGHVTPEVRRGESSCPPPAPPVLAAEALQAMLGNAILNNVQHVGANEAPAHGEEVQFIKSFVKAKPPSFDGRSEGSEAVVEWTGALEAI
        G G+G+   E     T      +    PP PP  A E     +  A+  +              E +FIK F +  PP+FDG SE + A  EW   LEA+
Subjt:  GCGSGRTKAEAGGHVTPEVRRGESSCPPPAPPVLAAEALQAMLGNAILNNVQHVGANEAPAHGEEVQFIKSFVKAKPPSFDGRSEGSEAVVEWTGALEAI

Query:  FQFLEANAQQRVQGATFMLKGYARTWWKAVGQTKNRPENPISWSGFKGLVQDHFGCRFAGVEVEAEFVSLVQGILSVEQY
        + +L    Q +V+GA FML+G A  WW +V   ++    PI W+ FK L+ D++         EAEF+ LVQG LSV QY
Subjt:  FQFLEANAQQRVQGATFMLKGYARTWWKAVGQTKNRPENPISWSGFKGLVQDHFGCRFAGVEVEAEFVSLVQGILSVEQY

TrEMBL top hitse value%identityAlignment
A0A6J1DRB3 uncharacterized protein LOC1110231265.4e-1934.54Show/hide
Query:  AEAGGHVTPEVRRGESSCPPPAPPV-LAAEALQAMLGNAI-LNNVQHVGANEAPAHGEEVQFIKSFVKAKPPSFDGRSEGSEAVVEWTGALEAIFQFLEA
        A  GG V P  +      P   P V L AEALQ +L NA      Q      A    EEVQFI+ F +  P  F+G SE   A  EW   LEA+  +L  
Subjt:  AEAGGHVTPEVRRGESSCPPPAPPV-LAAEALQAMLGNAI-LNNVQHVGANEAPAHGEEVQFIKSFVKAKPPSFDGRSEGSEAVVEWTGALEAIFQFLEA

Query:  NAQQRVQGATFMLKGYARTWWKAVGQTKNRPENPISWSGFKGLVQDHFGCRFAGVEVEAEFVSLVQGILSVEQYVGGSKSCPVVSQSWLPPRKL
        + + +V+GA FML+G    WWK+V   ++    P++W+ FK L+ +++       E  AEF+ L QG L+V QY             ++P ++L
Subjt:  NAQQRVQGATFMLKGYARTWWKAVGQTKNRPENPISWSGFKGLVQDHFGCRFAGVEVEAEFVSLVQGILSVEQYVGGSKSCPVVSQSWLPPRKL

A0A6J1DRF5 uncharacterized protein LOC1110224744.2e-1936.02Show/hide
Query:  RTKAEAGGHVTPEVRRGESSCPPPAPP----------VLAAEALQAMLGNAI-LNNVQHVGANEAPAHGEEVQFIKSFVKAKPPSFDGRSEGSEAVVEWT
        R +  A  +V P V RG +   P   P           L AEALQ +L NA      Q      A    EEVQFI+ F +  PP F+G SE   A  EW 
Subjt:  RTKAEAGGHVTPEVRRGESSCPPPAPP----------VLAAEALQAMLGNAI-LNNVQHVGANEAPAHGEEVQFIKSFVKAKPPSFDGRSEGSEAVVEWT

Query:  GALEAIFQFLEANAQQRVQGATFMLKGYARTWWKAVGQTKNRPENPISWSGFKGLVQDHFGCRFAGVEVEAEFVSLVQGILSVEQY
          LEA++ +L  + + +V+GA FML+G A  WW++V   ++    P++W+ FK L+ +++       E  AEF+ L QG L+V QY
Subjt:  GALEAIFQFLEANAQQRVQGATFMLKGYARTWWKAVGQTKNRPENPISWSGFKGLVQDHFGCRFAGVEVEAEFVSLVQGILSVEQY

A0A6J1DUM2 uncharacterized protein LOC1110232479.3e-1944.83Show/hide
Query:  EVQFIKSFVKAKPPSFDGRSEGSEAVVEWTGALEAIFQFLEANAQQRVQGATFMLKGYARTWWKAVGQTKNRPENPISWSGFKGLVQDHFGCRFAGVEVE
        E +FIK F +  PP+FDG SE + AV EW   LEA++ +L    Q +V+GA FML+G A  WW +V   ++    PI W+ FK L+ D++         E
Subjt:  EVQFIKSFVKAKPPSFDGRSEGSEAVVEWTGALEAIFQFLEANAQQRVQGATFMLKGYARTWWKAVGQTKNRPENPISWSGFKGLVQDHFGCRFAGVEVE

Query:  AEFVSLVQGILSVEQY
        AEF+ LVQG LSV QY
Subjt:  AEFVSLVQGILSVEQY

A0A6J1DVA0 uncharacterized protein LOC1110234241.9e-1938.36Show/hide
Query:  PPAPPVLAAEALQAMLGNAILNNVQHVGA--NEAPAH----GEEVQFIKSFVKAKPPSFDGRSEGSEAVVEWTGALEAIFQFLEANAQQRVQGATFMLKG
        PP PP  A E        A++NN   VG    + P H      E QFIK F +  PP+F G SE +    EW   LEA++ +L    Q +V+GA FML+ 
Subjt:  PPAPPVLAAEALQAMLGNAILNNVQHVGA--NEAPAH----GEEVQFIKSFVKAKPPSFDGRSEGSEAVVEWTGALEAIFQFLEANAQQRVQGATFMLKG

Query:  YARTWWKAVGQTKNRPENPISWSGFKGLVQDHFGCRFAGVEVEAEFVSLVQGILSVEQY
         A  WW +V  T++    P+ W+ FK L+ DH+         E EF+ LVQG L+V QY
Subjt:  YARTWWKAVGQTKNRPENPISWSGFKGLVQDHFGCRFAGVEVEAEFVSLVQGILSVEQY

A0A6J1E0B4 uncharacterized protein LOC1110252133.2e-1935Show/hide
Query:  GCGSGRTKAEAGGHVTPEVRRGESSCPPPAPPVLAAEALQAMLGNAILNNVQHVGANEAPAHGEEVQFIKSFVKAKPPSFDGRSEGSEAVVEWTGALEAI
        G G+G+   E     T      +    PP PP  A E     +  A+  +              E +FIK F +  PP+FDG SE + A  EW   LEA+
Subjt:  GCGSGRTKAEAGGHVTPEVRRGESSCPPPAPPVLAAEALQAMLGNAILNNVQHVGANEAPAHGEEVQFIKSFVKAKPPSFDGRSEGSEAVVEWTGALEAI

Query:  FQFLEANAQQRVQGATFMLKGYARTWWKAVGQTKNRPENPISWSGFKGLVQDHFGCRFAGVEVEAEFVSLVQGILSVEQY
        + +L    Q +V+GA FML+G A  WW +V   ++    PI W+ FK L+ D++         EAEF+ LVQG LSV QY
Subjt:  FQFLEANAQQRVQGATFMLKGYARTWWKAVGQTKNRPENPISWSGFKGLVQDHFGCRFAGVEVEAEFVSLVQGILSVEQY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTCCCCCTACTGGCGCCGACGACGAAATAAGAAGACAAAGACGTCGAGCAGCGTGGGACTGGGTGGCCAAACGAAGAAGAAGAGTCAAATGATTGGGTGGCTTAG
ATCCGTTGGGGCTGCTGCGGGGACTGGCCGTTTTCGCGCCGCCGCCGCTCAGCTCGCGTCGCCGCCGGTCATCGCGCCGCCGCCCTCTGCAGTTGCCGCCGCCTGTCGTG
TAGTCGCGCCGTCGCCGGTTCGTGCAGCGCCGCCGTCTCCGTACTCGTCGCGCATATCTCTCTCACTCTGCGCGTCTCTCTCTCTCCTCCGTCGCAGCTCGAGTGTCGCC
GCCACTACCCTTGCCGCCGTCGCCAGCCCCGGTCTCTCTGCTTCTCCGCGTGTTCTTCGTCGTGGATCTCGCGCGGACAGCAGCCCAGAGTCTTTGTTTCTCGCGTTTTC
GTCGCTGTCCAGCAGCGTCATTGGGCGTATCCGGCATCAATTAGCCGCCGTAAAAGTGTTCGATAAGGTTCGAAACACTTCAGCTTGGATACCCATTGCCCAAGGCCTAG
TAAGTGGGTTTTGGCTTTTCAAGCTTTTCTTAAGCCTGTTGGATATCTGTGGTTGCATAAGCATGATGCTTGCTTGTGGTTGTGAAAGCATGTTGGATGCGTGTGTAAGG
CATGTTGATTGCGTTGTTGATGTTTGTGGTTTGGTGAAAAAAATGGGTTCAGGCATTTTACGCCGTTATGCTGCCGAAATTTTCGGTACATCCGGTTTAAGTGGTTCAGT
TCCATATTGGTATCAGAGCGAAACCTCTCCAGTAGGATGTGGTTCGGGACGAACCAAGGCGGAAGCTGGTGGGCATGTGACGCCCGAGGTTAGGAGGGGCGAATCCTCAT
GTCCTCCCCCAGCGCCTCCTGTGCTGGCAGCAGAGGCATTGCAGGCGATGCTTGGCAATGCAATCCTGAACAACGTACAGCACGTCGGTGCTAACGAAGCCCCTGCTCAT
GGCGAAGAGGTGCAGTTTATCAAGAGTTTCGTGAAGGCGAAGCCTCCTTCATTTGATGGACGCTCGGAAGGTTCTGAAGCAGTTGTAGAATGGACTGGCGCGTTGGAAGC
GATATTTCAATTTCTTGAAGCTAATGCCCAGCAACGGGTCCAAGGAGCCACCTTTATGCTTAAGGGTTACGCTCGCACTTGGTGGAAGGCAGTGGGTCAAACCAAGAATC
GCCCGGAGAACCCCATTTCCTGGTCAGGGTTCAAGGGTCTGGTGCAAGACCATTTTGGCTGCCGTTTTGCTGGAGTTGAGGTAGAAGCGGAATTTGTCTCTCTGGTTCAA
GGGATTTTGTCCGTAGAGCAATACGTCGGAGGTTCGAAGAGTTGTCCTGTCGTGTCCCAGAGTTGGTTGCCACCGAGGAAATTAGGATCAACCGATTCGTTAACGGGCTC
CGCGCAGAAATTCGAGATAGAGCAGTCCCAAGAGGTTGGCACGTCATCTGGTGCCAAGAAGAAGCACGAAGAGGAAGCGTTTGTGCCTAGTCAGAAGGTTAGAAGATCTC
CATCAGGATCTCGCAAGTGCGCTGATGAGTTCTGGCCCTGTGTCACCGATGAGGAGCTCAAGGCAGAGTACCCAGAACTTTACGATGACGATGACTCTGATGATGAGGAA
AGCTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCTCCCCCTACTGGCGCCGACGACGAAATAAGAAGACAAAGACGTCGAGCAGCGTGGGACTGGGTGGCCAAACGAAGAAGAAGAGTCAAATGATTGGGTGGCTTAG
ATCCGTTGGGGCTGCTGCGGGGACTGGCCGTTTTCGCGCCGCCGCCGCTCAGCTCGCGTCGCCGCCGGTCATCGCGCCGCCGCCCTCTGCAGTTGCCGCCGCCTGTCGTG
TAGTCGCGCCGTCGCCGGTTCGTGCAGCGCCGCCGTCTCCGTACTCGTCGCGCATATCTCTCTCACTCTGCGCGTCTCTCTCTCTCCTCCGTCGCAGCTCGAGTGTCGCC
GCCACTACCCTTGCCGCCGTCGCCAGCCCCGGTCTCTCTGCTTCTCCGCGTGTTCTTCGTCGTGGATCTCGCGCGGACAGCAGCCCAGAGTCTTTGTTTCTCGCGTTTTC
GTCGCTGTCCAGCAGCGTCATTGGGCGTATCCGGCATCAATTAGCCGCCGTAAAAGTGTTCGATAAGGTTCGAAACACTTCAGCTTGGATACCCATTGCCCAAGGCCTAG
TAAGTGGGTTTTGGCTTTTCAAGCTTTTCTTAAGCCTGTTGGATATCTGTGGTTGCATAAGCATGATGCTTGCTTGTGGTTGTGAAAGCATGTTGGATGCGTGTGTAAGG
CATGTTGATTGCGTTGTTGATGTTTGTGGTTTGGTGAAAAAAATGGGTTCAGGCATTTTACGCCGTTATGCTGCCGAAATTTTCGGTACATCCGGTTTAAGTGGTTCAGT
TCCATATTGGTATCAGAGCGAAACCTCTCCAGTAGGATGTGGTTCGGGACGAACCAAGGCGGAAGCTGGTGGGCATGTGACGCCCGAGGTTAGGAGGGGCGAATCCTCAT
GTCCTCCCCCAGCGCCTCCTGTGCTGGCAGCAGAGGCATTGCAGGCGATGCTTGGCAATGCAATCCTGAACAACGTACAGCACGTCGGTGCTAACGAAGCCCCTGCTCAT
GGCGAAGAGGTGCAGTTTATCAAGAGTTTCGTGAAGGCGAAGCCTCCTTCATTTGATGGACGCTCGGAAGGTTCTGAAGCAGTTGTAGAATGGACTGGCGCGTTGGAAGC
GATATTTCAATTTCTTGAAGCTAATGCCCAGCAACGGGTCCAAGGAGCCACCTTTATGCTTAAGGGTTACGCTCGCACTTGGTGGAAGGCAGTGGGTCAAACCAAGAATC
GCCCGGAGAACCCCATTTCCTGGTCAGGGTTCAAGGGTCTGGTGCAAGACCATTTTGGCTGCCGTTTTGCTGGAGTTGAGGTAGAAGCGGAATTTGTCTCTCTGGTTCAA
GGGATTTTGTCCGTAGAGCAATACGTCGGAGGTTCGAAGAGTTGTCCTGTCGTGTCCCAGAGTTGGTTGCCACCGAGGAAATTAGGATCAACCGATTCGTTAACGGGCTC
CGCGCAGAAATTCGAGATAGAGCAGTCCCAAGAGGTTGGCACGTCATCTGGTGCCAAGAAGAAGCACGAAGAGGAAGCGTTTGTGCCTAGTCAGAAGGTTAGAAGATCTC
CATCAGGATCTCGCAAGTGCGCTGATGAGTTCTGGCCCTGTGTCACCGATGAGGAGCTCAAGGCAGAGTACCCAGAACTTTACGATGACGATGACTCTGATGATGAGGAA
AGCTCCTAA
Protein sequenceShow/hide protein sequence
MSSPYWRRRRNKKTKTSSSVGLGGQTKKKSQMIGWLRSVGAAAGTGRFRAAAAQLASPPVIAPPPSAVAAACRVVAPSPVRAAPPSPYSSRISLSLCASLSLLRRSSSVA
ATTLAAVASPGLSASPRVLRRGSRADSSPESLFLAFSSLSSSVIGRIRHQLAAVKVFDKVRNTSAWIPIAQGLVSGFWLFKLFLSLLDICGCISMMLACGCESMLDACVR
HVDCVVDVCGLVKKMGSGILRRYAAEIFGTSGLSGSVPYWYQSETSPVGCGSGRTKAEAGGHVTPEVRRGESSCPPPAPPVLAAEALQAMLGNAILNNVQHVGANEAPAH
GEEVQFIKSFVKAKPPSFDGRSEGSEAVVEWTGALEAIFQFLEANAQQRVQGATFMLKGYARTWWKAVGQTKNRPENPISWSGFKGLVQDHFGCRFAGVEVEAEFVSLVQ
GILSVEQYVGGSKSCPVVSQSWLPPRKLGSTDSLTGSAQKFEIEQSQEVGTSSGAKKKHEEEAFVPSQKVRRSPSGSRKCADEFWPCVTDEELKAEYPELYDDDDSDDEE
SS