; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021171 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021171
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUBN2_3 domain-containing protein
Genome locationchr7:5232664..5236149
RNA-Seq ExpressionLag0021171
SyntenyLag0021171
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032807.1 UBN2_3 domain-containing protein [Cucumis melo var. makuwa]3.1e-4367.39Show/hide
Query:  NSQSPYYLHHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPTGELLPAWSRNNLIVISWILNSVSKAISSSIIFTDSAEAIWIDLK
        +S SPY L+H+DTSNL+LV+EL+T++NYV WSRSM++A++I NKLGFID  I KP GELLP W  NN +VI+WILNS SK I SSI+ T SA A WIDL+
Subjt:  NSQSPYYLHHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPTGELLPAWSRNNLIVISWILNSVSKAISSSIIFTDSAEAIWIDLK

Query:  ERFKKRNGPRIFHLKRGLATLKQVQESVTMYFAKLKGL
        + F+KRNGPRIF LK GL+TLKQ QESVTMYFA +K L
Subjt:  ERFKKRNGPRIFHLKRGLATLKQVQESVTMYFAKLKGL

XP_008457013.1 PREDICTED: uncharacterized protein LOC103496792 [Cucumis melo]3.7e-4468.12Show/hide
Query:  NSQSPYYLHHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPTGELLPAWSRNNLIVISWILNSVSKAISSSIIFTDSAEAIWIDLK
        +S SPY L+H+DTSNL+LV+EL+T++NYV WSRSM++A++I NKLGFID  I KP GELLP W  NN +VI+WILNS SK I SSI+FT SA A WIDL+
Subjt:  NSQSPYYLHHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPTGELLPAWSRNNLIVISWILNSVSKAISSSIIFTDSAEAIWIDLK

Query:  ERFKKRNGPRIFHLKRGLATLKQVQESVTMYFAKLKGL
        + F+KRNGPRIF LK GL+TLKQ QESVTMYFA +K L
Subjt:  ERFKKRNGPRIFHLKRGLATLKQVQESVTMYFAKLKGL

XP_022154608.1 uncharacterized protein LOC111021831 [Momordica charantia]1.2e-4561.54Show/hide
Query:  PSFFFSMMDSTVPVQGASSSLSAENSQS-------PYYLHHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPTGELLPAWSRNNLI
        PS     +D   PV   SSS ++ +S S       PYYLHHTD + LVLV++ LT +NY +WSRSML+AL+I NKLGFIDGSI +P GELLPAW  NN +
Subjt:  PSFFFSMMDSTVPVQGASSSLSAENSQS-------PYYLHHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPTGELLPAWSRNNLI

Query:  VISWILNSVSKAISSSIIFTDSAEAIWIDLKERFKKRNGPRIFHLKRGLATLKQVQESVTMYFAKLKGL
        VI+WILNSVSK ISSSI+F++SA  IWIDLKERF+K NGPRIF LKR LA L Q Q+SV++YF KLK +
Subjt:  VISWILNSVSKAISSSIIFTDSAEAIWIDLKERFKKRNGPRIFHLKRGLATLKQVQESVTMYFAKLKGL

XP_022156861.1 uncharacterized protein LOC111023702 [Momordica charantia]8.2e-4459.2Show/hide
Query:  LAAHLLSDSFLPSFFFSMMDSTVPVQGASSSLSAENS-QSPYYLHHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPTGELLPAWS
        L + LL D  LP    S   S+ PV  +SS+LS + S  +PYYLHHTD + LV V++LLT DNY +WSRSM++ L++ NKL FIDG IP+P+G+LLPAW 
Subjt:  LAAHLLSDSFLPSFFFSMMDSTVPVQGASSSLSAENS-QSPYYLHHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPTGELLPAWS

Query:  RNNLIVISWILNSVSKAISSSIIFTDSAEAIWIDLKERFKKRNGPRIFHLKRGLATLKQVQESVTMYFAKLKGL
         NN IVI+WILNSVSK IS+SI+F++SA  IWIDL ERF+K N P I+ LKR LATL Q Q+SV+ YF KLK L
Subjt:  RNNLIVISWILNSVSKAISSSIIFTDSAEAIWIDLKERFKKRNGPRIFHLKRGLATLKQVQESVTMYFAKLKGL

XP_038895765.1 uncharacterized protein LOC120083929 [Benincasa hispida]4.2e-4869.29Show/hide
Query:  LSAENSQSPYYLHHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPTGELLPAWSRNNLIVISWILNSVSKAISSSIIFTDSAEAIW
        LS  +  +PY LHH+DTSNLVLVSELLT+DNYV+WSRSM++ L I NKLGFIDGS+P+PTG+LL  W  NN +V+SWIL SVSK+ISSSI+FT+SA+AIW
Subjt:  LSAENSQSPYYLHHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPTGELLPAWSRNNLIVISWILNSVSKAISSSIIFTDSAEAIW

Query:  IDLKERFKKRNGPRIFHLKRGLATLKQVQESVTMYFAKLK
        +DL++ F++RNGPRIFHLKR L++LKQ Q+SVTMYF K+K
Subjt:  IDLKERFKKRNGPRIFHLKRGLATLKQVQESVTMYFAKLK

TrEMBL top hitse value%identityAlignment
A0A1S3C5T4 uncharacterized protein LOC1034967921.8e-4468.12Show/hide
Query:  NSQSPYYLHHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPTGELLPAWSRNNLIVISWILNSVSKAISSSIIFTDSAEAIWIDLK
        +S SPY L+H+DTSNL+LV+EL+T++NYV WSRSM++A++I NKLGFID  I KP GELLP W  NN +VI+WILNS SK I SSI+FT SA A WIDL+
Subjt:  NSQSPYYLHHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPTGELLPAWSRNNLIVISWILNSVSKAISSSIIFTDSAEAIWIDLK

Query:  ERFKKRNGPRIFHLKRGLATLKQVQESVTMYFAKLKGL
        + F+KRNGPRIF LK GL+TLKQ QESVTMYFA +K L
Subjt:  ERFKKRNGPRIFHLKRGLATLKQVQESVTMYFAKLKGL

A0A5A7SU21 UBN2_3 domain-containing protein1.5e-4367.39Show/hide
Query:  NSQSPYYLHHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPTGELLPAWSRNNLIVISWILNSVSKAISSSIIFTDSAEAIWIDLK
        +S SPY L+H+DTSNL+LV+EL+T++NYV WSRSM++A++I NKLGFID  I KP GELLP W  NN +VI+WILNS SK I SSI+ T SA A WIDL+
Subjt:  NSQSPYYLHHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPTGELLPAWSRNNLIVISWILNSVSKAISSSIIFTDSAEAIWIDLK

Query:  ERFKKRNGPRIFHLKRGLATLKQVQESVTMYFAKLKGL
        + F+KRNGPRIF LK GL+TLKQ QESVTMYFA +K L
Subjt:  ERFKKRNGPRIFHLKRGLATLKQVQESVTMYFAKLKGL

A0A6J1DKR8 uncharacterized protein LOC1110218315.6e-4661.54Show/hide
Query:  PSFFFSMMDSTVPVQGASSSLSAENSQS-------PYYLHHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPTGELLPAWSRNNLI
        PS     +D   PV   SSS ++ +S S       PYYLHHTD + LVLV++ LT +NY +WSRSML+AL+I NKLGFIDGSI +P GELLPAW  NN +
Subjt:  PSFFFSMMDSTVPVQGASSSLSAENSQS-------PYYLHHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPTGELLPAWSRNNLI

Query:  VISWILNSVSKAISSSIIFTDSAEAIWIDLKERFKKRNGPRIFHLKRGLATLKQVQESVTMYFAKLKGL
        VI+WILNSVSK ISSSI+F++SA  IWIDLKERF+K NGPRIF LKR LA L Q Q+SV++YF KLK +
Subjt:  VISWILNSVSKAISSSIIFTDSAEAIWIDLKERFKKRNGPRIFHLKRGLATLKQVQESVTMYFAKLKGL

A0A6J1DLQ9 uncharacterized protein LOC1110221172.2e-4267.18Show/hide
Query:  LHHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPTGELLPAWSRNNLIVISWILNSVSKAISSSIIFTDSAEAIWIDLKERFKKRN
        +HH DTSNLVLVS+ LTN NYV+WSRSM +AL+I NKLGFI+GS+PKP G+LLP W RN  +VI+W LNSVSK IS+S+IFT+S   IW+DLK+RF+ +N
Subjt:  LHHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPTGELLPAWSRNNLIVISWILNSVSKAISSSIIFTDSAEAIWIDLKERFKKRN

Query:  GPRIFHLKRGLATLKQVQESVTMYFAKLKGL
        GP+IF L+R LATL Q Q SVTMY+ KLK L
Subjt:  GPRIFHLKRGLATLKQVQESVTMYFAKLKGL

A0A6J1DW89 uncharacterized protein LOC1110237024.0e-4459.2Show/hide
Query:  LAAHLLSDSFLPSFFFSMMDSTVPVQGASSSLSAENS-QSPYYLHHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPTGELLPAWS
        L + LL D  LP    S   S+ PV  +SS+LS + S  +PYYLHHTD + LV V++LLT DNY +WSRSM++ L++ NKL FIDG IP+P+G+LLPAW 
Subjt:  LAAHLLSDSFLPSFFFSMMDSTVPVQGASSSLSAENS-QSPYYLHHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPTGELLPAWS

Query:  RNNLIVISWILNSVSKAISSSIIFTDSAEAIWIDLKERFKKRNGPRIFHLKRGLATLKQVQESVTMYFAKLKGL
         NN IVI+WILNSVSK IS+SI+F++SA  IWIDL ERF+K N P I+ LKR LATL Q Q+SV+ YF KLK L
Subjt:  RNNLIVISWILNSVSKAISSSIIFTDSAEAIWIDLKERFKKRNGPRIFHLKRGLATLKQVQESVTMYFAKLKGL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.2e-1836.23Show/hide
Query:  SPYYL----HHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPT--GELLPAWSRNNLIVISWILNSVSKAISSSIIFTDSAEAIWI
        SPYYL    HH    ++  +S+    DNYV W       L +  K GFIDG++PKP     L   W + N +V+ W++NS++  +  S+++ ++A  +W 
Subjt:  SPYYL----HHTDTSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPT--GELLPAWSRNNLIVISWILNSVSKAISSSIIFTDSAEAIWI

Query:  DLKERFKKRNGPRIFHLKRGLATLKQVQESVTMYFAKL
        DL+  F      +I+ L+R LATL+Q  +SV  YF KL
Subjt:  DLKERFKKRNGPRIFHLKRGLATLKQVQESVTMYFAKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAAGGAAATCAGAAGCCGCGGAATCAGTGTTCATTTCGACGGAGGCGTTAGGTCAGTTAAGGTACCAACGGTTTCGTTCAGCTGCCGTACACCGGCAGTTATGCA
TTTTCCGATAGAAGTTTTAGGTTTTCTCATCTGCATTTTCAATTTTCTCCTCTGCCCCTCTCTCTCTTCAACGACGCAAACCAAGAACGACACGACTCCTCCCTCTCAAG
TTTTTGCATTTTTGGCATCTCCTCCGACGAGTGCAGCGATCGACGACAACTTTGGCAGTGAGCGTGGATCCGTGAAGCAAGTCGTTGACGACTCCAGCCCCTCCGGTAAA
GCTGCAGCACCTGCAAACGTGCGGACATTAGCCTCCATTCACGATTTCCAGCGATTTTGCAACGTGAGGTTGGGAGTTCCGACGTCTTCTTCTGATTCGATCAAAGAAAG
TGACCCGACCCGATCGGTTCGAGTCAAGTCGGGTCCGTCCGTCCGTCCGTACAACCGACATCTGTCGAGCAAAGCCCTTCTTGCTGCTCATCTTCTCTCTGATTCTTTCC
TCCCTTCCTTCTTTTTCTCAATGATGGACTCTACGGTTCCAGTTCAAGGTGCATCTTCTTCCCTTTCCGCTGAAAACTCTCAAAGTCCGTATTATCTCCATCACACCGAC
ACCTCCAATCTTGTTCTTGTATCGGAATTGCTCACTAATGACAATTATGTCACATGGAGTAGATCAATGCTGATGGCGTTGGCGATCATAAACAAATTGGGATTTATTGA
TGGAAGCATCCCAAAACCCACTGGAGAATTGCTTCCAGCCTGGTCTCGAAACAATCTTATTGTCATATCATGGATTCTGAATTCTGTGTCCAAAGCGATATCTTCGAGCA
TCATCTTTACTGATTCCGCCGAAGCTATCTGGATTGATTTAAAAGAACGTTTTAAAAAACGCAACGGTCCAAGAATTTTCCATCTCAAACGAGGATTGGCTACTTTAAAG
CAAGTACAAGAATCAGTGACCATGTATTTTGCAAAGCTTAAAGGTCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGAAGGAAATCAGAAGCCGCGGAATCAGTGTTCATTTCGACGGAGGCGTTAGGTCAGTTAAGGTACCAACGGTTTCGTTCAGCTGCCGTACACCGGCAGTTATGCA
TTTTCCGATAGAAGTTTTAGGTTTTCTCATCTGCATTTTCAATTTTCTCCTCTGCCCCTCTCTCTCTTCAACGACGCAAACCAAGAACGACACGACTCCTCCCTCTCAAG
TTTTTGCATTTTTGGCATCTCCTCCGACGAGTGCAGCGATCGACGACAACTTTGGCAGTGAGCGTGGATCCGTGAAGCAAGTCGTTGACGACTCCAGCCCCTCCGGTAAA
GCTGCAGCACCTGCAAACGTGCGGACATTAGCCTCCATTCACGATTTCCAGCGATTTTGCAACGTGAGGTTGGGAGTTCCGACGTCTTCTTCTGATTCGATCAAAGAAAG
TGACCCGACCCGATCGGTTCGAGTCAAGTCGGGTCCGTCCGTCCGTCCGTACAACCGACATCTGTCGAGCAAAGCCCTTCTTGCTGCTCATCTTCTCTCTGATTCTTTCC
TCCCTTCCTTCTTTTTCTCAATGATGGACTCTACGGTTCCAGTTCAAGGTGCATCTTCTTCCCTTTCCGCTGAAAACTCTCAAAGTCCGTATTATCTCCATCACACCGAC
ACCTCCAATCTTGTTCTTGTATCGGAATTGCTCACTAATGACAATTATGTCACATGGAGTAGATCAATGCTGATGGCGTTGGCGATCATAAACAAATTGGGATTTATTGA
TGGAAGCATCCCAAAACCCACTGGAGAATTGCTTCCAGCCTGGTCTCGAAACAATCTTATTGTCATATCATGGATTCTGAATTCTGTGTCCAAAGCGATATCTTCGAGCA
TCATCTTTACTGATTCCGCCGAAGCTATCTGGATTGATTTAAAAGAACGTTTTAAAAAACGCAACGGTCCAAGAATTTTCCATCTCAAACGAGGATTGGCTACTTTAAAG
CAAGTACAAGAATCAGTGACCATGTATTTTGCAAAGCTTAAAGGTCTTTAG
Protein sequenceShow/hide protein sequence
MLKEIRSRGISVHFDGGVRSVKVPTVSFSCRTPAVMHFPIEVLGFLICIFNFLLCPSLSSTTQTKNDTTPPSQVFAFLASPPTSAAIDDNFGSERGSVKQVVDDSSPSGK
AAAPANVRTLASIHDFQRFCNVRLGVPTSSSDSIKESDPTRSVRVKSGPSVRPYNRHLSSKALLAAHLLSDSFLPSFFFSMMDSTVPVQGASSSLSAENSQSPYYLHHTD
TSNLVLVSELLTNDNYVTWSRSMLMALAIINKLGFIDGSIPKPTGELLPAWSRNNLIVISWILNSVSKAISSSIIFTDSAEAIWIDLKERFKKRNGPRIFHLKRGLATLK
QVQESVTMYFAKLKGL