; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035297 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035297
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUBN2_3 domain-containing protein
Genome locationchr3:18299114..18300631
RNA-Seq ExpressionLag0035297
SyntenyLag0035297
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032807.1 UBN2_3 domain-containing protein [Cucumis melo var. makuwa]1.6e-3956.58Show/hide
Query:  NSRSPYFLHHSDTSNL--------------------------NKLGFIDGSIVKPSGQLLPLWARNNNIVIAWILNSVSKSISSSILCTDSAQTIWLDLK
        +S SPY L+HSDTSNL                          NKLGFID  I KP+G+LLPLW  NNN+VIAWILNS SK I SSIL T SA+  W+DL+
Subjt:  NSRSPYFLHHSDTSNL--------------------------NKLGFIDGSIVKPSGQLLPLWARNNNIVIAWILNSVSKSISSSILCTDSAQTIWLDLK

Query:  DRFAKKNGPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDEYVTFRPGCTCG
        D F K+NGPRIF LK  +STL+Q+Q++VTMY+A +KSLWDEY ++ PGCTCG
Subjt:  DRFAKKNGPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDEYVTFRPGCTCG

XP_008457013.1 PREDICTED: uncharacterized protein LOC103496792 [Cucumis melo]2.1e-3956.58Show/hide
Query:  NSRSPYFLHHSDTSNL--------------------------NKLGFIDGSIVKPSGQLLPLWARNNNIVIAWILNSVSKSISSSILCTDSAQTIWLDLK
        +S SPY L+HSDTSNL                          NKLGFID  I KP+G+LLPLW  NNN+VIAWILNS SK I SSIL T SA+  W+DL+
Subjt:  NSRSPYFLHHSDTSNL--------------------------NKLGFIDGSIVKPSGQLLPLWARNNNIVIAWILNSVSKSISSSILCTDSAQTIWLDLK

Query:  DRFAKKNGPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDEYVTFRPGCTCG
        D F K+NGPRIF LK  +STL+Q+Q++VTMY+A +KSLWDEY ++ PGCTCG
Subjt:  DRFAKKNGPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDEYVTFRPGCTCG

XP_022154608.1 uncharacterized protein LOC111021831 [Momordica charantia]8.3e-3648.54Show/hide
Query:  MDTLTSANPSASSIDGSEAQNSRSPYFLHHSDTSNL--------------------------NKLGFIDGSIVKPSGQLLPLWARNNNIVIAWILNSVSK
        +D  +S N S+SS+ G    +  +PY+LHH+D + L                          NKLGFIDGSI +P G+LLP W  NN++VIAWILNSVSK
Subjt:  MDTLTSANPSASSIDGSEAQNSRSPYFLHHSDTSNL--------------------------NKLGFIDGSIVKPSGQLLPLWARNNNIVIAWILNSVSK

Query:  SISSSILCTDSAQTIWLDLKDRFAKKNGPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDEYVTFRPGCTC
         ISSSIL ++SA+ IW+DLK+RF K NGPRIF LK++++ L Q QQ+V++Y+ KLK++WDE + +RP C+C
Subjt:  SISSSILCTDSAQTIWLDLKDRFAKKNGPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDEYVTFRPGCTC

XP_022154973.1 uncharacterized protein LOC111022117 [Momordica charantia]4.0e-3852.2Show/hide
Query:  LHHSDTSNL--------------------------NKLGFIDGSIVKPSGQLLPLWARNNNIVIAWILNSVSKSISSSILCTDSAQTIWLDLKDRFAKKN
        +HH+DTSNL                          NKLGFI+GS+ KP+G LLP+W RN ++VIAW LNSVSK IS+S++ T+S   IWLDLKDRF  +N
Subjt:  LHHSDTSNL--------------------------NKLGFIDGSIVKPSGQLLPLWARNNNIVIAWILNSVSKSISSSILCTDSAQTIWLDLKDRFAKKN

Query:  GPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDEYVTFRPGCTCGA-SCWLDDLLDRLV
        GP+IF L+++++TL Q+Q +VTMYY KLK+LWDEYV++RPGCTCG+ SC    L+++ V
Subjt:  GPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDEYVTFRPGCTCGA-SCWLDDLLDRLV

XP_038895765.1 uncharacterized protein LOC120083929 [Benincasa hispida]2.2e-4158.39Show/hide
Query:  SPYFLHHSDTSNL--------------------------NKLGFIDGSIVKPSGQLLPLWARNNNIVIAWILNSVSKSISSSILCTDSAQTIWLDLKDRF
        +PY LHHSDTSNL                          NKLGFIDGS+ +P+G LL LW  NNN+V++WIL SVSKSISSSIL T+SAQ IWLDL+D F
Subjt:  SPYFLHHSDTSNL--------------------------NKLGFIDGSIVKPSGQLLPLWARNNNIVIAWILNSVSKSISSSILCTDSAQTIWLDLKDRF

Query:  AKKNGPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDEYVTFRPGCTCG
         ++NGPRIFHLK+E+S+L+Q+Q +VTMY+ K+KS  DEYV++RPGCTCG
Subjt:  AKKNGPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDEYVTFRPGCTCG

TrEMBL top hitse value%identityAlignment
A0A1S3C5T4 uncharacterized protein LOC1034967921.0e-3956.58Show/hide
Query:  NSRSPYFLHHSDTSNL--------------------------NKLGFIDGSIVKPSGQLLPLWARNNNIVIAWILNSVSKSISSSILCTDSAQTIWLDLK
        +S SPY L+HSDTSNL                          NKLGFID  I KP+G+LLPLW  NNN+VIAWILNS SK I SSIL T SA+  W+DL+
Subjt:  NSRSPYFLHHSDTSNL--------------------------NKLGFIDGSIVKPSGQLLPLWARNNNIVIAWILNSVSKSISSSILCTDSAQTIWLDLK

Query:  DRFAKKNGPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDEYVTFRPGCTCG
        D F K+NGPRIF LK  +STL+Q+Q++VTMY+A +KSLWDEY ++ PGCTCG
Subjt:  DRFAKKNGPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDEYVTFRPGCTCG

A0A5A7SU21 UBN2_3 domain-containing protein7.8e-4056.58Show/hide
Query:  NSRSPYFLHHSDTSNL--------------------------NKLGFIDGSIVKPSGQLLPLWARNNNIVIAWILNSVSKSISSSILCTDSAQTIWLDLK
        +S SPY L+HSDTSNL                          NKLGFID  I KP+G+LLPLW  NNN+VIAWILNS SK I SSIL T SA+  W+DL+
Subjt:  NSRSPYFLHHSDTSNL--------------------------NKLGFIDGSIVKPSGQLLPLWARNNNIVIAWILNSVSKSISSSILCTDSAQTIWLDLK

Query:  DRFAKKNGPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDEYVTFRPGCTCG
        D F K+NGPRIF LK  +STL+Q+Q++VTMY+A +KSLWDEY ++ PGCTCG
Subjt:  DRFAKKNGPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDEYVTFRPGCTCG

A0A6J1DKR8 uncharacterized protein LOC1110218314.0e-3648.54Show/hide
Query:  MDTLTSANPSASSIDGSEAQNSRSPYFLHHSDTSNL--------------------------NKLGFIDGSIVKPSGQLLPLWARNNNIVIAWILNSVSK
        +D  +S N S+SS+ G    +  +PY+LHH+D + L                          NKLGFIDGSI +P G+LLP W  NN++VIAWILNSVSK
Subjt:  MDTLTSANPSASSIDGSEAQNSRSPYFLHHSDTSNL--------------------------NKLGFIDGSIVKPSGQLLPLWARNNNIVIAWILNSVSK

Query:  SISSSILCTDSAQTIWLDLKDRFAKKNGPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDEYVTFRPGCTC
         ISSSIL ++SA+ IW+DLK+RF K NGPRIF LK++++ L Q QQ+V++Y+ KLK++WDE + +RP C+C
Subjt:  SISSSILCTDSAQTIWLDLKDRFAKKNGPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDEYVTFRPGCTC

A0A6J1DLQ9 uncharacterized protein LOC1110221171.9e-3852.2Show/hide
Query:  LHHSDTSNL--------------------------NKLGFIDGSIVKPSGQLLPLWARNNNIVIAWILNSVSKSISSSILCTDSAQTIWLDLKDRFAKKN
        +HH+DTSNL                          NKLGFI+GS+ KP+G LLP+W RN ++VIAW LNSVSK IS+S++ T+S   IWLDLKDRF  +N
Subjt:  LHHSDTSNL--------------------------NKLGFIDGSIVKPSGQLLPLWARNNNIVIAWILNSVSKSISSSILCTDSAQTIWLDLKDRFAKKN

Query:  GPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDEYVTFRPGCTCGA-SCWLDDLLDRLV
        GP+IF L+++++TL Q+Q +VTMYY KLK+LWDEYV++RPGCTCG+ SC    L+++ V
Subjt:  GPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDEYVTFRPGCTCGA-SCWLDDLLDRLV

A0A7J0FKC9 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein4.1e-3348.19Show/hide
Query:  SASSIDGSEAQNSRSPYFLHHSDTSNL--------------------------NKLGFIDGSIVKPSG---QLLPLWARNNNIVIAWILNSVSKSISSSI
        S SSI+     +  SPYFLHHSD   L                          NKLGFIDGSI KP G    LL  W RNNN+VI+WILNSVSK IS+SI
Subjt:  SASSIDGSEAQNSRSPYFLHHSDTSNL--------------------------NKLGFIDGSIVKPSG---QLLPLWARNNNIVIAWILNSVSKSISSSI

Query:  LCTDSAQTIWLDLKDRFAKKNGPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDEYVTFRPGCTCG
        + + SA  IW+DLKDRF + NGPRIF L++E+    Q+Q  V++Y+ KLK++W+E   +RP C+CG
Subjt:  LCTDSAQTIWLDLKDRFAKKNGPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDEYVTFRPGCTCG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.2e-1332.74Show/hide
Query:  KLGFIDGSIVKPS--GQLLPLWARNNNIVIAWILNSVSKSISSSILCTDSAQTIWLDLKDRFAKKNGPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDE
        K GFIDG++ KP     L   W + N +V+ W++NS++  +  S++  ++A  +W DL+  F      +I+ L++ ++TLRQ   +V  Y+ KL  +W E
Subjt:  KLGFIDGSIVKPS--GQLLPLWARNNNIVIAWILNSVSKSISSSILCTDSAQTIWLDLKDRFAKKNGPRIFHLKQEMSTLRQEQQAVTMYYAKLKSLWDE

Query:  YVTFR--PGCTCG
           +   P C CG
Subjt:  YVTFR--PGCTCG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATACACTGACTTCAGCTAATCCCTCTGCTTCCAGCATTGATGGATCAGAGGCACAGAACTCTCGGAGTCCTTATTTCCTACACCACAGTGATACTTCGAATCTGAA
CAAATTAGGTTTCATAGATGGATCAATAGTTAAACCTAGTGGACAGCTCTTGCCGCTTTGGGCGAGGAATAATAATATTGTGATAGCATGGATCTTGAATTCAGTATCGA
AAAGTATATCATCGAGCATCCTGTGCACCGATTCAGCCCAGACAATCTGGCTTGATCTTAAGGATCGGTTTGCCAAGAAGAATGGACCTCGAATCTTCCACCTCAAGCAA
GAAATGAGCACTCTCCGCCAAGAACAACAAGCGGTAACCATGTACTATGCCAAACTCAAGAGCCTCTGGGATGAATATGTGACTTTTCGACCTGGATGTACTTGTGGTGC
CTCCTGTTGGCTTGATGATCTGTTGGATCGGCTTGTTAGCTTGTTGATCCTCTTGGATGATCGGCCTGCTGACTTGTTTATCTTCTTGATCGACTCTCTGACTTCTAATG
ACTCTGGACTCTGGCAGAATTCTCTGGACTATGAGTTCTCTGATCTGAATCTCCGAATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATACACTGACTTCAGCTAATCCCTCTGCTTCCAGCATTGATGGATCAGAGGCACAGAACTCTCGGAGTCCTTATTTCCTACACCACAGTGATACTTCGAATCTGAA
CAAATTAGGTTTCATAGATGGATCAATAGTTAAACCTAGTGGACAGCTCTTGCCGCTTTGGGCGAGGAATAATAATATTGTGATAGCATGGATCTTGAATTCAGTATCGA
AAAGTATATCATCGAGCATCCTGTGCACCGATTCAGCCCAGACAATCTGGCTTGATCTTAAGGATCGGTTTGCCAAGAAGAATGGACCTCGAATCTTCCACCTCAAGCAA
GAAATGAGCACTCTCCGCCAAGAACAACAAGCGGTAACCATGTACTATGCCAAACTCAAGAGCCTCTGGGATGAATATGTGACTTTTCGACCTGGATGTACTTGTGGTGC
CTCCTGTTGGCTTGATGATCTGTTGGATCGGCTTGTTAGCTTGTTGATCCTCTTGGATGATCGGCCTGCTGACTTGTTTATCTTCTTGATCGACTCTCTGACTTCTAATG
ACTCTGGACTCTGGCAGAATTCTCTGGACTATGAGTTCTCTGATCTGAATCTCCGAATCTGA
Protein sequenceShow/hide protein sequence
MDTLTSANPSASSIDGSEAQNSRSPYFLHHSDTSNLNKLGFIDGSIVKPSGQLLPLWARNNNIVIAWILNSVSKSISSSILCTDSAQTIWLDLKDRFAKKNGPRIFHLKQ
EMSTLRQEQQAVTMYYAKLKSLWDEYVTFRPGCTCGASCWLDDLLDRLVSLLILLDDRPADLFIFLIDSLTSNDSGLWQNSLDYEFSDLNLRI