; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019889 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019889
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr5:46446909..46447496
RNA-Seq ExpressionLag0019889
SyntenyLag0019889
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148562.1 uncharacterized protein LOC111017196 [Momordica charantia]1.6e-3671.43Show/hide
Query:  DGEMLHSWKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYRPSCTCGRCS
        D E L+SW ICN+VV AWILN+LSKEI ASV F DS R++WLDLQ+R+QR+NRPRIFQLRR++S L Q+QLSV+AYF KLKTLW EL +YRP+C+CGRC+
Subjt:  DGEMLHSWKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYRPSCTCGRCS

Query:  CGGCK
        CGG K
Subjt:  CGGCK

XP_022152756.1 uncharacterized protein LOC111020399 [Momordica charantia]3.0e-3569.23Show/hide
Query:  GEMLHSWKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYRPSCTCGRCSC
        G++LHSW ICN+VV +WILN+LSKEI AS+ F DS R++WLDL++R++++NRPRIFQLRR++SNL Q+QLSV+AYF  LKTLW EL SY PSCT GRCSC
Subjt:  GEMLHSWKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYRPSCTCGRCSC

Query:  GGCK
        GG K
Subjt:  GGCK

XP_038874906.1 uncharacterized protein LOC120067409 [Benincasa hispida]8.8e-3570.19Show/hide
Query:  GEMLHSWKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYRPSCTCGRCSC
        G++ +SW ICNSVV  WI NALSK+I ASVNF DSTR++WLDLQQRYQ K  P IFQ  RE+SNL Q+QLSV AYF KLKT WNEL SY+P C+CGRC+C
Subjt:  GEMLHSWKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYRPSCTCGRCSC

Query:  GGCK
        GG K
Subjt:  GGCK

XP_038887168.1 uncharacterized protein LOC120077355 [Benincasa hispida]1.6e-3672.12Show/hide
Query:  GEMLHSWKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYRPSCTCGRCSC
        GE+L SW ICN VV AWILN+LSKEI AS+NF DS +++W+DLQ+RYQR+NRPR+FQL REISNL Q Q SVT Y+AKLKTLWNEL SYRPSC+CG+C+C
Subjt:  GEMLHSWKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYRPSCTCGRCSC

Query:  GGCK
         G K
Subjt:  GGCK

XP_038904477.1 uncharacterized protein LOC120090845 [Benincasa hispida]1.4e-3771.15Show/hide
Query:  GEMLHSWKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYRPSCTCGRCSC
        GE+L SW ICN +V  WILN+LSKEI AS+NF DS +++W+DLQ+RYQRKNRPR+FQLRRE SNL+Q Q S+T Y+AKLKTLWNEL SYRPSC+CG+C+C
Subjt:  GEMLHSWKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYRPSCTCGRCSC

Query:  GGCK
        GG K
Subjt:  GGCK

TrEMBL top hitse value%identityAlignment
A0A2Z7D4G6 Uncharacterized protein3.2e-3058.72Show/hide
Query:  PNTTDGEMLHSWKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYRPSCTC
        P+++D  +L+SW   N++V +WILN++SKEI AS+ F +S   +WLDL+ R+Q+ N PRIFQLRRE+ NLAQEQLSV+ YF KLK LW+EL+++RP+CTC
Subjt:  PNTTDGEMLHSWKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYRPSCTC

Query:  GRCSCGGCK
        G+C+CGG K
Subjt:  GRCSCGGCK

A0A5J5BIH5 Uncharacterized protein4.9e-3157.8Show/hide
Query:  PNTTDGEMLHSWKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYRPSCTC
        P  TD ++++SW   N++V +WILN++SKEI AS+ F  S R++WLDL+ R+Q++N PRIFQL+RE+ NL QEQ SV+ YF KLKT+W EL++YRP+C+C
Subjt:  PNTTDGEMLHSWKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYRPSCTC

Query:  GRCSCGGCK
        G+CSCGG K
Subjt:  GRCSCGGCK

A0A6J1D5E3 uncharacterized protein LOC1110171967.8e-3771.43Show/hide
Query:  DGEMLHSWKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYRPSCTCGRCS
        D E L+SW ICN+VV AWILN+LSKEI ASV F DS R++WLDLQ+R+QR+NRPRIFQLRR++S L Q+QLSV+AYF KLKTLW EL +YRP+C+CGRC+
Subjt:  DGEMLHSWKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYRPSCTCGRCS

Query:  CGGCK
        CGG K
Subjt:  CGGCK

A0A6J1DIP8 uncharacterized protein LOC1110203991.5e-3569.23Show/hide
Query:  GEMLHSWKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYRPSCTCGRCSC
        G++LHSW ICN+VV +WILN+LSKEI AS+ F DS R++WLDL++R++++NRPRIFQLRR++SNL Q+QLSV+AYF  LKTLW EL SY PSCT GRCSC
Subjt:  GEMLHSWKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYRPSCTCGRCSC

Query:  GGCK
        GG K
Subjt:  GGCK

A0A6J1DNP7 uncharacterized protein LOC1110220655.6e-3569.81Show/hide
Query:  TDGEMLHSWKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYRPSCTCGRC
        TDG  LHSW ICN+VV +WI N+LSK+I ASV F DS  ++WLDL++R+QR+NRPRIFQLRRE+SNL Q+QLSVTAYF +LKTLW+EL  YRP+C+CGRC
Subjt:  TDGEMLHSWKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYRPSCTCGRC

Query:  SCGGCK
        S GG K
Subjt:  SCGGCK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.8e-1437.5Show/hide
Query:  WKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYR--PSCTCGRCSC
        W+ CN++V  W++N+++ ++L SV + ++   MW DL++ +      +I+QLRR ++ L Q   SV  YF KL  +W EL+ Y   P C CG C+C
Subjt:  WKICNSVVKAWILNALSKEILASVNFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYR--PSCTCGRCSC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACTGTTCAAGCTTCAAACCCAAATTCCTCTTCTTCAGCAAATCAACCTACCAACCCTACTGTCATCGATCAGTATGCCAATCTTACTTCCTTCATCATTCGGATG
GAACCAATTTGGTTCTTGTTTCCAAATTTCTCACCGAATCCAATTACTCCTCTTGGTATCACGCCATGCTTATCGGACTCACCGTGAAGAACAAGGTTGGCTTTGTCGAT
GGAACCCTAACACAACTGACGGTGAGATGCTCCACTCATGGAAGATTTGCAACAGTGTTGTCAAGGCTTGGATTCTCAACGCCTTATCCAAGGAGATACTCGCAAGCGTT
AATTTCTTTGATTCGACTAGAGATATGTGGCTCGACCTGCAACAACGCTATCAGAGGAAGAATCGTCCTCGAATTTTTCAATTACGGCGGGAAATTTCCAATCTGGCGCA
AGAACAGTTGTCTGTGACTGCGTATTTCGCCAAGTTAAAGACTTTGTGGAATGAACTTACCTCGTATAGACCTTCTTGTACTTGCGGTCGTTGTTCTTGTGGAGGTTGCA
AGATTTGGTTCAATACTTCCAAACTGAACACGTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAACTGTTCAAGCTTCAAACCCAAATTCCTCTTCTTCAGCAAATCAACCTACCAACCCTACTGTCATCGATCAGTATGCCAATCTTACTTCCTTCATCATTCGGATG
GAACCAATTTGGTTCTTGTTTCCAAATTTCTCACCGAATCCAATTACTCCTCTTGGTATCACGCCATGCTTATCGGACTCACCGTGAAGAACAAGGTTGGCTTTGTCGAT
GGAACCCTAACACAACTGACGGTGAGATGCTCCACTCATGGAAGATTTGCAACAGTGTTGTCAAGGCTTGGATTCTCAACGCCTTATCCAAGGAGATACTCGCAAGCGTT
AATTTCTTTGATTCGACTAGAGATATGTGGCTCGACCTGCAACAACGCTATCAGAGGAAGAATCGTCCTCGAATTTTTCAATTACGGCGGGAAATTTCCAATCTGGCGCA
AGAACAGTTGTCTGTGACTGCGTATTTCGCCAAGTTAAAGACTTTGTGGAATGAACTTACCTCGTATAGACCTTCTTGTACTTGCGGTCGTTGTTCTTGTGGAGGTTGCA
AGATTTGGTTCAATACTTCCAAACTGAACACGTTATGA
Protein sequenceShow/hide protein sequence
MKLFKLQTQIPLLQQINLPTLLSSISMPILLPSSFGWNQFGSCFQISHRIQLLLLVSRHAYRTHREEQGWLCRWNPNTTDGEMLHSWKICNSVVKAWILNALSKEILASV
NFFDSTRDMWLDLQQRYQRKNRPRIFQLRREISNLAQEQLSVTAYFAKLKTLWNELTSYRPSCTCGRCSCGGCKIWFNTSKLNTL