; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032051 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032051
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr11:23334858..23336265
RNA-Seq ExpressionLag0032051
SyntenyLag0032051
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147761.1 uncharacterized protein LOC111016619 [Momordica charantia]3.6e-1839.05Show/hide
Query:  EATRCRAFALTLIGLARQWFSKIPPKSIGSFKELARAFVTQFLGAWSRQKPQINLLTVKQGPRESLKDYINNL------------------------DER
        EA RCR F+ TL G  R WF ++  KSI SFKELARAFVTQF G W+R +P   LLT+KQ   ESLKDY+                           DE+
Subjt:  EATRCRAFALTLIGLARQWFSKIPPKSIGSFKELARAFVTQFLGAWSRQKPQINLLTVKQGPRESLKDYINNL------------------------DER

Query:  LLNSIGESQPRTYVEFMTRAQRYISAEKLLRSKQEE--RKAEVGKSSPRP--------MAEADQGLGRL
        L+ S G+    T+ E  +RAQ Y+S  +L+ SK++   + A+      RP           +D G GRL
Subjt:  LLNSIGESQPRTYVEFMTRAQRYISAEKLLRSKQEE--RKAEVGKSSPRP--------MAEADQGLGRL

XP_022150035.1 uncharacterized protein LOC111018307 [Momordica charantia]2.2e-1537.82Show/hide
Query:  EATRCRAFALTLIGLARQWFSKIPPKSIGSFKELARAFVTQFLGAWSRQKPQINLLTVKQGPRESLKDYINNL------------------------DER
        EA +CR F+ TL G AR WF ++   SI SFK LA+AFVTQF+G  SR +P   LLT+KQ   ESL DY+                           DE 
Subjt:  EATRCRAFALTLIGLARQWFSKIPPKSIGSFKELARAFVTQFLGAWSRQKPQINLLTVKQGPRESLKDYINNL------------------------DER

Query:  LLNSIGESQPRTYVEFMTRAQRYISAEKLLRSKQEERKAEVGKSSPRPMAEADQGLGRLGVVRNQKPGFT-VEGRDRSPNWRGLPQGVRGERQ
        L  S G+  P T+ E ++RAQ+Y+SA +   SK+E      GK S       DQ   R G     KP ++  E RDRS   +  P+ ++  RQ
Subjt:  LLNSIGESQPRTYVEFMTRAQRYISAEKLLRSKQEERKAEVGKSSPRPMAEADQGLGRLGVVRNQKPGFT-VEGRDRSPNWRGLPQGVRGERQ

XP_022158344.1 uncharacterized protein LOC111024851 [Momordica charantia]4.0e-1738.6Show/hide
Query:  EATRCRAFALTLIGLARQWFSKIPPKSIGSFKELARAFVTQFLGAWSRQKPQINLLTVKQGPRESLKDYINNL------------------------DER
        EA RCR F+ TL G AR WF ++   SI SFKELA AFVTQF+G   + KP   LLT+KQ   ESLK+Y+                           DER
Subjt:  EATRCRAFALTLIGLARQWFSKIPPKSIGSFKELARAFVTQFLGAWSRQKPQINLLTVKQGPRESLKDYINNL------------------------DER

Query:  LLNSIGESQPRTYVEFMTRAQRYISAEKLLR------------SKQEERKAEVGKSSPRPMAEADQGLGRL
        L+ S G+  P T++E ++RAQ+Y+SA +L+             S + ER+ E  K        +D G GRL
Subjt:  LLNSIGESQPRTYVEFMTRAQRYISAEKLLR------------SKQEERKAEVGKSSPRPMAEADQGLGRL

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]1.5e-1642.96Show/hide
Query:  EATRCRAFALTLIGLARQWFSKIPPKSIGSFKELARAFVTQFLGAWSRQKPQINLLTVKQGPRESLKDYINNL------------------------DER
        EA RCR F+ TL G AR WF ++   SI SFK LARAFVTQF+G   R +P   LLT+KQ   ESL+DY+                           DE 
Subjt:  EATRCRAFALTLIGLARQWFSKIPPKSIGSFKELARAFVTQFLGAWSRQKPQINLLTVKQGPRESLKDYINNL------------------------DER

Query:  LLNSIGESQPRTYVEFMTRAQRYISAEKLLRSKQE
        L  S G+  P T+ E ++RAQRY+SA +   SK+E
Subjt:  LLNSIGESQPRTYVEFMTRAQRYISAEKLLRSKQE

XP_030970370.1 uncharacterized protein LOC115990710 [Quercus lobata]2.2e-1535.76Show/hide
Query:  CRAFALTLIGLARQWFSKIPPKSIGSFKELARAFVTQFLGAWSRQKPQINLLTVKQGPRESLKDYI----------NNLDERLL--------------NS
        CRAF  TL GLAR WFSKIPP S+GSF+EL++ FV  F+G    ++   NLLT++QG  ESL+ +I          + +D++LL              + 
Subjt:  CRAFALTLIGLARQWFSKIPPKSIGSFKELARAFVTQFLGAWSRQKPQINLLTVKQGPRESLKDYI----------NNLDERLL--------------NS

Query:  IGESQPRTYVEFMTRAQRYISAEKLLRSKQEERKAEVGKSSPRPMAEADQG
        + E +P+T VE +  AQ +++ E  + +K+ +R   +G     P   ++QG
Subjt:  IGESQPRTYVEFMTRAQRYISAEKLLRSKQEERKAEVGKSSPRPMAEADQG

TrEMBL top hitse value%identityAlignment
A0A6J1D3B7 uncharacterized protein LOC1110166191.7e-1839.05Show/hide
Query:  EATRCRAFALTLIGLARQWFSKIPPKSIGSFKELARAFVTQFLGAWSRQKPQINLLTVKQGPRESLKDYINNL------------------------DER
        EA RCR F+ TL G  R WF ++  KSI SFKELARAFVTQF G W+R +P   LLT+KQ   ESLKDY+                           DE+
Subjt:  EATRCRAFALTLIGLARQWFSKIPPKSIGSFKELARAFVTQFLGAWSRQKPQINLLTVKQGPRESLKDYINNL------------------------DER

Query:  LLNSIGESQPRTYVEFMTRAQRYISAEKLLRSKQEE--RKAEVGKSSPRP--------MAEADQGLGRL
        L+ S G+    T+ E  +RAQ Y+S  +L+ SK++   + A+      RP           +D G GRL
Subjt:  LLNSIGESQPRTYVEFMTRAQRYISAEKLLRSKQEE--RKAEVGKSSPRP--------MAEADQGLGRL

A0A6J1D7D2 uncharacterized protein LOC1110183071.1e-1537.82Show/hide
Query:  EATRCRAFALTLIGLARQWFSKIPPKSIGSFKELARAFVTQFLGAWSRQKPQINLLTVKQGPRESLKDYINNL------------------------DER
        EA +CR F+ TL G AR WF ++   SI SFK LA+AFVTQF+G  SR +P   LLT+KQ   ESL DY+                           DE 
Subjt:  EATRCRAFALTLIGLARQWFSKIPPKSIGSFKELARAFVTQFLGAWSRQKPQINLLTVKQGPRESLKDYINNL------------------------DER

Query:  LLNSIGESQPRTYVEFMTRAQRYISAEKLLRSKQEERKAEVGKSSPRPMAEADQGLGRLGVVRNQKPGFT-VEGRDRSPNWRGLPQGVRGERQ
        L  S G+  P T+ E ++RAQ+Y+SA +   SK+E      GK S       DQ   R G     KP ++  E RDRS   +  P+ ++  RQ
Subjt:  LLNSIGESQPRTYVEFMTRAQRYISAEKLLRSKQEERKAEVGKSSPRPMAEADQGLGRLGVVRNQKPGFT-VEGRDRSPNWRGLPQGVRGERQ

A0A6J1DWY0 uncharacterized protein LOC1110252937.3e-1742.96Show/hide
Query:  EATRCRAFALTLIGLARQWFSKIPPKSIGSFKELARAFVTQFLGAWSRQKPQINLLTVKQGPRESLKDYINNL------------------------DER
        EA RCR F+ TL G AR WF ++   SI SFK LARAFVTQF+G   R +P   LLT+KQ   ESL+DY+                           DE 
Subjt:  EATRCRAFALTLIGLARQWFSKIPPKSIGSFKELARAFVTQFLGAWSRQKPQINLLTVKQGPRESLKDYINNL------------------------DER

Query:  LLNSIGESQPRTYVEFMTRAQRYISAEKLLRSKQE
        L  S G+  P T+ E ++RAQRY+SA +   SK+E
Subjt:  LLNSIGESQPRTYVEFMTRAQRYISAEKLLRSKQE

A0A6J1DZ49 uncharacterized protein LOC1110248511.9e-1738.6Show/hide
Query:  EATRCRAFALTLIGLARQWFSKIPPKSIGSFKELARAFVTQFLGAWSRQKPQINLLTVKQGPRESLKDYINNL------------------------DER
        EA RCR F+ TL G AR WF ++   SI SFKELA AFVTQF+G   + KP   LLT+KQ   ESLK+Y+                           DER
Subjt:  EATRCRAFALTLIGLARQWFSKIPPKSIGSFKELARAFVTQFLGAWSRQKPQINLLTVKQGPRESLKDYINNL------------------------DER

Query:  LLNSIGESQPRTYVEFMTRAQRYISAEKLLR------------SKQEERKAEVGKSSPRPMAEADQGLGRL
        L+ S G+  P T++E ++RAQ+Y+SA +L+             S + ER+ E  K        +D G GRL
Subjt:  LLNSIGESQPRTYVEFMTRAQRYISAEKLLR------------SKQEERKAEVGKSSPRPMAEADQGLGRL

A0A6J1E1E7 uncharacterized protein LOC1110255488.9e-1540.74Show/hide
Query:  EATRCRAFALTLIGLARQWFSKIPPKSIGSFKELARAFVTQFLGAWSRQKPQINLLTVKQGPRESLKDYINNL------------------------DER
        +A RCR F+ TL G AR WF ++   SI SFK LARAF+TQF+G   R +P   LLT+KQ   ESL DY+                           DE 
Subjt:  EATRCRAFALTLIGLARQWFSKIPPKSIGSFKELARAFVTQFLGAWSRQKPQINLLTVKQGPRESLKDYINNL------------------------DER

Query:  LLNSIGESQPRTYVEFMTRAQRYISAEKLLRSKQE
        L  S  +  P T+ E ++RAQRY+SA +   SK+E
Subjt:  LLNSIGESQPRTYVEFMTRAQRYISAEKLLRSKQE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCACTCCTCGGCCAATGGTCGAGGCCGAGCAGAAGCCCAATTGCAAAAAGGCCTGGAGAAGTTCGTCGCTAAAGACAGCGTCAGGTACGTACACAGGGAATAATGA
CAGGAAAAAGTCGGAGGCTCGGGCAAAGTTCGAGGCCGAGCAGGGCCAAAAGTGGCGAGAGCTATCCAAATGGCTAAAAGGCGAAGCGACAAGGTGCCGAGCCTTCGCGT
TAACACTCATAGGGTTGGCAAGGCAATGGTTTAGCAAGATCCCACCGAAGTCAATCGGTTCATTTAAAGAATTGGCCCGAGCGTTTGTTACGCAATTCCTCGGGGCCTGG
AGCCGACAAAAGCCTCAGATCAACTTGCTGACAGTAAAGCAGGGGCCCCGAGAAAGCTTGAAGGATTATATTAACAATTTAGATGAAAGATTGCTCAACTCGATCGGTGA
GAGCCAGCCACGGACATACGTGGAATTCATGACCCGAGCACAAAGGTACATAAGCGCCGAGAAACTGCTGAGGTCCAAACAAGAAGAGAGAAAGGCCGAAGTCGGCAAGA
GCAGTCCTCGGCCAATGGCTGAGGCCGACCAGGGACTAGGGAGACTCGGGGTCGTGCGGAACCAAAAGCCAGGTTTTACAGTTGAGGGACGAGATAGAAGCCCTAATTGG
AGAGGGTTACCTCAAGGAGTTCGTGGGGAACGACAGAAGCAAGAGGCCACTGCCAGCAGATCAAGGAATATCGACAGGAAGGTTCAAGCAACGTCGGCCTCGGGAGATGG
CCGAGGCCGAGCCTTTGAGGGTCAAGCAATCCCCTTCCAATGGAACATTGATTTCCTTCTCTTATGTCCTTTAGTTTCCAGTTTCTTGAAACCTTGTTCGACTTTCTTTC
ATTTCAATAGCATATTACTCAGGAATGTAATTGATTCATTTGAAATAAATTACTCTGAATTTGTTTCAAAAGTTTTGACAGCGCTTGCAGAGAGACCCGGGAATCTCCCT
AGGCCTCCCAAGCTTTTCCCCAGAAGAGTGTACACACCCCTGGGAAGACCACAACACTTGACCTATTATCGTAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCACTCCTCGGCCAATGGTCGAGGCCGAGCAGAAGCCCAATTGCAAAAAGGCCTGGAGAAGTTCGTCGCTAAAGACAGCGTCAGGTACGTACACAGGGAATAATGA
CAGGAAAAAGTCGGAGGCTCGGGCAAAGTTCGAGGCCGAGCAGGGCCAAAAGTGGCGAGAGCTATCCAAATGGCTAAAAGGCGAAGCGACAAGGTGCCGAGCCTTCGCGT
TAACACTCATAGGGTTGGCAAGGCAATGGTTTAGCAAGATCCCACCGAAGTCAATCGGTTCATTTAAAGAATTGGCCCGAGCGTTTGTTACGCAATTCCTCGGGGCCTGG
AGCCGACAAAAGCCTCAGATCAACTTGCTGACAGTAAAGCAGGGGCCCCGAGAAAGCTTGAAGGATTATATTAACAATTTAGATGAAAGATTGCTCAACTCGATCGGTGA
GAGCCAGCCACGGACATACGTGGAATTCATGACCCGAGCACAAAGGTACATAAGCGCCGAGAAACTGCTGAGGTCCAAACAAGAAGAGAGAAAGGCCGAAGTCGGCAAGA
GCAGTCCTCGGCCAATGGCTGAGGCCGACCAGGGACTAGGGAGACTCGGGGTCGTGCGGAACCAAAAGCCAGGTTTTACAGTTGAGGGACGAGATAGAAGCCCTAATTGG
AGAGGGTTACCTCAAGGAGTTCGTGGGGAACGACAGAAGCAAGAGGCCACTGCCAGCAGATCAAGGAATATCGACAGGAAGGTTCAAGCAACGTCGGCCTCGGGAGATGG
CCGAGGCCGAGCCTTTGAGGGTCAAGCAATCCCCTTCCAATGGAACATTGATTTCCTTCTCTTATGTCCTTTAGTTTCCAGTTTCTTGAAACCTTGTTCGACTTTCTTTC
ATTTCAATAGCATATTACTCAGGAATGTAATTGATTCATTTGAAATAAATTACTCTGAATTTGTTTCAAAAGTTTTGACAGCGCTTGCAGAGAGACCCGGGAATCTCCCT
AGGCCTCCCAAGCTTTTCCCCAGAAGAGTGTACACACCCCTGGGAAGACCACAACACTTGACCTATTATCGTAGCTGA
Protein sequenceShow/hide protein sequence
MSTPRPMVEAEQKPNCKKAWRSSSLKTASGTYTGNNDRKKSEARAKFEAEQGQKWRELSKWLKGEATRCRAFALTLIGLARQWFSKIPPKSIGSFKELARAFVTQFLGAW
SRQKPQINLLTVKQGPRESLKDYINNLDERLLNSIGESQPRTYVEFMTRAQRYISAEKLLRSKQEERKAEVGKSSPRPMAEADQGLGRLGVVRNQKPGFTVEGRDRSPNW
RGLPQGVRGERQKQEATASRSRNIDRKVQATSASGDGRGRAFEGQAIPFQWNIDFLLLCPLVSSFLKPCSTFFHFNSILLRNVIDSFEINYSEFVSKVLTALAERPGNLP
RPPKLFPRRVYTPLGRPQHLTYYRS