; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000381 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000381
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr4:5476776..5477774
RNA-Seq ExpressionLag0000381
SyntenyLag0000381
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147761.1 uncharacterized protein LOC111016619 [Momordica charantia]7.8e-1337.41Show/hide
Query:  DIIEAKMDFHGASEATKCRAFALTLTGMARQWFSKLPWRSIESFKEFVWAFVTQFLGVRDRHKPQINLLTIKQRRRESL---------------------
        D+     D +G +EA +CR F+ TLTG  R WF +L  +SI SFKE   AFVTQF G  +R +P   LLTIKQ+  ESL                     
Subjt:  DIIEAKMDFHGASEATKCRAFALTLTGMARQWFSKLPWRSIESFKEFVWAFVTQFLGVRDRHKPQINLLTIKQRRRESL---------------------

Query:  ---------QDEKLLNSIGESQLRTYVEFMTRAQRYISAKELLKLKQ
                 +DE+L+ S G+    T+ E  +RAQ Y+S  EL+  K+
Subjt:  ---------QDEKLLNSIGESQLRTYVEFMTRAQRYISAKELLKLKQ

XP_022150035.1 uncharacterized protein LOC111018307 [Momordica charantia]5.4e-1441.13Show/hide
Query:  MDFHGASEATKCRAFALTLTGMARQWFSKLPWRSIESFKEFVWAFVTQFLGVRDRHKPQINLLTIKQRRRESL---------------------------
        MD +G SEA KCR F+ TL+G AR WF +L   SI SFK    AFVTQF+G R R +P   LLTIKQR  ESL                           
Subjt:  MDFHGASEATKCRAFALTLTGMARQWFSKLPWRSIESFKEFVWAFVTQFLGVRDRHKPQINLLTIKQRRRESL---------------------------

Query:  ---QDEKLLNSIGESQLRTYVEFMTRAQRYISAKELLKLKQ
           +DE L  S G+    T+ E ++RAQ+Y+SA E    K+
Subjt:  ---QDEKLLNSIGESQLRTYVEFMTRAQRYISAKELLKLKQ

XP_022158344.1 uncharacterized protein LOC111024851 [Momordica charantia]4.1e-1440Show/hide
Query:  DFHGASEATKCRAFALTLTGMARQWFSKLPWRSIESFKEFVWAFVTQFLGVRDRHKPQINLLTIKQRRRESL----------------------------
        D +   EA +CR F+ TLTG AR WF +L   SI SFKE   AFVTQF+G R + KP   LLTIKQ+  ESL                            
Subjt:  DFHGASEATKCRAFALTLTGMARQWFSKLPWRSIESFKEFVWAFVTQFLGVRDRHKPQINLLTIKQRRRESL----------------------------

Query:  --QDEKLLNSIGESQLRTYVEFMTRAQRYISAKELLKLKQ
          +DE+L+ S G+    T++E ++RAQ+Y+SA EL+ L +
Subjt:  --QDEKLLNSIGESQLRTYVEFMTRAQRYISAKELLKLKQ

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]2.3e-1230.43Show/hide
Query:  RDPKKGKEAMMEEVRDS-ESVTSRMS-NPNLENDLTLKDPDPSDKRTRRSLSSKPTPDTYVRVSFREKNRGRSGIKGRTRWRGSGQEFLKWRKEEDNHYD
        RDPKKGK     +  +S  SV S++    N      + DP  + K+ +      P P           N G+S    R     S  +     + E +   
Subjt:  RDPKKGKEAMMEEVRDS-ESVTSRMS-NPNLENDLTLKDPDPSDKRTRRSLSSKPTPDTYVRVSFREKNRGRSGIKGRTRWRGSGQEFLKWRKEEDNHYD

Query:  YLRKTYNEDLEDLIGYIYLPFTYDIIEAK-----------------------------MDFHGASEATKCRAFALTLTGMARQWFSKLPWRSIESFKEFV
        +  K    DLE+L+     PFT +I+  K                             MD +G SEA +CR F+ TL G AR WF +L   SI SFK   
Subjt:  YLRKTYNEDLEDLIGYIYLPFTYDIIEAK-----------------------------MDFHGASEATKCRAFALTLTGMARQWFSKLPWRSIESFKEFV

Query:  WAFVTQFLGVRDRHKPQINLLTIKQRRRESL------------------------------QDEKLLNSIGESQLRTYVEFMTRAQRYISAKELLKLKQ
         AFVTQF+G R R +P   LLTIKQR  ESL                              +DE L  S G+    T+ E ++RAQRY+SA E    K+
Subjt:  WAFVTQFLGVRDRHKPQINLLTIKQRRRESL------------------------------QDEKLLNSIGESQLRTYVEFMTRAQRYISAKELLKLKQ

XP_022159109.1 uncharacterized protein LOC111025548 [Momordica charantia]1.7e-1232.37Show/hide
Query:  KEEDNHYDYLRKTYNEDLEDLIGYIYLPFTYDIIEAK-----------------------------MDFHGASEATKCRAFALTLTGMARQWFSKLPWRS
        + E +   + +K    DLE+L+G    PFT +I+  K                             MD +G S+A +CR F+ TL G AR WF +L   S
Subjt:  KEEDNHYDYLRKTYNEDLEDLIGYIYLPFTYDIIEAK-----------------------------MDFHGASEATKCRAFALTLTGMARQWFSKLPWRS

Query:  IESFKEFVWAFVTQFLGVRDRHKPQINLLTIKQRRRESL------------------------------QDEKLLNSIGESQLRTYVEFMTRAQRYISAK
        I SFK    AF+TQF+G R R +P   LLTIKQR  ESL                              +DE L  S  +    T+ E ++RAQRY+SA 
Subjt:  IESFKEFVWAFVTQFLGVRDRHKPQINLLTIKQRRRESL------------------------------QDEKLLNSIGESQLRTYVEFMTRAQRYISAK

Query:  ELLKLKQ
        E    K+
Subjt:  ELLKLKQ

TrEMBL top hitse value%identityAlignment
A0A6J1D3B7 uncharacterized protein LOC1110166193.8e-1337.41Show/hide
Query:  DIIEAKMDFHGASEATKCRAFALTLTGMARQWFSKLPWRSIESFKEFVWAFVTQFLGVRDRHKPQINLLTIKQRRRESL---------------------
        D+     D +G +EA +CR F+ TLTG  R WF +L  +SI SFKE   AFVTQF G  +R +P   LLTIKQ+  ESL                     
Subjt:  DIIEAKMDFHGASEATKCRAFALTLTGMARQWFSKLPWRSIESFKEFVWAFVTQFLGVRDRHKPQINLLTIKQRRRESL---------------------

Query:  ---------QDEKLLNSIGESQLRTYVEFMTRAQRYISAKELLKLKQ
                 +DE+L+ S G+    T+ E  +RAQ Y+S  EL+  K+
Subjt:  ---------QDEKLLNSIGESQLRTYVEFMTRAQRYISAKELLKLKQ

A0A6J1D7D2 uncharacterized protein LOC1110183072.6e-1441.13Show/hide
Query:  MDFHGASEATKCRAFALTLTGMARQWFSKLPWRSIESFKEFVWAFVTQFLGVRDRHKPQINLLTIKQRRRESL---------------------------
        MD +G SEA KCR F+ TL+G AR WF +L   SI SFK    AFVTQF+G R R +P   LLTIKQR  ESL                           
Subjt:  MDFHGASEATKCRAFALTLTGMARQWFSKLPWRSIESFKEFVWAFVTQFLGVRDRHKPQINLLTIKQRRRESL---------------------------

Query:  ---QDEKLLNSIGESQLRTYVEFMTRAQRYISAKELLKLKQ
           +DE L  S G+    T+ E ++RAQ+Y+SA E    K+
Subjt:  ---QDEKLLNSIGESQLRTYVEFMTRAQRYISAKELLKLKQ

A0A6J1DWY0 uncharacterized protein LOC1110252931.1e-1230.43Show/hide
Query:  RDPKKGKEAMMEEVRDS-ESVTSRMS-NPNLENDLTLKDPDPSDKRTRRSLSSKPTPDTYVRVSFREKNRGRSGIKGRTRWRGSGQEFLKWRKEEDNHYD
        RDPKKGK     +  +S  SV S++    N      + DP  + K+ +      P P           N G+S    R     S  +     + E +   
Subjt:  RDPKKGKEAMMEEVRDS-ESVTSRMS-NPNLENDLTLKDPDPSDKRTRRSLSSKPTPDTYVRVSFREKNRGRSGIKGRTRWRGSGQEFLKWRKEEDNHYD

Query:  YLRKTYNEDLEDLIGYIYLPFTYDIIEAK-----------------------------MDFHGASEATKCRAFALTLTGMARQWFSKLPWRSIESFKEFV
        +  K    DLE+L+     PFT +I+  K                             MD +G SEA +CR F+ TL G AR WF +L   SI SFK   
Subjt:  YLRKTYNEDLEDLIGYIYLPFTYDIIEAK-----------------------------MDFHGASEATKCRAFALTLTGMARQWFSKLPWRSIESFKEFV

Query:  WAFVTQFLGVRDRHKPQINLLTIKQRRRESL------------------------------QDEKLLNSIGESQLRTYVEFMTRAQRYISAKELLKLKQ
         AFVTQF+G R R +P   LLTIKQR  ESL                              +DE L  S G+    T+ E ++RAQRY+SA E    K+
Subjt:  WAFVTQFLGVRDRHKPQINLLTIKQRRRESL------------------------------QDEKLLNSIGESQLRTYVEFMTRAQRYISAKELLKLKQ

A0A6J1DZ49 uncharacterized protein LOC1110248512.0e-1440Show/hide
Query:  DFHGASEATKCRAFALTLTGMARQWFSKLPWRSIESFKEFVWAFVTQFLGVRDRHKPQINLLTIKQRRRESL----------------------------
        D +   EA +CR F+ TLTG AR WF +L   SI SFKE   AFVTQF+G R + KP   LLTIKQ+  ESL                            
Subjt:  DFHGASEATKCRAFALTLTGMARQWFSKLPWRSIESFKEFVWAFVTQFLGVRDRHKPQINLLTIKQRRRESL----------------------------

Query:  --QDEKLLNSIGESQLRTYVEFMTRAQRYISAKELLKLKQ
          +DE+L+ S G+    T++E ++RAQ+Y+SA EL+ L +
Subjt:  --QDEKLLNSIGESQLRTYVEFMTRAQRYISAKELLKLKQ

A0A6J1E1E7 uncharacterized protein LOC1110255488.4e-1332.37Show/hide
Query:  KEEDNHYDYLRKTYNEDLEDLIGYIYLPFTYDIIEAK-----------------------------MDFHGASEATKCRAFALTLTGMARQWFSKLPWRS
        + E +   + +K    DLE+L+G    PFT +I+  K                             MD +G S+A +CR F+ TL G AR WF +L   S
Subjt:  KEEDNHYDYLRKTYNEDLEDLIGYIYLPFTYDIIEAK-----------------------------MDFHGASEATKCRAFALTLTGMARQWFSKLPWRS

Query:  IESFKEFVWAFVTQFLGVRDRHKPQINLLTIKQRRRESL------------------------------QDEKLLNSIGESQLRTYVEFMTRAQRYISAK
        I SFK    AF+TQF+G R R +P   LLTIKQR  ESL                              +DE L  S  +    T+ E ++RAQRY+SA 
Subjt:  IESFKEFVWAFVTQFLGVRDRHKPQINLLTIKQRRRESL------------------------------QDEKLLNSIGESQLRTYVEFMTRAQRYISAK

Query:  ELLKLKQ
        E    K+
Subjt:  ELLKLKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAAGATGAGGCATAATTTGGCTGAAATCTTGAGCATTTTGAAGAAACCCGATCAAACTCTGAACAACTTGGAAGATCAACCACCTCGAGATCCCAAGAAAGGGAA
GGAAGCGATGATGGAGGAAGTAAGAGATTCAGAAAGTGTCACGAGCAGAATGTCGAACCCAAATCTGGAAAACGACTTGACTCTCAAAGACCCTGACCCAAGCGACAAAA
GAACTCGCAGAAGCTTGTCTTCAAAACCGACACCAGACACGTATGTTAGGGTCAGTTTCAGGGAAAAAAATAGGGGTCGGAGTGGGATCAAAGGCCGAACAAGGTGGAGG
GGATCGGGGCAGGAGTTTTTGAAGTGGCGGAAAGAGGAAGATAACCACTACGACTACCTGAGGAAGACATATAATGAGGATCTAGAAGACTTGATAGGGTACATATATCT
GCCCTTCACATATGACATTATCGAAGCAAAGATGGATTTCCATGGGGCCTCTGAGGCAACGAAGTGTCGGGCCTTCGCCCTCACCCTAACAGGGATGGCCAGGCAATGGT
TTAGCAAGCTGCCTTGGAGGTCCATCGAATCGTTCAAAGAGTTCGTGTGGGCCTTTGTCACGCAATTTCTAGGAGTCAGGGATCGACACAAGCCACAAATAAACCTACTA
ACCATCAAACAGAGACGAAGAGAAAGTTTGCAGGACGAAAAACTACTGAACTCAATTGGTGAGAGTCAACTGCGCACCTATGTAGAATTCATGACTAGGGCGCAAAGATA
CATAAGTGCTAAAGAGTTGCTAAAGTTGAAGCAGACGAAAAGGGAAGCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACAAGATGAGGCATAATTTGGCTGAAATCTTGAGCATTTTGAAGAAACCCGATCAAACTCTGAACAACTTGGAAGATCAACCACCTCGAGATCCCAAGAAAGGGAA
GGAAGCGATGATGGAGGAAGTAAGAGATTCAGAAAGTGTCACGAGCAGAATGTCGAACCCAAATCTGGAAAACGACTTGACTCTCAAAGACCCTGACCCAAGCGACAAAA
GAACTCGCAGAAGCTTGTCTTCAAAACCGACACCAGACACGTATGTTAGGGTCAGTTTCAGGGAAAAAAATAGGGGTCGGAGTGGGATCAAAGGCCGAACAAGGTGGAGG
GGATCGGGGCAGGAGTTTTTGAAGTGGCGGAAAGAGGAAGATAACCACTACGACTACCTGAGGAAGACATATAATGAGGATCTAGAAGACTTGATAGGGTACATATATCT
GCCCTTCACATATGACATTATCGAAGCAAAGATGGATTTCCATGGGGCCTCTGAGGCAACGAAGTGTCGGGCCTTCGCCCTCACCCTAACAGGGATGGCCAGGCAATGGT
TTAGCAAGCTGCCTTGGAGGTCCATCGAATCGTTCAAAGAGTTCGTGTGGGCCTTTGTCACGCAATTTCTAGGAGTCAGGGATCGACACAAGCCACAAATAAACCTACTA
ACCATCAAACAGAGACGAAGAGAAAGTTTGCAGGACGAAAAACTACTGAACTCAATTGGTGAGAGTCAACTGCGCACCTATGTAGAATTCATGACTAGGGCGCAAAGATA
CATAAGTGCTAAAGAGTTGCTAAAGTTGAAGCAGACGAAAAGGGAAGCCTAG
Protein sequenceShow/hide protein sequence
MNKMRHNLAEILSILKKPDQTLNNLEDQPPRDPKKGKEAMMEEVRDSESVTSRMSNPNLENDLTLKDPDPSDKRTRRSLSSKPTPDTYVRVSFREKNRGRSGIKGRTRWR
GSGQEFLKWRKEEDNHYDYLRKTYNEDLEDLIGYIYLPFTYDIIEAKMDFHGASEATKCRAFALTLTGMARQWFSKLPWRSIESFKEFVWAFVTQFLGVRDRHKPQINLL
TIKQRRRESLQDEKLLNSIGESQLRTYVEFMTRAQRYISAKELLKLKQTKREA