; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022928 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022928
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr7:41118300..41119253
RNA-Seq ExpressionLag0022928
SyntenyLag0022928
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142767.1 uncharacterized protein LOC111012805 [Momordica charantia]8.1e-2336.96Show/hide
Query:  ESLKDYINRFNNEVLQVEGYDDGFALTAVISGLQDERLLNSIGESQPRTYEEFMTRAQRYISAEELLRSKQE--------ERESEAGK------------
        ESL DY+ RFN E LQVEG  D  +L A I G++DE L  S G+  P T+ E ++RAQRY+SA E   SK+E        +RE    K            
Subjt:  ESLKDYINRFNNEVLQVEGYDDGFALTAVISGLQDERLLNSIGESQPRTYEEFMTRAQRYISAEELLRSKQE--------ERESEAGK------------

Query:  ----SPRPMTEADQRPGSLGSCGTKSRDTNLMKCPEKLRSDPDRRNQNKYCMFHGDYGHTTRECIQLRDEIEALIREGYLKEFV
             P+   +       L     + ++  L+K PE++ +   +R++ +YC+FH D+GH T++C  L++E+E LI  GYLKE+V
Subjt:  ----SPRPMTEADQRPGSLGSCGTKSRDTNLMKCPEKLRSDPDRRNQNKYCMFHGDYGHTTRECIQLRDEIEALIREGYLKEFV

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]5.1e-2538.04Show/hide
Query:  ESLKDYINRFNNEVLQVEGYDDGFALTAVISGLQDERLLNSIGESQPRTYEEFMTRAQRYISAEELLRSKQE--------ERESEAGK------------
        ESL+DY+ RFN E LQVEG  D  +L A +SG++DE L  S G+  P T+ E ++RAQRY+SA E   SK+E        +RE    K            
Subjt:  ESLKDYINRFNNEVLQVEGYDDGFALTAVISGLQDERLLNSIGESQPRTYEEFMTRAQRYISAEELLRSKQE--------ERESEAGK------------

Query:  ----SPRPMTEADQRPGSLGSCGTKSRDTNLMKCPEKLRSDPDRRNQNKYCMFHGDYGHTTRECIQLRDEIEALIREGYLKEFV
             PR   +       +     + +D  L+K PE++++   +R++ +YC+FH D+GH T++C  L++E+E LIR GYLKE+V
Subjt:  ----SPRPMTEADQRPGSLGSCGTKSRDTNLMKCPEKLRSDPDRRNQNKYCMFHGDYGHTTRECIQLRDEIEALIREGYLKEFV

XP_024024428.1 uncharacterized protein LOC112092448 [Morus notabilis]7.4e-2440.35Show/hide
Query:  GAHESLKDYINRFNNEVLQVEGYDDGFALTAVISGL-QDERLLNSIGESQPRTYEEFMTRAQRYISAEELLRSKQEERESEAGKSPRPMTEADQRPGSLG
        G +ESLKD+I RF  +V   EG  D  AL   +S + +D      +G   P  Y+EF+TRAQ +I+AEE  ++ + +  + A +  R  +  +      G
Subjt:  GAHESLKDYINRFNNEVLQVEGYDDGFALTAVISGL-QDERLLNSIGESQPRTYEEFMTRAQRYISAEELLRSKQEERESEAGKSPRPMTEADQRPGSLG

Query:  SCGTKSRDTNL--MKCPEKLRSDPDRRNQNKYCMFHGDYGHTTRECIQLRDEIEALIREGYLKEFVGNGRS
            ++  ++L  ++ P  L+SDP RRNQNKYC FHG+ GHTT EC  LRDE+E LIREG L E+  + R+
Subjt:  SCGTKSRDTNL--MKCPEKLRSDPDRRNQNKYCMFHGDYGHTTRECIQLRDEIEALIREGYLKEFVGNGRS

XP_024041095.1 uncharacterized protein LOC112098853 [Citrus clementina]1.3e-2537.5Show/hide
Query:  ESLKDYINRFNNEVLQVEGYDDGFALTAVISGLQDERLLNSIGESQPRTYEEFMTRAQRYISAEELLRSK-QEERESEAGKS------PRPMTEADQ---
        E+L+DYI R+NNE+ QV+GYDDG AL+ ++ GL+  +L  S+ +  P +Y E + RA++Y +AEE  +++ QE+ ES  GK        R +   DQ   
Subjt:  ESLKDYINRFNNEVLQVEGYDDGFALTAVISGLQDERLLNSIGESQPRTYEEFMTRAQRYISAEELLRSK-QEERESEAGKS------PRPMTEADQ---

Query:  ----RPGS----------LGSCGT--------------KSRDTNLMKCPEKLRSDPDRRNQNKYCMFHGDYGHTTRECIQLRDEIEALIREGYLKEFVGN
            RPG           L S  T              + R+  L + P  ++++P RRN NKYC FH D+GH T EC +L+++IE+L+R+G L+E+V N
Subjt:  ----RPGS----------LGSCGT--------------KSRDTNLMKCPEKLRSDPDRRNQNKYCMFHGDYGHTTRECIQLRDEIEALIREGYLKEFVGN

XP_024047974.1 uncharacterized protein LOC112101548 [Citrus clementina]2.8e-2336.18Show/hide
Query:  ESLKDYINRFNNEVLQVEGYDDGFALTAVISGLQDERLLNSIGESQPRTYEEFMTRAQRYISAEELLRSKQEERESEAGKSPRPMTE-ADQRPG------
        ESL++YI R+N E  QV+GYDDG AL+ ++ GLQ  RL  S+ ++ P TY E ++RA++Y +AEE  +SK+   + E+ K+ +   +  D RPG      
Subjt:  ESLKDYINRFNNEVLQVEGYDDGFALTAVISGLQDERLLNSIGESQPRTYEEFMTRAQRYISAEELLRSKQEERESEAGKSPRPMTE-ADQRPG------

Query:  ---------SLGSCGTKSR-----------------------DTNLMKCPEKLRSDPDRRNQNKYCMFHGDYGHTTRECIQLRDEIEALIREGYLKEFV
                  L S   +SR                       ++ L K P  L+SD  RRNQ KYC F+ D GH T EC  L+++IE+L+R+  L+ +V
Subjt:  ---------SLGSCGTKSR-----------------------DTNLMKCPEKLRSDPDRRNQNKYCMFHGDYGHTTRECIQLRDEIEALIREGYLKEFV

TrEMBL top hitse value%identityAlignment
A0A2N9FCQ3 Uncharacterized protein3.3e-2233.96Show/hide
Query:  EPIEASDQFANSKAGAHESLKDYINRFNNEVLQVEGYDDGFALTAVISGLQDERLLNSIGESQPRTYEEFMTRAQRYISAEELLRSKQEE--RESEAGKS
        +P+E +    N K    E+L+ Y+ RFN E L V+G DD   LTA ISGLQ    L S+ +  P T  E M  AQR+++ EE L ++ +   ++ +    
Subjt:  EPIEASDQFANSKAGAHESLKDYINRFNNEVLQVEGYDDGFALTAVISGLQDERLLNSIGESQPRTYEEFMTRAQRYISAEELLRSKQEE--RESEAGKS

Query:  PRPMTEADQRPGSLGSCGTKSRDTN-----------------------------LMKCPEKLRSDPDRRNQNKYCMFHGDYGHTTRECIQLRDEIEALIR
         RP    D RP +  +   K  D N                              +K P KL +DPD+R ++KYC FH D+GH T +C  L+ +IE LI+
Subjt:  PRPMTEADQRPGSLGSCGTKSRDTN-----------------------------LMKCPEKLRSDPDRRNQNKYCMFHGDYGHTTRECIQLRDEIEALIR

Query:  EGYLKEFVGNGR
        +G L+ FV  G+
Subjt:  EGYLKEFVGNGR

A0A2N9FY65 Uncharacterized protein2.0e-2238.86Show/hide
Query:  VCYAIPW---GPEPIEASDQFANSKAGAHESLKDYINRFNNEVLQVEGYDDGFALTAVISGLQDERLLNSIGESQPRTYEEFMTRAQRYISAEELLRSKQ
        +C A P    GP  I       +SK G  E+L+ Y+ RFN E L V+G DD   LTA ISGLQ    L S+ +  P T  E M  AQR+++ EE L++  
Subjt:  VCYAIPW---GPEPIEASDQFANSKAGAHESLKDYINRFNNEVLQVEGYDDGFALTAVISGLQDERLLNSIGESQPRTYEEFMTRAQRYISAEELLRSKQ

Query:  EERESEAGKSPRPMTEADQRPGSLGSCGTKSRDTNLMKCPEKLRSDPDRRNQNKYCMFHGDYGHTTRECIQLRDEIEALIREGYLKEFVGNGR
          R+  +GK  R    AD           + R+   +K P KL ++PD+R ++KYC FH D+GH T +C  L+ +IE LI++G L+ FV  G+
Subjt:  EERESEAGKSPRPMTEADQRPGSLGSCGTKSRDTNLMKCPEKLRSDPDRRNQNKYCMFHGDYGHTTRECIQLRDEIEALIREGYLKEFVGNGR

A0A6J1CNT2 uncharacterized protein LOC1110128053.9e-2336.96Show/hide
Query:  ESLKDYINRFNNEVLQVEGYDDGFALTAVISGLQDERLLNSIGESQPRTYEEFMTRAQRYISAEELLRSKQE--------ERESEAGK------------
        ESL DY+ RFN E LQVEG  D  +L A I G++DE L  S G+  P T+ E ++RAQRY+SA E   SK+E        +RE    K            
Subjt:  ESLKDYINRFNNEVLQVEGYDDGFALTAVISGLQDERLLNSIGESQPRTYEEFMTRAQRYISAEELLRSKQE--------ERESEAGK------------

Query:  ----SPRPMTEADQRPGSLGSCGTKSRDTNLMKCPEKLRSDPDRRNQNKYCMFHGDYGHTTRECIQLRDEIEALIREGYLKEFV
             P+   +       L     + ++  L+K PE++ +   +R++ +YC+FH D+GH T++C  L++E+E LI  GYLKE+V
Subjt:  ----SPRPMTEADQRPGSLGSCGTKSRDTNLMKCPEKLRSDPDRRNQNKYCMFHGDYGHTTRECIQLRDEIEALIREGYLKEFV

A0A6J1DWY0 uncharacterized protein LOC1110252932.5e-2538.04Show/hide
Query:  ESLKDYINRFNNEVLQVEGYDDGFALTAVISGLQDERLLNSIGESQPRTYEEFMTRAQRYISAEELLRSKQE--------ERESEAGK------------
        ESL+DY+ RFN E LQVEG  D  +L A +SG++DE L  S G+  P T+ E ++RAQRY+SA E   SK+E        +RE    K            
Subjt:  ESLKDYINRFNNEVLQVEGYDDGFALTAVISGLQDERLLNSIGESQPRTYEEFMTRAQRYISAEELLRSKQE--------ERESEAGK------------

Query:  ----SPRPMTEADQRPGSLGSCGTKSRDTNLMKCPEKLRSDPDRRNQNKYCMFHGDYGHTTRECIQLRDEIEALIREGYLKEFV
             PR   +       +     + +D  L+K PE++++   +R++ +YC+FH D+GH T++C  L++E+E LIR GYLKE+V
Subjt:  ----SPRPMTEADQRPGSLGSCGTKSRDTNLMKCPEKLRSDPDRRNQNKYCMFHGDYGHTTRECIQLRDEIEALIREGYLKEFV

A0A6J1E1B3 uncharacterized protein LOC1110255212.6e-2234.31Show/hide
Query:  ESLKDYINRFNNEVLQVEGYDDGFALTAVISGLQDERLLNSIGESQPRTYEEFMTRAQRYISAEELLRSK--QEERESEAGKSPRPMTEADQRPGSLGSC
        E+LK+Y+ RF  E L+V    D  A+   ++GL DE L   +GE  P T+ E + +A++ I  +ELLR+K  + E++ + G++ +   +AD +    GS 
Subjt:  ESLKDYINRFNNEVLQVEGYDDGFALTAVISGLQDERLLNSIGESQPRTYEEFMTRAQRYISAEELLRSK--QEERESEAGKSPRPMTEADQRPGSLGSC

Query:  GTKSRD-----------------------------TN--------LMKCPEKLRSDPDRRNQNKYCMFHGDYGHTTRECIQLRDEIEALIREGYLKEFVG
         + S+                              TN        L+K P+KLR DP++RN++KYC FH D+GH T  C +L+ +IE LI+ GY K+FVG
Subjt:  GTKSRD-----------------------------TN--------LMKCPEKLRSDPDRRNQNKYCMFHGDYGHTTRECIQLRDEIEALIREGYLKEFVG

Query:  NGRS
          RS
Subjt:  NGRS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGTCAGATCCGACATGAGCACTCCTCGGCCGATGGCCGAGGCCAAGCAGAAGCCCAATCACAAAAAGGTCCGCAAAAGTTCGCCGCTAAAGACAGCGTCAGGTAT
GTACACAGGGAATAATGACAGGAAAAAGTCAGAGGCTCGGGCAAAGTCCGAGGCCGAGCAGGGCCAAAAGGGGCGAGAGCTATCCAAATGGCTAAAAGACGAAGCGACAA
GAATTGGCCCGAGCGTTTGTTACGCAATTCCTTGGGGCCCGGAGCCGATAGAAGCCTCAGACCAATTTGCTAACAGTAAAGCAGGGGCCCATGAAAGCTTGAAGGATTAT
ATTAACAGATTTAATAACGAAGTTTTGCAGGTAGAAGGCTATGACGATGGATTTGCCTTGACTGCTGTGATTTCAGGTTTGCAAGATGAAAGACTACTCAACTCGATCGG
TGAGAGCCAGCCACGGACATACGAGGAATTCATGACCCGAGCACAAAGGTACATAAGCGCCGAGGAACTACTGAGGTCCAAACAAGAAGAGAGAGAGTCCGAAGCCGGAA
AGAGTCCTCGACCAATGACCGAGGCCGACCAGAGACCAGGAAGCCTCGGGTCGTGCGGAACCAAAAGCCGGGATACAAACCTAATGAAGTGCCCAGAAAAATTGAGGTCG
GACCCAGATAGGAGGAATCAGAACAAGTACTGCATGTTCCACGGTGACTACGGTCACACTACCCGGGAGTGCATACAGTTGAGGGACGAGATAGAAGCCCTAATTCGAGA
GGGTTACCTCAAGGAGTTTGTGGGGAATGGCAGAAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGTCAGATCCGACATGAGCACTCCTCGGCCGATGGCCGAGGCCAAGCAGAAGCCCAATCACAAAAAGGTCCGCAAAAGTTCGCCGCTAAAGACAGCGTCAGGTAT
GTACACAGGGAATAATGACAGGAAAAAGTCAGAGGCTCGGGCAAAGTCCGAGGCCGAGCAGGGCCAAAAGGGGCGAGAGCTATCCAAATGGCTAAAAGACGAAGCGACAA
GAATTGGCCCGAGCGTTTGTTACGCAATTCCTTGGGGCCCGGAGCCGATAGAAGCCTCAGACCAATTTGCTAACAGTAAAGCAGGGGCCCATGAAAGCTTGAAGGATTAT
ATTAACAGATTTAATAACGAAGTTTTGCAGGTAGAAGGCTATGACGATGGATTTGCCTTGACTGCTGTGATTTCAGGTTTGCAAGATGAAAGACTACTCAACTCGATCGG
TGAGAGCCAGCCACGGACATACGAGGAATTCATGACCCGAGCACAAAGGTACATAAGCGCCGAGGAACTACTGAGGTCCAAACAAGAAGAGAGAGAGTCCGAAGCCGGAA
AGAGTCCTCGACCAATGACCGAGGCCGACCAGAGACCAGGAAGCCTCGGGTCGTGCGGAACCAAAAGCCGGGATACAAACCTAATGAAGTGCCCAGAAAAATTGAGGTCG
GACCCAGATAGGAGGAATCAGAACAAGTACTGCATGTTCCACGGTGACTACGGTCACACTACCCGGGAGTGCATACAGTTGAGGGACGAGATAGAAGCCCTAATTCGAGA
GGGTTACCTCAAGGAGTTTGTGGGGAATGGCAGAAGCTAG
Protein sequenceShow/hide protein sequence
MEVRSDMSTPRPMAEAKQKPNHKKVRKSSPLKTASGMYTGNNDRKKSEARAKSEAEQGQKGRELSKWLKDEATRIGPSVCYAIPWGPEPIEASDQFANSKAGAHESLKDY
INRFNNEVLQVEGYDDGFALTAVISGLQDERLLNSIGESQPRTYEEFMTRAQRYISAEELLRSKQEERESEAGKSPRPMTEADQRPGSLGSCGTKSRDTNLMKCPEKLRS
DPDRRNQNKYCMFHGDYGHTTRECIQLRDEIEALIREGYLKEFVGNGRS