; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg004646 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg004646
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationscaffold9:8431361..8432573
RNA-Seq ExpressionSpg004646
SyntenySpg004646
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030490806.1 uncharacterized protein LOC115707099 [Cannabis sativa]3.9e-2936.12Show/hide
Query:  QLLDIDPGIERTFRRRRKEQRRKKTKQQELSAQEPLEEASCIQEFFMEQPGVDPQVDPQNRGIEQNGGRIPPTPLVPPMPQNENPIYIAKYRNRAIRDYA
        +L  IDP IE TFR+RRKEQ+ KK                               VD    G E  G                NPI +A  R RAIR+YA
Subjt:  QLLDIDPGIERTFRRRRKEQRRKKTKQQELSAQEPLEEASCIQEFFMEQPGVDPQVDPQNRGIEQNGGRIPPTPLVPPMPQNENPIYIAKYRNRAIRDYA

Query:  LPVFQTLNLGILDPSVEAPQFELK---------------------------SGDAKAWLDSFPPNSITSWNDLVEKF----FPSNKNAKFKAEIISFRQS
         P+F  LN GI+ P ++AP FELK                              A+AWL++ PP+S+T+WN+L EKF    FP  +NAKF++EI+SF+Q 
Subjt:  LPVFQTLNLGILDPSVEAPQFELK---------------------------SGDAKAWLDSFPPNSITSWNDLVEKF----FPSNKNAKFKAEIISFRQS

Query:  YNEPLDAAWERFQRLVRKCPHHGLPACIIIEHFYAPMTEVEAPMIGEDAPTMGAEPSPAERET
         +E    AWERF+ L+RKC HHG+P CI +E F   +      ++  DA   GA  S +  ET
Subjt:  YNEPLDAAWERFQRLVRKCPHHGLPACIIIEHFYAPMTEVEAPMIGEDAPTMGAEPSPAERET

XP_030502440.1 uncharacterized protein LOC115717596 [Cannabis sativa]4.6e-3040.88Show/hide
Query:  NPIYIAKYRNRAIRDYALPVFQTLNLGILDPSVEAPQFELK---------------------------SGDAKAWLDSFPPNSITSWNDLVEKF----FP
        NPI +A  R RAIR+YA P+F  LN GI+ P ++AP FELK                              A+AWL++ PPNS+T+WNDL EKF    FP
Subjt:  NPIYIAKYRNRAIRDYALPVFQTLNLGILDPSVEAPQFELK---------------------------SGDAKAWLDSFPPNSITSWNDLVEKF----FP

Query:  SNKNAKFKAEIISFRQSYNEPLDAAWERFQRLVRKCPHHGLPACIIIEHFYAPMTEVEAPMIGEDAPTMGAEPSPAERETS
          +NAKF++EI+SF+Q  +E    AWERF+ L+RKCPHHG+P CI +E FY  +      ++   A       +P  R+ +
Subjt:  SNKNAKFKAEIISFRQSYNEPLDAAWERFQRLVRKCPHHGLPACIIIEHFYAPMTEVEAPMIGEDAPTMGAEPSPAERETS

XP_030503898.1 uncharacterized protein LOC115719117 [Cannabis sativa]8.7e-2938.14Show/hide
Query:  QLLDIDPGIERTFRRRRKEQRRKKTKQQELSAQEPLEEASCIQEFFMEQPGVDPQVDPQNRGIEQNGGRIPPTPLVPPMPQNENPIYIAKYRNRAIRDYA
        +L  IDP IERTFR+RRKEQ+ KK                                     G E  G                NPI +A  R RAIR+YA
Subjt:  QLLDIDPGIERTFRRRRKEQRRKKTKQQELSAQEPLEEASCIQEFFMEQPGVDPQVDPQNRGIEQNGGRIPPTPLVPPMPQNENPIYIAKYRNRAIRDYA

Query:  LPVFQTLNLGILDPSVEAPQFELKSGD-AKAWLDSFPPNSITSWNDLVEKF----FPSNKNAKFKAEIISFRQSYNEPLDAAWERFQRLVRKCPHHGLPA
        LP+F  LN  + + ++    F     D A+AWL++ PP+S+T+WNDL EKF    FP  +NAKF++EI+SF+Q  +E    AWERF+ L+RKCPHHG+P 
Subjt:  LPVFQTLNLGILDPSVEAPQFELKSGD-AKAWLDSFPPNSITSWNDLVEKF----FPSNKNAKFKAEIISFRQSYNEPLDAAWERFQRLVRKCPHHGLPA

Query:  CIIIEHFYAPMTEVEAPMIGEDAPTMGAEPSPAERE
        CI +E FY  +      ++  DA   GA  S +  E
Subjt:  CIIIEHFYAPMTEVEAPMIGEDAPTMGAEPSPAERE

XP_030508936.1 uncharacterized protein LOC115723589 [Cannabis sativa]1.1e-2834.87Show/hide
Query:  QLLDIDPGIERTFRRRRKEQRRKKTKQQELSAQEPLEEASCIQEFFMEQPGVDPQVDPQNRGIEQNGGRIPPTPLVPPMPQNENPIYIAKYRNRAIRDYA
        +L  IDP IERTFR+RRKEQ+ KK                C      E  GV  +                            NPI +A  R RAIR+YA
Subjt:  QLLDIDPGIERTFRRRRKEQRRKKTKQQELSAQEPLEEASCIQEFFMEQPGVDPQVDPQNRGIEQNGGRIPPTPLVPPMPQNENPIYIAKYRNRAIRDYA

Query:  LPVFQTLNLGILDPSVEAPQFELK------------------------------------------------------SGDAKAWLDSFPPNSITSWNDL
         P+F  LN GI+ P ++AP FELK                                                         A+AWL++ PP+S+T+WNDL
Subjt:  LPVFQTLNLGILDPSVEAPQFELK------------------------------------------------------SGDAKAWLDSFPPNSITSWNDL

Query:  VEKF----FPSNKNAKFKAEIISFRQSYNEPLDAAWERFQRLVRKCPHHGLPACIIIEHFY
         EKF    FP  +NAKF++EI+SF+Q  +E    AWERF+ L+RKCPHHG+P CI +E FY
Subjt:  VEKF----FPSNKNAKFKAEIISFRQSYNEPLDAAWERFQRLVRKCPHHGLPACIIIEHFY

XP_030509259.1 uncharacterized protein LOC115723937 [Cannabis sativa]2.4e-3136.64Show/hide
Query:  QLLDIDPGIERTFRRRRKEQRRKKTKQQELSAQEPLEEASCIQEFFMEQPGVDPQVDPQNRGIEQNGGRIPPTPLVPPMPQNENPIYIAKYRNRAIRDYA
        +L  IDP IERTFR+RRKEQ+ KK                                     G E  G                NPI +A  R RAIR+YA
Subjt:  QLLDIDPGIERTFRRRRKEQRRKKTKQQELSAQEPLEEASCIQEFFMEQPGVDPQVDPQNRGIEQNGGRIPPTPLVPPMPQNENPIYIAKYRNRAIRDYA

Query:  LPVFQTLNLGILDPSVEAPQFELK---------------------------SGDAKAWLDSFPPNSITSWNDLVEKF----FPSNKNAKFKAEIISFRQS
         P+F  LN GI+ P ++AP FELK                              A+AWL++ PP+S+T+WNDL EKF    FP  +NAKF++EI+SF+Q 
Subjt:  LPVFQTLNLGILDPSVEAPQFELK---------------------------SGDAKAWLDSFPPNSITSWNDLVEKF----FPSNKNAKFKAEIISFRQS

Query:  YNEPLDAAWERFQRLVRKCPHHGLPACIIIEHFYAPMTEVEAPMIGEDAPTMGAEPSPAERE
         +E    AWERF+ L+RKCPHHG+P CI +E FY  +      ++  DA   GA  S +  E
Subjt:  YNEPLDAAWERFQRLVRKCPHHGLPACIIIEHFYAPMTEVEAPMIGEDAPTMGAEPSPAERE

TrEMBL top hitse value%identityAlignment
A0A6J1CXK1 uncharacterized protein LOC1110151934.2e-2144.62Show/hide
Query:  MPQNENPIYIAKYRNRAIRDYALPVFQTLNLGILDPSVEAPQFELK-SGDAKAWLDSFPPNSITSWNDLVEKF----FPSNKNAKFKAEIISFRQSYNEP
        +P++E+P    K   +   D+ LP       GI D ++    F    SG A AWL++FPP+S   W  +V+KF    FP  KNA  + EIISFRQ  NE 
Subjt:  MPQNENPIYIAKYRNRAIRDYALPVFQTLNLGILDPSVEAPQFELK-SGDAKAWLDSFPPNSITSWNDLVEKF----FPSNKNAKFKAEIISFRQSYNEP

Query:  LDAAWERFQRLVRKCPHHGLPACIIIEHFY
        +D AWERF+ L+R CP+ G+PAC+ IEHFY
Subjt:  LDAAWERFQRLVRKCPHHGLPACIIIEHFY

A0A6J1DSZ5 uncharacterized protein LOC1110241073.8e-2234.27Show/hide
Query:  NPIYIAKYRNRAIRDYALPVFQTLNLGILDPSVEAPQFELK------------------------------------------------------SGDAK
        NPI++A  ++RA+RDYA  + + LN  +++P     QFE K                                                      SG A 
Subjt:  NPIYIAKYRNRAIRDYALPVFQTLNLGILDPSVEAPQFELK------------------------------------------------------SGDAK

Query:  AWLDSFPPNSITSWNDLVEKF----FPSNKNAKFKAEIISFRQSYNEPLDAAWERFQRLVRKCPHHGLPACIIIEHFY
        AWL++FP  +IT+W+D+V+KF    FP  +NA  + EIISFRQ  NE ++ AWE F+ L+R CP+ G+PAC+ IEHF+
Subjt:  AWLDSFPPNSITSWNDLVEKF----FPSNKNAKFKAEIISFRQSYNEPLDAAWERFQRLVRKCPHHGLPACIIIEHFY

A0A6J1DW02 uncharacterized protein LOC1110248978.5e-2230.94Show/hide
Query:  LLDIDPGIERTFRRRRKEQRRKKTKQQELSAQ---EPLEEASCIQEFFMEQPGVDPQVDPQNRGIEQNGGRIPPTPLVPPMPQNENPIYIAKYRNRAIRD
        LL +DP IERT R+ RKEQR +K  + +   +    P  E         + P  DP   P   G  ++  R              N I +A  R+ A+R+
Subjt:  LLDIDPGIERTFRRRRKEQRRKKTKQQELSAQ---EPLEEASCIQEFFMEQPGVDPQVDPQNRGIEQNGGRIPPTPLVPPMPQNENPIYIAKYRNRAIRD

Query:  YALPVFQTLNLGILDPSVEAPQFELK------------------------------------------------------SGDAKAWLDSFPPNSITSWN
        YA   FQ  + GI++P      FELK                                                         A+  L++FP  SIT+W 
Subjt:  YALPVFQTLNLGILDPSVEAPQFELK------------------------------------------------------SGDAKAWLDSFPPNSITSWN

Query:  DLVE----KFFPSNKNAKFKAEIISFRQSYNEPLDAAWERFQRLVRKCPHHGLPACIIIEHFYAPMTEVEAPMIGEDA
         LVE    KFFP  ++A  + EIISFRQ   EP+  AWERF+ L+RKC +HGLPAC  IEHF+  +      M+   A
Subjt:  DLVE----KFFPSNKNAKFKAEIISFRQSYNEPLDAAWERFQRLVRKCPHHGLPACIIIEHFYAPMTEVEAPMIGEDA

A0A6J1DY39 uncharacterized protein LOC1110256534.2e-2133.71Show/hide
Query:  NPIYIAKYRNRAIRDYALPVFQTLNLGILDPSVEAPQFELK------------------------------------------------------SGDAK
        NPI++A  R+RA+RDYA  + + LN  +++      +FE K                                                      SG A 
Subjt:  NPIYIAKYRNRAIRDYALPVFQTLNLGILDPSVEAPQFELK------------------------------------------------------SGDAK

Query:  AWLDSFPPNSITSWNDLVEKF----FPSNKNAKFKAEIISFRQSYNEPLDAAWERFQRLVRKCPHHGLPACIIIEHFY
        AWL++FP ++IT+W+D+V+KF    FP  +NA  + EIISFRQ  NE ++ AWERF+ L+  CP+ G+PAC+ IEHF+
Subjt:  AWLDSFPPNSITSWNDLVEKF----FPSNKNAKFKAEIISFRQSYNEPLDAAWERFQRLVRKCPHHGLPACIIIEHFY

U5CUI2 Retrotrans_gag domain-containing protein8.8e-2737.86Show/hide
Query:  NPIYIAKYRNRAIRDYALPVFQTLNLGILDPSVEAPQFELK-------------SG-----------------------------------------DAK
        NPI +A  R RAIR+YA P+F  LN GI+ P ++APQFELK             SG                                          A+
Subjt:  NPIYIAKYRNRAIRDYALPVFQTLNLGILDPSVEAPQFELK-------------SG-----------------------------------------DAK

Query:  AWLDSFPPNSITSWNDLVEKF----FPSNKNAKFKAEIISFRQSYNEPLDAAWERFQRLVRKCPHHGLPACIIIEHFYAPMTEVEAPMIGEDAPTMGAEP
        +WL++ PP+S+T+WNDL EKF    FP  +NAKF++EI+SF+Q  +E    AWERF+ L+RKCPHHG+P CI +E FY  +      ++  DA   GA  
Subjt:  AWLDSFPPNSITSWNDLVEKF----FPSNKNAKFKAEIISFRQSYNEPLDAAWERFQRLVRKCPHHGLPACIIIEHFYAPMTEVEAPMIGEDAPTMGAEP

Query:  SPAERE
        S +  E
Subjt:  SPAERE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAAGGGGTGTCAACTCCTTGATATAGATCCTGGGATAGAGAGAACCTTTCGTCGACGTCGTAAAGAGCAAAGACGAAAGAAGACGAAACAACAAGAGTTGAGCGC
ACAGGAACCTCTTGAAGAAGCTTCTTGTATACAAGAATTTTTTATGGAACAACCTGGAGTCGATCCTCAAGTTGACCCACAAAATCGTGGAATTGAGCAGAATGGTGGGA
GAATACCTCCAACACCTCTAGTTCCACCGATGCCACAAAATGAAAATCCAATCTATATTGCAAAATACCGCAATAGGGCAATAAGGGATTATGCCTTACCTGTTTTCCAA
ACTCTGAATCTAGGAATTCTTGATCCTTCGGTAGAAGCTCCACAATTTGAGCTGAAATCAGGAGATGCTAAAGCTTGGTTGGATTCATTCCCTCCAAACTCCATCACGTC
TTGGAATGACTTGGTAGAGAAGTTTTTCCCGTCCAATAAAAATGCTAAGTTTAAAGCTGAAATCATTTCCTTCAGACAATCATATAATGAACCTTTAGATGCAGCTTGGG
AGAGATTCCAAAGGTTGGTTAGGAAGTGTCCTCATCATGGATTGCCTGCTTGTATCATTATTGAGCATTTTTATGCTCCCATGACCGAGGTCGAGGCTCCCATGATTGGG
GAGGATGCCCCGACAATGGGAGCTGAACCTTCCCCAGCAGAGAGAGAGACCTCGTGGATGGATCCACTCATTGACTGTTTGGGAAAAGGTGAGCTGCCAGCAGAGAGAAT
GGAAGCTCAGAAGCTGCAGTGGCGAGCATCACATTTTGTGCTAAAGGAAGGAAAGCTATACAAGAGAAGTTACTCGATGACGTTGCTGAGGTGCCTTCTGAAGGATTGGA
ATTCCCTAATTCCGTTAGTGGAAGCGAATTGGGACCGTTCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACAAGGGGTGTCAACTCCTTGATATAGATCCTGGGATAGAGAGAACCTTTCGTCGACGTCGTAAAGAGCAAAGACGAAAGAAGACGAAACAACAAGAGTTGAGCGC
ACAGGAACCTCTTGAAGAAGCTTCTTGTATACAAGAATTTTTTATGGAACAACCTGGAGTCGATCCTCAAGTTGACCCACAAAATCGTGGAATTGAGCAGAATGGTGGGA
GAATACCTCCAACACCTCTAGTTCCACCGATGCCACAAAATGAAAATCCAATCTATATTGCAAAATACCGCAATAGGGCAATAAGGGATTATGCCTTACCTGTTTTCCAA
ACTCTGAATCTAGGAATTCTTGATCCTTCGGTAGAAGCTCCACAATTTGAGCTGAAATCAGGAGATGCTAAAGCTTGGTTGGATTCATTCCCTCCAAACTCCATCACGTC
TTGGAATGACTTGGTAGAGAAGTTTTTCCCGTCCAATAAAAATGCTAAGTTTAAAGCTGAAATCATTTCCTTCAGACAATCATATAATGAACCTTTAGATGCAGCTTGGG
AGAGATTCCAAAGGTTGGTTAGGAAGTGTCCTCATCATGGATTGCCTGCTTGTATCATTATTGAGCATTTTTATGCTCCCATGACCGAGGTCGAGGCTCCCATGATTGGG
GAGGATGCCCCGACAATGGGAGCTGAACCTTCCCCAGCAGAGAGAGAGACCTCGTGGATGGATCCACTCATTGACTGTTTGGGAAAAGGTGAGCTGCCAGCAGAGAGAAT
GGAAGCTCAGAAGCTGCAGTGGCGAGCATCACATTTTGTGCTAAAGGAAGGAAAGCTATACAAGAGAAGTTACTCGATGACGTTGCTGAGGTGCCTTCTGAAGGATTGGA
ATTCCCTAATTCCGTTAGTGGAAGCGAATTGGGACCGTTCGTAG
Protein sequenceShow/hide protein sequence
MNKGCQLLDIDPGIERTFRRRRKEQRRKKTKQQELSAQEPLEEASCIQEFFMEQPGVDPQVDPQNRGIEQNGGRIPPTPLVPPMPQNENPIYIAKYRNRAIRDYALPVFQ
TLNLGILDPSVEAPQFELKSGDAKAWLDSFPPNSITSWNDLVEKFFPSNKNAKFKAEIISFRQSYNEPLDAAWERFQRLVRKCPHHGLPACIIIEHFYAPMTEVEAPMIG
EDAPTMGAEPSPAERETSWMDPLIDCLGKGELPAERMEAQKLQWRASHFVLKEGKLYKRSYSMTLLRCLLKDWNSLIPLVEANWDRS