; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030702 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030702
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationscaffold11:25227340..25232055
RNA-Seq ExpressionSpg030702
SyntenySpg030702
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]6.4e-4745.93Show/hide
Query:  ARKEAVEGVKASDLEKGETSTSLVESKIVKDEKFSNSPVLRYVPLSRRKKGESPFTECPKNLKVDDLEILKESFTTPLTKITKQE---------------
        A  E  E  +  +  KGE  TS  +  I+KDE  +N+PVLRYVPLSRRKKGESPF E PK LKV D+EI+KESFTTPLTKI KQE               
Subjt:  ARKEAVEGVKASDLEKGETSTSLVESKIVKDEKFSNSPVLRYVPLSRRKKGESPFTECPKNLKVDDLEILKESFTTPLTKITKQE---------------

Query:  ----------------------------------------KKLLKDGYSLPTSRKGLGYKLLEPVHITRRGKVKVADTNHITVEEVDDSKEKESVDQRTS
                                                KKLL++G+S+P SRKGLGYK  EP+ IT++GK KV D NHIT+EE D++  KE  +QR S
Subjt:  ----------------------------------------KKLLKDGYSLPTSRKGLGYKLLEPVHITRRGKVKVADTNHITVEEVDDSKEKESVDQRTS

Query:  VFRRIRAPVARALVFQRLSVNETEEESAQPTNSSTRPSIFRRLSMPNGEEDSTFSTPDVTRPSIFRRLNM
        VF RIR  VAR +VF+RLS+ E E E  Q   S  R S+FRRL+    +E+ST      TRPS F RL +
Subjt:  VFRRIRAPVARALVFQRLSVNETEEESAQPTNSSTRPSIFRRLSMPNGEEDSTFSTPDVTRPSIFRRLNM

XP_031737045.1 uncharacterized protein LOC116402134 [Cucumis sativus]1.4e-4645.56Show/hide
Query:  ARKEAVEGVKASDLEKGETSTSLVESKIVKDEKFSNSPVLRYVPLSRRKKGESPFTECPKNLKVDDLEILKESFTTPLTKITKQE---------------
        A  E  E  +  +  KGE  TS  +  I+KDE  +N+PVLRYVPLSRRKKGESPF E PK LKV D+EI+KESFTTPLTKI KQE               
Subjt:  ARKEAVEGVKASDLEKGETSTSLVESKIVKDEKFSNSPVLRYVPLSRRKKGESPFTECPKNLKVDDLEILKESFTTPLTKITKQE---------------

Query:  ----------------------------------------KKLLKDGYSLPTSRKGLGYKLLEPVHITRRGKVKVADTNHITVEEVDDSKEKESVDQRTS
                                                KKLL++G+S+P SRKGLGYK  EP+ IT++GK KV D NHIT+EE D++  KE  +QR S
Subjt:  ----------------------------------------KKLLKDGYSLPTSRKGLGYKLLEPVHITRRGKVKVADTNHITVEEVDDSKEKESVDQRTS

Query:  VFRRIRAPVARALVFQRLSVNETEEESAQPTNSSTRPSIFRRLSMPNGEEDSTFSTPDVTRPSIFRRLNM
        VF RIR  VAR +VF+RLS+ E E E  Q   +  R S+FRRL+    +E+ST      TRPS F RL +
Subjt:  VFRRIRAPVARALVFQRLSVNETEEESAQPTNSSTRPSIFRRLSMPNGEEDSTFSTPDVTRPSIFRRLNM

XP_031737045.1 uncharacterized protein LOC116402134 [Cucumis sativus]1.4e-0142.25Show/hide
Query:  PKDTLNPHVIQGETLETVTCHIVDVVEDDDVPSSSSGMVIGLGDLSSFSIKYLLSLPLEAKSVLIDALMES
        PK  L  H  Q E  E V CH ++  E++ +P  S        DLS F+++ LLSLP E K++LIDAL+ S
Subjt:  PKDTLNPHVIQGETLETVTCHIVDVVEDDDVPSSSSGMVIGLGDLSSFSIKYLLSLPLEAKSVLIDALMES

XP_031737045.1 uncharacterized protein LOC116402134 [Cucumis sativus]1.4e-4645.56Show/hide
Query:  ARKEAVEGVKASDLEKGETSTSLVESKIVKDEKFSNSPVLRYVPLSRRKKGESPFTECPKNLKVDDLEILKESFTTPLTKITKQE---------------
        A  E  E  +  +  KGE  TS  +  I+KDE  +N+PVLRYVPLSRRKKGESPF E PK LKV D+EI+KESFTTPLTKI KQE               
Subjt:  ARKEAVEGVKASDLEKGETSTSLVESKIVKDEKFSNSPVLRYVPLSRRKKGESPFTECPKNLKVDDLEILKESFTTPLTKITKQE---------------

Query:  ----------------------------------------KKLLKDGYSLPTSRKGLGYKLLEPVHITRRGKVKVADTNHITVEEVDDSKEKESVDQRTS
                                                KKLL++G+S+P SRKGLGYK  EP+ IT++GK KV D NHIT+EE D++  KE  +QR S
Subjt:  ----------------------------------------KKLLKDGYSLPTSRKGLGYKLLEPVHITRRGKVKVADTNHITVEEVDDSKEKESVDQRTS

Query:  VFRRIRAPVARALVFQRLSVNETEEESAQPTNSSTRPSIFRRLSMPNGEEDSTFSTPDVTRPSIFRRLNM
        VF RIR  VAR +VF+RLS+ E E E  Q   +  R S+FRRL+    +E+ST      TRPS F RL +
Subjt:  VFRRIRAPVARALVFQRLSVNETEEESAQPTNSSTRPSIFRRLSMPNGEEDSTFSTPDVTRPSIFRRLNM

XP_031737372.1 uncharacterized protein LOC116402244 [Cucumis sativus]1.4e-4645.56Show/hide
Query:  ARKEAVEGVKASDLEKGETSTSLVESKIVKDEKFSNSPVLRYVPLSRRKKGESPFTECPKNLKVDDLEILKESFTTPLTKITKQE---------------
        A  E  E  +  +  KGE  TS  +  I+KDE  +N+PVLRYVPLSRRKKGESPF E PK LKV D+EI+KESFTTPLTKI KQE               
Subjt:  ARKEAVEGVKASDLEKGETSTSLVESKIVKDEKFSNSPVLRYVPLSRRKKGESPFTECPKNLKVDDLEILKESFTTPLTKITKQE---------------

Query:  ----------------------------------------KKLLKDGYSLPTSRKGLGYKLLEPVHITRRGKVKVADTNHITVEEVDDSKEKESVDQRTS
                                                KKLL++G+S+P SRKGLGYK  EP+ IT++GK KV D NHIT+EE D++  KE  +QR S
Subjt:  ----------------------------------------KKLLKDGYSLPTSRKGLGYKLLEPVHITRRGKVKVADTNHITVEEVDDSKEKESVDQRTS

Query:  VFRRIRAPVARALVFQRLSVNETEEESAQPTNSSTRPSIFRRLSMPNGEEDSTFSTPDVTRPSIFRRLNM
        VF RIR  VAR +VF+RLS+ E E E  Q   +  R S+FRRL+    +E+ST      TRPS F RL +
Subjt:  VFRRIRAPVARALVFQRLSVNETEEESAQPTNSSTRPSIFRRLSMPNGEEDSTFSTPDVTRPSIFRRLNM

XP_031740568.1 uncharacterized protein LOC116403508 [Cucumis sativus]1.4e-4645.56Show/hide
Query:  ARKEAVEGVKASDLEKGETSTSLVESKIVKDEKFSNSPVLRYVPLSRRKKGESPFTECPKNLKVDDLEILKESFTTPLTKITKQE---------------
        A  E  E  +  +  KGE  TS  +  I+KDE  +N+PVLRYVPLSRRKKGESPF E PK LKV D+EI+KESFTTPLTKI KQE               
Subjt:  ARKEAVEGVKASDLEKGETSTSLVESKIVKDEKFSNSPVLRYVPLSRRKKGESPFTECPKNLKVDDLEILKESFTTPLTKITKQE---------------

Query:  ----------------------------------------KKLLKDGYSLPTSRKGLGYKLLEPVHITRRGKVKVADTNHITVEEVDDSKEKESVDQRTS
                                                KKLL++G+S+P SRKGLGYK  EP+ IT++GK KV D NHIT+EE D++  KE  +QR S
Subjt:  ----------------------------------------KKLLKDGYSLPTSRKGLGYKLLEPVHITRRGKVKVADTNHITVEEVDDSKEKESVDQRTS

Query:  VFRRIRAPVARALVFQRLSVNETEEESAQPTNSSTRPSIFRRLSMPNGEEDSTFSTPDVTRPSIFRRLNM
        VF RIR  VAR +VF+RLS+ E E E  Q   +  R S+FRRL+    +E+ST      TRPS F RL +
Subjt:  VFRRIRAPVARALVFQRLSVNETEEESAQPTNSSTRPSIFRRLSMPNGEEDSTFSTPDVTRPSIFRRLNM

TrEMBL top hitse value%identityAlignment
A0A5A7TJZ7 Retrotransposon gag protein2.9e-4546.46Show/hide
Query:  GETSTSLVESKIVKDEKFSNSPVLRYVPLSRRKKGESPFTECPKNLKVDDLEILKESFTTPLTKITKQE-------------------------------
        G+ STS  +S I+ DEK SN P+LRYVPLSR KKGESPF + P+ LKV D+E+LKESFTTP TKITKQE                               
Subjt:  GETSTSLVESKIVKDEKFSNSPVLRYVPLSRRKKGESPFTECPKNLKVDDLEILKESFTTPLTKITKQE-------------------------------

Query:  ------------------------KKLLKDGYSLPTSRKGLGYKLLEPVHITRRGKVKVADTNHITVEEVDDSKEKESVDQRTSVFRRIRAPVARALVFQ
                                KKLL++G+++P SRKGLGYK  EP+ ITR+GK KV D NHITV+EVD  KEKE   QRTS F RI   VAR  VF+
Subjt:  ------------------------KKLLKDGYSLPTSRKGLGYKLLEPVHITRRGKVKVADTNHITVEEVDDSKEKESVDQRTSVFRRIRAPVARALVFQ

Query:  RLSVNETEEESAQPTNSSTRPSIFRRLSMPNGEEDSTFSTPDVTRPSIFRRLNM
        RLS+ E E +  Q T++  R S F+RL+M + +E         TRPS F RL+M
Subjt:  RLSVNETEEESAQPTNSSTRPSIFRRLSMPNGEEDSTFSTPDVTRPSIFRRLNM

A0A5A7UD46 Uncharacterized protein4.2e-4437.22Show/hide
Query:  GSEDTKDTIRISKLIEESSKDKVAVKDNPLFESVTPTSEHPKDTLNPHVIQGETLETVTCHI--------VDVVEDDDVPSSSSGMVIGLGDLSSFSIKY
        GS+     IR+  +I +       +KD+ LF  +   + + K  L    I G  + T T H         V  VE D  P S +         + F +K 
Subjt:  GSEDTKDTIRISKLIEESSKDKVAVKDNPLFESVTPTSEHPKDTLNPHVIQGETLETVTCHI--------VDVVEDDDVPSSSSGMVIGLGDLSSFSIKY

Query:  LLSLPLEAKSVLIDALMESDGPQADARKEAVEGVKASDLEKGETSTSLVESKIVKDEKFSNSPVLRYVPLSRRKKGESPFTECPKNLKVDDLEILKESFT
          SL + +  VL+    ++   ++ A KE  + +      K E ST+  +S I+ DEK SN P+LRYVPLSRRKKGESPF E P+ LKV ++E+LKESFT
Subjt:  LLSLPLEAKSVLIDALMESDGPQADARKEAVEGVKASDLEKGETSTSLVESKIVKDEKFSNSPVLRYVPLSRRKKGESPFTECPKNLKVDDLEILKESFT

Query:  TPLTKITKQE-------------------------------------------------------KKLLKDGYSLPTSRKGLGYKLLEPVHITRRGKVKV
        TPLTKITKQE                                                       KKLL++G+ +P SRKGLGYK  EP+ ITR+GK KV
Subjt:  TPLTKITKQE-------------------------------------------------------KKLLKDGYSLPTSRKGLGYKLLEPVHITRRGKVKV

Query:  ADTNHITVEEVDDSKEKESVDQRTSVFRRIRAPVARALVFQRLSVNETEEESAQPTNSSTRPSIFRRLSMPNGEEDSTFSTPDVTRPSIFRRLNM
         D+NHIT++E D  +EKE   QRTS F RI   VARA VF++LS+ E E +  Q T++  R S F+RL++   EE     T   T+PS F RL++
Subjt:  ADTNHITVEEVDDSKEKESVDQRTSVFRRIRAPVARALVFQRLSVNETEEESAQPTNSSTRPSIFRRLSMPNGEEDSTFSTPDVTRPSIFRRLNM

A0A5A7V356 Uncharacterized protein3.5e-4345.21Show/hide
Query:  ARKEAVEGVKASDLEKGETSTSLVESKIVKDEKFSNSPVLRYVPLSRRKKGESPFTECPKNLKVDDLEILKESFTTPLTKITKQE---------------
        A KE  + +     EK E STS  +S I+ DEK SN P+L YVPLSRRKKGESPF E  + LKV D+E+LKESFTTPLTKITKQE               
Subjt:  ARKEAVEGVKASDLEKGETSTSLVESKIVKDEKFSNSPVLRYVPLSRRKKGESPFTECPKNLKVDDLEILKESFTTPLTKITKQE---------------

Query:  ----------------------------------------KKLLKDGYSLPTSRKGLGYKLLEPVHITRRGKVKVADTNHITVEEVDDSKEKESVDQRTS
                                                KKLL++G+++P SRKGLGYKL EP+ ITR+GK K+ D+NHI V+EVD  +EKE   QRTS
Subjt:  ----------------------------------------KKLLKDGYSLPTSRKGLGYKLLEPVHITRRGKVKVADTNHITVEEVDDSKEKESVDQRTS

Query:  VFRRIRAPVARALVFQRLSVNETEEESAQPTNSSTRPSIFRRLSMPNGEEDSTFSTPDVTR
         F RI   VARA VF+RLSV E E +  Q T++  R S F RLS+   +   T   P + R
Subjt:  VFRRIRAPVARALVFQRLSVNETEEESAQPTNSSTRPSIFRRLSMPNGEEDSTFSTPDVTR

A0A5D3BY54 Ty3-gypsy retrotransposon protein2.7e-4345.49Show/hide
Query:  EAKSVLIDALMESDGPQAD--ARKEAVEGVKASDLEKGETSTSLVESKIVKDEKFSNSPVLRYVPLSRRKKGESPFTECPKNLKVDDLEILKESFTTPLT
        E  SV +  + + D  Q    A +E+ +        K E STS  +S IV DEK SN P+LRYVPLSRRKKGESPF E P+ LKV D+E+LKESFTTPLT
Subjt:  EAKSVLIDALMESDGPQAD--ARKEAVEGVKASDLEKGETSTSLVESKIVKDEKFSNSPVLRYVPLSRRKKGESPFTECPKNLKVDDLEILKESFTTPLT

Query:  KITKQE-------------------------------------------------------KKLLKDGYSLPTSRKGLGYKLLEPVHITRRGKVKVADTN
        KITKQE                                                       KKLL++G+++P SRKGLGYK  EP+ ITR+GK KV D+N
Subjt:  KITKQE-------------------------------------------------------KKLLKDGYSLPTSRKGLGYKLLEPVHITRRGKVKVADTN

Query:  HITVEEVDDSKEKESVDQRTSVFRRIRAPVARALVFQRLSVNETEEESAQPTNSSTRPSIFRRLSM
        HITV+EVD  +EKE   QRTS F R+   VARA VF+RLS+ E E +  Q T+S  R S F+RL+M
Subjt:  HITVEEVDDSKEKESVDQRTSVFRRIRAPVARALVFQRLSVNETEEESAQPTNSSTRPSIFRRLSM

A0A5D3C0W6 Ty3-gypsy retrotransposon protein6.4e-4544.68Show/hide
Query:  EAKSVLIDALMESDGPQAD--ARKEAVEGVKASDLEKGETSTSLVESKIVKDEKFSNSPVLRYVPLSRRKKGESPFTECPKNLKVDDLEILKESFTTPLT
        E  SV +  +   D  Q    A KE    +     EK E STS  +S I+ DEK SN  +LRYVPLSRRKKGESPF E P+ LKV D+E+LKESFTTPLT
Subjt:  EAKSVLIDALMESDGPQAD--ARKEAVEGVKASDLEKGETSTSLVESKIVKDEKFSNSPVLRYVPLSRRKKGESPFTECPKNLKVDDLEILKESFTTPLT

Query:  KITKQE-------------------------------------------------------KKLLKDGYSLPTSRKGLGYKLLEPVHITRRGKVKVADTN
        KITKQE                                                       KKLL++G+++P SRKGLGYKL EP+ ITR+GK K+ D+N
Subjt:  KITKQE-------------------------------------------------------KKLLKDGYSLPTSRKGLGYKLLEPVHITRRGKVKVADTN

Query:  HITVEEVDDSKEKESVDQRTSVFRRIRAPVARALVFQRLSVNETEEESAQPTNSSTRPSIFRRLSMPNGEEDSTFSTPDVTR
        HITV+EVD  KEKE   QRTS F RI   VARA VF+RLSV E E +  Q T++  R S F RLS+   +   T   P + R
Subjt:  HITVEEVDDSKEKESVDQRTSVFRRIRAPVARALVFQRLSVNETEEESAQPTNSSTRPSIFRRLSMPNGEEDSTFSTPDVTR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGGCCTAAGAACAATCAGATTGCAAGGAAGACGATCGGAAGCGAAGATACAAAAGACACCATTAGGATCTCAAAATTGATTGAAGAATCCTCTAAGGACAAGGT
TGCAGTCAAAGACAACCCATTGTTCGAATCTGTCACTCCAACATCTGAGCACCCAAAGGACACACTAAATCCTCATGTGATTCAGGGAGAGACACTAGAAACTGTCACGT
GTCACATTGTGGACGTGGTGGAAGATGATGATGTTCCTTCTAGCTCATCGGGAATGGTGATAGGTCTAGGAGACTTATCTTCTTTTAGTATAAAATACCTATTGTCACTC
CCTTTGGAGGCTAAAAGTGTTCTTATTGATGCGTTGATGGAGTCTGATGGGCCACAAGCTGATGCAAGAAAGGAAGCTGTTGAAGGTGTGAAGGCATCCGACCTGGAAAA
GGGTGAAACATCTACAAGCCTTGTGGAGTCTAAGATTGTAAAGGATGAGAAATTTTCAAATTCACCTGTCCTACGATACGTCCCTTTATCTCGACGTAAAAAGGGTGAAT
CACCCTTCACAGAATGCCCGAAAAACTTGAAGGTCGATGATCTTGAAATTCTAAAGGAAAGTTTCACTACACCTCTTACAAAGATTACAAAGCAAGAGAAGAAACTTCTA
AAGGATGGTTATAGTCTGCCTACATCGAGAAAAGGACTTGGATATAAGTTGCTTGAGCCGGTTCACATAACAAGAAGAGGGAAGGTGAAAGTGGCAGACACAAATCATAT
AACAGTAGAGGAGGTTGATGACTCAAAAGAAAAAGAGAGCGTCGACCAACGAACTTCTGTTTTTAGGCGCATTAGGGCACCAGTTGCTCGTGCTTTAGTATTTCAGAGAT
TAAGTGTGAATGAAACGGAAGAAGAAAGCGCACAACCTACCAATAGCTCCACTCGACCTTCAATTTTTCGAAGGTTAAGTATGCCGAATGGGGAAGAAGATAGTACATTT
TCAACTCCGGATGTCACTCGACCTTCAATTTTTCGAAGGTTAAATATGCCTATTGGGGAAGAAGAGAGTACATTTTTAACTTCGGATGTCACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGGCCTAAGAACAATCAGATTGCAAGGAAGACGATCGGAAGCGAAGATACAAAAGACACCATTAGGATCTCAAAATTGATTGAAGAATCCTCTAAGGACAAGGT
TGCAGTCAAAGACAACCCATTGTTCGAATCTGTCACTCCAACATCTGAGCACCCAAAGGACACACTAAATCCTCATGTGATTCAGGGAGAGACACTAGAAACTGTCACGT
GTCACATTGTGGACGTGGTGGAAGATGATGATGTTCCTTCTAGCTCATCGGGAATGGTGATAGGTCTAGGAGACTTATCTTCTTTTAGTATAAAATACCTATTGTCACTC
CCTTTGGAGGCTAAAAGTGTTCTTATTGATGCGTTGATGGAGTCTGATGGGCCACAAGCTGATGCAAGAAAGGAAGCTGTTGAAGGTGTGAAGGCATCCGACCTGGAAAA
GGGTGAAACATCTACAAGCCTTGTGGAGTCTAAGATTGTAAAGGATGAGAAATTTTCAAATTCACCTGTCCTACGATACGTCCCTTTATCTCGACGTAAAAAGGGTGAAT
CACCCTTCACAGAATGCCCGAAAAACTTGAAGGTCGATGATCTTGAAATTCTAAAGGAAAGTTTCACTACACCTCTTACAAAGATTACAAAGCAAGAGAAGAAACTTCTA
AAGGATGGTTATAGTCTGCCTACATCGAGAAAAGGACTTGGATATAAGTTGCTTGAGCCGGTTCACATAACAAGAAGAGGGAAGGTGAAAGTGGCAGACACAAATCATAT
AACAGTAGAGGAGGTTGATGACTCAAAAGAAAAAGAGAGCGTCGACCAACGAACTTCTGTTTTTAGGCGCATTAGGGCACCAGTTGCTCGTGCTTTAGTATTTCAGAGAT
TAAGTGTGAATGAAACGGAAGAAGAAAGCGCACAACCTACCAATAGCTCCACTCGACCTTCAATTTTTCGAAGGTTAAGTATGCCGAATGGGGAAGAAGATAGTACATTT
TCAACTCCGGATGTCACTCGACCTTCAATTTTTCGAAGGTTAAATATGCCTATTGGGGAAGAAGAGAGTACATTTTTAACTTCGGATGTCACTTGA
Protein sequenceShow/hide protein sequence
MEGPKNNQIARKTIGSEDTKDTIRISKLIEESSKDKVAVKDNPLFESVTPTSEHPKDTLNPHVIQGETLETVTCHIVDVVEDDDVPSSSSGMVIGLGDLSSFSIKYLLSL
PLEAKSVLIDALMESDGPQADARKEAVEGVKASDLEKGETSTSLVESKIVKDEKFSNSPVLRYVPLSRRKKGESPFTECPKNLKVDDLEILKESFTTPLTKITKQEKKLL
KDGYSLPTSRKGLGYKLLEPVHITRRGKVKVADTNHITVEEVDDSKEKESVDQRTSVFRRIRAPVARALVFQRLSVNETEEESAQPTNSSTRPSIFRRLSMPNGEEDSTF
STPDVTRPSIFRRLNMPIGEEESTFLTSDVT