; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002092 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002092
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotransposon gag protein
Genome locationscaffold2:27585544..27590581
RNA-Seq ExpressionSpg002092
SyntenySpg002092
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]9.0e-3350.51Show/hide
Query:  STKSEDRRLSAIHK---FKQRSKEFSQPRQPVTAKELFSKTF---HKKEKENFATSYCIDL----------EEVDNSKKGEQRTSVFDRIKPSNTRPSVF
        S K + RR   + K    K + ++  QPRQ +T  E F ++F   H +E     T +   +          EEVDNS + +QRTSVFDRIKP  TR SVF
Subjt:  STKSEDRRLSAIHK---FKQRSKEFSQPRQPVTAKELFSKTF---HKKEKENFATSYCIDL----------EEVDNSKKGEQRTSVFDRIKPSNTRPSVF

Query:  QRMSMATTEEENQCMVSTSARPSAFQRLSVSTMRKSQSSTSVFDRLKVADDQPRRKMNNLKVKLFNEVSSDEKLQSIVLSRMKRKFSVLINTEGYL
        QR+SMAT EEENQC  ST AR SAF+RLS+ST +K + STS FDRLK+ +DQ +R+M +LK K F+E + D+K+ S V SRMKRK SV INTEG L
Subjt:  QRMSMATTEEENQCMVSTSARPSAFQRLSVSTMRKSQSSTSVFDRLKVADDQPRRKMNNLKVKLFNEVSSDEKLQSIVLSRMKRKFSVLINTEGYL

KAA0050734.1 gag protease polyprotein [Cucumis melo var. makuwa]1.7e-3150.56Show/hide
Query:  KQRSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDL-------EEVDNSKKGEQRTSVFDRIKPSNTRPSVFQRMSMATTEEENQCMVST
        K + ++F QPR+ +T  E   ++F +   E         T+  +++       EEVDNS + +QRTS+FDRIKP  TR  VFQR+SMAT EEENQC  ST
Subjt:  KQRSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDL-------EEVDNSKKGEQRTSVFDRIKPSNTRPSVFQRMSMATTEEENQCMVST

Query:  SARPSAFQRLSVSTMRKSQSSTSVFDRLKVADDQPRRKMNNLKVKLFNEVSSDEKLQSIVLSRMKRKFSVLINTEGYL
         AR SAF+RLS+ST +K + STS FDRLK+ +DQ +R+M +LK K F+E + D+K+ S V SRMKRK SV INTEG L
Subjt:  SARPSAFQRLSVSTMRKSQSSTSVFDRLKVADDQPRRKMNNLKVKLFNEVSSDEKLQSIVLSRMKRKFSVLINTEGYL

KAA0050736.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.3e-3147.96Show/hide
Query:  STKSEDRRLSAIHK---FKQRSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDL-------EEVDNSKKGEQRTSVFDRIKPSNTRPSVF
        S K + RR   + K    K++ ++F QPR+ +T  E F ++F +   E         T+  +++       EEVDNS + +QRTSVFDRIKP  TR SVF
Subjt:  STKSEDRRLSAIHK---FKQRSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDL-------EEVDNSKKGEQRTSVFDRIKPSNTRPSVF

Query:  QRMSMATTEEENQCMVSTSARPSAFQRLSVSTMRKSQSSTSVFDRLKVADDQPRRKMNNLKVKLFNEVSSDEKLQSIVLSRMKRKFSVLINTEGYL
        QR+SMAT EEENQC +ST  R SAF+RLS+S  +K + STS FDRLK+ +DQ +R+M +LK K F+E + D+K+ S V SR+KRK S+ INTEG L
Subjt:  QRMSMATTEEENQCMVSTSARPSAFQRLSVSTMRKSQSSTSVFDRLKVADDQPRRKMNNLKVKLFNEVSSDEKLQSIVLSRMKRKFSVLINTEGYL

KAA0055462.1 retrotransposon gag protein [Cucumis melo var. makuwa]5.8e-3248.98Show/hide
Query:  STKSEDRRLSAIHK---FKQRSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDL-------EEVDNSKKGEQRTSVFDRIKPSNTRPSVF
        S K + RR   + K    K + ++F QPR+ +T  E  S++F +   E         T+  +++       EEVDNS + +QRTSVFDRIKP  TR SVF
Subjt:  STKSEDRRLSAIHK---FKQRSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDL-------EEVDNSKKGEQRTSVFDRIKPSNTRPSVF

Query:  QRMSMATTEEENQCMVSTSARPSAFQRLSVSTMRKSQSSTSVFDRLKVADDQPRRKMNNLKVKLFNEVSSDEKLQSIVLSRMKRKFSVLINTEGYL
        QR+SMAT EE+NQC  ST AR SAF+RLS+ST +K + STS FDRLK+ +DQ +R+M +LK K F+E + D+K+ + V SRMKRK SV INTEG L
Subjt:  QRMSMATTEEENQCMVSTSARPSAFQRLSVSTMRKSQSSTSVFDRLKVADDQPRRKMNNLKVKLFNEVSSDEKLQSIVLSRMKRKFSVLINTEGYL

TYK08944.1 retrotransposon gag protein [Cucumis melo var. makuwa]5.8e-3248.98Show/hide
Query:  STKSEDRRLSAIHK---FKQRSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDL-------EEVDNSKKGEQRTSVFDRIKPSNTRPSVF
        S K + RR   + K    K + ++F QPR+ +T  E  S++F +   E         T+  +++       EEVDNS + +QRTSVFDRIKP  TR SVF
Subjt:  STKSEDRRLSAIHK---FKQRSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDL-------EEVDNSKKGEQRTSVFDRIKPSNTRPSVF

Query:  QRMSMATTEEENQCMVSTSARPSAFQRLSVSTMRKSQSSTSVFDRLKVADDQPRRKMNNLKVKLFNEVSSDEKLQSIVLSRMKRKFSVLINTEGYL
        QR+SMAT EE+NQC  ST AR SAF+RLS+ST +K + STS FDRLK+ +DQ +R+M +LK K F+E + D+K+ + V SRMKRK SV INTEG L
Subjt:  QRMSMATTEEENQCMVSTSARPSAFQRLSVSTMRKSQSSTSVFDRLKVADDQPRRKMNNLKVKLFNEVSSDEKLQSIVLSRMKRKFSVLINTEGYL

TrEMBL top hitse value%identityAlignment
A0A5A7TQ06 Retrotransposon gag protein4.4e-3350.51Show/hide
Query:  STKSEDRRLSAIHK---FKQRSKEFSQPRQPVTAKELFSKTF---HKKEKENFATSYCIDL----------EEVDNSKKGEQRTSVFDRIKPSNTRPSVF
        S K + RR   + K    K + ++  QPRQ +T  E F ++F   H +E     T +   +          EEVDNS + +QRTSVFDRIKP  TR SVF
Subjt:  STKSEDRRLSAIHK---FKQRSKEFSQPRQPVTAKELFSKTF---HKKEKENFATSYCIDL----------EEVDNSKKGEQRTSVFDRIKPSNTRPSVF

Query:  QRMSMATTEEENQCMVSTSARPSAFQRLSVSTMRKSQSSTSVFDRLKVADDQPRRKMNNLKVKLFNEVSSDEKLQSIVLSRMKRKFSVLINTEGYL
        QR+SMAT EEENQC  ST AR SAF+RLS+ST +K + STS FDRLK+ +DQ +R+M +LK K F+E + D+K+ S V SRMKRK SV INTEG L
Subjt:  QRMSMATTEEENQCMVSTSARPSAFQRLSVSTMRKSQSSTSVFDRLKVADDQPRRKMNNLKVKLFNEVSSDEKLQSIVLSRMKRKFSVLINTEGYL

A0A5A7U974 Retrotransposon gag protein6.3e-3247.96Show/hide
Query:  STKSEDRRLSAIHK---FKQRSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDL-------EEVDNSKKGEQRTSVFDRIKPSNTRPSVF
        S K + RR   + K    K++ ++F QPR+ +T  E F ++F +   E         T+  +++       EEVDNS + +QRTSVFDRIKP  TR SVF
Subjt:  STKSEDRRLSAIHK---FKQRSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDL-------EEVDNSKKGEQRTSVFDRIKPSNTRPSVF

Query:  QRMSMATTEEENQCMVSTSARPSAFQRLSVSTMRKSQSSTSVFDRLKVADDQPRRKMNNLKVKLFNEVSSDEKLQSIVLSRMKRKFSVLINTEGYL
        QR+SMAT EEENQC +ST  R SAF+RLS+S  +K + STS FDRLK+ +DQ +R+M +LK K F+E + D+K+ S V SR+KRK S+ INTEG L
Subjt:  QRMSMATTEEENQCMVSTSARPSAFQRLSVSTMRKSQSSTSVFDRLKVADDQPRRKMNNLKVKLFNEVSSDEKLQSIVLSRMKRKFSVLINTEGYL

A0A5A7UI09 Retrotransposon gag protein2.8e-3248.98Show/hide
Query:  STKSEDRRLSAIHK---FKQRSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDL-------EEVDNSKKGEQRTSVFDRIKPSNTRPSVF
        S K + RR   + K    K + ++F QPR+ +T  E  S++F +   E         T+  +++       EEVDNS + +QRTSVFDRIKP  TR SVF
Subjt:  STKSEDRRLSAIHK---FKQRSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDL-------EEVDNSKKGEQRTSVFDRIKPSNTRPSVF

Query:  QRMSMATTEEENQCMVSTSARPSAFQRLSVSTMRKSQSSTSVFDRLKVADDQPRRKMNNLKVKLFNEVSSDEKLQSIVLSRMKRKFSVLINTEGYL
        QR+SMAT EE+NQC  ST AR SAF+RLS+ST +K + STS FDRLK+ +DQ +R+M +LK K F+E + D+K+ + V SRMKRK SV INTEG L
Subjt:  QRMSMATTEEENQCMVSTSARPSAFQRLSVSTMRKSQSSTSVFDRLKVADDQPRRKMNNLKVKLFNEVSSDEKLQSIVLSRMKRKFSVLINTEGYL

A0A5D3BBF9 Gag protease polyprotein8.2e-3250.56Show/hide
Query:  KQRSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDL-------EEVDNSKKGEQRTSVFDRIKPSNTRPSVFQRMSMATTEEENQCMVST
        K + ++F QPR+ +T  E   ++F +   E         T+  +++       EEVDNS + +QRTS+FDRIKP  TR  VFQR+SMAT EEENQC  ST
Subjt:  KQRSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDL-------EEVDNSKKGEQRTSVFDRIKPSNTRPSVFQRMSMATTEEENQCMVST

Query:  SARPSAFQRLSVSTMRKSQSSTSVFDRLKVADDQPRRKMNNLKVKLFNEVSSDEKLQSIVLSRMKRKFSVLINTEGYL
         AR SAF+RLS+ST +K + STS FDRLK+ +DQ +R+M +LK K F+E + D+K+ S V SRMKRK SV INTEG L
Subjt:  SARPSAFQRLSVSTMRKSQSSTSVFDRLKVADDQPRRKMNNLKVKLFNEVSSDEKLQSIVLSRMKRKFSVLINTEGYL

A0A5D3CCI8 Retrotransposon gag protein2.8e-3248.98Show/hide
Query:  STKSEDRRLSAIHK---FKQRSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDL-------EEVDNSKKGEQRTSVFDRIKPSNTRPSVF
        S K + RR   + K    K + ++F QPR+ +T  E  S++F +   E         T+  +++       EEVDNS + +QRTSVFDRIKP  TR SVF
Subjt:  STKSEDRRLSAIHK---FKQRSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDL-------EEVDNSKKGEQRTSVFDRIKPSNTRPSVF

Query:  QRMSMATTEEENQCMVSTSARPSAFQRLSVSTMRKSQSSTSVFDRLKVADDQPRRKMNNLKVKLFNEVSSDEKLQSIVLSRMKRKFSVLINTEGYL
        QR+SMAT EE+NQC  ST AR SAF+RLS+ST +K + STS FDRLK+ +DQ +R+M +LK K F+E + D+K+ + V SRMKRK SV INTEG L
Subjt:  QRMSMATTEEENQCMVSTSARPSAFQRLSVSTMRKSQSSTSVFDRLKVADDQPRRKMNNLKVKLFNEVSSDEKLQSIVLSRMKRKFSVLINTEGYL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTGTGCTAGGGCGAGAGGTCTTCTTGGGTCTCCGTTGAGGGGGCTTTGCGTACCTCTGTTCACCCTTGAACCAAAAGGAAGTCCAGGAGCTGATTGGAACATGGG
AATGGAGAACGAGACTTGGTGGAGTCTCTCTGAATTCTCTAGTTCTACAAGAGAGGTTCAGAAATTCTACACTCTCACAAAGACAAGAGTTCGGAATTTTAAAGCTGTCA
AGCAGAACCAGAGAATTCAGAGAGACTCCACCAAGTCTGAAGACCGAAGACTCTCTGCAATTCATAAGTTCAAGCAGAGAAGTAAAGAGTTTTCTCAACCTCGACAACCG
GTAACTGCGAAGGAACTCTTCTCCAAAACTTTTCACAAAAAGGAAAAAGAAAACTTTGCAACTTCCTACTGCATCGACTTAGAAGAAGTTGACAATTCCAAGAAGGGTGA
ACAAAGGACTTCCGTCTTCGATCGCATTAAGCCTTCAAATACTCGACCTTCGGTATTCCAAAGAATGAGTATGGCCACGACAGAAGAAGAAAATCAATGTATGGTGTCCA
CCTCCGCTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACAATGAGGAAAAGTCAGTCTTCAACATCTGTCTTTGATCGCCTCAAAGTAGCAGACGATCAACCTAGA
AGAAAGATGAACAACTTGAAGGTGAAACTTTTCAATGAAGTAAGCAGTGACGAGAAACTTCAAAGTATTGTCCTGTCACGTATGAAAAGGAAGTTCTCTGTTCTCATAAA
TACAGAAGGTTACTTGAAGTTCGAAGGTTCTCACGTGCTTCGCTGGAGTTCCTTCTCTCCGAGTTCGAAGGTTCTCAGCGCTTCGTTGGAGTTCCTTCTCTCCATGTTTG
AAGGTTCTCACGTGTTTCGCTGCAATTCCTTCTCTCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCTCTCCAAATTCAAAGGTTCCATGCGCTTTGCTGT
AGTTCCTTATCTCCAAGTTCGAAGTTTGAAGGTTCTCCGCTGCACGCTCCTACGTTGCTCCTTCTCCAAGTTCGAAGGTTTGCACATGTTGTTATGTTGCTTCACTGTTG
TTCCTTCTCCAAGTTTGAAGGTTTTCACATGTTGTGTTGTTGCGCGCTGCTTCCTTCTCTCTCCAAGTTCGAAGGTTCTGACGCTTCGCTGTTCGTTCTCCAAGTTCGAA
GGTTCTCACGTTGCTCTGCTTCCTTCTCCAAGTTCGAAGGTTCTCATGCTACGCTAGGCTGCGCTGTTGCGCTACTTTCTTCTTTAAGTTCGAAGGTTCCCACATTGCGT
TGTTATGCTGCTTCCTTCTCCAAGTTCGAAGGTTCTGACGTTGCGCTGCTTCCTTCACCAAGTTCGAAGGTTCTCACGTCGCGCTATTTCGCTGTTCTTTCTCCACGTTG
A
mRNA sequenceShow/hide mRNA sequence
ATGAGTTGTGCTAGGGCGAGAGGTCTTCTTGGGTCTCCGTTGAGGGGGCTTTGCGTACCTCTGTTCACCCTTGAACCAAAAGGAAGTCCAGGAGCTGATTGGAACATGGG
AATGGAGAACGAGACTTGGTGGAGTCTCTCTGAATTCTCTAGTTCTACAAGAGAGGTTCAGAAATTCTACACTCTCACAAAGACAAGAGTTCGGAATTTTAAAGCTGTCA
AGCAGAACCAGAGAATTCAGAGAGACTCCACCAAGTCTGAAGACCGAAGACTCTCTGCAATTCATAAGTTCAAGCAGAGAAGTAAAGAGTTTTCTCAACCTCGACAACCG
GTAACTGCGAAGGAACTCTTCTCCAAAACTTTTCACAAAAAGGAAAAAGAAAACTTTGCAACTTCCTACTGCATCGACTTAGAAGAAGTTGACAATTCCAAGAAGGGTGA
ACAAAGGACTTCCGTCTTCGATCGCATTAAGCCTTCAAATACTCGACCTTCGGTATTCCAAAGAATGAGTATGGCCACGACAGAAGAAGAAAATCAATGTATGGTGTCCA
CCTCCGCTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACAATGAGGAAAAGTCAGTCTTCAACATCTGTCTTTGATCGCCTCAAAGTAGCAGACGATCAACCTAGA
AGAAAGATGAACAACTTGAAGGTGAAACTTTTCAATGAAGTAAGCAGTGACGAGAAACTTCAAAGTATTGTCCTGTCACGTATGAAAAGGAAGTTCTCTGTTCTCATAAA
TACAGAAGGTTACTTGAAGTTCGAAGGTTCTCACGTGCTTCGCTGGAGTTCCTTCTCTCCGAGTTCGAAGGTTCTCAGCGCTTCGTTGGAGTTCCTTCTCTCCATGTTTG
AAGGTTCTCACGTGTTTCGCTGCAATTCCTTCTCTCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCTCTCCAAATTCAAAGGTTCCATGCGCTTTGCTGT
AGTTCCTTATCTCCAAGTTCGAAGTTTGAAGGTTCTCCGCTGCACGCTCCTACGTTGCTCCTTCTCCAAGTTCGAAGGTTTGCACATGTTGTTATGTTGCTTCACTGTTG
TTCCTTCTCCAAGTTTGAAGGTTTTCACATGTTGTGTTGTTGCGCGCTGCTTCCTTCTCTCTCCAAGTTCGAAGGTTCTGACGCTTCGCTGTTCGTTCTCCAAGTTCGAA
GGTTCTCACGTTGCTCTGCTTCCTTCTCCAAGTTCGAAGGTTCTCATGCTACGCTAGGCTGCGCTGTTGCGCTACTTTCTTCTTTAAGTTCGAAGGTTCCCACATTGCGT
TGTTATGCTGCTTCCTTCTCCAAGTTCGAAGGTTCTGACGTTGCGCTGCTTCCTTCACCAAGTTCGAAGGTTCTCACGTCGCGCTATTTCGCTGTTCTTTCTCCACGTTG
A
Protein sequenceShow/hide protein sequence
MSCARARGLLGSPLRGLCVPLFTLEPKGSPGADWNMGMENETWWSLSEFSSSTREVQKFYTLTKTRVRNFKAVKQNQRIQRDSTKSEDRRLSAIHKFKQRSKEFSQPRQP
VTAKELFSKTFHKKEKENFATSYCIDLEEVDNSKKGEQRTSVFDRIKPSNTRPSVFQRMSMATTEEENQCMVSTSARPSAFQRLSVSTMRKSQSSTSVFDRLKVADDQPR
RKMNNLKVKLFNEVSSDEKLQSIVLSRMKRKFSVLINTEGYLKFEGSHVLRWSSFSPSSKVLSASLEFLLSMFEGSHVFRCNSFSPSSKVLTRFAAVPSLQIQRFHALCC
SSLSPSSKFEGSPLHAPTLLLLQVRRFAHVVMLLHCCSFSKFEGFHMLCCCALLPSLSKFEGSDASLFVLQVRRFSRCSASFSKFEGSHATLGCAVALLSSLSSKVPTLR
CYAASFSKFEGSDVALLPSPSSKVLTSRYFAVLSPR