; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg004804 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg004804
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCACTA en-spm transposon protein
Genome locationscaffold5:19592262..19595349
RNA-Seq ExpressionSpg004804
SyntenySpg004804
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant
IPR007125 - Histone H2A/H2B/H3
IPR009072 - Histone-fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032748.1 hypothetical protein E6C27_scaffold853G00910 [Cucumis melo var. makuwa]3.2e-2545.36Show/hide
Query:  IRDYLEHEMSAFHRDFRCSLHTTYKKYDSPIEARKNRDKRVARDS-------DWNRLCDRWETEEFKR----------RSESNSKARATL-PFTHRGGTA
        ++DYLEHEMS  +RDFRCSLH +YKKYDSP +ARK+R KRVA          D N    R + ++  R          + E    +   L   TH     
Subjt:  IRDYLEHEMSAFHRDFRCSLHTTYKKYDSPIEARKNRDKRVARDS-------DWNRLCDRWETEEFKR----------RSESNSKARATL-PFTHRGGTA

Query:  TFL-RLKQKKDEMVALKTASSQDGGETLRDKEICEQVLGTRPGHVKGLGWGHRPRNAKNDHASYQKSKATEAQVAQLQSIVES
         ++   K+K DEMV LKT SSQ+G E L ++EICEQVLG R  HVKG G G RP+N  N+   +  +K   AQ+A  QSIVES
Subjt:  TFL-RLKQKKDEMVALKTASSQDGGETLRDKEICEQVLGTRPGHVKGLGWGHRPRNAKNDHASYQKSKATEAQVAQLQSIVES

KAA0046978.1 formin-like protein 4 isoform X2 [Cucumis melo var. makuwa]1.9e-2261.11Show/hide
Query:  KQKKDEMVALKTASSQDGGETLRDKEICEQVLGTRPGHVKGLGWGHRPRNAKNDHASYQKSKATEAQVAQLQSIVESQQATIEEILRRLASGEGSSSNVE
        K+K DEMV LKT SSQ+G E L +++IC++VLG R GHVKG GWG RP+NA+N+   +   KA  AQ+A LQSIVESQQATIE ILRRL+  E ++SN+E
Subjt:  KQKKDEMVALKTASSQDGGETLRDKEICEQVLGTRPGHVKGLGWGHRPRNAKNDHASYQKSKATEAQVAQLQSIVESQQATIEEILRRLASGEGSSSNVE

Query:  RTPEQSNN
        R+ EQSN+
Subjt:  RTPEQSNN

KAE8651942.1 hypothetical protein Csa_006405 [Cucumis sativus]2.3e-4451.64Show/hide
Query:  AIHAKRVTIMPKDIQLA--RRIRDYLEHEMSAFHRDFRCSLHTTYKKYDSPIEARKNRDKRVARDSDWNRLCDRWETEEFKRRSESNSKARATLPFTHRG
        +I  + V  +  DIQ      ++DYL+HEMS  +RDF CSLH +YKK DSP EARK+ DKRVA+DSDW RLCDRWE E FK RSE+N+KAR+ LPFTHRG
Subjt:  AIHAKRVTIMPKDIQLA--RRIRDYLEHEMSAFHRDFRCSLHTTYKKYDSPIEARKNRDKRVARDSDWNRLCDRWETEEFKRRSESNSKARATLPFTHRG

Query:  GTATFLRLKQKK--------------------------------DEMVALKTASSQDGGETLRDKEICEQVLGTRPGHVKGLGWGHRPRNAKNDHASYQK
        GT TFLR KQK                                 DEMV LKT  SQ+G + L +KEICE+VL  R G VK  GWG RP+NA+N+ A  Q 
Subjt:  GTATFLRLKQKK--------------------------------DEMVALKTASSQDGGETLRDKEICEQVLGTRPGHVKGLGWGHRPRNAKNDHASYQK

Query:  SKATEAQVAQLQS
        +K   AQ+A LQS
Subjt:  SKATEAQVAQLQS

TYJ98779.1 hypothetical protein E5676_scaffold156G00880 [Cucumis melo var. makuwa]3.2e-2545.36Show/hide
Query:  IRDYLEHEMSAFHRDFRCSLHTTYKKYDSPIEARKNRDKRVARDS-------DWNRLCDRWETEEFKR----------RSESNSKARATL-PFTHRGGTA
        ++DYLEHEMS  +RDFRCSLH +YKKYDSP +ARK+R KRVA          D N    R + ++  R          + E    +   L   TH     
Subjt:  IRDYLEHEMSAFHRDFRCSLHTTYKKYDSPIEARKNRDKRVARDS-------DWNRLCDRWETEEFKR----------RSESNSKARATL-PFTHRGGTA

Query:  TFL-RLKQKKDEMVALKTASSQDGGETLRDKEICEQVLGTRPGHVKGLGWGHRPRNAKNDHASYQKSKATEAQVAQLQSIVES
         ++   K+K DEMV LKT SSQ+G E L ++EICEQVLG R  HVKG G G RP+N  N+   +  +K   AQ+A  QSIVES
Subjt:  TFL-RLKQKKDEMVALKTASSQDGGETLRDKEICEQVLGTRPGHVKGLGWGHRPRNAKNDHASYQKSKATEAQVAQLQSIVES

TYK04806.1 formin-like protein 4 isoform X2 [Cucumis melo var. makuwa]1.9e-2261.11Show/hide
Query:  KQKKDEMVALKTASSQDGGETLRDKEICEQVLGTRPGHVKGLGWGHRPRNAKNDHASYQKSKATEAQVAQLQSIVESQQATIEEILRRLASGEGSSSNVE
        K+K DEMV LKT SSQ+G E L +++IC++VLG R GHVKG GWG RP+NA+N+   +   KA  AQ+A LQSIVESQQATIE ILRRL+  E ++SN+E
Subjt:  KQKKDEMVALKTASSQDGGETLRDKEICEQVLGTRPGHVKGLGWGHRPRNAKNDHASYQKSKATEAQVAQLQSIVESQQATIEEILRRLASGEGSSSNVE

Query:  RTPEQSNN
        R+ EQSN+
Subjt:  RTPEQSNN

TrEMBL top hitse value%identityAlignment
A0A0A0LM17 Uncharacterized protein7.8e-3056.06Show/hide
Query:  VFPLFLEQIATEATCFLAAHINRAIHAKRVTIMPKDIQLARRIRDYLEHEMSAFHRDFRCSLHTTYKKYDSPIEARKNRDKRVARDSDWNRLCDRWETEE
        + P+ +E    +  C  ++  N  I  +   I+  D  L   ++DYL+HEMS  +RDF CSLH +YKK DSP EARK+ DKRVA+DSDW RLCDRWE E 
Subjt:  VFPLFLEQIATEATCFLAAHINRAIHAKRVTIMPKDIQLARRIRDYLEHEMSAFHRDFRCSLHTTYKKYDSPIEARKNRDKRVARDSDWNRLCDRWETEE

Query:  FKRRSESNSKARATLPFTHRGGTATFLRLKQK
        FK RSE+N+KAR+ LPFTHRGGT TFLR KQK
Subjt:  FKRRSESNSKARATLPFTHRGGTATFLRLKQK

A0A5A7SQC8 DUF4218 domain-containing protein1.5e-2545.36Show/hide
Query:  IRDYLEHEMSAFHRDFRCSLHTTYKKYDSPIEARKNRDKRVARDS-------DWNRLCDRWETEEFKR----------RSESNSKARATL-PFTHRGGTA
        ++DYLEHEMS  +RDFRCSLH +YKKYDSP +ARK+R KRVA          D N    R + ++  R          + E    +   L   TH     
Subjt:  IRDYLEHEMSAFHRDFRCSLHTTYKKYDSPIEARKNRDKRVARDS-------DWNRLCDRWETEEFKR----------RSESNSKARATL-PFTHRGGTA

Query:  TFL-RLKQKKDEMVALKTASSQDGGETLRDKEICEQVLGTRPGHVKGLGWGHRPRNAKNDHASYQKSKATEAQVAQLQSIVES
         ++   K+K DEMV LKT SSQ+G E L ++EICEQVLG R  HVKG G G RP+N  N+   +  +K   AQ+A  QSIVES
Subjt:  TFL-RLKQKKDEMVALKTASSQDGGETLRDKEICEQVLGTRPGHVKGLGWGHRPRNAKNDHASYQKSKATEAQVAQLQSIVES

A0A5A7TVR8 Formin-like protein 4 isoform X29.3e-2361.11Show/hide
Query:  KQKKDEMVALKTASSQDGGETLRDKEICEQVLGTRPGHVKGLGWGHRPRNAKNDHASYQKSKATEAQVAQLQSIVESQQATIEEILRRLASGEGSSSNVE
        K+K DEMV LKT SSQ+G E L +++IC++VLG R GHVKG GWG RP+NA+N+   +   KA  AQ+A LQSIVESQQATIE ILRRL+  E ++SN+E
Subjt:  KQKKDEMVALKTASSQDGGETLRDKEICEQVLGTRPGHVKGLGWGHRPRNAKNDHASYQKSKATEAQVAQLQSIVESQQATIEEILRRLASGEGSSSNVE

Query:  RTPEQSNN
        R+ EQSN+
Subjt:  RTPEQSNN

A0A5D3BIG6 DUF4218 domain-containing protein1.5e-2545.36Show/hide
Query:  IRDYLEHEMSAFHRDFRCSLHTTYKKYDSPIEARKNRDKRVARDS-------DWNRLCDRWETEEFKR----------RSESNSKARATL-PFTHRGGTA
        ++DYLEHEMS  +RDFRCSLH +YKKYDSP +ARK+R KRVA          D N    R + ++  R          + E    +   L   TH     
Subjt:  IRDYLEHEMSAFHRDFRCSLHTTYKKYDSPIEARKNRDKRVARDS-------DWNRLCDRWETEEFKR----------RSESNSKARATL-PFTHRGGTA

Query:  TFL-RLKQKKDEMVALKTASSQDGGETLRDKEICEQVLGTRPGHVKGLGWGHRPRNAKNDHASYQKSKATEAQVAQLQSIVES
         ++   K+K DEMV LKT SSQ+G E L ++EICEQVLG R  HVKG G G RP+N  N+   +  +K   AQ+A  QSIVES
Subjt:  TFL-RLKQKKDEMVALKTASSQDGGETLRDKEICEQVLGTRPGHVKGLGWGHRPRNAKNDHASYQKSKATEAQVAQLQSIVES

A0A5D3C2Z5 Formin-like protein 4 isoform X29.3e-2361.11Show/hide
Query:  KQKKDEMVALKTASSQDGGETLRDKEICEQVLGTRPGHVKGLGWGHRPRNAKNDHASYQKSKATEAQVAQLQSIVESQQATIEEILRRLASGEGSSSNVE
        K+K DEMV LKT SSQ+G E L +++IC++VLG R GHVKG GWG RP+NA+N+   +   KA  AQ+A LQSIVESQQATIE ILRRL+  E ++SN+E
Subjt:  KQKKDEMVALKTASSQDGGETLRDKEICEQVLGTRPGHVKGLGWGHRPRNAKNDHASYQKSKATEAQVAQLQSIVESQQATIEEILRRLASGEGSSSNVE

Query:  RTPEQSNN
        R+ EQSN+
Subjt:  RTPEQSNN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13370.1 Histone superfamily protein7.3e-0433.61Show/hide
Query:  KRPT--LAVKSAVESGEPRRGICRRINRVSPWNTAAESSSRVRRSRILSSSPSVFPLFLEQIATE------------------ATCFLAAHINR----AI
        K PT  LA K+A +S  P  G  ++ +R  P   A     + ++S  L +    F   + +IA +                  A  +L          AI
Subjt:  KRPT--LAVKSAVESGEPRRGICRRINRVSPWNTAAESSSRVRRSRILSSSPSVFPLFLEQIATE------------------ATCFLAAHINR----AI

Query:  HAKRVTIMPKDIQLARRIR
        HAKRVTIMPKD+QLARRIR
Subjt:  HAKRVTIMPKDIQLARRIR

AT1G19890.1 male-gamete-specific histone H31.1e-0433.62Show/hide
Query:  RPTLAVKSAVESGEPRRGICRRINRVSPWNTAAESSSRVRRSRILSSSPSVFPLFLEQIATE------------------ATCFLAAHINR----AIHAK
        R  LA K+A ++  P RG  +R +R  P   A     + ++S  L      F   + +IA +                  A  +L          AIHAK
Subjt:  RPTLAVKSAVESGEPRRGICRRINRVSPWNTAAESSSRVRRSRILSSSPSVFPLFLEQIATE------------------ATCFLAAHINR----AIHAK

Query:  RVTIMPKDIQLARRIR
        RVTIM KDIQLARRIR
Subjt:  RVTIMPKDIQLARRIR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGTTGTTTGTTCAACGAAGTACTACATCAACACCGCCAACAATGACCATCCACCGCACCCCGCCTACATCTTACAACTGGAAAGAATCCAACAGCGAAATCATCA
AATCGATGATTCCATTGATTTCCCGTGGCCGATGGAGTATCGTGATATGGCTTGGAGTATCTCGAAGAACACAAGTCTCAGCTCCCTCGATTTTTTTCCCTCAAACACTC
CTCTCCCTCGATTTCGACTGAAGCGCCCCACCCTTGCCGTCAAATCTGCCGTCGAATCAGGGGAGCCTCGTCGTGGAATCTGTCGTCGAATCAATCGAGTCTCGCCGTGG
AATACTGCAGCCGAAAGTTCATCTCGAGTTCGCCGAAGTCGAATCCTGTCGAGTTCGCCGTCCGTTTTCCCTCTCTTCCTCGAGCAAATCGCCACCGAAGCTACCTGTTT
TCTCGCCGCCCATATTAATCGTGCTATTCATGCTAAGAGAGTTACTATTATGCCGAAGGATATCCAACTCGCTAGGAGGATTAGAGATTATCTAGAACATGAGATGAGTG
CATTTCACAGGGACTTTCGTTGCTCATTACACACTACTTACAAAAAATATGATTCTCCAATCGAAGCACGAAAAAATCGAGACAAACGAGTGGCACGAGATTCAGATTGG
AACCGTTTATGTGATCGATGGGAAACAGAAGAATTTAAGCGTCGGTCTGAGTCAAATTCTAAGGCTCGAGCCACTCTTCCTTTTACTCATAGAGGTGGCACTGCGACATT
TCTACGCCTTAAGCAGAAAAAAGACGAAATGGTGGCATTAAAAACTGCTTCATCACAAGACGGCGGTGAAACTCTTCGGGACAAAGAGATATGTGAACAAGTACTAGGTA
CGCGACCTGGGCATGTGAAGGGCCTTGGTTGGGGACATAGACCAAGAAATGCTAAGAATGATCATGCAAGTTACCAAAAATCTAAAGCGACGGAGGCTCAAGTTGCTCAA
TTGCAAAGTATAGTTGAATCACAACAAGCGACAATTGAAGAAATTTTAAGAAGGTTGGCAAGTGGAGAAGGTTCATCGTCGAATGTGGAAAGGACTCCAGAACAGTCAAA
TAATGAAAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGTTGTTTGTTCAACGAAGTACTACATCAACACCGCCAACAATGACCATCCACCGCACCCCGCCTACATCTTACAACTGGAAAGAATCCAACAGCGAAATCATCA
AATCGATGATTCCATTGATTTCCCGTGGCCGATGGAGTATCGTGATATGGCTTGGAGTATCTCGAAGAACACAAGTCTCAGCTCCCTCGATTTTTTTCCCTCAAACACTC
CTCTCCCTCGATTTCGACTGAAGCGCCCCACCCTTGCCGTCAAATCTGCCGTCGAATCAGGGGAGCCTCGTCGTGGAATCTGTCGTCGAATCAATCGAGTCTCGCCGTGG
AATACTGCAGCCGAAAGTTCATCTCGAGTTCGCCGAAGTCGAATCCTGTCGAGTTCGCCGTCCGTTTTCCCTCTCTTCCTCGAGCAAATCGCCACCGAAGCTACCTGTTT
TCTCGCCGCCCATATTAATCGTGCTATTCATGCTAAGAGAGTTACTATTATGCCGAAGGATATCCAACTCGCTAGGAGGATTAGAGATTATCTAGAACATGAGATGAGTG
CATTTCACAGGGACTTTCGTTGCTCATTACACACTACTTACAAAAAATATGATTCTCCAATCGAAGCACGAAAAAATCGAGACAAACGAGTGGCACGAGATTCAGATTGG
AACCGTTTATGTGATCGATGGGAAACAGAAGAATTTAAGCGTCGGTCTGAGTCAAATTCTAAGGCTCGAGCCACTCTTCCTTTTACTCATAGAGGTGGCACTGCGACATT
TCTACGCCTTAAGCAGAAAAAAGACGAAATGGTGGCATTAAAAACTGCTTCATCACAAGACGGCGGTGAAACTCTTCGGGACAAAGAGATATGTGAACAAGTACTAGGTA
CGCGACCTGGGCATGTGAAGGGCCTTGGTTGGGGACATAGACCAAGAAATGCTAAGAATGATCATGCAAGTTACCAAAAATCTAAAGCGACGGAGGCTCAAGTTGCTCAA
TTGCAAAGTATAGTTGAATCACAACAAGCGACAATTGAAGAAATTTTAAGAAGGTTGGCAAGTGGAGAAGGTTCATCGTCGAATGTGGAAAGGACTCCAGAACAGTCAAA
TAATGAAAATTAA
Protein sequenceShow/hide protein sequence
MSVVCSTKYYINTANNDHPPHPAYILQLERIQQRNHQIDDSIDFPWPMEYRDMAWSISKNTSLSSLDFFPSNTPLPRFRLKRPTLAVKSAVESGEPRRGICRRINRVSPW
NTAAESSSRVRRSRILSSSPSVFPLFLEQIATEATCFLAAHINRAIHAKRVTIMPKDIQLARRIRDYLEHEMSAFHRDFRCSLHTTYKKYDSPIEARKNRDKRVARDSDW
NRLCDRWETEEFKRRSESNSKARATLPFTHRGGTATFLRLKQKKDEMVALKTASSQDGGETLRDKEICEQVLGTRPGHVKGLGWGHRPRNAKNDHASYQKSKATEAQVAQ
LQSIVESQQATIEEILRRLASGEGSSSNVERTPEQSNNEN