; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg010284 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg010284
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCACTA en-spm transposon protein
Genome locationscaffold8:19270864..19278822
RNA-Seq ExpressionSpg010284
SyntenySpg010284
Gene Ontology termsGO:0016560 - protein import into peroxisome matrix, docking (biological process)
GO:0005777 - peroxisome (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR015931 - Aconitase/3-isopropylmalate dehydratase large subunit, alpha/beta/alpha, subdomain 1/3
IPR035463 - Peroxin 13


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044973.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]3.3e-1933.78Show/hide
Query:  HVFLNVAWSFTALCCRNFRYIRFREVILVAYESRVSSSVARLEVREPGALQEDPKWKNMGEKNKQNHAKLPFNHYAGSKSFLHIQQKNASLKIILRVSYF
        H  L     F A C R+F+     E ++  +++     V R++      +  +    N   + KQ     P+NH +GSKSFL  Q               
Subjt:  HVFLNVAWSFTALCCRNFRYIRFREVILVAYESRVSSSVARLEVREPGALQEDPKWKNMGEKNKQNHAKLPFNHYAGSKSFLHIQQKNASLKIILRVSYF

Query:  FLILLICAKFELQKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALQQAPTPDGTQPPKPDDICEAVLGSRSDQIKGLGWGPKPKSRKCTSSS
                 +EL + R G+++  ++LF +T  + G  +VSQAA +AH+QM+ LQ  PTP+G+QP   D+IC+ VLG R    KGLGWGPKPK+R+  S+S
Subjt:  FLILLICAKFELQKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALQQAPTPDGTQPPKPDDICEAVLGSRSDQIKGLGWGPKPKSRKCTSSS

Query:  DAT-----SLRREHELSAVLSE
         ++     S ++E EL A L E
Subjt:  DAT-----SLRREHELSAVLSE

KAA0055457.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]5.6e-1939.52Show/hide
Query:  WKNMGEKNKQNHAKLPFNHYAGSKSFLHIQQKNASLKIILRVSYFFLILLICAKFELQKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALQQ
        ++     NK    K P+NH +GSKSFL  Q                        +EL + R G+ +  ++LF +T  + G  +VSQAA +AH+QM+ LQ 
Subjt:  WKNMGEKNKQNHAKLPFNHYAGSKSFLHIQQKNASLKIILRVSYFFLILLICAKFELQKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALQQ

Query:  APTPDGTQPPKPDDICEAVLGSRSDQIKGLGWGPKPKSRKCTSSSDAT-----SLRREHELSAVLSE
         PTP+G+QP   D+IC+ VLG R D  KGLGWGPKPK+R+  S+S ++     S ++E EL A L E
Subjt:  APTPDGTQPPKPDDICEAVLGSRSDQIKGLGWGPKPKSRKCTSSSDAT-----SLRREHELSAVLSE

KAA0066000.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]4.3e-1938.92Show/hide
Query:  WKNMGEKNKQNHAKLPFNHYAGSKSFLHIQQKNASLKIILRVSYFFLILLICAKFELQKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALQQ
        ++     NK    K P+NH +GSKSFL  Q + A                          R G+++  ++LF +T  + G  +VSQAA + HDQM+ LQ 
Subjt:  WKNMGEKNKQNHAKLPFNHYAGSKSFLHIQQKNASLKIILRVSYFFLILLICAKFELQKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALQQ

Query:  APTPDGTQPPKPDDICEAVLGSRSDQIKGLGWGPKPKSRKCTSSSDAT-----SLRREHELSAVLSE
         PTP+G+QP   D+IC+ VLG R    KGLGWGPKPK+R+ TS+S ++     S  +E EL A L E
Subjt:  APTPDGTQPPKPDDICEAVLGSRSDQIKGLGWGPKPKSRKCTSSSDAT-----SLRREHELSAVLSE

TYK11183.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]3.3e-1940.48Show/hide
Query:  WKNMGEKNKQNHAKLPFNHYAGSKSFLHIQQKNASLKIILRVSYFFLILLICAKFELQKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALQQ
        ++   + NK    K P+NH +GSKSFL  Q                        +EL K R+GE++  ++LF KT  + G  +VSQA  +AH+QM+ LQ 
Subjt:  WKNMGEKNKQNHAKLPFNHYAGSKSFLHIQQKNASLKIILRVSYFFLILLICAKFELQKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALQQ

Query:  APTPDGTQPPKPDDICEAVLGSRSDQIKGLGWGPKPKSRKCTSSSDAT-----SLRREHELSAVLSET
         PT  G+QP   D+IC+ VLG R    KGLGWGPKPK+R+ TS+S ++     S ++E EL A L+ET
Subjt:  APTPDGTQPPKPDDICEAVLGSRSDQIKGLGWGPKPKSRKCTSSSDAT-----SLRREHELSAVLSET

XP_022156286.1 uncharacterized protein LOC111023212 [Momordica charantia]5.4e-2241.18Show/hide
Query:  EDPKWKNMGEKNKQNHAKLPFNHYAGSKSFLHIQQKNASLKIILRVSYFFLILLICAKFELQKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMV
        E P+WK + +KNK+N AKLPFNH AGSKSFL +Q +                          K +EG ++GP+DLF ++ Y E  G V+  A +A++ M 
Subjt:  EDPKWKNMGEKNKQNHAKLPFNHYAGSKSFLHIQQKNASLKIILRVSYFFLILLICAKFELQKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMV

Query:  ALQQAPTPDGTQPPKPDDICEAVLGSRSDQIKGLGWGPKPKSRKCTSSSDATS
         L +APT +G +P    + C  VLG R D +KGLG+GP+P   K  SSS+ TS
Subjt:  ALQQAPTPDGTQPPKPDDICEAVLGSRSDQIKGLGWGPKPKSRKCTSSSDATS

TrEMBL top hitse value%identityAlignment
A0A5A7TN85 CACTA en-spm transposon protein1.6e-1933.78Show/hide
Query:  HVFLNVAWSFTALCCRNFRYIRFREVILVAYESRVSSSVARLEVREPGALQEDPKWKNMGEKNKQNHAKLPFNHYAGSKSFLHIQQKNASLKIILRVSYF
        H  L     F A C R+F+     E ++  +++     V R++      +  +    N   + KQ     P+NH +GSKSFL  Q               
Subjt:  HVFLNVAWSFTALCCRNFRYIRFREVILVAYESRVSSSVARLEVREPGALQEDPKWKNMGEKNKQNHAKLPFNHYAGSKSFLHIQQKNASLKIILRVSYF

Query:  FLILLICAKFELQKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALQQAPTPDGTQPPKPDDICEAVLGSRSDQIKGLGWGPKPKSRKCTSSS
                 +EL + R G+++  ++LF +T  + G  +VSQAA +AH+QM+ LQ  PTP+G+QP   D+IC+ VLG R    KGLGWGPKPK+R+  S+S
Subjt:  FLILLICAKFELQKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALQQAPTPDGTQPPKPDDICEAVLGSRSDQIKGLGWGPKPKSRKCTSSS

Query:  DAT-----SLRREHELSAVLSE
         ++     S ++E EL A L E
Subjt:  DAT-----SLRREHELSAVLSE

A0A5A7ULK1 CACTA en-spm transposon protein2.7e-1939.52Show/hide
Query:  WKNMGEKNKQNHAKLPFNHYAGSKSFLHIQQKNASLKIILRVSYFFLILLICAKFELQKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALQQ
        ++     NK    K P+NH +GSKSFL  Q                        +EL + R G+ +  ++LF +T  + G  +VSQAA +AH+QM+ LQ 
Subjt:  WKNMGEKNKQNHAKLPFNHYAGSKSFLHIQQKNASLKIILRVSYFFLILLICAKFELQKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALQQ

Query:  APTPDGTQPPKPDDICEAVLGSRSDQIKGLGWGPKPKSRKCTSSSDAT-----SLRREHELSAVLSE
         PTP+G+QP   D+IC+ VLG R D  KGLGWGPKPK+R+  S+S ++     S ++E EL A L E
Subjt:  APTPDGTQPPKPDDICEAVLGSRSDQIKGLGWGPKPKSRKCTSSSDAT-----SLRREHELSAVLSE

A0A5D3CH15 CACTA en-spm transposon protein1.6e-1940.48Show/hide
Query:  WKNMGEKNKQNHAKLPFNHYAGSKSFLHIQQKNASLKIILRVSYFFLILLICAKFELQKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALQQ
        ++   + NK    K P+NH +GSKSFL  Q                        +EL K R+GE++  ++LF KT  + G  +VSQA  +AH+QM+ LQ 
Subjt:  WKNMGEKNKQNHAKLPFNHYAGSKSFLHIQQKNASLKIILRVSYFFLILLICAKFELQKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALQQ

Query:  APTPDGTQPPKPDDICEAVLGSRSDQIKGLGWGPKPKSRKCTSSSDAT-----SLRREHELSAVLSET
         PT  G+QP   D+IC+ VLG R    KGLGWGPKPK+R+ TS+S ++     S ++E EL A L+ET
Subjt:  APTPDGTQPPKPDDICEAVLGSRSDQIKGLGWGPKPKSRKCTSSSDAT-----SLRREHELSAVLSET

A0A5D3DZR1 CACTA en-spm transposon protein2.1e-1938.92Show/hide
Query:  WKNMGEKNKQNHAKLPFNHYAGSKSFLHIQQKNASLKIILRVSYFFLILLICAKFELQKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALQQ
        ++     NK    K P+NH +GSKSFL  Q + A                          R G+++  ++LF +T  + G  +VSQAA + HDQM+ LQ 
Subjt:  WKNMGEKNKQNHAKLPFNHYAGSKSFLHIQQKNASLKIILRVSYFFLILLICAKFELQKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVALQQ

Query:  APTPDGTQPPKPDDICEAVLGSRSDQIKGLGWGPKPKSRKCTSSSDAT-----SLRREHELSAVLSE
         PTP+G+QP   D+IC+ VLG R    KGLGWGPKPK+R+ TS+S ++     S  +E EL A L E
Subjt:  APTPDGTQPPKPDDICEAVLGSRSDQIKGLGWGPKPKSRKCTSSSDAT-----SLRREHELSAVLSE

A0A6J1DUH3 uncharacterized protein LOC1110232122.6e-2241.18Show/hide
Query:  EDPKWKNMGEKNKQNHAKLPFNHYAGSKSFLHIQQKNASLKIILRVSYFFLILLICAKFELQKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMV
        E P+WK + +KNK+N AKLPFNH AGSKSFL +Q +                          K +EG ++GP+DLF ++ Y E  G V+  A +A++ M 
Subjt:  EDPKWKNMGEKNKQNHAKLPFNHYAGSKSFLHIQQKNASLKIILRVSYFFLILLICAKFELQKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMV

Query:  ALQQAPTPDGTQPPKPDDICEAVLGSRSDQIKGLGWGPKPKSRKCTSSSDATS
         L +APT +G +P    + C  VLG R D +KGLG+GP+P   K  SSS+ TS
Subjt:  ALQQAPTPDGTQPPKPDDICEAVLGSRSDQIKGLGWGPKPKSRKCTSSSDATS

SwissProt top hitse value%identityAlignment
Q9SRR0 Peroxisomal membrane protein 137.9e-0867.44Show/hide
Query:  MHGVVNFFGRISILIDQNTQAFHMFMTALLQL-DRYVRIHGKI
        M G VNFFGR+++LIDQNTQAFHMFM+ALLQL DR   ++G++
Subjt:  MHGVVNFFGRISILIDQNTQAFHMFMTALLQL-DRYVRIHGKI

Arabidopsis top hitse value%identityAlignment
AT3G07560.1 peroxin 135.6e-0967.44Show/hide
Query:  MHGVVNFFGRISILIDQNTQAFHMFMTALLQL-DRYVRIHGKI
        M G VNFFGR+++LIDQNTQAFHMFM+ALLQL DR   ++G++
Subjt:  MHGVVNFFGRISILIDQNTQAFHMFMTALLQL-DRYVRIHGKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCAAACCCTCGGCGTCGAGACATTGGCAACAGTGTTGAGACGCCAAAAACGTGCCTGCACCCAGTTTCGTCCAAAGTGTCTTGACATTGTCACCTGCTTCTATAA
ATTGTTAGCGAACTACCTTGCTTCACCTCCTTTGGTGGTAGCTTATGCCTTAGCTGGCACGATGCATGGTGTTGTGAACTTCTTTGGTCGAATATCTATTCTCATTGACC
AAAACACACAGGCATTCCATATGTTCATGACTGCACTACTTCAGCTGGATAGATACGTTCGCATCCATGGGAAGATACCTATCGAGATCACCGACGAGTTGAGAAAGCCG
GTGTGTGACAATGCAGTAAAATTCAGAGGTACCACTGGTAAAATTGTCAGAGAATCATTTTCCGTACGTTATGCAAAGTCAAAAGTTGTACCGAAGGACGCAAGGGATCA
TCTTAAACATTGTCTTTTGTGCTCCTCTAATGAACAACCTGTTTATGGTCCAACCAGTAAACAGAAAGTCCCTCTCGAGCCAGTGAGAGGGCGAGATCCCTTTGATACCC
CCGCCCGCATGTCTCCTACATGGACGCCTTGGATCAATACGTTTGTATCGAATACAAAGTGTTACCAGGATAAGATGGGAGGTGTAACACCCCGTGCACATTTGGCAGCC
CCCTTTGCCGTTTGCTTTAGCGGCCGCGTGTCCCCTCCAGCCGAAGTCTCGCGCCGTCGCAGCCTAGCCCTTTGCAGTCGAGGAACGCCGCCGTGGGTTGCCGCCGCTCG
AGCTCCAGCGCCGTCGTCGCCGTGTTCATCGTACCGCCCCTTCGTCGCTCGGATCTCCCTCTCTCTGCGTCAATCTTGCGCGGACAGCAGCTCGGAGTCTCCTTTCCTCG
CGTTTTCGCCTCTTTCCAGCAGCGTCATTGGGCGTTCCCGGCGTCATTTAGCGATTTCGGTTTTTAAATCATGGTTTAACTGGAAGCTCGTTTTGGAGCAAGTCTGTGCA
GTTCCAGCTAGCGTTGGTATTAAAAGCATTTCATGTTATGCTGTTTACAATCGTTCTGAATTGTTTGAGATGAGCTCGTTTTGTTCATCGCTTGTGGCTAGCTTGTGTCA
CGGGAAGTGTAGCATAAGTCTAGTGGTAGCGTTGCATGACGCCCTGGCGCATAATGCATGCAAGTGGTTTTGCGTAGCATGGCGCGAAATGCACGTTTTTCTGAATGTTG
CCTGGAGTTTTACGGCGTTATGCTGCCGAAATTTTCGATACATTCGGTTTAGAGAAGTTATCCTAGTTGCTTATGAGTCTAGAGTGAGTAGTAGCGTTGCTAGGCTAGAG
GTTAGAGAACCTGGGGCGTTACAGGAGGACCCCAAGTGGAAGAACATGGGTGAGAAGAATAAACAGAATCATGCCAAACTTCCTTTCAACCACTATGCTGGGTCAAAATC
ATTTCTTCACATACAACAGAAGAATGCAAGTTTAAAAATAATCTTAAGGGTAAGCTATTTTTTTCTAATATTACTAATATGTGCGAAATTTGAACTACAGAAAACTAGAG
AAGGTGAGAATTTGGGTCCTATTGACTTATTTCATAAGACTAGATACAAGGAAGGAAAGGGGTGGGTTAGTCAAGCCGCCAATGAAGCTCATGATCAAATGGTGGCATTG
CAACAGGCACCCACCCCAGATGGGACCCAACCGCCTAAACCAGATGACATATGCGAAGCTGTTCTGGGTAGTCGATCGGATCAAATTAAAGGGCTTGGTTGGGGACCAAA
GCCGAAGTCGAGAAAGTGTACCTCCTCATCCGATGCAACTTCATTAAGACGAGAGCATGAACTAAGTGCAGTCTTGAGTGAGACCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATCAAACCCTCGGCGTCGAGACATTGGCAACAGTGTTGAGACGCCAAAAACGTGCCTGCACCCAGTTTCGTCCAAAGTGTCTTGACATTGTCACCTGCTTCTATAA
ATTGTTAGCGAACTACCTTGCTTCACCTCCTTTGGTGGTAGCTTATGCCTTAGCTGGCACGATGCATGGTGTTGTGAACTTCTTTGGTCGAATATCTATTCTCATTGACC
AAAACACACAGGCATTCCATATGTTCATGACTGCACTACTTCAGCTGGATAGATACGTTCGCATCCATGGGAAGATACCTATCGAGATCACCGACGAGTTGAGAAAGCCG
GTGTGTGACAATGCAGTAAAATTCAGAGGTACCACTGGTAAAATTGTCAGAGAATCATTTTCCGTACGTTATGCAAAGTCAAAAGTTGTACCGAAGGACGCAAGGGATCA
TCTTAAACATTGTCTTTTGTGCTCCTCTAATGAACAACCTGTTTATGGTCCAACCAGTAAACAGAAAGTCCCTCTCGAGCCAGTGAGAGGGCGAGATCCCTTTGATACCC
CCGCCCGCATGTCTCCTACATGGACGCCTTGGATCAATACGTTTGTATCGAATACAAAGTGTTACCAGGATAAGATGGGAGGTGTAACACCCCGTGCACATTTGGCAGCC
CCCTTTGCCGTTTGCTTTAGCGGCCGCGTGTCCCCTCCAGCCGAAGTCTCGCGCCGTCGCAGCCTAGCCCTTTGCAGTCGAGGAACGCCGCCGTGGGTTGCCGCCGCTCG
AGCTCCAGCGCCGTCGTCGCCGTGTTCATCGTACCGCCCCTTCGTCGCTCGGATCTCCCTCTCTCTGCGTCAATCTTGCGCGGACAGCAGCTCGGAGTCTCCTTTCCTCG
CGTTTTCGCCTCTTTCCAGCAGCGTCATTGGGCGTTCCCGGCGTCATTTAGCGATTTCGGTTTTTAAATCATGGTTTAACTGGAAGCTCGTTTTGGAGCAAGTCTGTGCA
GTTCCAGCTAGCGTTGGTATTAAAAGCATTTCATGTTATGCTGTTTACAATCGTTCTGAATTGTTTGAGATGAGCTCGTTTTGTTCATCGCTTGTGGCTAGCTTGTGTCA
CGGGAAGTGTAGCATAAGTCTAGTGGTAGCGTTGCATGACGCCCTGGCGCATAATGCATGCAAGTGGTTTTGCGTAGCATGGCGCGAAATGCACGTTTTTCTGAATGTTG
CCTGGAGTTTTACGGCGTTATGCTGCCGAAATTTTCGATACATTCGGTTTAGAGAAGTTATCCTAGTTGCTTATGAGTCTAGAGTGAGTAGTAGCGTTGCTAGGCTAGAG
GTTAGAGAACCTGGGGCGTTACAGGAGGACCCCAAGTGGAAGAACATGGGTGAGAAGAATAAACAGAATCATGCCAAACTTCCTTTCAACCACTATGCTGGGTCAAAATC
ATTTCTTCACATACAACAGAAGAATGCAAGTTTAAAAATAATCTTAAGGGTAAGCTATTTTTTTCTAATATTACTAATATGTGCGAAATTTGAACTACAGAAAACTAGAG
AAGGTGAGAATTTGGGTCCTATTGACTTATTTCATAAGACTAGATACAAGGAAGGAAAGGGGTGGGTTAGTCAAGCCGCCAATGAAGCTCATGATCAAATGGTGGCATTG
CAACAGGCACCCACCCCAGATGGGACCCAACCGCCTAAACCAGATGACATATGCGAAGCTGTTCTGGGTAGTCGATCGGATCAAATTAAAGGGCTTGGTTGGGGACCAAA
GCCGAAGTCGAGAAAGTGTACCTCCTCATCCGATGCAACTTCATTAAGACGAGAGCATGAACTAAGTGCAGTCTTGAGTGAGACCTAA
Protein sequenceShow/hide protein sequence
MDQTLGVETLATVLRRQKRACTQFRPKCLDIVTCFYKLLANYLASPPLVVAYALAGTMHGVVNFFGRISILIDQNTQAFHMFMTALLQLDRYVRIHGKIPIEITDELRKP
VCDNAVKFRGTTGKIVRESFSVRYAKSKVVPKDARDHLKHCLLCSSNEQPVYGPTSKQKVPLEPVRGRDPFDTPARMSPTWTPWINTFVSNTKCYQDKMGGVTPRAHLAA
PFAVCFSGRVSPPAEVSRRRSLALCSRGTPPWVAAARAPAPSSPCSSYRPFVARISLSLRQSCADSSSESPFLAFSPLSSSVIGRSRRHLAISVFKSWFNWKLVLEQVCA
VPASVGIKSISCYAVYNRSELFEMSSFCSSLVASLCHGKCSISLVVALHDALAHNACKWFCVAWREMHVFLNVAWSFTALCCRNFRYIRFREVILVAYESRVSSSVARLE
VREPGALQEDPKWKNMGEKNKQNHAKLPFNHYAGSKSFLHIQQKNASLKIILRVSYFFLILLICAKFELQKTREGENLGPIDLFHKTRYKEGKGWVSQAANEAHDQMVAL
QQAPTPDGTQPPKPDDICEAVLGSRSDQIKGLGWGPKPKSRKCTSSSDATSLRREHELSAVLSET