; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS012874 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS012874
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProfilin domain-containing protein
Genome locationscaffold63:4155012..4156949
RNA-Seq ExpressionMS012874
SyntenyMS012874
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0005856 - cytoskeleton (cellular component)
GO:0003779 - actin binding (molecular function)
InterPro domainsIPR005455 - Profilin
IPR036140 - Profilin superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577112.1 hypothetical protein SDJN03_24686, partial [Cucurbita argyrosperma subsp. sororia]3.6e-7393.2Show/hide
Query:  MDWAFAHKAWDKWASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
        MDWAF  KAWDKWAS S+GTFGQPLKAA+LINYDPTGPSRLLSTIAEQEGI +NPIELNQFIDFIKRDKPQ ESFSIGSNQYIMTSVHENWFCARCMNTI
Subjt:  MDWAFAHKAWDKWASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI

Query:  KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
        KHAGEGVVIVQT AFILVAMYDGSIAAASRAMA+ADQLSWQLGRKNL
Subjt:  KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL

XP_008451862.1 PREDICTED: uncharacterized protein LOC103493024 [Cucumis melo]6.8e-7290.48Show/hide
Query:  MDWAFAHKAWDKWASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
        MDWAF HKAWDKWAS S+GTFGQPLKAA+LINYDPTGPSRLLSTIAEQEGI +NPIELNQF+DFIKRDKPQ ESFSIG NQYIMTSVHENWFCARCMNTI
Subjt:  MDWAFAHKAWDKWASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI

Query:  KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
        KHAGEGV+IVQT AFIL+AMYDGSIAAASRAMAAADQLSW L RKNL
Subjt:  KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL

XP_011653239.1 uncharacterized protein LOC105435190 [Cucumis sativus]2.0e-7190.41Show/hide
Query:  MDWAFAHKAWDKWASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
        MDWAF HKAWDKWAS S+GTFGQPLKAA+LINYDPTGPSRLLSTIAEQEGI +NPIELNQF+DFIKRDKPQ ESFSIG NQYIMTSVHENWFCARCMNTI
Subjt:  MDWAFAHKAWDKWASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI

Query:  KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKN
        KHAGEGV+IVQT AFIL+AMYDGSIAAASRAMAAADQLSW L RKN
Subjt:  KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKN

XP_022136632.1 uncharacterized protein LOC111008291 [Momordica charantia]5.4e-7798.64Show/hide
Query:  MDWAFAHKAWDKWASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
        MDWAFAHKAWDKWASAS+GTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGI INPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
Subjt:  MDWAFAHKAWDKWASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI

Query:  KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
        KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
Subjt:  KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL

XP_038878184.1 uncharacterized protein LOC120070329 [Benincasa hispida]3.6e-7391.84Show/hide
Query:  MDWAFAHKAWDKWASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
        M+WAF HK WDKWAS S+GTFGQPLKAA+LINYDPTGPSRLLSTIAEQEGI +NPIELNQF+DFIKRDKPQ ESFSIGSNQYIMTSVHENWFCARCMNTI
Subjt:  MDWAFAHKAWDKWASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI

Query:  KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
        KHAGEGV+IVQT AFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
Subjt:  KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL

TrEMBL top hitse value%identityAlignment
A0A1Q3AMV2 Profilin domain-containing protein6.4e-6072.79Show/hide
Query:  MDWAFAHKAWDKWASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
        MDW F H+AW+KWAS++VG++G+PLKAA+LINYDPTGPSRLLSTIAEQEGI  NPIEL++FIDFI R+KPQTESF +G NQYI+TS+HE+WFCARC+NT 
Subjt:  MDWAFAHKAWDKWASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI

Query:  KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
        K AGEG +++QT AF+LV +YDGSI +ASRAM AADQ +WQL R+NL
Subjt:  KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL

A0A1S3BTL7 uncharacterized protein LOC1034930243.3e-7290.48Show/hide
Query:  MDWAFAHKAWDKWASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
        MDWAF HKAWDKWAS S+GTFGQPLKAA+LINYDPTGPSRLLSTIAEQEGI +NPIELNQF+DFIKRDKPQ ESFSIG NQYIMTSVHENWFCARCMNTI
Subjt:  MDWAFAHKAWDKWASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI

Query:  KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
        KHAGEGV+IVQT AFIL+AMYDGSIAAASRAMAAADQLSW L RKNL
Subjt:  KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL

A0A5N5HDY6 Uncharacterized protein8.4e-6072.79Show/hide
Query:  MDWAFAHKAWDKWASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
        MDW F HKAWDKWAS ++ +  +PLKAA+LINYDPTGPSRLLSTIAE+EGI +NPIEL+QF+DFIK DK QTESF IG+NQY++TS+H+NWFCARCMNT 
Subjt:  MDWAFAHKAWDKWASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI

Query:  KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
        KH+GEG +++QTAAF+LV +YDGSI +ASRAMAA DQ +WQL RKN+
Subjt:  KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL

A0A6J1C603 uncharacterized protein LOC1110082912.6e-7798.64Show/hide
Query:  MDWAFAHKAWDKWASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
        MDWAFAHKAWDKWASAS+GTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGI INPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
Subjt:  MDWAFAHKAWDKWASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI

Query:  KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
        KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
Subjt:  KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL

A0A6P9E5M5 uncharacterized protein LOC1089800731.1e-5975.51Show/hide
Query:  MDWAFAHKAWDKWASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
        MDWAF HK WDKWAS +VG+ G+PLKAA+LINYDPTGPSRLLSTIAEQEGI  NPIEL+Q +DFIK +K QTESF IG+NQY++TS+HENWFCARCMNT 
Subjt:  MDWAFAHKAWDKWASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI

Query:  KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
        K AGEG +I+QTAAF+LVA+YDGSI  ASRAM A DQ +WQL R+NL
Subjt:  KHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19400.1 Profilin family protein1.1e-5165.31Show/hide
Query:  MDWAFAHKAWDKW-ASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNT
        MD AF  +AWDKW  + +VG+ G PLKAA+LINYDPTGPSRLLSTIA+QEGI I P++L QFIDF++     TE+F +GSNQYI+TS+HENWF ARC+NT
Subjt:  MDWAFAHKAWDKW-ASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNT

Query:  IKHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKN
         + AGEG +++QTA ++LVA+YDGSI +AS+AMAAADQ + QL RKN
Subjt:  IKHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKN

AT4G19410.2 Pectinacetylesterase family protein1.1e-5165.31Show/hide
Query:  MDWAFAHKAWDKW-ASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNT
        MD AF  +AWDKW  + +VG+ G PLKAA+LINYDPTGPSRLLSTIA+QEGI I P++L QFIDF++     TE+F +GSNQYI+TS+HENWF ARC+NT
Subjt:  MDWAFAHKAWDKW-ASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNT

Query:  IKHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKN
         + AGEG +++QTA ++LVA+YDGSI +AS+AMAAADQ + QL RKN
Subjt:  IKHAGEGVVIVQTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTGGGCATTTGCCCATAAAGCTTGGGACAAGTGGGCTTCCGCCAGCGTCGGCACCTTCGGTCAGCCATTGAAAGCTGCTATGTTGATTAATTATGATCCAACCGG
ACCTTCCCGCTTGCTTTCAACCATTGCAGAACAAGAAGGAATCTTCATCAATCCCATAGAACTGAATCAGTTTATCGATTTCATCAAACGTGACAAACCCCAAACAGAGA
GTTTCAGCATTGGTTCAAATCAGTACATAATGACATCGGTTCATGAGAATTGGTTTTGTGCAAGGTGCATGAACACTATAAAGCATGCTGGTGAAGGTGTTGTTATTGTG
CAAACAGCAGCATTTATCTTGGTTGCTATGTATGATGGTTCCATTGCAGCAGCGTCTCGCGCTATGGCAGCAGCTGATCAGTTATCTTGGCAATTAGGCAGGAAAAATCT
T
mRNA sequenceShow/hide mRNA sequence
ATGGATTGGGCATTTGCCCATAAAGCTTGGGACAAGTGGGCTTCCGCCAGCGTCGGCACCTTCGGTCAGCCATTGAAAGCTGCTATGTTGATTAATTATGATCCAACCGG
ACCTTCCCGCTTGCTTTCAACCATTGCAGAACAAGAAGGAATCTTCATCAATCCCATAGAACTGAATCAGTTTATCGATTTCATCAAACGTGACAAACCCCAAACAGAGA
GTTTCAGCATTGGTTCAAATCAGTACATAATGACATCGGTTCATGAGAATTGGTTTTGTGCAAGGTGCATGAACACTATAAAGCATGCTGGTGAAGGTGTTGTTATTGTG
CAAACAGCAGCATTTATCTTGGTTGCTATGTATGATGGTTCCATTGCAGCAGCGTCTCGCGCTATGGCAGCAGCTGATCAGTTATCTTGGCAATTAGGCAGGAAAAATCT
T
Protein sequenceShow/hide protein sequence
MDWAFAHKAWDKWASASVGTFGQPLKAAMLINYDPTGPSRLLSTIAEQEGIFINPIELNQFIDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTIKHAGEGVVIV
QTAAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL