; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018872 (gene) of Snake gourd v1 genome

Gene IDTan0018872
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProfilin domain-containing protein
Genome locationLG01:10104782..10109525
RNA-Seq ExpressionTan0018872
SyntenyTan0018872
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0005856 - cytoskeleton (cellular component)
GO:0003779 - actin binding (molecular function)
InterPro domainsIPR005455 - Profilin
IPR036140 - Profilin superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577112.1 hypothetical protein SDJN03_24686, partial [Cucurbita argyrosperma subsp. sororia]3.8e-7595.92Show/hide
Query:  MDWAFVHKAWDKWASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
        MDWAFV KAWDKWASGSIGTFGQPLK+ALLINYDPTGPSRLLSTIAEQEGILLNPIEL+QF+DFIKRDKPQ ESFSIGSNQYIMTSVHENWFCARCMNTI
Subjt:  MDWAFVHKAWDKWASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI

Query:  KHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
        KHAGEGVVIVQTTAFILVAMYDGSIAAASRAMA+ADQLSWQLGRKNL
Subjt:  KHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL

XP_008451862.1 PREDICTED: uncharacterized protein LOC103493024 [Cucumis melo]4.2e-7494.56Show/hide
Query:  MDWAFVHKAWDKWASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
        MDWAFVHKAWDKWASGSIGTFGQPLK+ALLINYDPTGPSRLLSTIAEQEGILLNPIEL+QFVDFIKRDKPQ ESFSIG NQYIMTSVHENWFCARCMNTI
Subjt:  MDWAFVHKAWDKWASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI

Query:  KHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
        KHAGEGV+IVQTTAFIL+AMYDGSIAAASRAMAAADQLSW L RKNL
Subjt:  KHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL

XP_011653239.1 uncharacterized protein LOC105435190 [Cucumis sativus]1.2e-7394.52Show/hide
Query:  MDWAFVHKAWDKWASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
        MDWAFVHKAWDKWASGSIGTFGQPLK+ALLINYDPTGPSRLLSTIAEQEGILLNPIEL+QFVDFIKRDKPQ ESFSIG NQYIMTSVHENWFCARCMNTI
Subjt:  MDWAFVHKAWDKWASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI

Query:  KHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKN
        KHAGEGV+IVQTTAFIL+AMYDGSIAAASRAMAAADQLSW L RKN
Subjt:  KHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKN

XP_022136632.1 uncharacterized protein LOC111008291 [Momordica charantia]8.5e-7594.56Show/hide
Query:  MDWAFVHKAWDKWASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
        MDWAF HKAWDKWAS SIGTFGQPLK+A+LINYDPTGPSRLLSTIAEQEGIL+NPIEL+QF+DFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
Subjt:  MDWAFVHKAWDKWASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI

Query:  KHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
        KHAGEGVVIVQT AFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
Subjt:  KHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL

XP_038878184.1 uncharacterized protein LOC120070329 [Benincasa hispida]2.2e-7595.92Show/hide
Query:  MDWAFVHKAWDKWASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
        M+WAFVHK WDKWASGSIGTFGQPLK+ALLINYDPTGPSRLLSTIAEQEGILLNPIEL+QFVDFIKRDKPQ ESFSIGSNQYIMTSVHENWFCARCMNTI
Subjt:  MDWAFVHKAWDKWASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI

Query:  KHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
        KHAGEGV+IVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
Subjt:  KHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL

TrEMBL top hitse value%identityAlignment
A0A0B2RAC5 Uncharacterized protein4.9e-6077.55Show/hide
Query:  MDWAFVHKAWDKWASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
        MDWAFVHK WDKWAS +IG  G PLK+ALLINYDPTGPSRLLSTIAEQEG+  NPIELS FVDFIK++K QTE F IGSNQY++TS+HENWF ARC+NT 
Subjt:  MDWAFVHKAWDKWASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI

Query:  KHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
        K AGEG +++QT A+ILVAMY+GSI  ASRAMAAADQL+WQLGRKNL
Subjt:  KHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL

A0A1Q3AMV2 Profilin domain-containing protein3.4e-6173.47Show/hide
Query:  MDWAFVHKAWDKWASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
        MDW FVH+AW+KWAS ++G++G+PLK+ALLINYDPTGPSRLLSTIAEQEGI  NPIELS+F+DFI R+KPQTESF +G NQYI+TS+HE+WFCARC+NT 
Subjt:  MDWAFVHKAWDKWASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI

Query:  KHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
        K AGEG +++QTTAF+LV +YDGSI +ASRAM AADQ +WQL R+NL
Subjt:  KHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL

A0A1S3BTL7 uncharacterized protein LOC1034930242.1e-7494.56Show/hide
Query:  MDWAFVHKAWDKWASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
        MDWAFVHKAWDKWASGSIGTFGQPLK+ALLINYDPTGPSRLLSTIAEQEGILLNPIEL+QFVDFIKRDKPQ ESFSIG NQYIMTSVHENWFCARCMNTI
Subjt:  MDWAFVHKAWDKWASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI

Query:  KHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
        KHAGEGV+IVQTTAFIL+AMYDGSIAAASRAMAAADQLSW L RKNL
Subjt:  KHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL

A0A6J1C603 uncharacterized protein LOC1110082914.1e-7594.56Show/hide
Query:  MDWAFVHKAWDKWASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
        MDWAF HKAWDKWAS SIGTFGQPLK+A+LINYDPTGPSRLLSTIAEQEGIL+NPIEL+QF+DFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
Subjt:  MDWAFVHKAWDKWASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI

Query:  KHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
        KHAGEGVVIVQT AFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
Subjt:  KHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL

A0A6P9E5M5 uncharacterized protein LOC1089800731.7e-6076.19Show/hide
Query:  MDWAFVHKAWDKWASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI
        MDWAFVHK WDKWAS ++G+ G+PLK+ALLINYDPTGPSRLLSTIAEQEGI  NPIELSQ VDFIK +K QTESF IG+NQY++TS+HENWFCARCMNT 
Subjt:  MDWAFVHKAWDKWASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTI

Query:  KHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL
        K AGEG +I+QT AF+LVA+YDGSI  ASRAM A DQ +WQL R+NL
Subjt:  KHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19400.1 Profilin family protein8.3e-5263.27Show/hide
Query:  MDWAFVHKAWDKW-ASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNT
        MD AFV +AWDKW  +G++G+ G PLK+A+LINYDPTGPSRLLSTIA+QEGI + P++L QF+DF++     TE+F +GSNQYI+TS+HENWF ARC+NT
Subjt:  MDWAFVHKAWDKW-ASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNT

Query:  IKHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKN
         + AGEG +++QT  ++LVA+YDGSI +AS+AMAAADQ + QL RKN
Subjt:  IKHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKN

AT4G19410.2 Pectinacetylesterase family protein8.3e-5263.27Show/hide
Query:  MDWAFVHKAWDKW-ASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNT
        MD AFV +AWDKW  +G++G+ G PLK+A+LINYDPTGPSRLLSTIA+QEGI + P++L QF+DF++     TE+F +GSNQYI+TS+HENWF ARC+NT
Subjt:  MDWAFVHKAWDKW-ASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNT

Query:  IKHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKN
         + AGEG +++QT  ++LVA+YDGSI +AS+AMAAADQ + QL RKN
Subjt:  IKHAGEGVVIVQTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTGGGCATTCGTCCACAAAGCTTGGGACAAATGGGCTTCCGGCAGCATTGGCACCTTTGGTCAGCCATTGAAATCTGCTCTGTTGATTAATTATGATCCAACTGG
ACCTTCCCGCTTGCTTTCGACCATTGCAGAACAAGAAGGAATATTGCTTAATCCCATAGAACTGAGTCAGTTTGTCGATTTCATCAAACGTGACAAACCCCAGACAGAGA
GTTTCAGCATTGGTTCAAATCAATACATAATGACATCAGTTCATGAGAATTGGTTTTGTGCAAGGTGTATGAACACTATAAAGCATGCTGGTGAAGGTGTTGTTATTGTG
CAAACAACAGCGTTTATTTTGGTTGCTATGTATGATGGCTCCATTGCAGCAGCATCTCGTGCCATGGCCGCAGCCGATCAGTTATCTTGGCAATTAGGGAGGAAAAATCT
TTAG
mRNA sequenceShow/hide mRNA sequence
CTTATATATAACGAAATTTTTGTTTGAAAAAAAAAAACAGGATTTCCCTCACTATTTGGTAGAGCTCACTGTCACTGGTTCATTTCCATGGAAGCTTAGACGCCCAGTAG
GTTTCGCTTAAATGGATTGGGCATTCGTCCACAAAGCTTGGGACAAATGGGCTTCCGGCAGCATTGGCACCTTTGGTCAGCCATTGAAATCTGCTCTGTTGATTAATTAT
GATCCAACTGGACCTTCCCGCTTGCTTTCGACCATTGCAGAACAAGAAGGAATATTGCTTAATCCCATAGAACTGAGTCAGTTTGTCGATTTCATCAAACGTGACAAACC
CCAGACAGAGAGTTTCAGCATTGGTTCAAATCAATACATAATGACATCAGTTCATGAGAATTGGTTTTGTGCAAGGTGTATGAACACTATAAAGCATGCTGGTGAAGGTG
TTGTTATTGTGCAAACAACAGCGTTTATTTTGGTTGCTATGTATGATGGCTCCATTGCAGCAGCATCTCGTGCCATGGCCGCAGCCGATCAGTTATCTTGGCAATTAGGG
AGGAAAAATCTTTAGTTGCCTTGAAGTAAAAACTTTATATGAATGACAAATCTATCTGATAGAACAAATGACAAATCCTTCACCCAGTATCTAACTGATGTAGTTGCAGT
TGTTTCTTCCATGTTTTATTACAATCACCTACTTATCACTAGAGTTCTTCCTATCATGAATTTGACCGATGAAGAAAATGCAAATATTTCTTCCTTAAGCAACTTTCGAC
AACAGTTTTGAGTCTCAACTGTTACAAAGCTATAGAACTTCTATCTAATATCAATTTTCAAATGTTATACCTGCTCTAAGTTGTACTAAGTTGTAATGTGGATTAATGTT
TTTTTACTAGCCTAAGCATAACTCAACTAGTTAAGCATCTCTCAAGTAATGTGATTCAGTTCAAAATATATTCAGCATATAATATATATATATATTTCTTATTTGGGCCT
TGAATTTTGTTGAATGTGATTCAGTTCTTTTTTTCTTTTTCTAGCATTTATATTATATAATATTTTGTTCTATATGACATGTACTATGCTGAGAAGGAAGGTCAAAAGAA
ATTAGGTTGAACTTCCCTTCTAAAGGGTGCTTTTGTTGCGGTGTCTTAACTGATTATAGGAAGCCTTGCTGCTGAAGATAACTTTTTTAGCAACATTATTTTATTATAAG
TTTGCTCTTTGTTCTGTACAATAAACTATCTCTTGTAAGAAGAAAAGTACCGTTGTAGTTGATCGTTTAGAAAATTTAATTTTAGCAATT
Protein sequenceShow/hide protein sequence
MDWAFVHKAWDKWASGSIGTFGQPLKSALLINYDPTGPSRLLSTIAEQEGILLNPIELSQFVDFIKRDKPQTESFSIGSNQYIMTSVHENWFCARCMNTIKHAGEGVVIV
QTTAFILVAMYDGSIAAASRAMAAADQLSWQLGRKNL