; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016400 (gene) of Snake gourd v1 genome

Gene IDTan0016400
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPlant protein 1589 of unknown function
Genome locationLG11:6553786..6555067
RNA-Seq ExpressionTan0016400
SyntenyTan0016400
Gene Ontology termsGO:0032502 - developmental process (biological process)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR006476 - Conserved hypothetical protein CHP01589, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004150990.1 uncharacterized protein LOC101208991 isoform X1 [Cucumis sativus]2.9e-3989.47Show/hide
Query:  MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTKGDPKN
        MADSS+SY+HMVQHLIEKCLIFHMTKEECMDALSKHA+I PIITSTVW ELEKENKEFFEAY QSH+N DRMSEEETSQMIQ+MISDS KGDPKN
Subjt:  MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTKGDPKN

XP_008463517.1 PREDICTED: uncharacterized protein LOC103501658 [Cucumis melo]3.7e-3989.47Show/hide
Query:  MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTKGDPKN
        MADSS+SYIHMVQHLIEKCLIFHMTKEECM+ALSKHA+I PIITSTVW+ELEKENKEFFEAY QSH+N DRMSEEETSQMIQ+MISDS KGDPKN
Subjt:  MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTKGDPKN

XP_022133966.1 uncharacterized protein LOC111006373 isoform X1 [Momordica charantia]2.0e-4092.63Show/hide
Query:  MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTKGDPKN
        MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANI PIITSTVW+ELEKENKEFFEAY QS SN DRMSEEETSQMIQ+MISDS+KGDPKN
Subjt:  MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTKGDPKN

XP_022921441.1 uncharacterized protein LOC111429717 [Cucurbita moschata]2.1e-3786.32Show/hide
Query:  MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTKGDPKN
        MADS ASYIHMVQHLIEKCLIF MTKEECMDALSKHANI+P+ITSTVWTELEKENKEFFE Y QS  + DRMSEEETSQMIQ+MIS+S+KGDPKN
Subjt:  MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTKGDPKN

XP_038890611.1 uncharacterized protein LOC120080119 [Benincasa hispida]9.8e-4089.47Show/hide
Query:  MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTKGDPKN
        MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHA+I PIITSTVW+ELEKENKEFFEAY Q+H+N DRMSEE+TSQMIQ+MISDS+KGDPKN
Subjt:  MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTKGDPKN

TrEMBL top hitse value%identityAlignment
A0A0A0KU49 Uncharacterized protein1.4e-3989.47Show/hide
Query:  MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTKGDPKN
        MADSS+SY+HMVQHLIEKCLIFHMTKEECMDALSKHA+I PIITSTVW ELEKENKEFFEAY QSH+N DRMSEEETSQMIQ+MISDS KGDPKN
Subjt:  MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTKGDPKN

A0A1S3CJG8 uncharacterized protein LOC1035016581.8e-3989.47Show/hide
Query:  MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTKGDPKN
        MADSS+SYIHMVQHLIEKCLIFHMTKEECM+ALSKHA+I PIITSTVW+ELEKENKEFFEAY QSH+N DRMSEEETSQMIQ+MISDS KGDPKN
Subjt:  MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTKGDPKN

A0A6J1BWN4 uncharacterized protein LOC111006373 isoform X19.6e-4192.63Show/hide
Query:  MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTKGDPKN
        MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANI PIITSTVW+ELEKENKEFFEAY QS SN DRMSEEETSQMIQ+MISDS+KGDPKN
Subjt:  MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTKGDPKN

A0A6J1E1E4 uncharacterized protein LOC1114297179.9e-3886.32Show/hide
Query:  MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTKGDPKN
        MADS ASYIHMVQHLIEKCLIF MTKEECMDALSKHANI+P+ITSTVWTELEKENKEFFE Y QS  + DRMSEEETSQMIQ+MIS+S+KGDPKN
Subjt:  MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTKGDPKN

A0A6J1JTG8 uncharacterized protein LOC1114873438.4e-3786.46Show/hide
Query:  MAD-SSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTKGDPKN
        MAD SSASYIHMVQHLIEKCLIF MTKEECMDALSKHANI+P+ITSTVWTELEKENKEFFEAY ++ SN D MSEEET+QMIQRMI DS+KGDPKN
Subjt:  MAD-SSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTKGDPKN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10657.2 Plant protein 1589 of unknown function3.1e-1557.14Show/hide
Query:  SYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAY
        SYI MVQH+IE+C++  MT++EC+ AL  HA+I P++T TVW  L++ENK+FFE Y
Subjt:  SYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAY

AT1G10657.3 Plant protein 1589 of unknown function1.2e-1448.65Show/hide
Query:  SYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQM
        SYI MVQH+IE+C++  MT++EC+ AL  HA+I P++T TVW  L++ENK+FFE Y    S    +S+    QM
Subjt:  SYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQM

AT3G28990.1 Plant protein 1589 of unknown function1.9e-2566.27Show/hide
Query:  SSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMIS
        S A YIH+VQH+IE CL F+M+KEECM+ALS++ANI PIITSTVW EL KENK+FFE Y+Q     + MSEEET+Q+IQ +IS
Subjt:  SSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMIS

AT3G55240.1 Plant protein 1589 of unknown function6.0e-3578.02Show/hide
Query:  MAD-SSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTK
        MAD SSASYIHMVQH+IEKCLIFHM+KEEC++ALSKHANITP+ITSTVW ELEKENKEFF+AY++  S  ++MSEEET+QMIQ++ISDS+K
Subjt:  MAD-SSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTK

AT5G02580.1 Plant protein 1589 of unknown function1.4e-2361.63Show/hide
Query:  SSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDST
        S ASYIH+V HLIE+C++F+M KEECMDAL KHANI PIITSTVW EL KENKEFFEAY++    +   +E+ET++ I+ ++S +T
Subjt:  SSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGATTCTTCTGCTTCATACATACATATGGTGCAGCACCTGATCGAGAAGTGTCTGATCTTCCACATGACAAAAGAAGAGTGCATGGACGCCCTCTCCAAACATGC
AAATATTACACCTATCATCACGTCAACAGTGTGGACCGAATTAGAGAAGGAAAACAAGGAATTCTTTGAGGCATATAAACAATCTCATAGCAACATGGACAGAATGTCTG
AGGAAGAGACAAGCCAAATGATCCAAAGGATGATCTCAGATTCCACCAAAGGGGACCCTAAAAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGATTCTTCTGCTTCATACATACATATGGTGCAGCACCTGATCGAGAAGTGTCTGATCTTCCACATGACAAAAGAAGAGTGCATGGACGCCCTCTCCAAACATGC
AAATATTACACCTATCATCACGTCAACAGTGTGGACCGAATTAGAGAAGGAAAACAAGGAATTCTTTGAGGCATATAAACAATCTCATAGCAACATGGACAGAATGTCTG
AGGAAGAGACAAGCCAAATGATCCAAAGGATGATCTCAGATTCCACCAAAGGGGACCCTAAAAACTAA
Protein sequenceShow/hide protein sequence
MADSSASYIHMVQHLIEKCLIFHMTKEECMDALSKHANITPIITSTVWTELEKENKEFFEAYKQSHSNMDRMSEEETSQMIQRMISDSTKGDPKN