; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004116 (gene) of Snake gourd v1 genome

Gene IDTan0004116
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHomeobox prospero protein
Genome locationLG08:74481794..74482920
RNA-Seq ExpressionTan0004116
SyntenyTan0004116
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004154327.1 uncharacterized protein LOC101221969 isoform X1 [Cucumis sativus]1.1e-8394.44Show/hide
Query:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF
        M RKRFQEAK G+QL+KSMRAERFLKKVGLGREDRYFWKQ+GKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF
Subjt:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF

Query:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE
        + KGGMIGTAIGPKG+LDFDKDSYNYQKELQN KLEQEAQKLWFRMRNEVISELQEKGYDVE
Subjt:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE

XP_008457407.1 PREDICTED: uncharacterized protein LOC103497102 [Cucumis melo]1.4e-8191.98Show/hide
Query:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF
        M RKRFQEAK G+QL+ SMR ERFL+KVGLGREDRYFWKQIGKALLCTYT+IG AWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF
Subjt:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF

Query:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE
        + KGGMIGTAIGPKG+LDFDKDSYNYQKELQN KLEQEAQKLWFRMRNEVISELQEKGYDVE
Subjt:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE

XP_022965021.1 uncharacterized protein LOC111464959 [Cucurbita moschata]3.9e-8191.98Show/hide
Query:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF
        M RKRFQEAK GLQLVKSMR E+FLKKVGLGRED YFWKQIGKALLCTYTLIG AWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAME+F
Subjt:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF

Query:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE
        +AKGGMIGTAIGPKGM+DFDKDSYNY+KEL+  KLEQEAQKLWFRMRNEVISELQEKGYDVE
Subjt:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE

XP_022998684.1 uncharacterized protein LOC111493271 [Cucurbita maxima]2.3e-8193.21Show/hide
Query:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF
        M RKRFQEAK GLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALL +YTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF
Subjt:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF

Query:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE
        +AKGGMIGTAIGPKG++DFDKDS+NYQKEL+N KLEQEAQKLWFRMRNEVISELQ KGYDVE
Subjt:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE

XP_038895441.1 uncharacterized protein LOC120083675 [Benincasa hispida]5.9e-8596.3Show/hide
Query:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF
        M RKRFQEAK G+QLVKS+RAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAME+F
Subjt:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF

Query:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE
        +AKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE
Subjt:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE

TrEMBL top hitse value%identityAlignment
A0A0A0LY68 Uncharacterized protein5.4e-8494.44Show/hide
Query:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF
        M RKRFQEAK G+QL+KSMRAERFLKKVGLGREDRYFWKQ+GKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF
Subjt:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF

Query:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE
        + KGGMIGTAIGPKG+LDFDKDSYNYQKELQN KLEQEAQKLWFRMRNEVISELQEKGYDVE
Subjt:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE

A0A1S3C5F1 uncharacterized protein LOC1034971026.6e-8291.98Show/hide
Query:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF
        M RKRFQEAK G+QL+ SMR ERFL+KVGLGREDRYFWKQIGKALLCTYT+IG AWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF
Subjt:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF

Query:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE
        + KGGMIGTAIGPKG+LDFDKDSYNYQKELQN KLEQEAQKLWFRMRNEVISELQEKGYDVE
Subjt:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE

A0A5A7SMD1 Uncharacterized protein6.6e-8291.98Show/hide
Query:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF
        M RKRFQEAK G+QL+ SMR ERFL+KVGLGREDRYFWKQIGKALLCTYT+IG AWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF
Subjt:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF

Query:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE
        + KGGMIGTAIGPKG+LDFDKDSYNYQKELQN KLEQEAQKLWFRMRNEVISELQEKGYDVE
Subjt:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE

A0A6J1HJ79 uncharacterized protein LOC1114649591.9e-8191.98Show/hide
Query:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF
        M RKRFQEAK GLQLVKSMR E+FLKKVGLGRED YFWKQIGKALLCTYTLIG AWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAME+F
Subjt:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF

Query:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE
        +AKGGMIGTAIGPKGM+DFDKDSYNY+KEL+  KLEQEAQKLWFRMRNEVISELQEKGYDVE
Subjt:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE

A0A6J1KD63 uncharacterized protein LOC1114932711.1e-8193.21Show/hide
Query:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF
        M RKRFQEAK GLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALL +YTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF
Subjt:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF

Query:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE
        +AKGGMIGTAIGPKG++DFDKDS+NYQKEL+N KLEQEAQKLWFRMRNEVISELQ KGYDVE
Subjt:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G58920.1 unknown protein8.2e-6971.6Show/hide
Query:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF
        M RK+FQ+AK G++ +KSM A  +LKKVGLGR+D +FWKQ+GKALLCTYT+ G+AW+YNETSP GWWTLKPR K E++LAHLYERREFPYPGD EAMEDF
Subjt:  MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDF

Query:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE
        VAKGGMIGTAIGPKG+++ + ++ NYQKE++ KK ++EAQKLW RMRNEVI+ELQEKG+++E
Subjt:  VAKGGMIGTAIGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTCGTAAACGCTTTCAAGAAGCTAAAATGGGTCTGCAACTGGTGAAATCGATGAGAGCCGAGAGGTTTTTGAAGAAAGTTGGATTGGGAAGAGAGGATCGTTACTT
CTGGAAGCAAATTGGCAAGGCTCTGTTGTGTACGTATACTCTGATCGGCGTTGCATGGCTTTACAATGAAACATCACCATTTGGTTGGTGGACGCTGAAGCCACGGTCGA
AGGCAGAGAAAGATTTGGCTCACCTGTATGAGCGGCGGGAGTTTCCATATCCAGGTGATGAAGAAGCTATGGAGGATTTCGTTGCCAAGGGGGGAATGATCGGAACTGCA
ATTGGTCCGAAGGGGATGCTTGATTTTGATAAGGATTCTTACAATTATCAGAAAGAATTACAGAACAAGAAGCTCGAGCAAGAGGCCCAGAAGCTATGGTTCAGGATGAG
GAACGAGGTTATTTCAGAGCTTCAGGAGAAGGGCTACGACGTTGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTCGTAAACGCTTTCAAGAAGCTAAAATGGGTCTGCAACTGGTGAAATCGATGAGAGCCGAGAGGTTTTTGAAGAAAGTTGGATTGGGAAGAGAGGATCGTTACTT
CTGGAAGCAAATTGGCAAGGCTCTGTTGTGTACGTATACTCTGATCGGCGTTGCATGGCTTTACAATGAAACATCACCATTTGGTTGGTGGACGCTGAAGCCACGGTCGA
AGGCAGAGAAAGATTTGGCTCACCTGTATGAGCGGCGGGAGTTTCCATATCCAGGTGATGAAGAAGCTATGGAGGATTTCGTTGCCAAGGGGGGAATGATCGGAACTGCA
ATTGGTCCGAAGGGGATGCTTGATTTTGATAAGGATTCTTACAATTATCAGAAAGAATTACAGAACAAGAAGCTCGAGCAAGAGGCCCAGAAGCTATGGTTCAGGATGAG
GAACGAGGTTATTTCAGAGCTTCAGGAGAAGGGCTACGACGTTGAGTGA
Protein sequenceShow/hide protein sequence
MTRKRFQEAKMGLQLVKSMRAERFLKKVGLGREDRYFWKQIGKALLCTYTLIGVAWLYNETSPFGWWTLKPRSKAEKDLAHLYERREFPYPGDEEAMEDFVAKGGMIGTA
IGPKGMLDFDKDSYNYQKELQNKKLEQEAQKLWFRMRNEVISELQEKGYDVE