; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000834 (gene) of Snake gourd v1 genome

Gene IDTan0000834
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionEmbryo sac development arrest protein
Genome locationLG10:5333579..5334935
RNA-Seq ExpressionTan0000834
SyntenyTan0000834
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597349.1 hypothetical protein SDJN03_10529, partial [Cucurbita argyrosperma subsp. sororia]8.0e-5387.5Show/hide
Query:  MSFHFHSRGLLSAGPSRKRKELSAPAASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAAQWKKPKSEAAPPEIIKKEHQIQSYA
        MSFH HSRGLLSAGPSRKRKE   P+A KAGE P++S RLLAGYMAYEFLTKGTILGRKFDPDRAEASPFP A AQ+KKPK+EAAPPEIIKKEHQIQSYA
Subjt:  MSFHFHSRGLLSAGPSRKRKELSAPAASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAAQWKKPKSEAAPPEIIKKEHQIQSYA

Query:  DVATILKTDGAQISGIVNPTQLARWLNK
        DVATILKTD AQISGIVNPTQLARWL K
Subjt:  DVATILKTDGAQISGIVNPTQLARWLNK

KGN56835.1 hypothetical protein Csa_009887 [Cucumis sativus]7.0e-4980Show/hide
Query:  MSFHFHSRGLLSAGPSRKRKELSAPAASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAA--QWKKPKSEAAPPEIIKKEHQIQS
        MSFHFHSRGLLSAGPS+KRKE S P+ASKAGEP VSS RLLAGYMAYEFLTKGT+ GRKFDP R EA+P   +AA  QWKKPKS+AAPPEI+KKEHQIQS
Subjt:  MSFHFHSRGLLSAGPSRKRKELSAPAASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAA--QWKKPKSEAAPPEIIKKEHQIQS

Query:  YADVATILKTDGAQISGIVNPTQLARWLNK
        YA+VA ILKT G+ ISGIVNPTQL RWL K
Subjt:  YADVATILKTDGAQISGIVNPTQLARWLNK

XP_008438490.1 PREDICTED: uncharacterized protein LOC103483571 [Cucumis melo]2.1e-4577.1Show/hide
Query:  MSFHFHSRGLLSAGPSRKRKELSAPA-ASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAAQWKKPKSEAAPPEIIKKEH--QIQ
        MSFHFHSRGLLSAGPS+KRKE S PA  +KAGEP V S RLLAGYMAYEFLTKGT+ GRKFDP R EA+P   A +QWKKPK EAAPPEI+KKEH  QIQ
Subjt:  MSFHFHSRGLLSAGPSRKRKELSAPA-ASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAAQWKKPKSEAAPPEIIKKEH--QIQ

Query:  SYADVATILKTDGAQISGIVNPTQLARWLNK
        SYA+VA ILKT G+ ISGIVNPTQL RWL K
Subjt:  SYADVATILKTDGAQISGIVNPTQLARWLNK

XP_022924296.1 uncharacterized protein LOC111431827 [Cucurbita moschata]5.6e-4680.47Show/hide
Query:  MSFHFHSRGLLSAGPSRKRKELSAPAASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAAQWKKPKSEAAPPEIIKKEHQIQSYA
        MSFHFHSRGLL AGPSRKRKE   PAA KA EPPVSS RLLAGYMAYEFLTKGTILGR+F+P RAEA   P+AAAQWKK KSEAAPPEI+KKEHQIQSY 
Subjt:  MSFHFHSRGLLSAGPSRKRKELSAPAASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAAQWKKPKSEAAPPEIIKKEHQIQSYA

Query:  DVATILKTDGAQISGIVNPTQLARWLNK
        +VATILKT G  I  IVNPTQL RWL K
Subjt:  DVATILKTDGAQISGIVNPTQLARWLNK

XP_023539410.1 uncharacterized protein LOC111800062 [Cucurbita pepo subsp. pepo]3.0e-5288.37Show/hide
Query:  MSFHFHSRGLLSAGPSRKRKELSAPAASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPT-AAAQWKKPKSEAAPPEIIKKEHQIQSY
        MSFH HSRGLLSAGPSRKRKE   P+A KAGE PV+S RLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPT AAAQ+KKPK+EAAPPEIIKKEHQIQSY
Subjt:  MSFHFHSRGLLSAGPSRKRKELSAPAASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPT-AAAQWKKPKSEAAPPEIIKKEHQIQSY

Query:  ADVATILKTDGAQISGIVNPTQLARWLNK
         DVATILKTD AQISGIVNPTQLARWL+K
Subjt:  ADVATILKTDGAQISGIVNPTQLARWLNK

TrEMBL top hitse value%identityAlignment
A0A0A0L4C1 Uncharacterized protein3.4e-4980Show/hide
Query:  MSFHFHSRGLLSAGPSRKRKELSAPAASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAA--QWKKPKSEAAPPEIIKKEHQIQS
        MSFHFHSRGLLSAGPS+KRKE S P+ASKAGEP VSS RLLAGYMAYEFLTKGT+ GRKFDP R EA+P   +AA  QWKKPKS+AAPPEI+KKEHQIQS
Subjt:  MSFHFHSRGLLSAGPSRKRKELSAPAASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAA--QWKKPKSEAAPPEIIKKEHQIQS

Query:  YADVATILKTDGAQISGIVNPTQLARWLNK
        YA+VA ILKT G+ ISGIVNPTQL RWL K
Subjt:  YADVATILKTDGAQISGIVNPTQLARWLNK

A0A1S3AWH5 uncharacterized protein LOC1034835711.0e-4577.1Show/hide
Query:  MSFHFHSRGLLSAGPSRKRKELSAPA-ASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAAQWKKPKSEAAPPEIIKKEH--QIQ
        MSFHFHSRGLLSAGPS+KRKE S PA  +KAGEP V S RLLAGYMAYEFLTKGT+ GRKFDP R EA+P   A +QWKKPK EAAPPEI+KKEH  QIQ
Subjt:  MSFHFHSRGLLSAGPSRKRKELSAPA-ASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAAQWKKPKSEAAPPEIIKKEH--QIQ

Query:  SYADVATILKTDGAQISGIVNPTQLARWLNK
        SYA+VA ILKT G+ ISGIVNPTQL RWL K
Subjt:  SYADVATILKTDGAQISGIVNPTQLARWLNK

A0A5A7U1L2 Uncharacterized protein1.0e-4577.1Show/hide
Query:  MSFHFHSRGLLSAGPSRKRKELSAPA-ASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAAQWKKPKSEAAPPEIIKKEH--QIQ
        MSFHFHSRGLLSAGPS+KRKE S PA  +KAGEP V S RLLAGYMAYEFLTKGT+ GRKFDP R EA+P   A +QWKKPK EAAPPEI+KKEH  QIQ
Subjt:  MSFHFHSRGLLSAGPSRKRKELSAPA-ASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAAQWKKPKSEAAPPEIIKKEH--QIQ

Query:  SYADVATILKTDGAQISGIVNPTQLARWLNK
        SYA+VA ILKT G+ ISGIVNPTQL RWL K
Subjt:  SYADVATILKTDGAQISGIVNPTQLARWLNK

A0A6J1E960 uncharacterized protein LOC1114318272.7e-4680.47Show/hide
Query:  MSFHFHSRGLLSAGPSRKRKELSAPAASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAAQWKKPKSEAAPPEIIKKEHQIQSYA
        MSFHFHSRGLL AGPSRKRKE   PAA KA EPPVSS RLLAGYMAYEFLTKGTILGR+F+P RAEA   P+AAAQWKK KSEAAPPEI+KKEHQIQSY 
Subjt:  MSFHFHSRGLLSAGPSRKRKELSAPAASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAAQWKKPKSEAAPPEIIKKEHQIQSYA

Query:  DVATILKTDGAQISGIVNPTQLARWLNK
        +VATILKT G  I  IVNPTQL RWL K
Subjt:  DVATILKTDGAQISGIVNPTQLARWLNK

A0A6J1IWN8 uncharacterized protein LOC1114792655.1e-4578.91Show/hide
Query:  MSFHFHSRGLLSAGPSRKRKELSAPAASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAAQWKKPKSEAAPPEIIKKEHQIQSYA
        MSFHFH RGLL  GPSRKRKE   PAA KA EPPVSS RLLAGYMAYEFLTKGTILGR+F+P +AEA   P+AAAQWKK KSEAAPPEI+KKEHQIQSY 
Subjt:  MSFHFHSRGLLSAGPSRKRKELSAPAASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAAQWKKPKSEAAPPEIIKKEHQIQSYA

Query:  DVATILKTDGAQISGIVNPTQLARWLNK
        +VATILKT GA I  IVNPTQL RWL K
Subjt:  DVATILKTDGAQISGIVNPTQLARWLNK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G04000.1 unknown protein5.1e-1337.14Show/hide
Query:  LSAPAASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAAQWKKPKSEAAPPEIIKKEHQIQSYADVATILKTDGAQISGIVNPTQ
        +S  +++ A EP  S+  +LAGY+++E+LT+GT+ G +++  RA+A       +   +P  E  P          + Y +VA +L++DGAQ+ GIVNP Q
Subjt:  LSAPAASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAAQWKKPKSEAAPPEIIKKEHQIQSYADVATILKTDGAQISGIVNPTQ

Query:  LARWL
        LAR+L
Subjt:  LARWL

AT3G23440.1 embryo sac development arrest 64.6e-1441.03Show/hide
Query:  GPSRKRKELSAPAASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAE----ASPFPTAAAQWKKPKSEAAPPEIIKKEHQIQSYADVATILKTD
        G SRKRK+  +     A     +   LLAGYMA+E+LT GT+LGRK     AE     SP P  + + KK +               QSY++VA++ KTD
Subjt:  GPSRKRKELSAPAASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAE----ASPFPTAAAQWKKPKSEAAPPEIIKKEHQIQSYADVATILKTD

Query:  GAQISGIVNPTQLARWL
        G  + G+VNPTQLA+W+
Subjt:  GAQISGIVNPTQLARWL

AT5G44060.1 unknown protein1.9e-1542Show/hide
Query:  ASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAAQWKKPKSEAAPPEIIKKEHQIQSYADVATILKTDGAQISGIVNPTQLARWL
        A+ A   PV S +LLAGY+A+EFL  GT+ G  ++P +A+A P  T +     P+      +I   +H+ + Y +VA IL+ DG  + GIVNP+QLAR+L
Subjt:  ASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAAQWKKPKSEAAPPEIIKKEHQIQSYADVATILKTDGAQISGIVNPTQLARWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTCCATTTCCACAGCCGTGGGCTGTTATCGGCTGGGCCGTCGAGAAAGCGGAAGGAGCTGTCGGCGCCGGCGGCTTCTAAAGCCGGTGAGCCGCCGGTTTCGTC
GACCCGGCTTCTGGCCGGGTACATGGCTTATGAGTTTCTGACCAAAGGAACCATCTTGGGCCGGAAGTTCGACCCGGATCGAGCCGAAGCAAGCCCATTTCCCACCGCCG
CCGCTCAGTGGAAGAAGCCGAAGTCGGAAGCCGCGCCGCCGGAGATCATCAAGAAAGAGCATCAGATTCAGAGCTACGCCGACGTGGCGACCATTTTGAAGACGGATGGG
GCCCAGATATCCGGCATCGTCAACCCGACCCAACTGGCTCGGTGGCTTAACAAATGA
mRNA sequenceShow/hide mRNA sequence
CACTCTCTCTGTTCTCTCTCTCTCTCTCTCTCTCTCTCTGTTCTCCTTCAAATTCTTTTTTGTAATCTAAACGGATTTTTTCTTGGTCCGCCGTCATCTCCGCCGTTCGA
TGGGCGGCGCCACTGGTTAACCGCCGTCCTTACGCCGATCCAACCAGTACCCTCAGGTTCCATCCGCCACCATCTGGATCCGATTTTACTAAAAACATTGACCCCTTTCT
TCCCTGTTTTGAGTTTTCATTCCCAATTATAAAAAAAGAAAATTGTTTTTTTTTTTCCTTTCCCTTTTTTAAAAATTTAGTCGCTTTCAATCCTTTTCCTTTTCGCCTTA
ACTTACGTTATCCCCATTTTTAAACATCTTCTTTATTTTTACCATTGCTCTCTAGCACTCATTCCTTTTAACCCTTTCGTCAAATTGTTCCTGTAAAGAGAAAAAAAAAA
ACACAAAATCTGGTTGGACGATTTTACCCCTGAGAATATCAACTTTCTACGGTTCTGCCACCGAGGAATCCGAGTGTTTGTTGGGGCCCGGCTTACGTGGCATTTAACAA
AGGGTGCCGAGAAAAAAAGCCAGATGAGTTTCCATTTCCACAGCCGTGGGCTGTTATCGGCTGGGCCGTCGAGAAAGCGGAAGGAGCTGTCGGCGCCGGCGGCTTCTAAA
GCCGGTGAGCCGCCGGTTTCGTCGACCCGGCTTCTGGCCGGGTACATGGCTTATGAGTTTCTGACCAAAGGAACCATCTTGGGCCGGAAGTTCGACCCGGATCGAGCCGA
AGCAAGCCCATTTCCCACCGCCGCCGCTCAGTGGAAGAAGCCGAAGTCGGAAGCCGCGCCGCCGGAGATCATCAAGAAAGAGCATCAGATTCAGAGCTACGCCGACGTGG
CGACCATTTTGAAGACGGATGGGGCCCAGATATCCGGCATCGTCAACCCGACCCAACTGGCTCGGTGGCTTAACAAATGACCGAGCCGAACCGAACCAAAACGTCTGCCA
CATGGCGAAATCCTATTGGTCAGCGCTATCTTTCCTTTTTCTCCACGTCATATTTGACCGAGTCTTCGAGCTCACTGAGCATTAGAAAACACACACAGAATCTCACTGAA
GCTCCCTTCTATGTTTCTCCGATTGCCGACGGCCGGAAAAAGAATCAGAGGGCGGTGGTAGAATAGTTCTGTTGGGTTTTTTCTTGTTTTTTTTTTGGTGTTTTTGTTGA
GAATCAATGAGGATATATATCATATGTTTATAATTATGAGTTGTGCTGTAATTATATAATACAAATCTTTCTTTCTCTTTCTGTATATTTTTTTTTAATGTGGAAATGTG
AAGGTTATTTGCATGGATGACAAGAGGCAAGTGGCGG
Protein sequenceShow/hide protein sequence
MSFHFHSRGLLSAGPSRKRKELSAPAASKAGEPPVSSTRLLAGYMAYEFLTKGTILGRKFDPDRAEASPFPTAAAQWKKPKSEAAPPEIIKKEHQIQSYADVATILKTDG
AQISGIVNPTQLARWLNK