; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015549 (gene) of Snake gourd v1 genome

Gene IDTan0015549
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBEST Arabidopsis thaliana protein match is: NHL domain-containing protein .
Genome locationLG01:103659715..103660138
RNA-Seq ExpressionTan0015549
SyntenyTan0015549
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608120.1 hypothetical protein SDJN03_01462, partial [Cucurbita argyrosperma subsp. sororia]5.2e-2777.53Show/hide
Query:  MEKSEGHKSKEKELLSCWGRLKLKLSDIKKEGNNLCIGETKPLNGGFRYDALSYAQNFDDGLKDA-EEGRYRGFSARYASASKPLAEKK
        MEKSEG KS E EL+SCWGRLKLKL  +K+EGNN CIG+TKPLNGGFRYDALSYAQNFD+GL D  EE R RGFSARY SASKPL + K
Subjt:  MEKSEGHKSKEKELLSCWGRLKLKLSDIKKEGNNLCIGETKPLNGGFRYDALSYAQNFDDGLKDA-EEGRYRGFSARYASASKPLAEKK

KAG7024589.1 hypothetical protein SDJN02_13407, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-1053.93Show/hide
Query:  MEKSEGHKSKEKELLSCWGRLKLKLSDIKKEGNNLCIGETKPLNGGFRYDALSYAQNFDDGLKDA-EEGRYRGFSARYASASKPLAEKK
        ME     K K KE  S W  LK K+S  KKEG       +K  +G F YDA+SYAQNFDDGL +A +EG  R FSARYA ASKP  +KK
Subjt:  MEKSEGHKSKEKELLSCWGRLKLKLSDIKKEGNNLCIGETKPLNGGFRYDALSYAQNFDDGLKDA-EEGRYRGFSARYASASKPLAEKK

KGN60281.1 hypothetical protein Csa_002688 [Cucumis sativus]5.6e-1353.47Show/hide
Query:  MEKSEGHKSKEKELLSCWGRLKLKLSDIKKEGN-------NL-CI-GETKPLN----GGFRYDALSYAQNFDDGLKDAE-EGRYRGFSARYASASKPLAE
        M+K+E  K+KE E +SCW RL++K+   KKEGN       N+ C+ GET  LN    G F+YDALSYA+NFD+GL +A+ EG +R FSARYA  SKP A+
Subjt:  MEKSEGHKSKEKELLSCWGRLKLKLSDIKKEGN-------NL-CI-GETKPLN----GGFRYDALSYAQNFDDGLKDAE-EGRYRGFSARYASASKPLAE

Query:  K
        K
Subjt:  K

OMO50828.1 hypothetical protein CCACVL1_30223 [Corchorus capsularis]4.0e-1152.04Show/hide
Query:  MEKSEGHKSKEKE-LLSCWGRLKLKLSDIKKEGNNLCIGETKPLN-------GGFRYDALSYAQNFDDGL--KDAEEGRYRGFSARYASAS-KPLAEK
        ME+S   K+  +E LLSCWGRLKLKL   K+   NL    T P         GGFRYD LSYAQNFDDG    D E   YRGFS+RYA+ S + +A+K
Subjt:  MEKSEGHKSKEKE-LLSCWGRLKLKLSDIKKEGNNLCIGETKPLN-------GGFRYDALSYAQNFDDGL--KDAEEGRYRGFSARYASAS-KPLAEK

XP_007137662.1 hypothetical protein PHAVU_009G145200g [Phaseolus vulgaris]3.1e-1154.32Show/hide
Query:  MEKSEGHKSKEKELLSCWGRLKLKLSDIKKEGNNLCIGETKPLNGGFRYDALSYAQNFDDGLKDAEEGRYRGFSARYASAS
        MEKS   +SK    LSCWG LK+KL   K+  +N      KP+ GGF+YD LSYAQNFD+GL + +E  +RGFSARYA+ S
Subjt:  MEKSEGHKSKEKELLSCWGRLKLKLSDIKKEGNNLCIGETKPLNGGFRYDALSYAQNFDDGLKDAEEGRYRGFSARYASAS

TrEMBL top hitse value%identityAlignment
A0A0A0LJR5 Uncharacterized protein2.7e-1353.47Show/hide
Query:  MEKSEGHKSKEKELLSCWGRLKLKLSDIKKEGN-------NL-CI-GETKPLN----GGFRYDALSYAQNFDDGLKDAE-EGRYRGFSARYASASKPLAE
        M+K+E  K+KE E +SCW RL++K+   KKEGN       N+ C+ GET  LN    G F+YDALSYA+NFD+GL +A+ EG +R FSARYA  SKP A+
Subjt:  MEKSEGHKSKEKELLSCWGRLKLKLSDIKKEGN-------NL-CI-GETKPLN----GGFRYDALSYAQNFDDGLKDAE-EGRYRGFSARYASASKPLAE

Query:  K
        K
Subjt:  K

A0A0B2RTQ8 Uncharacterized protein6.2e-1055.42Show/hide
Query:  MEKSEGHKSKEK-ELLSCWGRLKLKLSDIKKEGNNLCIGETKPLNGGFRYDALSYAQNFDDG-LKDAEEGRYRGFSARYASAS
        MEKS   +SK     LSCWGRLKLKL   K+  +       KP+ GGF YD LSYAQNFD+G ++D EE  +RGFSARYA+ S
Subjt:  MEKSEGHKSKEK-ELLSCWGRLKLKLSDIKKEGNNLCIGETKPLNGGFRYDALSYAQNFDDG-LKDAEEGRYRGFSARYASAS

A0A1R3FYB4 Uncharacterized protein1.9e-1152.04Show/hide
Query:  MEKSEGHKSKEKE-LLSCWGRLKLKLSDIKKEGNNLCIGETKPLN-------GGFRYDALSYAQNFDDGL--KDAEEGRYRGFSARYASAS-KPLAEK
        ME+S   K+  +E LLSCWGRLKLKL   K+   NL    T P         GGFRYD LSYAQNFDDG    D E   YRGFS+RYA+ S + +A+K
Subjt:  MEKSEGHKSKEKE-LLSCWGRLKLKLSDIKKEGNNLCIGETKPLN-------GGFRYDALSYAQNFDDGL--KDAEEGRYRGFSARYASAS-KPLAEK

A0A1R3GHI4 Uncharacterized protein9.6e-1151.02Show/hide
Query:  MEKSEGHKSKEKE-LLSCWGRLKLKLSDIKKEGNNLCIGETKPLN-------GGFRYDALSYAQNFDDGL--KDAEEGRYRGFSARYASAS-KPLAEK
        ME+S   K+  +E  LSCWGRLKLKL   K+   NL    T P         GGFRYD LSYAQNFDDG    D E   YRGFS+RYA+ S + +A+K
Subjt:  MEKSEGHKSKEKE-LLSCWGRLKLKLSDIKKEGNNLCIGETKPLN-------GGFRYDALSYAQNFDDGL--KDAEEGRYRGFSARYASAS-KPLAEK

V7AZM1 Uncharacterized protein1.5e-1154.32Show/hide
Query:  MEKSEGHKSKEKELLSCWGRLKLKLSDIKKEGNNLCIGETKPLNGGFRYDALSYAQNFDDGLKDAEEGRYRGFSARYASAS
        MEKS   +SK    LSCWG LK+KL   K+  +N      KP+ GGF+YD LSYAQNFD+GL + +E  +RGFSARYA+ S
Subjt:  MEKSEGHKSKEKELLSCWGRLKLKLSDIKKEGNNLCIGETKPLNGGFRYDALSYAQNFDDGLKDAEEGRYRGFSARYASAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)3.0e-0447.83Show/hide
Query:  GGFRYDALSYAQNFDDGLKDA---EEGRYRGFSARYASASKPLAEK
        G FRYD LSY+ NFDDG +     +E  YR +S R+A+ S P++ K
Subjt:  GGFRYDALSYAQNFDDGLKDA---EEGRYRGFSARYASASKPLAEK

AT5G14890.1 NHL domain-containing protein7.8e-0537.68Show/hide
Query:  IKKEGNNLCI------GETKPLNGGFRYDALSYAQNFDDGLKDA---EEGRYRGFSARYASASKPLAEK
        I++ G N C       G  +P +  FRYD+ SY+ NFDDG +     +E  YR +S R+A+ S P++ K
Subjt:  IKKEGNNLCI------GETKPLNGGFRYDALSYAQNFDDGLKDA---EEGRYRGFSARYASASKPLAEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAATCGGAGGGGCACAAAAGCAAAGAAAAAGAGTTATTGTCGTGTTGGGGGCGTTTGAAGTTGAAGCTTTCGGATATAAAAAAAGAGGGCAATAATCTGTGTAT
TGGAGAGACAAAGCCATTGAATGGCGGATTCAGATACGACGCCTTAAGCTACGCTCAGAACTTCGATGACGGATTGAAGGATGCTGAAGAAGGACGTTATCGAGGTTTTT
CTGCTAGATATGCTTCTGCTTCCAAACCGCTTGCTGAGAAAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAATCGGAGGGGCACAAAAGCAAAGAAAAAGAGTTATTGTCGTGTTGGGGGCGTTTGAAGTTGAAGCTTTCGGATATAAAAAAAGAGGGCAATAATCTGTGTAT
TGGAGAGACAAAGCCATTGAATGGCGGATTCAGATACGACGCCTTAAGCTACGCTCAGAACTTCGATGACGGATTGAAGGATGCTGAAGAAGGACGTTATCGAGGTTTTT
CTGCTAGATATGCTTCTGCTTCCAAACCGCTTGCTGAGAAAAAGTAAAATAGAAAGATTCCGAAATTTCGCATATGGTAATTTAATCTAATGGGCTTCTTTATTATATAT
ATATGTATCCTCTCTTCTTCTTCTTTTTACAGATAGATTTATATAGGACTTGACAATATGGTATATATAATATAATATAATATACAGAAAGAAG
Protein sequenceShow/hide protein sequence
MEKSEGHKSKEKELLSCWGRLKLKLSDIKKEGNNLCIGETKPLNGGFRYDALSYAQNFDDGLKDAEEGRYRGFSARYASASKPLAEKK