; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000406 (gene) of Snake gourd v1 genome

Gene IDTan0000406
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionEncodes a protein involved in salt tolerance, names SIS (Salt Induced Serine rich).
Genome locationLG03:65013697..65017320
RNA-Seq ExpressionTan0000406
SyntenyTan0000406
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595356.1 hypothetical protein SDJN03_11909, partial [Cucurbita argyrosperma subsp. sororia]4.2e-5981.46Show/hide
Query:  MEGKKHVGSSSSLTTELFGSKESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQRAQ
        M+G  HV SSSSL T+LFGSK +S SS +GIFGSIF+PSSKVLG DSLLS+TKE ER S N+PWIPNA+DQ  TANQRQKES+E  NKD SSIYQ+QRAQ
Subjt:  MEGKKHVGSSSSLTTELFGSKESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQRAQ

Query:  PCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG
        PCQF SSIYYGGQDVYAHPQNSHNSGVNS FKKDGGEDDSGSASRGNWWQG
Subjt:  PCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG

KAG7027363.1 hypothetical protein SDJN02_11375 [Cucurbita argyrosperma subsp. argyrosperma]2.2e-6082.78Show/hide
Query:  MEGKKHVGSSSSLTTELFGSKESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQRAQ
        M+G  HV SSSSLTT+LFGSK +S SS +GIFGSIF+PSSKVLG DSLLS+TKE ER S N+PWIPNA+DQ  TANQRQKES+EM NKD SSIYQ+QRAQ
Subjt:  MEGKKHVGSSSSLTTELFGSKESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQRAQ

Query:  PCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG
        PCQF SSIYYGGQDVYAHPQNSHNSGVNS FKKDGGEDDSGSASRGNWWQG
Subjt:  PCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG

XP_022931917.1 uncharacterized protein LOC111438192 [Cucurbita moschata]1.1e-5982.12Show/hide
Query:  MEGKKHVGSSSSLTTELFGSKESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQRAQ
        M+G  HV SSSSLTT+LFGSK +S SS +GIFGSIF+PSSKVLG DSLLS+TKE ER S N+PWIPNA+DQ  TANQRQKES+E  NKD SSIYQ+QRAQ
Subjt:  MEGKKHVGSSSSLTTELFGSKESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQRAQ

Query:  PCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG
        PCQF SSIYYGGQDVYAHPQNSHNSGVNS FKKDGGEDDSGSASRGNWWQG
Subjt:  PCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG

XP_022966597.1 uncharacterized protein LOC111466230 [Cucurbita maxima]8.4e-6081.46Show/hide
Query:  MEGKKHVGSSSSLTTELFGSKESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQRAQ
        M+G  HV SSSSLTT+LFGSK +S SS +GIFGSIF+PSSKVLG DSLLS+TKE ER S N+PWIPNA+DQ +TANQRQKES+E  NKD  SIYQ+QRAQ
Subjt:  MEGKKHVGSSSSLTTELFGSKESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQRAQ

Query:  PCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG
        PCQF SSIYYGGQDVYAHPQNSHNSGVNS FKKDGGEDDSGSASRGNWWQG
Subjt:  PCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG

XP_038882075.1 uncharacterized protein LOC120073353 isoform X1 [Benincasa hispida]9.9e-6183.77Show/hide
Query:  MEGKKHVG---SSSSLTTELFGSKESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQ
        MEGKKHVG   SSSSLTT+LFGSKE+S SS TGIFGSIFAPSSKVLGRDSLLSQTKEGERDS NEPWIPNAE Q  TAN  QKES EM NKDMSSIYQ+Q
Subjt:  MEGKKHVG---SSSSLTTELFGSKESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQ

Query:  RAQPCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG
        RAQPC  +SSIYYGGQDVY HPQNS+NS VNS +KK+GGEDDSGSASRGNWWQG
Subjt:  RAQPCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG

TrEMBL top hitse value%identityAlignment
A0A1S3BK03 uncharacterized protein LOC103490707 isoform X21.4e-5779.87Show/hide
Query:  MEGKKHVG---SSSSLTTELFGSKESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQ
        MEGKKHVG   SSSSLTT+LFGS E+S SS TGIFGSIFAPSSKVLGR+SLLSQTKE ER+S NEPW PNAE Q   AN  QKESQEM NKDMSSIYQ+Q
Subjt:  MEGKKHVG---SSSSLTTELFGSKESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQ

Query:  RAQPCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG
         AQPC  +SSIYYGGQDVY HPQNS+NSG NS +KK+GGEDDSGSASRGNWWQG
Subjt:  RAQPCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG

A0A5D3DIY4 Uncharacterized protein1.4e-5779.87Show/hide
Query:  MEGKKHVG---SSSSLTTELFGSKESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQ
        MEGKKHVG   SSSSLTT+LFGS E+S SS TGIFGSIFAPSSKVLGR+SLLSQTKE ER+S NEPW PNAE Q   AN  QKESQEM NKDMSSIYQ+Q
Subjt:  MEGKKHVG---SSSSLTTELFGSKESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQ

Query:  RAQPCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG
         AQPC  +SSIYYGGQDVY HPQNS+NSG NS +KK+GGEDDSGSASRGNWWQG
Subjt:  RAQPCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG

A0A6J1DDV5 uncharacterized protein LOC1110194803.4e-5982.12Show/hide
Query:  MEGKKHVGSSSSLTTELFGSKESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQRAQ
        MEGKK +GSSSSLT +LFGSKE+S SS TGIFGSIFAPSSKVLG +SLLSQ KEGERDS NEPWIPN E +   AN RQKESQEM NKD+SSIYQEQRAQ
Subjt:  MEGKKHVGSSSSLTTELFGSKESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQRAQ

Query:  PCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG
        PC  +SSIYYGGQDVY+  QNSHNSGVNS FKKDGGEDDSGSASRGNWWQG
Subjt:  PCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG

A0A6J1EV60 uncharacterized protein LOC1114381925.3e-6082.12Show/hide
Query:  MEGKKHVGSSSSLTTELFGSKESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQRAQ
        M+G  HV SSSSLTT+LFGSK +S SS +GIFGSIF+PSSKVLG DSLLS+TKE ER S N+PWIPNA+DQ  TANQRQKES+E  NKD SSIYQ+QRAQ
Subjt:  MEGKKHVGSSSSLTTELFGSKESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQRAQ

Query:  PCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG
        PCQF SSIYYGGQDVYAHPQNSHNSGVNS FKKDGGEDDSGSASRGNWWQG
Subjt:  PCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG

A0A6J1HNF5 uncharacterized protein LOC1114662304.1e-6081.46Show/hide
Query:  MEGKKHVGSSSSLTTELFGSKESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQRAQ
        M+G  HV SSSSLTT+LFGSK +S SS +GIFGSIF+PSSKVLG DSLLS+TKE ER S N+PWIPNA+DQ +TANQRQKES+E  NKD  SIYQ+QRAQ
Subjt:  MEGKKHVGSSSSLTTELFGSKESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQRAQ

Query:  PCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG
        PCQF SSIYYGGQDVYAHPQNSHNSGVNS FKKDGGEDDSGSASRGNWWQG
Subjt:  PCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39855.2 unknown protein2.2e-1341.4Show/hide
Query:  MEGKKHV-GSSSSLTTEL---FGSK--ESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIY
        M+ KK V GSSSS ++ L   FG +   S  SS TG+F SIF P S V       +Q     R+ A +    N E    T N+R + S+   NK+  S  
Subjt:  MEGKKHV-GSSSSLTTEL---FGSK--ESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIY

Query:  QEQRAQPCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG
         E+   PC  +SSIYYGGQD Y+    + ++     +KKDG E DS SASRGNWW+G
Subjt:  QEQRAQPCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG

AT3G55646.1 unknown protein1.4e-1238.78Show/hide
Query:  SSSSLTT--ELFGSKESSCSSN--TGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQRAQPCQF
        SSSSL++   +FG + SS SS+  TG+F SIF P S     D L  Q     +    +   PNA  +G  +N+++K+          S Y E+   PC  
Subjt:  SSSSLTT--ELFGSKESSCSSN--TGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQRAQPCQF

Query:  ASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG
        +SS+YYGGQ+ Y    +S  +  +  +KKDG E DS  ASRGNWW+G
Subjt:  ASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG

AT5G02020.1 Encodes a protein involved in salt tolerance, names SIS (Salt Induced Serine rich).1.5e-2751.9Show/hide
Query:  MEGKKHVG------SSSSLTTELFGSKES-SCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSI
        MEG+K         SSSSLT+ELFGS+E+ S  S++GI GSIF P SKVLGR+S+  +T  G        W       G   + R +E QE      S  
Subjt:  MEGKKHVG------SSSSLTTELFGSKES-SCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSI

Query:  YQEQRAQPCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG
         Q+QR QPC  +SSIYYGG DVY  PQNS ++  N   KKDGGEDDSGSASRGNWWQG
Subjt:  YQEQRAQPCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQG

AT5G02020.2 Encodes a protein involved in salt tolerance, names SIS (Salt Induced Serine rich).2.7e-1647.29Show/hide
Query:  MEGKKHVG------SSSSLTTELFGSKES-SCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSI
        MEG+K         SSSSLT+ELFGS+E+ S  S++GI GSIF P SKVLGR+S+  +T  G        W       G   + R +E QE      S  
Subjt:  MEGKKHVG------SSSSLTTELFGSKES-SCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSI

Query:  YQEQRAQPCQFASSIYYGGQDVYAHPQNS
         Q+QR QPC  +SSIYYGG DVY  PQNS
Subjt:  YQEQRAQPCQFASSIYYGGQDVYAHPQNS

AT5G59080.1 unknown protein2.2e-1338.51Show/hide
Query:  MEGKKHVGS----SSSLTTELFGSKE-SSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQ
        MEGK  VGS    SSS T ELFGSK+ S  SS++GIF ++F   SK   RD   S +K G                       Q + +E +N       Q
Subjt:  MEGKKHVGS----SSSLTTELFGSKE-SSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQ

Query:  EQRAQPCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSG-----SASRGNWWQG
        E R +PC  +SS+YYGGQDVYA    +         ++  GEDD+        SRGNWWQG
Subjt:  EQRAQPCQFASSIYYGGQDVYAHPQNSHNSGVNSPFKKDGGEDDSG-----SASRGNWWQG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGAAAGAAACATGTGGGTTCCTCTTCTTCTCTCACTACTGAGCTGTTTGGCTCTAAAGAGAGTTCGTGTTCTTCAAACACTGGGATTTTTGGCTCTATATTTGC
ACCTTCTTCCAAGGTGTTAGGGAGAGACTCTCTGCTCTCTCAAACCAAAGAGGGAGAGAGGGATTCTGCAAATGAGCCATGGATCCCCAACGCTGAAGACCAAGGTTATA
CTGCCAATCAAAGACAAAAGGAGAGTCAGGAGATGATGAATAAAGATATGAGTTCCATTTATCAGGAACAAAGAGCACAACCATGTCAATTTGCCTCATCAATCTATTAT
GGTGGCCAAGACGTTTATGCTCATCCTCAGAACTCCCACAATTCTGGGGTGAACTCGCCGTTCAAGAAGGATGGGGGAGAAGATGATTCTGGGAGTGCTTCAAGAGGAAA
TTGGTGGCAAGGTATGAAATGGAGGTACATAGGATCATAG
mRNA sequenceShow/hide mRNA sequence
AAAAAAAGAAAATCCTGAGAGAGAGAGAGAGAAAGAGAGAGAAAGAGCGAGAGATCGAGGCAAATGAGAAATTCGCAGAGATTTTGAAAGAGGTTTTAGTTTTTTGAGTC
TTTCTTCATATCACAACCTTTTTTCTGCCTTATAAATTGGTGGGGAAGTGGCAGTTTCTTCATCATTTCAACTCTGCATCGAGCATTCACGTGTTCTTCCCTTCTCCTCT
TTTTCTCTGCTTCTCTGTTTTTTCCATTTCCGTTTTCTCTTTGAAACCCAACAATCTTTTTTCCTCCATTAAAGGGGCTTCATCCGGAGTAACCTCGAATAGACGGGCAA
CTCTGTTTTTCACTGATTAGGATCAAAATGGAAGGAAAGAAACATGTGGGTTCCTCTTCTTCTCTCACTACTGAGCTGTTTGGCTCTAAAGAGAGTTCGTGTTCTTCAAA
CACTGGGATTTTTGGCTCTATATTTGCACCTTCTTCCAAGGTGTTAGGGAGAGACTCTCTGCTCTCTCAAACCAAAGAGGGAGAGAGGGATTCTGCAAATGAGCCATGGA
TCCCCAACGCTGAAGACCAAGGTTATACTGCCAATCAAAGACAAAAGGAGAGTCAGGAGATGATGAATAAAGATATGAGTTCCATTTATCAGGAACAAAGAGCACAACCA
TGTCAATTTGCCTCATCAATCTATTATGGTGGCCAAGACGTTTATGCTCATCCTCAGAACTCCCACAATTCTGGGGTGAACTCGCCGTTCAAGAAGGATGGGGGAGAAGA
TGATTCTGGGAGTGCTTCAAGAGGAAATTGGTGGCAAGGTATGAAATGGAGGTACATAGGATCATAGATCTGCAGCACTACTACAACTTCACCTCTTATTTTATATCGCT
TAGACGATAGCGAGATCGTTGTGAGAAAAAGCTGGTTGATCCATTTTCCAGCCAAACACAACAGAATCTCTTTTAGTTAATTAATAGTTATAAGCATCTTTCTCCATTTT
ACTGTAAAGAATGCTGCTGCTGCTGCTGCTTTGTTGTTTTGTTTGAACCTTGGTGTTTTTAGCATACAGGAAATCTAGTTTGCTTTTTGCTTATTACTATCACATTGGAA
CCTGAATCAAAATCTTGAACATATATCTCAGAATGATCAAGATCGCTTGGTGGTCATAAAAAAAATCTTTCATTTCTTGGCAATCTAAGCATAGCCCAACTGGTTAAGAC
ATTACATCTTCAGCCAAAAAGTTAGATGTTCGAATTGTTTTTCTTAGAAAGCCAAGCAAGACTCTTGAGTTATGGGAAAAGTCAAAAGTAGTTGGAAGTTGGAAGTGTCA
TTTTCCTATCCTTATCCGAGTTAAGGATATCTTGAGAGAGTTTCAGGGCCAAAACCAGTCATTTTCTGTGAAGGAGTTTTATGGTTCTTGCTTTTCCATGGTCGTTAAGT
TTGGCCGTTAAAGAAAGAAATAATTGAAAAGATGGGTGGGATGGGGACAGAGATTCTGCAGCCACAATTCTTGGCAATGTTTGATCTGATTTTTTTCTTTTCTTTTTTGA
TGTTACTTTTGGTTCTGCTCTATCTTGTTCTTGTCCACACCATTCATAATTTTAAAAGTATAAATAATGTGCAAACCAACATCATTGTTACGTCAAAAGATGAATCTTAT
GTTAATTATTAATTACC
Protein sequenceShow/hide protein sequence
MEGKKHVGSSSSLTTELFGSKESSCSSNTGIFGSIFAPSSKVLGRDSLLSQTKEGERDSANEPWIPNAEDQGYTANQRQKESQEMMNKDMSSIYQEQRAQPCQFASSIYY
GGQDVYAHPQNSHNSGVNSPFKKDGGEDDSGSASRGNWWQGMKWRYIGS