; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002894 (gene) of Snake gourd v1 genome

Gene IDTan0002894
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionEncodes a protein involved in salt tolerance, names SIS (Salt Induced Serine rich).
Genome locationLG01:29365809..29375217
RNA-Seq ExpressionTan0002894
SyntenyTan0002894
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583402.1 hypothetical protein SDJN03_19334, partial [Cucurbita argyrosperma subsp. sororia]1.9e-5458.99Show/hide
Query:  MEEKSKGYQASSFVSDLFDVKEPPLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS
        ME KSKGYQASSFV+DLFDVKEPP +S+S VFA IFPS QKGG RNSS SGDWLKQ+NGNQP H +QGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAG S
Subjt:  MEEKSKGYQASSFVSDLFDVKEPPLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS

Query:  PPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGSTVPFIIRTSALPWWRMNSHLFSKKVSHLKMPMFFFFWCRNSFRSRE
        P P  T                     LKKSGGEDDPNGNN QPASRGNWWQG+     +                  V+H   P+F     +       
Subjt:  PPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGSTVPFIIRTSALPWWRMNSHLFSKKVSHLKMPMFFFFWCRNSFRSRE

Query:  DGDLKLDLHYIVYTISN
            +LDL YIVYTISN
Subjt:  DGDLKLDLHYIVYTISN

KAG7019166.1 hypothetical protein SDJN02_18124 [Cucurbita argyrosperma subsp. argyrosperma]2.9e-6364.98Show/hide
Query:  MEEKSKGYQASSFVSDLFDVKEPPLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS
        ME KSKGYQASSFV+DLFDVKEPP +S+S VFA IFPS QKGG RNSS SGDWLKQ+NGNQP H +QGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAG S
Subjt:  MEEKSKGYQASSFVSDLFDVKEPPLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS

Query:  PPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGSTVPFIIRTSALPWWRMNSHLFSKKVSHLKMPMFFFFWCRNSFRSRE
        P P  TV +++K  CSS VVA SLMTQLKKSGGEDDPNGNN QPASRGNWWQG+     +                  V+H   P+F     +       
Subjt:  PPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGSTVPFIIRTSALPWWRMNSHLFSKKVSHLKMPMFFFFWCRNSFRSRE

Query:  DGDLKLDLHYIVYTISN
            +LDL YIVYTISN
Subjt:  DGDLKLDLHYIVYTISN

XP_022970583.1 uncharacterized protein LOC111469517 [Cucurbita maxima]4.6e-5374.68Show/hide
Query:  MEEKSKGYQASSFVSDLFDVKEPPLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS
        ME KSKGYQASSFV+DLFDVKEPP +S+S VFA IFPS QKGG RNSS SGDWLKQ+NGNQP H +QGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS
Subjt:  MEEKSKGYQASSFVSDLFDVKEPPLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS

Query:  PPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS
        P P  T                     LKKSGGEDDPNGNN QPASRGNWWQGS
Subjt:  PPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS

XP_023525537.1 uncharacterized protein LOC111789122 [Cucurbita pepo subsp. pepo]4.6e-5375.32Show/hide
Query:  MEEKSKGYQASSFVSDLFDVKEPPLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS
        ME K  GYQ SSFV+DLFDV+E PLSSASGV AP+FPS QK GSR SS SGDWLKQ+NGNQPRHTK+GNSGGSLEPCHLSSSLYYGGQDGYS+APSAGPS
Subjt:  MEEKSKGYQASSFVSDLFDVKEPPLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS

Query:  PPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS
        PPPPS                      LKKSGGEDDPNGNNSQPASRGNWWQGS
Subjt:  PPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS

XP_038894691.1 uncharacterized protein LOC120083160 [Benincasa hispida]9.3e-5476.62Show/hide
Query:  MEEKSKGYQASSFVSDLFDVKEPPLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS
        ME KSKGYQASSFV+DLFDVKEPPLSS SGVFA IF S QKG  RNSS SGDWLKQ+NGNQPRHT+QGNS GSLEPCHLSSSLYYGGQDGYSQAPSAGPS
Subjt:  MEEKSKGYQASSFVSDLFDVKEPPLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS

Query:  PPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS
        PP P T                     +KKSGGEDDPNGNNSQPASRGNWWQGS
Subjt:  PPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS

TrEMBL top hitse value%identityAlignment
A0A1S3C5M6 uncharacterized protein LOC1034971237.2e-5273.08Show/hide
Query:  MEEKSKGYQASSFVSDLFDVKEPPLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS
        ME KSKGYQ SSFV+DLFDVKE PLSSASG FA IFPS QKG  RNSS S DWLKQ+NGNQP HT+QGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS
Subjt:  MEEKSKGYQASSFVSDLFDVKEPPLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS

Query:  --PPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS
          PPPP T+K                    K  G +DDPNGNNSQPASRGNWWQGS
Subjt:  --PPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS

A0A5D3BEB7 Uncharacterized protein7.2e-5273.08Show/hide
Query:  MEEKSKGYQASSFVSDLFDVKEPPLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS
        ME KSKGYQ SSFV+DLFDVKE PLSSASG FA IFPS QKG  RNSS S DWLKQ+NGNQP HT+QGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS
Subjt:  MEEKSKGYQASSFVSDLFDVKEPPLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS

Query:  --PPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS
          PPPP T+K                    K  G +DDPNGNNSQPASRGNWWQGS
Subjt:  --PPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS

A0A6J1GAG9 uncharacterized protein LOC111452257 isoform X15.0e-5375.32Show/hide
Query:  MEEKSKGYQASSFVSDLFDVKEPPLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS
        ME KS GYQ SSFV+DLFDV+E PLSSASGV A +FPS QK GSR SS SGDWLKQ+NGNQPRHTKQGNSGGSLEPCHLSSSLYYGGQDGY +APS GPS
Subjt:  MEEKSKGYQASSFVSDLFDVKEPPLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS

Query:  PPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS
        PPPPST                     LKKSGGEDDPNGNNSQPASRGNWWQGS
Subjt:  PPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS

A0A6J1HJ44 uncharacterized protein LOC1114649293.2e-5274.03Show/hide
Query:  MEEKSKGYQASSFVSDLFDVKEPPLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS
        ME KSKGYQASSFV+DLFDVKEPP +S+S VFA IFPS QKGG RNSS SGDWLKQ+NGNQP H +QGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAG S
Subjt:  MEEKSKGYQASSFVSDLFDVKEPPLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS

Query:  PPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS
        P P  T                     LKKSGGEDDPNGNN QPASRGNWWQGS
Subjt:  PPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS

A0A6J1I4B7 uncharacterized protein LOC1114695172.2e-5374.68Show/hide
Query:  MEEKSKGYQASSFVSDLFDVKEPPLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS
        ME KSKGYQASSFV+DLFDVKEPP +S+S VFA IFPS QKGG RNSS SGDWLKQ+NGNQP H +QGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS
Subjt:  MEEKSKGYQASSFVSDLFDVKEPPLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS

Query:  PPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS
        P P  T                     LKKSGGEDDPNGNN QPASRGNWWQGS
Subjt:  PPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39855.2 unknown protein1.7e-0531.36Show/hide
Query:  EEKSKGYQASSFVSDLFDVKEPPL-----SSASGVFAPIF--PSAQKGGSRNSSISGDWLKQSNGNQP---------RHTKQGNSGGSLEPCHLSSSLYY
        ++KS    +SS  S L  +  P +     SS +G+F  IF  PSA   G+  S       + +N   P         +  K   S  +  PC+LSSS+YY
Subjt:  EEKSKGYQASSFVSDLFDVKEPPL-----SSASGVFAPIF--PSAQKGGSRNSSISGDWLKQSNGNQP---------RHTKQGNSGGSLEPCHLSSSLYY

Query:  GGQDGYSQAPSAGPSPPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS
        GGQD YS            ST   ++                 KK G E D     S+ ASRGNWW+GS
Subjt:  GGQDGYSQAPSAGPSPPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS

AT3G55646.1 unknown protein3.5e-0631.61Show/hide
Query:  LHKMEEKSKGYQASSFVSDL--FD------VKEPPLSSASGVFAPIFP---SAQKGGSRNSSISGDWLKQSNGN------QPRHTKQGNSGGSLEPCHLS
        + K + K K   ASS  S L  FD      V     SSA+G+F  IFP   + Q G   + +  G  +K  + N        +  K   +  +  PCHLS
Subjt:  LHKMEEKSKGYQASSFVSDL--FD------VKEPPLSSASGVFAPIFP---SAQKGGSRNSSISGDWLKQSNGN------QPRHTKQGNSGGSLEPCHLS

Query:  SSLYYGGQDGYSQAPSAGPSPPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS
        SSLYYGGQ+ YS                              +     KK G E D     S+ ASRGNWW+GS
Subjt:  SSLYYGGQDGYSQAPSAGPSPPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS

AT5G02020.1 Encodes a protein involved in salt tolerance, names SIS (Salt Induced Serine rich).7.2e-1236.84Show/hide
Query:  KMEEKSKGYQASSFVSDLFDVKEPPLS-SASGVFAPIFPSAQKGGSRNS-----SISGDW---LKQSNGNQPRHTKQGNSGGS-------LEPCHLSSSL
        K    S    +SS  S+LF  +E P S S+SG+   IFP   K   R S        G W     ++ GN  R+ +Q  + GS       ++PCHLSSS+
Subjt:  KMEEKSKGYQASSFVSDLFDVKEPPLS-SASGVFAPIFPSAQKGGSRNS-----SISGDW---LKQSNGNQPRHTKQGNSGGS-------LEPCHLSSSL

Query:  YYGGQDGYSQAPSAGPSPPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS
        YYGG D Y Q        P  ST  S +                 KK GGEDD     S  ASRGNWWQGS
Subjt:  YYGGQDGYSQAPSAGPSPPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS

AT5G02020.2 Encodes a protein involved in salt tolerance, names SIS (Salt Induced Serine rich).7.0e-0737.27Show/hide
Query:  KMEEKSKGYQASSFVSDLFDVKEPPLS-SASGVFAPIFPSAQKGGSRNS-----SISGDW---LKQSNGNQPRHTKQGNSGGS-------LEPCHLSSSL
        K    S    +SS  S+LF  +E P S S+SG+   IFP   K   R S        G W     ++ GN  R+ +Q  + GS       ++PCHLSSS+
Subjt:  KMEEKSKGYQASSFVSDLFDVKEPPLS-SASGVFAPIFPSAQKGGSRNS-----SISGDW---LKQSNGNQPRHTKQGNSGGS-------LEPCHLSSSL

Query:  YYGGQDGYSQ
        YYGG D Y Q
Subjt:  YYGGQDGYSQ

AT5G59080.1 unknown protein5.2e-1840.13Show/hide
Query:  SKGYQASSFVSDLFDVKEP-PLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNS-GGSLEPCHLSSSLYYGGQDGYSQAPSAGPSPP
        S    +SSF ++LF  K+P P SS+SG+F+ +FP   KG +R+ S S       +G+Q +  +  N+    +EPCHLSSSLYYGGQD Y+++ +    PP
Subjt:  SKGYQASSFVSDLFDVKEP-PLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQGNS-GGSLEPCHLSSSLYYGGQDGYSQAPSAGPSPP

Query:  PPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS
            VK++                  ++  GEDD NG N Q  SRGNWWQGS
Subjt:  PPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTAGAAATCATGTGGGCTTTGCTCGCGCAGATATGTCCAATTGGGATCACATGTGGACCACACTTGGCACTTATCCTCCTATGTTCTCACAGGAAAGCATGGATGA
TATTATTATTTTGCATAAAATGGAGGAAAAGTCTAAAGGGTATCAAGCCTCCTCTTTCGTTTCTGATCTTTTCGATGTCAAGGAACCGCCATTGTCCTCAGCATCAGGAG
TTTTTGCACCAATCTTTCCTTCTGCTCAGAAGGGGGGAAGCAGGAACTCTTCTATTTCTGGGGATTGGCTAAAACAGAGCAACGGAAATCAACCACGCCACACCAAACAA
GGAAATTCAGGAGGGAGCTTGGAGCCTTGTCATCTGAGTTCATCTCTATACTATGGAGGACAAGATGGCTACTCCCAGGCCCCATCAGCTGGACCGTCCCCACCCCCACC
CTCCACTGTGAAAAGTGAAAGTAAAATTGGTTGTTCTTCTTCTGTTGTTGCCAATTCTTTGATGACCCAGCTGAAGAAAAGTGGGGGAGAAGATGATCCAAATGGAAACA
ACTCTCAACCTGCTTCTAGGGGAAATTGGTGGCAAGGTAGTACAGTTCCCTTTATTATTAGGACCTCTGCCTTGCCTTGGTGGCGCATGAATTCCCATCTATTTTCTAAG
AAGGTTTCCCACCTCAAAATGCCTATGTTTTTTTTTTTTTGGTGCAGGAATTCATTTAGAAGCAGAGAAGATGGAGATTTGAAGTTGGATTTACATTACATTGTGTATAC
CATTTCAAATTTGTTGTTTGGTTTTTACTTTTACTTTGAAAATGGTGTGGAGTTTTCTGTATTTTCTTTTCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGACTAGAAATCATGTGGGCTTTGCTCGCGCAGATATGTCCAATTGGGATCACATGTGGACCACACTTGGCACTTATCCTCCTATGTTCTCACAGGAAAGCATGGATGA
TATTATTATTTTGCATAAAATGGAGGAAAAGTCTAAAGGGTATCAAGCCTCCTCTTTCGTTTCTGATCTTTTCGATGTCAAGGAACCGCCATTGTCCTCAGCATCAGGAG
TTTTTGCACCAATCTTTCCTTCTGCTCAGAAGGGGGGAAGCAGGAACTCTTCTATTTCTGGGGATTGGCTAAAACAGAGCAACGGAAATCAACCACGCCACACCAAACAA
GGAAATTCAGGAGGGAGCTTGGAGCCTTGTCATCTGAGTTCATCTCTATACTATGGAGGACAAGATGGCTACTCCCAGGCCCCATCAGCTGGACCGTCCCCACCCCCACC
CTCCACTGTGAAAAGTGAAAGTAAAATTGGTTGTTCTTCTTCTGTTGTTGCCAATTCTTTGATGACCCAGCTGAAGAAAAGTGGGGGAGAAGATGATCCAAATGGAAACA
ACTCTCAACCTGCTTCTAGGGGAAATTGGTGGCAAGGTAGTACAGTTCCCTTTATTATTAGGACCTCTGCCTTGCCTTGGTGGCGCATGAATTCCCATCTATTTTCTAAG
AAGGTTTCCCACCTCAAAATGCCTATGTTTTTTTTTTTTTGGTGCAGGAATTCATTTAGAAGCAGAGAAGATGGAGATTTGAAGTTGGATTTACATTACATTGTGTATAC
CATTTCAAATTTGTTGTTTGGTTTTTACTTTTACTTTGAAAATGGTGTGGAGTTTTCTGTATTTTCTTTTCTCTAA
Protein sequenceShow/hide protein sequence
MTRNHVGFARADMSNWDHMWTTLGTYPPMFSQESMDDIIILHKMEEKSKGYQASSFVSDLFDVKEPPLSSASGVFAPIFPSAQKGGSRNSSISGDWLKQSNGNQPRHTKQ
GNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPSPPPPSTVKSESKIGCSSSVVANSLMTQLKKSGGEDDPNGNNSQPASRGNWWQGSTVPFIIRTSALPWWRMNSHLFSK
KVSHLKMPMFFFFWCRNSFRSREDGDLKLDLHYIVYTISNLLFGFYFYFENGVEFSVFSFL