; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004630 (gene) of Snake gourd v1 genome

Gene IDTan0004630
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionstress response protein NST1
Genome locationLG10:4090918..4096070
RNA-Seq ExpressionTan0004630
SyntenyTan0004630
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582480.1 hypothetical protein SDJN03_22482, partial [Cucurbita argyrosperma subsp. sororia]1.0e-8090.16Show/hide
Query:  MAFASRSSIRTATNLSNNLLRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQRKDEAK
        MA  SRS+IRTATNLSNNLLRSFSTS K AHHNNHQQTHKYLEA+AFVGSWEAPKDPKEAQA+LA LRRDYAKQVKQVRKNYIQEVE+LRLEKQRKDEAK
Subjt:  MAFASRSSIRTATNLSNNLLRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQRKDEAK

Query:  REALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL
        REALRVANEERKKLKAEAAKARAEERK+A+EEFRRTLMKER EK EHWRMMEK R+EKKKEKNE IRRQSSMW+DEKKLEEKLL+AIVNT PL
Subjt:  REALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL

KAG6597284.1 hypothetical protein SDJN03_10464, partial [Cucurbita argyrosperma subsp. sororia]2.8e-8191.19Show/hide
Query:  MAFASRSSIRTATNLSNNLLRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQRKDEAK
        MAFASRS+IR ATN S+ LLRSFSTSTK AHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLA LRRDYAK+VKQVRKNYIQEVELLRLEKQRKDEAK
Subjt:  MAFASRSSIRTATNLSNNLLRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQRKDEAK

Query:  REALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL
        REALRV NEERKKLKAEAAKARAEERK+A+EEFR+TLMKERAEK EHWRMMEKTR+EKK EKNELIRRQS MWIDEKKLEEKLL+AIVNTTPL
Subjt:  REALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL

XP_022924252.1 UPF0329 protein ECU05_1680/ECU11_0050-like [Cucurbita moschata]2.3e-8089.64Show/hide
Query:  MAFASRSSIRTATNLSNNLLRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQRKDEAK
        MA  SRS+IRTATN+SNNLLRSFSTS K AHHNNHQQTHKYLEA+AFVGSWEAPKDPKEAQA+LA LRRDYAKQVKQVRKNYIQEVE+LRLEKQRKDEAK
Subjt:  MAFASRSSIRTATNLSNNLLRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQRKDEAK

Query:  REALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL
        REALRVANEERKKLKAEAAKARAEERK+A+EEFRRTLMKER EK EHWRMMEK R+EKKKEKNE IRRQSSMW+DEKKLEEKLL+AIVNT PL
Subjt:  REALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL

XP_022940859.1 UPF0329 protein ECU05_1680/ECU11_0050-like [Cucurbita moschata]3.6e-8191.19Show/hide
Query:  MAFASRSSIRTATNLSNNLLRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQRKDEAK
        MAFASRS+IR ATN S+ LLRSFSTSTK AHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLA LRRDYAK+VKQVRKNYIQEVELLRLEKQRKDEAK
Subjt:  MAFASRSSIRTATNLSNNLLRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQRKDEAK

Query:  REALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL
        REALRV NEERKKLKAEAAKARAEERK+A+EEFR+TLMKERAEK EHWRMMEKTR+EKK EKNELIRRQS MWIDEKKLEEKLL+AIVNTTPL
Subjt:  REALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL

XP_038892259.1 chromatin assembly factor 1 subunit A [Benincasa hispida]1.4e-8088.6Show/hide
Query:  MAFASRSSIRTATNLSNNLLRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQRKDEAK
        MAF  RS+IRTAT+LSNNLLRS STSTK AHHNNHQQTHKYLEANAF+GSWEAPKDPKEAQA+LAQLRRDYAKQ+KQVRKNYIQEVELLRLEK+RKDEAK
Subjt:  MAFASRSSIRTATNLSNNLLRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQRKDEAK

Query:  REALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL
        REALRVANEERKKLKAEAAK RAEERK+A+EEFR TLMKERAEK EHWRMMEK RDEKKKEKN+L+RRQSS+W+DE KLEEKLL+AIVNTTPL
Subjt:  REALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL

TrEMBL top hitse value%identityAlignment
A0A1S3AWS6 LOW QUALITY PROTEIN: chromatin assembly factor 1 subunit A1.6e-7485.49Show/hide
Query:  MAFASRSSIRTATNLSNNLLRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQRKDEAK
        MAFAS S IR ATNLSNNLLRSFSTS K AH+NNH QTH+YLEAN+F+GSW+APKDPKEAQARLA+LRR+YAKQVKQVRKNYIQEVELLRLEK++KDEAK
Subjt:  MAFASRSSIRTATNLSNNLLRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQRKDEAK

Query:  REALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL
        REALRVANEERKKLKAEAAK RAEERKIA+EEFR TLMKERAEK EHWRMMEK R+EK KEK EL+RRQSS WIDE KLEEKLLEAIVNT  L
Subjt:  REALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL

A0A6J1CYM7 uncharacterized protein LOC1110159876.2e-7988.08Show/hide
Query:  MAFASRSSIRTATNLSNNLLRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQRKDEAK
        MA +SRS+IR+ATNLSNNLLRSFSTSTK AHHN+HQQTHK+LEANAFVGSWE PKDP+EAQA+L QLRRDYAKQVKQVRKNYIQEVELLRLE QRKDEAK
Subjt:  MAFASRSSIRTATNLSNNLLRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQRKDEAK

Query:  REALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL
        REALRVANEERKKLKAEAAKARAEERK+A+EEFRRTLMKER+EK EHWR MEK R+EKKKEKNELIRRQSS+WIDE KLE KLL+AIVNTTPL
Subjt:  REALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL

A0A6J1E8D4 UPF0329 protein ECU05_1680/ECU11_0050-like1.1e-8089.64Show/hide
Query:  MAFASRSSIRTATNLSNNLLRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQRKDEAK
        MA  SRS+IRTATN+SNNLLRSFSTS K AHHNNHQQTHKYLEA+AFVGSWEAPKDPKEAQA+LA LRRDYAKQVKQVRKNYIQEVE+LRLEKQRKDEAK
Subjt:  MAFASRSSIRTATNLSNNLLRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQRKDEAK

Query:  REALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL
        REALRVANEERKKLKAEAAKARAEERK+A+EEFRRTLMKER EK EHWRMMEK R+EKKKEKNE IRRQSSMW+DEKKLEEKLL+AIVNT PL
Subjt:  REALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL

A0A6J1FLI1 UPF0329 protein ECU05_1680/ECU11_0050-like1.7e-8191.19Show/hide
Query:  MAFASRSSIRTATNLSNNLLRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQRKDEAK
        MAFASRS+IR ATN S+ LLRSFSTSTK AHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLA LRRDYAK+VKQVRKNYIQEVELLRLEKQRKDEAK
Subjt:  MAFASRSSIRTATNLSNNLLRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQRKDEAK

Query:  REALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL
        REALRV NEERKKLKAEAAKARAEERK+A+EEFR+TLMKERAEK EHWRMMEKTR+EKK EKNELIRRQS MWIDEKKLEEKLL+AIVNTTPL
Subjt:  REALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL

A0A6J1IZ92 uncharacterized protein LOC1114797981.9e-8089.64Show/hide
Query:  MAFASRSSIRTATNLSNNLLRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQRKDEAK
        MA  SRS+IR+ATNLSNNLLRSFSTS K AHHNNHQQTHKYLEA+AFVGSWEAPKDPKEAQA+LA LRRDYAKQVKQVRKNYIQEVELLRLEKQ KDEAK
Subjt:  MAFASRSSIRTATNLSNNLLRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQRKDEAK

Query:  REALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL
        REALRVANEERKKLKAEAAKARAEERK+A+EEFRRTLMKER EK EHWRMMEK R+EKKKEKNE+IRRQSSMW+DEKKLEEKLL+AIVNT PL
Subjt:  REALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G49210.1 unknown protein4.7e-4758.08Show/hide
Query:  MAFASRSSIRTATNLSNNL-----LRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQR
        MAF ++ S R     S N      LRSFS      HH  HQ+TH +LE   ++GSWEAP DPK+A+ +LAQLRRDYAK+V+  RK YI E+E+LR+EKQR
Subjt:  MAFASRSSIRTATNLSNNL-----LRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQR

Query:  KDEAKREALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL
        KDEA+  A R ANEER+ LKAEAAK RAEERKIA+EEFR+TL+KERAEK E W+MM + R+EK KE+ +L+R QSS+WI++K+LE K+ EA+V+   L
Subjt:  KDEAKREALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL

AT5G49210.2 unknown protein4.7e-4758.08Show/hide
Query:  MAFASRSSIRTATNLSNNL-----LRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQR
        MAF ++ S R     S N      LRSFS      HH  HQ+TH +LE   ++GSWEAP DPK+A+ +LAQLRRDYAK+V+  RK YI E+E+LR+EKQR
Subjt:  MAFASRSSIRTATNLSNNL-----LRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQR

Query:  KDEAKREALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL
        KDEA+  A R ANEER+ LKAEAAK RAEERKIA+EEFR+TL+KERAEK E W+MM + R+EK KE+ +L+R QSS+WI++K+LE K+ EA+V+   L
Subjt:  KDEAKREALRVANEERKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTTTGCGTCTCGTTCTTCGATCCGCACTGCAACCAATCTCTCCAATAACCTTCTGCGATCCTTCTCCACTTCCACCAAACCCGCTCACCATAACAATCACCAGCA
GACGCACAAATACTTGGAGGCGAACGCCTTCGTTGGAAGCTGGGAGGCGCCGAAAGATCCCAAGGAAGCGCAGGCCAGGCTCGCTCAGCTTCGAAGGGACTACGCCAAAC
AGGTGAAGCAGGTGCGCAAGAATTACATCCAGGAGGTCGAACTCTTGAGACTCGAAAAGCAGCGCAAGGACGAAGCAAAGAGAGAGGCGCTTAGGGTTGCCAATGAAGAA
CGGAAGAAACTTAAAGCCGAAGCTGCTAAAGCCCGAGCTGAAGAGCGTAAGATTGCCGAGGAAGAGTTCCGACGGACTTTGATGAAAGAAAGAGCTGAGAAGCATGAGCA
TTGGAGAATGATGGAAAAGACGAGGGATGAAAAGAAGAAAGAAAAGAATGAGCTAATAAGACGGCAGAGTTCCATGTGGATTGATGAAAAGAAGTTGGAAGAGAAGCTAT
TAGAGGCCATTGTTAATACCACCCCTCTCTGA
mRNA sequenceShow/hide mRNA sequence
CCTAGGGGGGGAGAGAACAAACAAGCTTAAACCTTCCTTCTTCAAGCCCAGAAGCTTGTATCTCTACCATCTCATACAACAATGGCGTTTGCGTCTCGTTCTTCGATCCG
CACTGCAACCAATCTCTCCAATAACCTTCTGCGATCCTTCTCCACTTCCACCAAACCCGCTCACCATAACAATCACCAGCAGACGCACAAATACTTGGAGGCGAACGCCT
TCGTTGGAAGCTGGGAGGCGCCGAAAGATCCCAAGGAAGCGCAGGCCAGGCTCGCTCAGCTTCGAAGGGACTACGCCAAACAGGTGAAGCAGGTGCGCAAGAATTACATC
CAGGAGGTCGAACTCTTGAGACTCGAAAAGCAGCGCAAGGACGAAGCAAAGAGAGAGGCGCTTAGGGTTGCCAATGAAGAACGGAAGAAACTTAAAGCCGAAGCTGCTAA
AGCCCGAGCTGAAGAGCGTAAGATTGCCGAGGAAGAGTTCCGACGGACTTTGATGAAAGAAAGAGCTGAGAAGCATGAGCATTGGAGAATGATGGAAAAGACGAGGGATG
AAAAGAAGAAAGAAAAGAATGAGCTAATAAGACGGCAGAGTTCCATGTGGATTGATGAAAAGAAGTTGGAAGAGAAGCTATTAGAGGCCATTGTTAATACCACCCCTCTC
TGAGTTCTCTTTGCGAGATTGATTTAGAGGACTACTCTTTAACTTTGAGTGATAATTGGCCAAGATCTTTTTAAATTCTCATATTTCTGGTTAGAAGCAATTAAGATCAT
TTTAATGTTGGATTCTAGCTGGTAAATTTCTGTACATCCAAGACTTGTACCAATCGTGATGAATTTGGTTTTGCATTAGTAGCTCAATTAATGCTGCTGATGGTCATAGT
TCAAATACTAAACTGATTAAGATTGGGTATTTAAACTCGGATGTAAGTTCAAATCTACTCCTTGTTTGTGAGATTGAACTTAACTAATGCATTTTACTTTTTGTACATAT
ATATATCGATGAGAGATTGAACCAGC
Protein sequenceShow/hide protein sequence
MAFASRSSIRTATNLSNNLLRSFSTSTKPAHHNNHQQTHKYLEANAFVGSWEAPKDPKEAQARLAQLRRDYAKQVKQVRKNYIQEVELLRLEKQRKDEAKREALRVANEE
RKKLKAEAAKARAEERKIAEEEFRRTLMKERAEKHEHWRMMEKTRDEKKKEKNELIRRQSSMWIDEKKLEEKLLEAIVNTTPL