; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016670 (gene) of Snake gourd v1 genome

Gene IDTan0016670
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein SOB FIVE-LIKE 5
Genome locationLG02:80403381..80405068
RNA-Seq ExpressionTan0016670
SyntenyTan0016670
Gene Ontology termsGO:0009691 - cytokinin biosynthetic process (biological process)
InterPro domainsIPR044670 - SOB-five-Like (SOFL) family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598325.1 hypothetical protein SDJN03_08103, partial [Cucurbita argyrosperma subsp. sororia]1.0e-5070.62Show/hide
Query:  MSGFSGSHC-NSGCESGWTVYFEESLETP--------DYGGGREGNDDEQEFEGDLSMISDASSGPRPRNVYCDDVEENYQSVRRNGGKSAALKSKRKEE
        MS FSGSHC N+ CESGWT+Y EES +T         DY GGR   ++E+E EGDLSMISDASSGPR         EEN QSV RN GK AA KSKRKEE
Subjt:  MSGFSGSHC-NSGCESGWTVYFEESLETP--------DYGGGREGNDDEQEFEGDLSMISDASSGPRPRNVYCDDVEENYQSVRRNGGKSAALKSKRKEE

Query:  MGRRNQHSCLDDTASSPVFGLSK-RETNPHTKEGFVENVKEFSQSHSRKQHGK--TSIFQSSSTKKLAKNVSGDFQE
        MGRRN+HSCLDDTASSPVFGLSK RETNP+  EG VENVKEFSQ+HSRKQHGK  TS FQSSS KK  KNVSGDFQE
Subjt:  MGRRNQHSCLDDTASSPVFGLSK-RETNPHTKEGFVENVKEFSQSHSRKQHGK--TSIFQSSSTKKLAKNVSGDFQE

KAG7029296.1 hypothetical protein SDJN02_07634, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-5170.06Show/hide
Query:  MSGFSGSHC-NSGCESGWTVYFEESLETP--------DYGGGREGNDDEQEFEGDLSMISDASSGPRPRNVYCDDVEENYQSVRRNGGKSAALKSKRKEE
        MS FSGSHC N+ CESGWT+Y EES +T         DYGG  +  ++E+E EGDLSMISDASSGPR         EEN QSV RN GK+AA KSKRKEE
Subjt:  MSGFSGSHC-NSGCESGWTVYFEESLETP--------DYGGGREGNDDEQEFEGDLSMISDASSGPRPRNVYCDDVEENYQSVRRNGGKSAALKSKRKEE

Query:  MGRRNQHSCLDDTASSPVFGLSK-RETNPHTKEGFVENVKEFSQSHSRKQHGK--TSIFQSSSTKKLAKNVSGDFQE
        MGRRN+HSCLDDTASSPVFGLSK RETNP+  EG VENVKEFSQ+HSRKQHGK  TS FQSSS KK  KNVSGDFQE
Subjt:  MGRRNQHSCLDDTASSPVFGLSK-RETNPHTKEGFVENVKEFSQSHSRKQHGK--TSIFQSSSTKKLAKNVSGDFQE

XP_022962515.1 uncharacterized protein LOC111462920 [Cucurbita moschata]9.0e-5271.19Show/hide
Query:  MSGFSGSHC-NSGCESGWTVYFEESLETP--------DYGGGREGNDDEQEFEGDLSMISDASSGPRPRNVYCDDVEENYQSVRRNGGKSAALKSKRKEE
        MS FSGSHC N+ CESGWT+Y EES ET         DYGG   G ++E+E EGDLSMISDASSGPR         EEN QSV RN GK+AA KSKRKEE
Subjt:  MSGFSGSHC-NSGCESGWTVYFEESLETP--------DYGGGREGNDDEQEFEGDLSMISDASSGPRPRNVYCDDVEENYQSVRRNGGKSAALKSKRKEE

Query:  MGRRNQHSCLDDTASSPVFGLSK-RETNPHTKEGFVENVKEFSQSHSRKQHGK--TSIFQSSSTKKLAKNVSGDFQE
        MGRRN+HSCLDDTASSPVFGLSK RETNP+  EG VENVKEFSQ+HSRKQHGK  TS FQSSS KK  KNVSGDFQE
Subjt:  MGRRNQHSCLDDTASSPVFGLSK-RETNPHTKEGFVENVKEFSQSHSRKQHGK--TSIFQSSSTKKLAKNVSGDFQE

XP_022997597.1 uncharacterized protein LOC111492483 [Cucurbita maxima]5.3e-5270.62Show/hide
Query:  MSGFSGSHC-NSGCESGWTVYFEESLETP--------DYGGGREGNDDEQEFEGDLSMISDASSGPRPRNVYCDDVEENYQSVRRNGGKSAALKSKRKEE
        MS FSGSHC N+ CESGWT+Y EES ET         DY GGR   ++E+E EGDLSMISDASSGPR         EEN QSVRRNGGK+AA KSKRKE+
Subjt:  MSGFSGSHC-NSGCESGWTVYFEESLETP--------DYGGGREGNDDEQEFEGDLSMISDASSGPRPRNVYCDDVEENYQSVRRNGGKSAALKSKRKEE

Query:  MGRRNQHSCLDDTASSPVFGLSK-RETNPHTKEGFVENVKEFSQSHSRKQHGK--TSIFQSSSTKKLAKNVSGDFQE
        MGRRN+HSCLDDTASSPVFGLSK RETNP+  EG VENVKEFSQ+HSRKQHGK  T  FQSSS KK+ KNVSGDF+E
Subjt:  MGRRNQHSCLDDTASSPVFGLSK-RETNPHTKEGFVENVKEFSQSHSRKQHGK--TSIFQSSSTKKLAKNVSGDFQE

XP_038886134.1 uncharacterized protein LOC120076391 isoform X2 [Benincasa hispida]6.9e-5272.47Show/hide
Query:  MSGFSGSHC-NSGCESGWTVYFEESLETP--------DYGGGREGNDDEQEFEGDLSMISDASSGPRPRNVYCDDVEENYQSVRRNGGKSAALKSKRKEE
        MS FSGSHC NSGCESGWT+YFEES+ET         DYGGG      ++E EGDLSMISDASSG  P N Y +  E NYQ VRRNGGKSAA KSKRKEE
Subjt:  MSGFSGSHC-NSGCESGWTVYFEESLETP--------DYGGGREGNDDEQEFEGDLSMISDASSGPRPRNVYCDDVEENYQSVRRNGGKSAALKSKRKEE

Query:  MGRRNQHSCLDDTASSPVFGLSK-RETNPHTKEGFVENVKEFS-QSHSRKQHGK--TSIFQSSSTKKLAKNVSGDFQE
        MGRRNQHSCLDDTASSPVFGLS  RE +P+  EG VENVKEFS Q+HSRKQHGK  TS FQSSS KKL KNVSGD+QE
Subjt:  MGRRNQHSCLDDTASSPVFGLSK-RETNPHTKEGFVENVKEFS-QSHSRKQHGK--TSIFQSSSTKKLAKNVSGDFQE

TrEMBL top hitse value%identityAlignment
A0A0A0LR31 Uncharacterized protein4.5e-4970.06Show/hide
Query:  MSGFSGSHC-NSGCESGWTVYFEESLETP--------DYGGGREGNDDEQEFEGDLSMISDASSGPRPRNVYCDDVEENYQSVRRNGGKSAALKSKRKEE
        MS F+GSHC NSGCESGWT+YFEES+ET         DYGGG    + E+E E DLSMISDASSG  PRN Y +  E+N QSV R+GGK  A KSKR+EE
Subjt:  MSGFSGSHC-NSGCESGWTVYFEESLETP--------DYGGGREGNDDEQEFEGDLSMISDASSGPRPRNVYCDDVEENYQSVRRNGGKSAALKSKRKEE

Query:  MGRRNQHSCLDDTASSPVFGLSK-RETNPHTKEGFVENVKEFSQSHSRKQHGK--TSIFQSSSTKKLAKNVSGDFQE
        MG RNQHSCLDDTASSPVFGLSK RETNP+  EG  +NVKEFSQ+HSRK HGK  TS FQSSS KKLAKNVSGD+QE
Subjt:  MGRRNQHSCLDDTASSPVFGLSK-RETNPHTKEGFVENVKEFSQSHSRKQHGK--TSIFQSSSTKKLAKNVSGDFQE

A0A6J1BR82 uncharacterized protein LOC111005036 isoform X21.6e-4967.23Show/hide
Query:  MSGFSGSHCNSGCESGWTVYFEESLE-----TPDYGGGREGNDDEQEFEGDLSMISDASSGPRPRNVYCDD----VEENYQSV-RRNGGKSAALKSKRKE
        MS FSG+HCNSGC+SGWTVYF++S E       DYGGG E        EGDLSM+SDASSG  PRN + D      E N+Q V RRNGGKSAA K+KR++
Subjt:  MSGFSGSHCNSGCESGWTVYFEESLE-----TPDYGGGREGNDDEQEFEGDLSMISDASSGPRPRNVYCDD----VEENYQSV-RRNGGKSAALKSKRKE

Query:  EMGRRNQHSCLDDTASSPVFGLSKRETNPHTKEGFVENVKEFSQSHSRKQHGKTSIFQSSSTKKLAKNVSGDFQEAK
        E+GRRNQHS LDDTA+SPVF LSKRETNP+T E  VENVKEF+ SHSRKQ GKTS FQSSS KK  +NVSGDFQEAK
Subjt:  EMGRRNQHSCLDDTASSPVFGLSKRETNPHTKEGFVENVKEFSQSHSRKQHGKTSIFQSSSTKKLAKNVSGDFQEAK

A0A6J1BRF5 uncharacterized protein LOC111005036 isoform X15.0e-4866.48Show/hide
Query:  MSGFSGSHCNSGCESGWTVYFEESLE-----TPDYGGGREGNDDEQEFEGDLSMISDASSGPRPRNVYCDD----VEENYQSV-RRNGGKSAALKSKRKE
        MS FSG+HCNSGC+SGWTVYF++S E       DYGGG E        EGDLSM+SDASSG  PRN + D      E N+Q V RRNGGKSAA K+KR++
Subjt:  MSGFSGSHCNSGCESGWTVYFEESLE-----TPDYGGGREGNDDEQEFEGDLSMISDASSGPRPRNVYCDD----VEENYQSV-RRNGGKSAALKSKRKE

Query:  EMGRRNQHSCLDDTASSPVFGLSKRETNPHTKEGFVENVKEFSQSHSRKQHGK--TSIFQSSSTKKLAKNVSGDFQEAK
        E+GRRNQHS LDDTA+SPVF LSKRETNP+T E  VENVKEF+ SHSRKQ GK  TS FQSSS KK  +NVSGDFQEAK
Subjt:  EMGRRNQHSCLDDTASSPVFGLSKRETNPHTKEGFVENVKEFSQSHSRKQHGK--TSIFQSSSTKKLAKNVSGDFQEAK

A0A6J1HFC4 uncharacterized protein LOC1114629204.4e-5271.19Show/hide
Query:  MSGFSGSHC-NSGCESGWTVYFEESLETP--------DYGGGREGNDDEQEFEGDLSMISDASSGPRPRNVYCDDVEENYQSVRRNGGKSAALKSKRKEE
        MS FSGSHC N+ CESGWT+Y EES ET         DYGG   G ++E+E EGDLSMISDASSGPR         EEN QSV RN GK+AA KSKRKEE
Subjt:  MSGFSGSHC-NSGCESGWTVYFEESLETP--------DYGGGREGNDDEQEFEGDLSMISDASSGPRPRNVYCDDVEENYQSVRRNGGKSAALKSKRKEE

Query:  MGRRNQHSCLDDTASSPVFGLSK-RETNPHTKEGFVENVKEFSQSHSRKQHGK--TSIFQSSSTKKLAKNVSGDFQE
        MGRRN+HSCLDDTASSPVFGLSK RETNP+  EG VENVKEFSQ+HSRKQHGK  TS FQSSS KK  KNVSGDFQE
Subjt:  MGRRNQHSCLDDTASSPVFGLSK-RETNPHTKEGFVENVKEFSQSHSRKQHGK--TSIFQSSSTKKLAKNVSGDFQE

A0A6J1K5J3 uncharacterized protein LOC1114924832.6e-5270.62Show/hide
Query:  MSGFSGSHC-NSGCESGWTVYFEESLETP--------DYGGGREGNDDEQEFEGDLSMISDASSGPRPRNVYCDDVEENYQSVRRNGGKSAALKSKRKEE
        MS FSGSHC N+ CESGWT+Y EES ET         DY GGR   ++E+E EGDLSMISDASSGPR         EEN QSVRRNGGK+AA KSKRKE+
Subjt:  MSGFSGSHC-NSGCESGWTVYFEESLETP--------DYGGGREGNDDEQEFEGDLSMISDASSGPRPRNVYCDDVEENYQSVRRNGGKSAALKSKRKEE

Query:  MGRRNQHSCLDDTASSPVFGLSK-RETNPHTKEGFVENVKEFSQSHSRKQHGK--TSIFQSSSTKKLAKNVSGDFQE
        MGRRN+HSCLDDTASSPVFGLSK RETNP+  EG VENVKEFSQ+HSRKQHGK  T  FQSSS KK+ KNVSGDF+E
Subjt:  MGRRNQHSCLDDTASSPVFGLSK-RETNPHTKEGFVENVKEFSQSHSRKQHGK--TSIFQSSSTKKLAKNVSGDFQE

SwissProt top hitse value%identityAlignment
Q8L9K4 Protein SOB FIVE-LIKE 53.6e-1134.16Show/hide
Query:  NSGCESGWTVYFEESLETPDYGGGREGN------------------DDEQEFEGDLSMISDASSGPRPRNVYCDDVEENYQSVRRNGGKSAALKSKRKEE
        +SGCESGWT+Y ++S+ +P     R+ N                   +E+E E DLSMISDASSG  PRN+     E++ + +   G K    + K++ +
Subjt:  NSGCESGWTVYFEESLETPDYGGGREGN------------------DDEQEFEGDLSMISDASSGPRPRNVYCDDVEENYQSVRRNGGKSAALKSKRKEE

Query:  MGRRNQHSCLDDTASSPVFG----LSKRETNPHTKEGFVENVKEFSQSHSRKQHGKTSIFQ
          + N  S LDDTASSP+F     L K       ++ F E+  ++SQ  S  Q    + FQ
Subjt:  MGRRNQHSCLDDTASSPVFG----LSKRETNPHTKEGFVENVKEFSQSHSRKQHGKTSIFQ

Arabidopsis top hitse value%identityAlignment
AT1G58460.1 unknown protein3.3e-0430.73Show/hide
Query:  FSGSHCNSGCESGWTVYFEESLETPDYGGGREGNDDEQEFEGDLSMISDASSGPR---PRNVYCDDVEENYQ----SVRRNGGKSAALKSKRKEEMGRRN
        FS    +   +SGWT+Y   S     +       + +QE + D SM+SDASSGP       V+ D +++N Q    S  +N  K+   K K  EE G   
Subjt:  FSGSHCNSGCESGWTVYFEESLETPDYGGGREGNDDEQEFEGDLSMISDASSGPR---PRNVYCDDVEENYQ----SVRRNGGKSAALKSKRKEEMGRRN

Query:  Q-HSCLDDTASSPVFGLSKRETNPHTK-EGFVENVKEFSQSHSRKQHGKTSI-----FQSSSTKKLA-KNVSGDFQEAK
        + +S  DDTASS   G    E + H + +   +   +F QS+S ++  K  +      Q+    KLA  N  GD Q  +
Subjt:  Q-HSCLDDTASSPVFGLSKRETNPHTK-EGFVENVKEFSQSHSRKQHGKTSI-----FQSSSTKKLA-KNVSGDFQEAK

AT4G33800.1 unknown protein2.6e-1234.16Show/hide
Query:  NSGCESGWTVYFEESLETPDYGGGREGN------------------DDEQEFEGDLSMISDASSGPRPRNVYCDDVEENYQSVRRNGGKSAALKSKRKEE
        +SGCESGWT+Y ++S+ +P     R+ N                   +E+E E DLSMISDASSG  PRN+     E++ + +   G K    + K++ +
Subjt:  NSGCESGWTVYFEESLETPDYGGGREGN------------------DDEQEFEGDLSMISDASSGPRPRNVYCDDVEENYQSVRRNGGKSAALKSKRKEE

Query:  MGRRNQHSCLDDTASSPVFG----LSKRETNPHTKEGFVENVKEFSQSHSRKQHGKTSIFQ
          + N  S LDDTASSP+F     L K       ++ F E+  ++SQ  S  Q    + FQ
Subjt:  MGRRNQHSCLDDTASSPVFG----LSKRETNPHTKEGFVENVKEFSQSHSRKQHGKTSIFQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGGTTTTCGGGTTCGCATTGTAATAGTGGCTGTGAATCGGGTTGGACGGTGTATTTCGAGGAATCCTTGGAGACGCCGGATTACGGCGGCGGAAGGGAGGGGAA
TGACGACGAACAGGAATTTGAAGGGGATTTGTCCATGATTTCGGACGCGTCGTCGGGGCCGAGGCCGAGGAATGTCTATTGTGACGACGTTGAGGAGAATTACCAGTCAG
TGCGGCGGAATGGGGGGAAATCGGCGGCTCTGAAAAGTAAGAGGAAAGAAGAAATGGGGCGGCGGAACCAACATTCTTGCCTTGATGACACTGCAAGCTCCCCTGTTTTT
GGGCTTTCTAAGAGAGAGACAAACCCACATACAAAGGAAGGTTTCGTGGAGAATGTAAAAGAATTCTCACAAAGCCATTCTCGAAAACAACATGGCAAGACAAGCATTTT
TCAGTCTTCCTCTACCAAAAAATTGGCCAAAAATGTCTCAGGCGATTTTCAGGAAGCCAAATGGGAACATTAG
mRNA sequenceShow/hide mRNA sequence
TTGATCCCCAAACGCCACACTCTATAACACACAAACCAAACCAAAAATTTTCACCATCCCCTCAACTCAACCGCGCCATTGCATTCTATTCGAATTCCCTTCCTCTGTTT
TTCTGGTTTCAAGATGAGTGGGTTTTCGGGTTCGCATTGTAATAGTGGCTGTGAATCGGGTTGGACGGTGTATTTCGAGGAATCCTTGGAGACGCCGGATTACGGCGGCG
GAAGGGAGGGGAATGACGACGAACAGGAATTTGAAGGGGATTTGTCCATGATTTCGGACGCGTCGTCGGGGCCGAGGCCGAGGAATGTCTATTGTGACGACGTTGAGGAG
AATTACCAGTCAGTGCGGCGGAATGGGGGGAAATCGGCGGCTCTGAAAAGTAAGAGGAAAGAAGAAATGGGGCGGCGGAACCAACATTCTTGCCTTGATGACACTGCAAG
CTCCCCTGTTTTTGGGCTTTCTAAGAGAGAGACAAACCCACATACAAAGGAAGGTTTCGTGGAGAATGTAAAAGAATTCTCACAAAGCCATTCTCGAAAACAACATGGCA
AGACAAGCATTTTTCAGTCTTCCTCTACCAAAAAATTGGCCAAAAATGTCTCAGGCGATTTTCAGGAAGCCAAATGGGAACATTAGAAAAGGAGTACAATGAAAATTATA
AGTAAATGAAATGAATAGAGACACGAGGATGATGGATATTATTATTTTGTTGGT
Protein sequenceShow/hide protein sequence
MSGFSGSHCNSGCESGWTVYFEESLETPDYGGGREGNDDEQEFEGDLSMISDASSGPRPRNVYCDDVEENYQSVRRNGGKSAALKSKRKEEMGRRNQHSCLDDTASSPVF
GLSKRETNPHTKEGFVENVKEFSQSHSRKQHGKTSIFQSSSTKKLAKNVSGDFQEAKWEH