; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy5G012717 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy5G012717
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionIleal sodium/bile acid cotransporter, putative
Genome locationGy14Chr5:14983420..14984968
RNA-Seq ExpressionCsGy5G012717
SyntenyCsGy5G012717
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050246.1 putative Ileal sodium/bile acid cotransporter [Cucumis melo var. makuwa]4.72e-4357.52Show/hide
Query:  MSAFSSFIRTISQRNYGLGCCCTPSNCLPVFIRMQQ-SPQATADSSQKLGKIILSLSFQAVLALFISSPPTSPPLLIHFFAAAVFISFAVSFAALFLHNS
        M+   SFI  I++RN  +       N LP+ I MQ+ SP  ++ +   +GK IL L+FQAVLALFI+SP +SPPLL H FAAAV ISFAVSFA +FL N 
Subjt:  MSAFSSFIRTISQRNYGLGCCCTPSNCLPVFIRMQQ-SPQATADSSQKLGKIILSLSFQAVLALFISSPPTSPPLLIHFFAAAVFISFAVSFAALFLHNS

Query:  FPRTAHLFEKIGALFSAFGVCFIASFLLVHQNFAWICWVACTFSIIVFALSFK
        FPR A LFEKIGAL +A GVC +AS LL+HQNFAWI W+A  FS++ F LSF+
Subjt:  FPRTAHLFEKIGALFSAFGVCFIASFLLVHQNFAWICWVACTFSIIVFALSFK

KAG6579132.1 hypothetical protein SDJN03_23580, partial [Cucurbita argyrosperma subsp. sororia]2.99e-3654.62Show/hide
Query:  TPSNCLPVFIRMQQSPQATADSSQKLGKIILSLSFQAVLALFISSPPTSPPLLIHFFAAAVFISFAVSFAALFLHNSFPRTAHLFEKIGALFSAFGVCFI
        TP +CLP++   +Q  +     S  +GK I+ L+ QA+LA+FISSP +SPPLL   F A +FISF +SFA +FL N+FP+ A LFEK+GALF+A GV  I
Subjt:  TPSNCLPVFIRMQQSPQATADSSQKLGKIILSLSFQAVLALFISSPPTSPPLLIHFFAAAVFISFAVSFAALFLHNSFPRTAHLFEKIGALFSAFGVCFI

Query:  ASFLLVHQNFAWICWVACTFSIIVFALSFK
        ASFLL+H+N+AWI  +AC FS+IVF LS+ 
Subjt:  ASFLLVHQNFAWICWVACTFSIIVFALSFK

KAG6579136.1 hypothetical protein SDJN03_23584, partial [Cucurbita argyrosperma subsp. sororia]5.40e-3349.08Show/hide
Query:  MASTSHQTSLDMSAFSSFIRTISQRNYGLGCCCTPSNCLPVFIRMQQSPQATADSSQKLGKIILSLSFQAVLALFISSPPTSPPLLIHFFAAAVFISFAV
        M ST    S+DM+ FSS    I   + GL                 + PQA  +    LGKI+  L+FQAVLALFIS+P + PPLLI+ FAAA+ IS A+
Subjt:  MASTSHQTSLDMSAFSSFIRTISQRNYGLGCCCTPSNCLPVFIRMQQSPQATADSSQKLGKIILSLSFQAVLALFISSPPTSPPLLIHFFAAAVFISFAV

Query:  SFAALFLHNSFPRTAHLFEKIGALFSAFGVCFIASFLLVHQNFAWICWVACTFSIIVFALSFK
        S AALFL  +FPR A  F KIGA+ +A G C +AS LL HQNF+WI W+AC F+++ F LSFK
Subjt:  SFAALFLHNSFPRTAHLFEKIGALFSAFGVCFIASFLLVHQNFAWICWVACTFSIIVFALSFK

KGN49803.1 hypothetical protein Csa_004681 [Cucumis sativus]7.61e-4254.27Show/hide
Query:  MASTSHQTSLDMSAFSSFIRTISQRNYGLGCCCTPSNCLPVFIRMQQ-SPQATADSSQKLGKIILSLSFQAVLALFISSPPTSPPLLIHFFAAAVFISFA
        M S+S Q S+DM+   S I +I++RN  +       N LP+ I MQ  SP  ++ +   +G  IL L+FQAVLALFI+S  +SPPLL H F AAV ISFA
Subjt:  MASTSHQTSLDMSAFSSFIRTISQRNYGLGCCCTPSNCLPVFIRMQQ-SPQATADSSQKLGKIILSLSFQAVLALFISSPPTSPPLLIHFFAAAVFISFA

Query:  VSFAALFLHNSFPRTAHLFEKIGALFSAFGVCFIASFLLVHQNFAWICWVACTFSIIVFALSFK
        VSF  +FL + FPR A LFEKIGAL +A GVC +AS LL+HQNFAWI W+AC FS++ F LSF+
Subjt:  VSFAALFLHNSFPRTAHLFEKIGALFSAFGVCFIASFLLVHQNFAWICWVACTFSIIVFALSFK

KGN49808.1 hypothetical protein Csa_004650 [Cucumis sativus]2.33e-10598.78Show/hide
Query:  MASTSHQTSLDMSAFSSFIRTISQRNYGLGCCCTPSNCLPVFIRMQQSPQATADSSQKLGKIILSLSFQAVLALFISSPPTSPP-LLIHFFAAAVFISFA
        MASTSHQTSLDMSAFSSFIRTISQRNYGLGCCCTPSNCLPVFIRMQQSPQATADSSQKLGKIILSLSFQAVLALFISSPPTSPP LLIHFFAAAVFISFA
Subjt:  MASTSHQTSLDMSAFSSFIRTISQRNYGLGCCCTPSNCLPVFIRMQQSPQATADSSQKLGKIILSLSFQAVLALFISSPPTSPP-LLIHFFAAAVFISFA

Query:  VSFAALFLHNSFPRTAHLFEKIGALFSAFGVCFIASFLLVHQNFAWICWVACTFSIIVFALSFK
        VSFAALFLHNSFPRTAHLFEK+GALFSAFGVCFIASFLLVHQNFAWICWVACTFSIIVFALSFK
Subjt:  VSFAALFLHNSFPRTAHLFEKIGALFSAFGVCFIASFLLVHQNFAWICWVACTFSIIVFALSFK

TrEMBL top hitse value%identityAlignment
A0A0A0KJN8 Uncharacterized protein3.68e-4254.27Show/hide
Query:  MASTSHQTSLDMSAFSSFIRTISQRNYGLGCCCTPSNCLPVFIRMQQ-SPQATADSSQKLGKIILSLSFQAVLALFISSPPTSPPLLIHFFAAAVFISFA
        M S+S Q S+DM+   S I +I++RN  +       N LP+ I MQ  SP  ++ +   +G  IL L+FQAVLALFI+S  +SPPLL H F AAV ISFA
Subjt:  MASTSHQTSLDMSAFSSFIRTISQRNYGLGCCCTPSNCLPVFIRMQQ-SPQATADSSQKLGKIILSLSFQAVLALFISSPPTSPPLLIHFFAAAVFISFA

Query:  VSFAALFLHNSFPRTAHLFEKIGALFSAFGVCFIASFLLVHQNFAWICWVACTFSIIVFALSFK
        VSF  +FL + FPR A LFEKIGAL +A GVC +AS LL+HQNFAWI W+AC FS++ F LSF+
Subjt:  VSFAALFLHNSFPRTAHLFEKIGALFSAFGVCFIASFLLVHQNFAWICWVACTFSIIVFALSFK

A0A0A0KJP2 Uncharacterized protein1.13e-10598.78Show/hide
Query:  MASTSHQTSLDMSAFSSFIRTISQRNYGLGCCCTPSNCLPVFIRMQQSPQATADSSQKLGKIILSLSFQAVLALFISSPPTSPP-LLIHFFAAAVFISFA
        MASTSHQTSLDMSAFSSFIRTISQRNYGLGCCCTPSNCLPVFIRMQQSPQATADSSQKLGKIILSLSFQAVLALFISSPPTSPP LLIHFFAAAVFISFA
Subjt:  MASTSHQTSLDMSAFSSFIRTISQRNYGLGCCCTPSNCLPVFIRMQQSPQATADSSQKLGKIILSLSFQAVLALFISSPPTSPP-LLIHFFAAAVFISFA

Query:  VSFAALFLHNSFPRTAHLFEKIGALFSAFGVCFIASFLLVHQNFAWICWVACTFSIIVFALSFK
        VSFAALFLHNSFPRTAHLFEK+GALFSAFGVCFIASFLLVHQNFAWICWVACTFSIIVFALSFK
Subjt:  VSFAALFLHNSFPRTAHLFEKIGALFSAFGVCFIASFLLVHQNFAWICWVACTFSIIVFALSFK

A0A0A0KQ03 Uncharacterized protein2.55e-2841.36Show/hide
Query:  HQTSLDMSAFSSFIRTISQRNYGLGCCC---TPSNCLPVFIRMQQSPQATADSSQKLGKIILSLSFQAVLALFIS-SPPTSPPLLIHFFAAAVFISFAVS
        H+ SLDM+  +S I  ++ RN G+  C      S+CLP++I+MQ+       +S + G   LSL+FQA++ LF+S +P +S PL    FAA +  SF  S
Subjt:  HQTSLDMSAFSSFIRTISQRNYGLGCCC---TPSNCLPVFIRMQQSPQATADSSQKLGKIILSLSFQAVLALFIS-SPPTSPPLLIHFFAAAVFISFAVS

Query:  FAALFLHNSFPRTAHLFEKIGALFSAFGVCFIASFLLVHQNFAWICWVACTFSIIVFALSFK
        +  + L   FP+TA L +  GALF+A G C I S LL++ NF WICW+A    +  F +SFK
Subjt:  FAALFLHNSFPRTAHLFEKIGALFSAFGVCFIASFLLVHQNFAWICWVACTFSIIVFALSFK

A0A5A7U7U1 Putative Ileal sodium/bile acid cotransporter2.28e-4357.52Show/hide
Query:  MSAFSSFIRTISQRNYGLGCCCTPSNCLPVFIRMQQ-SPQATADSSQKLGKIILSLSFQAVLALFISSPPTSPPLLIHFFAAAVFISFAVSFAALFLHNS
        M+   SFI  I++RN  +       N LP+ I MQ+ SP  ++ +   +GK IL L+FQAVLALFI+SP +SPPLL H FAAAV ISFAVSFA +FL N 
Subjt:  MSAFSSFIRTISQRNYGLGCCCTPSNCLPVFIRMQQ-SPQATADSSQKLGKIILSLSFQAVLALFISSPPTSPPLLIHFFAAAVFISFAVSFAALFLHNS

Query:  FPRTAHLFEKIGALFSAFGVCFIASFLLVHQNFAWICWVACTFSIIVFALSFK
        FPR A LFEKIGAL +A GVC +AS LL+HQNFAWI W+A  FS++ F LSF+
Subjt:  FPRTAHLFEKIGALFSAFGVCFIASFLLVHQNFAWICWVACTFSIIVFALSFK

A0A6J1CY58 uncharacterized protein LOC1110156585.19e-3259.52Show/hide
Query:  MASTSHQTSLDMSAFSSFIRTISQRNYGL------GCCCTPSNCLPVFIRMQQSPQATADSSQKLGKIILSLSFQAVLALFISSPPTSPPLLIHFFAAAV
        MAST  QTS+DM+A SSFI  I++RN G+      G   TPS+CLP+ IRMQ+ P A   SSQ LGK IL L+FQAVLALFIS P + P L    F AAV
Subjt:  MASTSHQTSLDMSAFSSFIRTISQRNYGL------GCCCTPSNCLPVFIRMQQSPQATADSSQKLGKIILSLSFQAVLALFISSPPTSPPLLIHFFAAAV

Query:  FISFAVSFAALFLHNSFPRTAHLFEK
         ISFAVSFA LFL  ++PR A LFEK
Subjt:  FISFAVSFAALFLHNSFPRTAHLFEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAACTTCACACCAAACCTCTTTAGATATGAGTGCATTTTCTTCTTTTATCCGAACAATCAGCCAAAGAAACTATGGACTTGGCTGTTGCTGCACACCATCAAA
CTGCTTACCAGTTTTCATTAGAATGCAACAAAGCCCTCAAGCAACAGCCGATAGCTCTCAAAAACTTGGCAAGATAATTCTTAGCCTTAGTTTTCAAGCAGTTTTAGCCT
TGTTCATTAGCTCACCACCAACTTCCCCTCCACTTTTGATACACTTTTTTGCTGCTGCTGTTTTCATTAGCTTTGCTGTTTCATTTGCTGCTCTTTTCCTTCATAACTCC
TTCCCGAGAACCGCCCATTTATTCGAGAAGATTGGTGCGCTTTTTTCTGCATTTGGTGTGTGTTTCATAGCAAGCTTTCTTCTAGTTCATCAGAACTTTGCTTGGATTTG
TTGGGTGGCATGTACCTTCTCCATCATTGTCTTTGCTTTATCATTTAAGTGA
mRNA sequenceShow/hide mRNA sequence
AGTTGAATCTTTAGTCAAATTTCAAAAACAAAATCTTCATTATTTTGGCTTTGGTTAGAAGAATATAATACAACAATAAACTTAAGGTAGAAGAAATGTTTCTAGCCTCC
AATTTCAAAAACTAAAAATCAAACACAACTGATTACCGAACGAAATCTAATATATTTTCATTTTTATATGCTTTATTCGATCCTTTCAACATTAAAACTCAATGACACAA
TACTCAGAAGTTCAAATCCTTTTACATGATGTTCAACTCAAATTCATACATATTTACACATACATATAATGAAGAAACTTCTTTAAACAAGCAAAATCTTCTCTGATAAA
TGCAAGTGTGTGGCTTAGTATTTACCATGAAAGTTTAATTGATTGATGATTGTGTGTGAAAGAGAAGCAAGAGTCAAACAACTTGACTAAAAAAGTGTTACAAAAAAAAC
CCTAACAACTTCTAAGAAACTTCTAATAAATTGCACTCGAATTACCAAAACAATTGTAGTAGAATCAAATATCACCAATAGAAAAAGGGCAACCCATACTGAATGGGTCA
AAGGGAAACCCCTTGAGAGTGTATTGAAAAAACTATCATTTTCTTTCCTTTTTTTAGCTTTCTTTTCCAATGGCTTCAACTTCACACCAAACCTCTTTAGATATGAGTGC
ATTTTCTTCTTTTATCCGAACAATCAGCCAAAGAAACTATGGACTTGGCTGTTGCTGCACACCATCAAACTGCTTACCAGTTTTCATTAGAATGCAACAAAGCCCTCAAG
CAACAGCCGATAGCTCTCAAAAACTTGGCAAGATAATTCTTAGCCTTAGTTTTCAAGCAGTTTTAGCCTTGTTCATTAGCTCACCACCAACTTCCCCTCCACTTTTGATA
CACTTTTTTGCTGCTGCTGTTTTCATTAGCTTTGCTGTTTCATTTGCTGCTCTTTTCCTTCATAACTCCTTCCCGAGAACCGCCCATTTATTCGAGAAGATTGGTGCGCT
TTTTTCTGCATTTGGTGTGTGTTTCATAGCAAGCTTTCTTCTAGTTCATCAGAACTTTGCTTGGATTTGTTGGGTGGCATGTACCTTCTCCATCATTGTCTTTGCTTTAT
CATTTAAGTGACATTTTTTAGACATAAAATGGCCATTTCTTTACCCATCCATTAGGCATTTTGTTTGACCTTCTTCCATGGCTCCATCAAGAATAATGAGCTCTCTCTGT
ATGGATTTTTTTTCCCAGTTGAGATAATATTATTGAAGAAAAAGTTGTTCCCACAGATGTATAGGTTGAAATAGCATTTTTGATGTTGTATGTTGAATTTCGTTCCATTT
TAATTCTCGTATACAATACTTTCTAATGCTAAATGTCTAGTTGTCTAAATGAGTTTGATTAAGTGATATTGACGTCATTTTCGTTTTTAGAAGTTGAAAGTTCAATCCTT
ATTGTTTTACTATTAAAGAAATTATTAACGTCTAATAGTGTGATTCAACGAATCTTAGAAACAGTCCATATTATTGGTTTGTTATATATATATATATATATATATTAAAT
AAAGAGTTT
Protein sequenceShow/hide protein sequence
MASTSHQTSLDMSAFSSFIRTISQRNYGLGCCCTPSNCLPVFIRMQQSPQATADSSQKLGKIILSLSFQAVLALFISSPPTSPPLLIHFFAAAVFISFAVSFAALFLHNS
FPRTAHLFEKIGALFSAFGVCFIASFLLVHQNFAWICWVACTFSIIVFALSFK