; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015213 (gene) of Snake gourd v1 genome

Gene IDTan0015213
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionIleal sodium/bile acid cotransporter, putative
Genome locationLG11:15837569..15840392
RNA-Seq ExpressionTan0015213
SyntenyTan0015213
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050244.1 putative Ileal sodium/bile acid cotransporter [Cucumis melo var. makuwa]1.2e-3556.98Show/hide
Query:  MAST-TQEASLEMNE-YSFITAINERNTGISSLIWGWGVSTASDCLPIYIRMQRPKPSPSPPANSLQNLGKTILSLTFQAILALFIGFPTSSPSLPTLLF
        MA T   E SL+MNE YSFITA+  RN GISS  WG   + ASDCLPIYI+MQRP P  SP        G T +S  FQAIL LF+  P S P L  L+ 
Subjt:  MAST-TQEASLEMNE-YSFITAINERNTGISSLIWGWGVSTASDCLPIYIRMQRPKPSPSPPANSLQNLGKTILSLTFQAILALFIGFPTSSPSLPTLLF

Query:  RTAVLISFAVSFAGVFLQNAFPRMALLLEKFGALFAAIGVCIISSLLVHQNLAWICWLACGSCLLAFIISFK
         T +L +F  SF GV LQ  FP++A LL+ FGAL AAIGVCI+ S L+H NL WI WLACG  L AFI SFK
Subjt:  RTAVLISFAVSFAGVFLQNAFPRMALLLEKFGALFAAIGVCIISSLLVHQNLAWICWLACGSCLLAFIISFK

KAA0050246.1 putative Ileal sodium/bile acid cotransporter [Cucumis melo var. makuwa]7.9e-4063.75Show/hide
Query:  MNE-YSFITAINERNTGISSLIWGWGVSTASDCLPIYIRMQRPKPSPSPPANSLQNLGKTILSLTFQAILALFIGFPTSSPSLPTLLFRTAVLISFAVSF
        MNE YSFITAINERN  I+S             LPI I MQRP P+ S  + +  N+GKTIL LTFQA+LALFI  P SSP L T LF  AVLISFAVSF
Subjt:  MNE-YSFITAINERNTGISSLIWGWGVSTASDCLPIYIRMQRPKPSPSPPANSLQNLGKTILSLTFQAILALFIGFPTSSPSLPTLLFRTAVLISFAVSF

Query:  AGVFLQNAFPRMALLLEKFGALFAAIGVCIISSLLVHQNLAWICWLACGSCLLAFIISFK
        AG+FLQN FPR+ALL EK GAL AAIGVCI++SLL+HQN AWI WLA G  L+AF++SF+
Subjt:  AGVFLQNAFPRMALLLEKFGALFAAIGVCIISSLLVHQNLAWICWLACGSCLLAFIISFK

KGN49803.1 hypothetical protein Csa_004681 [Cucumis sativus]4.8e-3757.89Show/hide
Query:  MASTTQEASLEMNE-YSFITAINERNTGISSLIWGWGVSTASDCLPIYIRMQRPKPSPSPPANSLQNLGKTILSLTFQAILALFIGFPTSSPSLPTLLFR
        M S++Q+ S++MN  YS IT+INERN  I+              LPI I MQ   P+ S  + +  N+G TIL LTFQA+LALFI   TSSP L T LF 
Subjt:  MASTTQEASLEMNE-YSFITAINERNTGISSLIWGWGVSTASDCLPIYIRMQRPKPSPSPPANSLQNLGKTILSLTFQAILALFIGFPTSSPSLPTLLFR

Query:  TAVLISFAVSFAGVFLQNAFPRMALLLEKFGALFAAIGVCIISSLLVHQNLAWICWLACGSCLLAFIISFK
         AVLISFAVSF GVFLQ+ FPR+ALL EK GAL AAIGVCI++SLL+HQN AWI WLACG  L+AF++SF+
Subjt:  TAVLISFAVSFAGVFLQNAFPRMALLLEKFGALFAAIGVCIISSLLVHQNLAWICWLACGSCLLAFIISFK

KGN49806.1 hypothetical protein Csa_004683 [Cucumis sativus]2.0e-3556.65Show/hide
Query:  MAST-TQEASLEMNEY-SFITAINERNTGISSLIWGWGVSTASDCLPIYIRMQRPKPSPSPPANSLQNLGKTILSLTFQAILALFIGF-PTSSPSLPTLL
        MA T   E SL+MN+  S I A+  RN GISS  WG     ASDCLPIYI+MQRP P  SP        G T LSLTFQAI+ LF+   P+SS  LP+ L
Subjt:  MAST-TQEASLEMNEY-SFITAINERNTGISSLIWGWGVSTASDCLPIYIRMQRPKPSPSPPANSLQNLGKTILSLTFQAILALFIGF-PTSSPSLPTLL

Query:  FRTAVLISFAVSFAGVFLQNAFPRMALLLEKFGALFAAIGVCIISSLLVHQNLAWICWLACGSCLLAFIISFK
        F   +L SF  S+ GV LQ  FP+ A LL+ FGALFAAIG CII SLL++ N  WICWLA G  L AFIISFK
Subjt:  FRTAVLISFAVSFAGVFLQNAFPRMALLLEKFGALFAAIGVCIISSLLVHQNLAWICWLACGSCLLAFIISFK

XP_022146444.1 uncharacterized protein LOC111015658 [Momordica charantia]1.3e-3772.09Show/hide
Query:  MASTTQEASLEMNEY-SFITAINERNTGISSLIWGWGVSTASDCLPIYIRMQRPKPSPSPPANSLQNLGKTILSLTFQAILALFIGFPTSSPSLPTLLFR
        MAST Q+ S++MN   SFITAINERN GISS  WG G ST SDCLPI IRMQRP     PPA S Q+LGKTIL LTFQA+LALFI  P+S P LPTLLF 
Subjt:  MASTTQEASLEMNEY-SFITAINERNTGISSLIWGWGVSTASDCLPIYIRMQRPKPSPSPPANSLQNLGKTILSLTFQAILALFIGFPTSSPSLPTLLFR

Query:  TAVLISFAVSFAGVFLQNAFPRMALLLEK
         AVLISFAVSFAG+FLQ A+PRMALL EK
Subjt:  TAVLISFAVSFAGVFLQNAFPRMALLLEK

TrEMBL top hitse value%identityAlignment
A0A0A0KJN8 Uncharacterized protein2.3e-3757.89Show/hide
Query:  MASTTQEASLEMNE-YSFITAINERNTGISSLIWGWGVSTASDCLPIYIRMQRPKPSPSPPANSLQNLGKTILSLTFQAILALFIGFPTSSPSLPTLLFR
        M S++Q+ S++MN  YS IT+INERN  I+              LPI I MQ   P+ S  + +  N+G TIL LTFQA+LALFI   TSSP L T LF 
Subjt:  MASTTQEASLEMNE-YSFITAINERNTGISSLIWGWGVSTASDCLPIYIRMQRPKPSPSPPANSLQNLGKTILSLTFQAILALFIGFPTSSPSLPTLLFR

Query:  TAVLISFAVSFAGVFLQNAFPRMALLLEKFGALFAAIGVCIISSLLVHQNLAWICWLACGSCLLAFIISFK
         AVLISFAVSF GVFLQ+ FPR+ALL EK GAL AAIGVCI++SLL+HQN AWI WLACG  L+AF++SF+
Subjt:  TAVLISFAVSFAGVFLQNAFPRMALLLEKFGALFAAIGVCIISSLLVHQNLAWICWLACGSCLLAFIISFK

A0A0A0KQ03 Uncharacterized protein9.8e-3656.65Show/hide
Query:  MAST-TQEASLEMNEY-SFITAINERNTGISSLIWGWGVSTASDCLPIYIRMQRPKPSPSPPANSLQNLGKTILSLTFQAILALFIGF-PTSSPSLPTLL
        MA T   E SL+MN+  S I A+  RN GISS  WG     ASDCLPIYI+MQRP P  SP        G T LSLTFQAI+ LF+   P+SS  LP+ L
Subjt:  MAST-TQEASLEMNEY-SFITAINERNTGISSLIWGWGVSTASDCLPIYIRMQRPKPSPSPPANSLQNLGKTILSLTFQAILALFIGF-PTSSPSLPTLL

Query:  FRTAVLISFAVSFAGVFLQNAFPRMALLLEKFGALFAAIGVCIISSLLVHQNLAWICWLACGSCLLAFIISFK
        F   +L SF  S+ GV LQ  FP+ A LL+ FGALFAAIG CII SLL++ N  WICWLA G  L AFIISFK
Subjt:  FRTAVLISFAVSFAGVFLQNAFPRMALLLEKFGALFAAIGVCIISSLLVHQNLAWICWLACGSCLLAFIISFK

A0A5A7U7U1 Putative Ileal sodium/bile acid cotransporter3.8e-4063.75Show/hide
Query:  MNE-YSFITAINERNTGISSLIWGWGVSTASDCLPIYIRMQRPKPSPSPPANSLQNLGKTILSLTFQAILALFIGFPTSSPSLPTLLFRTAVLISFAVSF
        MNE YSFITAINERN  I+S             LPI I MQRP P+ S  + +  N+GKTIL LTFQA+LALFI  P SSP L T LF  AVLISFAVSF
Subjt:  MNE-YSFITAINERNTGISSLIWGWGVSTASDCLPIYIRMQRPKPSPSPPANSLQNLGKTILSLTFQAILALFIGFPTSSPSLPTLLFRTAVLISFAVSF

Query:  AGVFLQNAFPRMALLLEKFGALFAAIGVCIISSLLVHQNLAWICWLACGSCLLAFIISFK
        AG+FLQN FPR+ALL EK GAL AAIGVCI++SLL+HQN AWI WLA G  L+AF++SF+
Subjt:  AGVFLQNAFPRMALLLEKFGALFAAIGVCIISSLLVHQNLAWICWLACGSCLLAFIISFK

A0A5D3BEH8 Putative Ileal sodium/bile acid cotransporter5.7e-3656.98Show/hide
Query:  MAST-TQEASLEMNE-YSFITAINERNTGISSLIWGWGVSTASDCLPIYIRMQRPKPSPSPPANSLQNLGKTILSLTFQAILALFIGFPTSSPSLPTLLF
        MA T   E SL+MNE YSFITA+  RN GISS  WG   + ASDCLPIYI+MQRP P  SP        G T +S  FQAIL LF+  P S P L  L+ 
Subjt:  MAST-TQEASLEMNE-YSFITAINERNTGISSLIWGWGVSTASDCLPIYIRMQRPKPSPSPPANSLQNLGKTILSLTFQAILALFIGFPTSSPSLPTLLF

Query:  RTAVLISFAVSFAGVFLQNAFPRMALLLEKFGALFAAIGVCIISSLLVHQNLAWICWLACGSCLLAFIISFK
         T +L +F  SF GV LQ  FP++A LL+ FGAL AAIGVCI+ S L+H NL WI WLACG  L AFI SFK
Subjt:  RTAVLISFAVSFAGVFLQNAFPRMALLLEKFGALFAAIGVCIISSLLVHQNLAWICWLACGSCLLAFIISFK

A0A6J1CY58 uncharacterized protein LOC1110156586.1e-3872.09Show/hide
Query:  MASTTQEASLEMNEY-SFITAINERNTGISSLIWGWGVSTASDCLPIYIRMQRPKPSPSPPANSLQNLGKTILSLTFQAILALFIGFPTSSPSLPTLLFR
        MAST Q+ S++MN   SFITAINERN GISS  WG G ST SDCLPI IRMQRP     PPA S Q+LGKTIL LTFQA+LALFI  P+S P LPTLLF 
Subjt:  MASTTQEASLEMNEY-SFITAINERNTGISSLIWGWGVSTASDCLPIYIRMQRPKPSPSPPANSLQNLGKTILSLTFQAILALFIGFPTSSPSLPTLLFR

Query:  TAVLISFAVSFAGVFLQNAFPRMALLLEK
         AVLISFAVSFAG+FLQ A+PRMALL EK
Subjt:  TAVLISFAVSFAGVFLQNAFPRMALLLEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCAACCACACAAGAAGCCTCCTTGGAGATGAATGAATATTCCTTCATCACTGCAATCAATGAAAGAAATACTGGAATTAGCAGTCTTATCTGGGGATGGGGAGT
ATCCACAGCATCAGATTGCCTCCCAATTTACATCAGAATGCAGAGGCCTAAGCCTTCACCTTCACCACCAGCCAACAGCCTTCAGAATCTGGGGAAGACAATCCTTAGTC
TTACTTTCCAGGCAATTTTAGCCCTGTTCATCGGTTTTCCCACTTCATCTCCTTCACTTCCTACACTTCTCTTTAGGACTGCTGTGTTGATTAGCTTTGCAGTTTCGTTT
GCTGGAGTTTTCCTTCAAAATGCATTCCCGAGAATGGCGCTGTTGTTGGAAAAGTTTGGTGCTCTTTTTGCTGCAATTGGTGTCTGCATAATATCAAGTCTTCTAGTGCA
TCAAAACCTCGCTTGGATCTGTTGGCTGGCATGTGGCTCCTGCTTGCTAGCCTTTATTATCTCATTCAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATTTTAAAAATAGATGATTCTTTGGACATACATGTCATTTTACATTGTCTAATTACCAATTAGGGACGGAGTTCATACCAATGAAAGAGCAGTGGGTCATATGATTATTG
TACGAAAAAGAAGCAATAGTAAAAGAGAGAGAACTTGACTCAAAAAGTGAACACAATTTCTACGAAATTTAATCAAAAAACTTGTAGGAGAATCACTGAAACTTCTCTCT
CTGTATTCTAGAATACTCCAATGCAATATAAAAAGGAAACTGCAAGACAGAATCGATCCAAGGGAAACCACTTGAAGAGTATTGACCAAATCTTTTTTGTTCCCTCATGG
CATCAACCACACAAGAAGCCTCCTTGGAGATGAATGAATATTCCTTCATCACTGCAATCAATGAAAGAAATACTGGAATTAGCAGTCTTATCTGGGGATGGGGAGTATCC
ACAGCATCAGATTGCCTCCCAATTTACATCAGAATGCAGAGGCCTAAGCCTTCACCTTCACCACCAGCCAACAGCCTTCAGAATCTGGGGAAGACAATCCTTAGTCTTAC
TTTCCAGGCAATTTTAGCCCTGTTCATCGGTTTTCCCACTTCATCTCCTTCACTTCCTACACTTCTCTTTAGGACTGCTGTGTTGATTAGCTTTGCAGTTTCGTTTGCTG
GAGTTTTCCTTCAAAATGCATTCCCGAGAATGGCGCTGTTGTTGGAAAAGTTTGGTGCTCTTTTTGCTGCAATTGGTGTCTGCATAATATCAAGTCTTCTAGTGCATCAA
AACCTCGCTTGGATCTGTTGGCTGGCATGTGGCTCCTGCTTGCTAGCCTTTATTATCTCATTCAAGTGATTTTGACAGAAGATGCATCCAATGTCCGAAGAGGACAGAAG
GAGCACTTTTGTAGAATCCATGCCCATATGTTTGTTTGACTTCTTTTACAATCTCTGAATCTTGACCTGGTGTGGTTCATCACCAAGACAATATCAGCTCCAATTTCCCT
GAAAATCATGTTATTCTACAACTTACAATGACATGATAACTCTGCCTGTGTCATGCAGTAGGCATCATTTCTTCTACGCAGCCGGTCCCAAGCCCGGAACAAAGGAGGAG
GGTAGTGTCATCAGACCATTGGATCTTGAGCTTGTTCTAAAAGACAGAGGCTGAAAGTATAGCTAGCAGTATTTTCATCTTAGTAGTAATAAAGATGGCTGCATCGTGAA
AGATGAGCTCTCTCTATGGTGCATACAGACGAACCACATATTTCTGAAGCTTAGTCATCCCCTAAGGTATAAGTATAACTAAAATCTTGTCAAAATATATGTTTTGAGTC
GTGCAATTAATAGTGGAATGGGCATGCTATGTTATTAGAATGTTAGAAAATATCAATAGACTATGTTGTGGTTCAGATTACATGGGTCCTAAAAAAACACACACATACAC
ACACGGGAACTACAACAACAACTACGGCCACAATTGAAATCAACGTTTTCAAGAACACAGTCCAAAAAAAGAAAGATTAGCAAACAAAAGCCATTTCATTTCATGAAACA
AAAGAGAATTTCATCACATCCTCTCCAGTTCTGCCTCAAAGCCATTTCAGACGCCAACAATAAATAAGAAGAAAAGGAGAGATTTGATGCTGCACAATGCTCGTATGGAC
TGAGATACAAGTTCCCTGCAGACCATGAATTTTAGTGCCACCAGTATTATTGGGGGAAATCCAAAATTGGTAGTCAACTTACCCTATTAAACTAGAATTTAAGAGAAAAA
GATGAGAATAATGAATATGATAATTTCACATTCAAACAGAAACACCAAAAATATATTTCATATGATGGAAAAAAGTAAAACAGCAAAGCAATCAGAGAACTTTGAAGACC
AGAACGATATAAGTTCCACATCAGTTTCAGATGACAAAGTTTTCATCATTTTGCCTTGCTTTTCTTACAGCAAGTTAATCAAAACTCAATACAACAAGCTACACCAATGA
CTGAGTATACCCTCTCGTAGAGGATCAAATCACATCTGCATCGATTCAACCAAAATCACACTCTCCCTGTATTTCAAATTCAGGATAGATACAACAAAATCTAAGCCAAC
CATCAGGCAACAGCAATCAGAGAACTTTGAAGGCATTGAATGATATGAGTTAAGCACACCATAGGTTCACCTCAAAAAGCCTCATCATCTTGAATGGTCATTTCACATTT
TTTGTTAAAAAAAAAGATCGAGACTAAGAGTTGAGACAACGCCATGAAAATAGAACAGAGCATTTCGAAAATTCACATTCTGAATCTGAAGACGACCTTGTGATTGTTTC
TTTGGCCAAGCGGGGTTATGCTGGAGAAAATCTTCCCCAAAACCATAGATGCAAAAGCCAAAATGAATGTAGCACCGCAAGAATAATACCTTTTTCTCGATTTTCCTTGC
AACTAAACTCGAAGAAGCAAGGGGCTATTTGGTGCATCAAATCATGCATGAAAAGGAAACCTAAAACAGAAAAGCAAATGCTTATGCTTAATTGACGCGTCGCTTACCAG
AGAATCGATTTGCAGGAGACAATCATCGAGAGAAGCCCATAAACGAAGTTCCAGATTAGGTTGTGCTGATACTGAACCGACTTGGCGGCTTTTATAGAGGTTTGCTGGAT
TAGGGTTTTATCTGGATTCGCATTTTTTTATTGGGTTGAAGGATATTTTGGACCAGCCCAAATATTAGAGTCGGTCATTCCTTCAACACAATCAATCCTATTTATGAGAG
TAACCCAATCTAACCTATTGGGTCACTAACTAAAATCACATGAAATTCAAATATCAAACTTCAAAATTCATAAC
Protein sequenceShow/hide protein sequence
MASTTQEASLEMNEYSFITAINERNTGISSLIWGWGVSTASDCLPIYIRMQRPKPSPSPPANSLQNLGKTILSLTFQAILALFIGFPTSSPSLPTLLFRTAVLISFAVSF
AGVFLQNAFPRMALLLEKFGALFAAIGVCIISSLLVHQNLAWICWLACGSCLLAFIISFK