; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000643 (gene) of Snake gourd v1 genome

Gene IDTan0000643
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionIleal sodium/bile acid cotransporter, putative
Genome locationLG11:15854485..15857282
RNA-Seq ExpressionTan0000643
SyntenyTan0000643
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050246.1 putative Ileal sodium/bile acid cotransporter [Cucumis melo var. makuwa]8.5e-4772.44Show/hide
Query:  MNELYSFITTMNERNPGISSITWGTSTASDGLPICITMQRPPPPANS-PQNSLGKIILGLTFQAVLALFISTPSSCPPLLTHLFAAAVLISFALSCAGLF
        MNELYSFIT +NERNP I+S           LPICITMQRP P  +S  +N++GK ILGLTFQAVLALFI++P+S PPLLTHLFAAAVLISFA+S AG+F
Subjt:  MNELYSFITTMNERNPGISSITWGTSTASDGLPICITMQRPPPPANS-PQNSLGKIILGLTFQAVLALFISTPSSCPPLLTHLFAAAVLISFALSCAGLF

Query:  LQSALPRIALLFGKIGALIAAIGVFIIASLLIHHNFAWISWLACAFSLIAFVLSFK
        LQ+  PRIALLF KIGALIAAIGV I+ASLLIH NFAWISWLA  FSL+AFVLSF+
Subjt:  LQSALPRIALLFGKIGALIAAIGVFIIASLLIHHNFAWISWLACAFSLIAFVLSFK

KAG6579137.1 hypothetical protein SDJN03_23585, partial [Cucurbita argyrosperma subsp. sororia]3.0e-3657.99Show/hide
Query:  MESNSQQISLDMNELYSFITTMNERNPGISSITWGTSTASDGLPICITMQRPPPPANSPQ--NSLGKIILGLTFQAVLALFISTPSSCPPLLTHLFAAAV
        M S  Q +S+DMN+                        ASDGLP    M RPPPP +S Q  N+LGKI+ GLTFQAVLALFIS P+SCPPLL H+FAAA+
Subjt:  MESNSQQISLDMNELYSFITTMNERNPGISSITWGTSTASDGLPICITMQRPPPPANSPQ--NSLGKIILGLTFQAVLALFISTPSSCPPLLTHLFAAAV

Query:  LISFALSCAGLFLQSALPRIALLFGKIGALIAAIGVFIIASLLI-HHNFAWISWLACAFSLIAFVLSFK
        LISFALS A LFLQ A PRIAL  GKIGAL+AAIG   I S+L+ H +F+WI WLAC F+L+AF+LSFK
Subjt:  LISFALSCAGLFLQSALPRIALLFGKIGALIAAIGVFIIASLLI-HHNFAWISWLACAFSLIAFVLSFK

KGN49803.1 hypothetical protein Csa_004681 [Cucumis sativus]2.5e-4667.66Show/hide
Query:  MESNSQQISLDMNELYSFITTMNERNPGISSITWGTSTASDGLPICITMQRPPPPANS-PQNSLGKIILGLTFQAVLALFISTPSSCPPLLTHLFAAAVL
        MES+SQQIS+DMN LYS IT++NERNP I           +GLPICI MQ   P  +S  +N++G  ILGLTFQAVLALFI++ +S PPLLTHLF AAVL
Subjt:  MESNSQQISLDMNELYSFITTMNERNPGISSITWGTSTASDGLPICITMQRPPPPANS-PQNSLGKIILGLTFQAVLALFISTPSSCPPLLTHLFAAAVL

Query:  ISFALSCAGLFLQSALPRIALLFGKIGALIAAIGVFIIASLLIHHNFAWISWLACAFSLIAFVLSFK
        ISFA+S  G+FLQ   PRIALLF KIGALIAAIGV I+ASLLIH NFAWISWLAC FSL+AF+LSF+
Subjt:  ISFALSCAGLFLQSALPRIALLFGKIGALIAAIGVFIIASLLIHHNFAWISWLACAFSLIAFVLSFK

KGN49808.1 hypothetical protein Csa_004650 [Cucumis sativus]1.8e-3356.55Show/hide
Query:  MESNSQQISLDMNELYSFITTMNERNPGISSITWGTSTASDGLPICITMQRPPPPANSPQNSLGKIILGLTFQAVLALFIST-PSSCPPLLTHLFAAAVL
        M S S Q SLDM+   SFI T+++RN G+        T S+ LP+ I MQ+ P         LGKIIL L+FQAVLALFIS+ P+S PPLL H FAAAV 
Subjt:  MESNSQQISLDMNELYSFITTMNERNPGISSITWGTSTASDGLPICITMQRPPPPANSPQNSLGKIILGLTFQAVLALFIST-PSSCPPLLTHLFAAAVL

Query:  ISFALSCAGLFLQSALPRIALLFGKIGALIAAIGVFIIAS-LLIHHNFAWISWLACAFSLIAFVLSFK
        ISFA+S A LFL ++ PR A LF K+GAL +A GV  IAS LL+H NFAWI W+AC FS+I F LSFK
Subjt:  ISFALSCAGLFLQSALPRIALLFGKIGALIAAIGVFIIAS-LLIHHNFAWISWLACAFSLIAFVLSFK

XP_022146444.1 uncharacterized protein LOC111015658 [Momordica charantia]1.6e-3272.22Show/hide
Query:  MESNSQQISLDMNELYSFITTMNERNPGISSITW--GTSTASDGLPICITMQRPPPPANSPQNSLGKIILGLTFQAVLALFISTPSSCPPLLTHLFAAAV
        M S  QQ S+DMN L SFIT +NERN GISS TW  GTST SD LPICI MQRPPP  +S   SLGK ILGLTFQAVLALFIS PSS P L T LF AAV
Subjt:  MESNSQQISLDMNELYSFITTMNERNPGISSITW--GTSTASDGLPICITMQRPPPPANSPQNSLGKIILGLTFQAVLALFISTPSSCPPLLTHLFAAAV

Query:  LISFALSCAGLFLQSALPRIALLFGK
        LISFA+S AGLFLQ+A PR+ALLF K
Subjt:  LISFALSCAGLFLQSALPRIALLFGK

TrEMBL top hitse value%identityAlignment
A0A0A0KJN8 Uncharacterized protein1.2e-4667.66Show/hide
Query:  MESNSQQISLDMNELYSFITTMNERNPGISSITWGTSTASDGLPICITMQRPPPPANS-PQNSLGKIILGLTFQAVLALFISTPSSCPPLLTHLFAAAVL
        MES+SQQIS+DMN LYS IT++NERNP I           +GLPICI MQ   P  +S  +N++G  ILGLTFQAVLALFI++ +S PPLLTHLF AAVL
Subjt:  MESNSQQISLDMNELYSFITTMNERNPGISSITWGTSTASDGLPICITMQRPPPPANS-PQNSLGKIILGLTFQAVLALFISTPSSCPPLLTHLFAAAVL

Query:  ISFALSCAGLFLQSALPRIALLFGKIGALIAAIGVFIIASLLIHHNFAWISWLACAFSLIAFVLSFK
        ISFA+S  G+FLQ   PRIALLF KIGALIAAIGV I+ASLLIH NFAWISWLAC FSL+AF+LSF+
Subjt:  ISFALSCAGLFLQSALPRIALLFGKIGALIAAIGVFIIASLLIHHNFAWISWLACAFSLIAFVLSFK

A0A0A0KJP2 Uncharacterized protein8.9e-3456.55Show/hide
Query:  MESNSQQISLDMNELYSFITTMNERNPGISSITWGTSTASDGLPICITMQRPPPPANSPQNSLGKIILGLTFQAVLALFIST-PSSCPPLLTHLFAAAVL
        M S S Q SLDM+   SFI T+++RN G+        T S+ LP+ I MQ+ P         LGKIIL L+FQAVLALFIS+ P+S PPLL H FAAAV 
Subjt:  MESNSQQISLDMNELYSFITTMNERNPGISSITWGTSTASDGLPICITMQRPPPPANSPQNSLGKIILGLTFQAVLALFIST-PSSCPPLLTHLFAAAVL

Query:  ISFALSCAGLFLQSALPRIALLFGKIGALIAAIGVFIIAS-LLIHHNFAWISWLACAFSLIAFVLSFK
        ISFA+S A LFL ++ PR A LF K+GAL +A GV  IAS LL+H NFAWI W+AC FS+I F LSFK
Subjt:  ISFALSCAGLFLQSALPRIALLFGKIGALIAAIGVFIIAS-LLIHHNFAWISWLACAFSLIAFVLSFK

A0A5A7U7U1 Putative Ileal sodium/bile acid cotransporter4.1e-4772.44Show/hide
Query:  MNELYSFITTMNERNPGISSITWGTSTASDGLPICITMQRPPPPANS-PQNSLGKIILGLTFQAVLALFISTPSSCPPLLTHLFAAAVLISFALSCAGLF
        MNELYSFIT +NERNP I+S           LPICITMQRP P  +S  +N++GK ILGLTFQAVLALFI++P+S PPLLTHLFAAAVLISFA+S AG+F
Subjt:  MNELYSFITTMNERNPGISSITWGTSTASDGLPICITMQRPPPPANS-PQNSLGKIILGLTFQAVLALFISTPSSCPPLLTHLFAAAVLISFALSCAGLF

Query:  LQSALPRIALLFGKIGALIAAIGVFIIASLLIHHNFAWISWLACAFSLIAFVLSFK
        LQ+  PRIALLF KIGALIAAIGV I+ASLLIH NFAWISWLA  FSL+AFVLSF+
Subjt:  LQSALPRIALLFGKIGALIAAIGVFIIASLLIHHNFAWISWLACAFSLIAFVLSFK

A0A5D3BEH8 Putative Ileal sodium/bile acid cotransporter1.9e-3153.94Show/hide
Query:  ESNSQQISLDMNELYSFITTMNERNPGISSITWGTSTASDGLPICITMQRPPPPANSPQNSLGKIILGLTFQAVLALFISTPSSCPPLLTHLFAAAVLIS
        E+   +ISLDMNELYSFIT +  RNPGISS TWGT+ ASD LPI I MQR P P NSPQ   G   +   FQA+L LF++ P S PP L+ L    +L +
Subjt:  ESNSQQISLDMNELYSFITTMNERNPGISSITWGTSTASDGLPICITMQRPPPPANSPQNSLGKIILGLTFQAVLALFISTPSSCPPLLTHLFAAAVLIS

Query:  FALSCAGLFLQSALPRIALLFGKIGALIAAIGVFIIASLLIHHNFAWISWLACAFSLIAFVLSFK
        F  S  G+ LQ   P+IA L    GAL+AAIGV I+ S L+H N  WI WLAC   L AF+ SFK
Subjt:  FALSCAGLFLQSALPRIALLFGKIGALIAAIGVFIIASLLIHHNFAWISWLACAFSLIAFVLSFK

A0A6J1CY58 uncharacterized protein LOC1110156587.5e-3372.22Show/hide
Query:  MESNSQQISLDMNELYSFITTMNERNPGISSITW--GTSTASDGLPICITMQRPPPPANSPQNSLGKIILGLTFQAVLALFISTPSSCPPLLTHLFAAAV
        M S  QQ S+DMN L SFIT +NERN GISS TW  GTST SD LPICI MQRPPP  +S   SLGK ILGLTFQAVLALFIS PSS P L T LF AAV
Subjt:  MESNSQQISLDMNELYSFITTMNERNPGISSITW--GTSTASDGLPICITMQRPPPPANSPQNSLGKIILGLTFQAVLALFISTPSSCPPLLTHLFAAAV

Query:  LISFALSCAGLFLQSALPRIALLFGK
        LISFA+S AGLFLQ+A PR+ALLF K
Subjt:  LISFALSCAGLFLQSALPRIALLFGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCAAATTCACAGCAAATTTCCCTAGATATGAATGAACTATATTCCTTCATCACTACAATGAATGAAAGAAATCCTGGAATCAGCAGTATTACCTGGGGAACATC
CACAGCATCAGATGGCCTTCCAATTTGCATCACAATGCAGAGGCCTCCTCCACCAGCCAACAGCCCTCAGAATAGCCTTGGGAAGATAATTCTTGGCCTTACTTTTCAGG
CTGTTTTGGCCTTGTTCATCAGCACACCCAGTTCATGTCCTCCACTTTTGACACATCTTTTTGCTGCTGCTGTGTTGATTAGCTTTGCACTTTCATGTGCTGGACTTTTC
CTTCAAAGTGCACTCCCGAGAATCGCACTTTTGTTCGGAAAGATCGGTGCTCTTATCGCAGCAATTGGCGTGTTCATCATAGCAAGTCTTCTAATCCATCACAACTTCGC
TTGGATCTCTTGGCTGGCATGTGCCTTTTCCTTGATAGCCTTTGTTCTATCATTCAAGTGA
mRNA sequenceShow/hide mRNA sequence
GTGATCAAAACATCATCCCTTCACAATTTCTACGAAATTTCCTCTCTATAACTTGTACTAGAATCACTGATAAATTTCTCTCTCTAAAATCTAGAACATTCCATCCCTCC
ACATAAAAAGGAACTGCAAGGCTTTAATCATCAAGTCAAGACAGAGAAGTTAGAACCCACTAGAGATATTGAGATTTTTTGCCCCCAATGGAATCAAATTCACAGCAAAT
TTCCCTAGATATGAATGAACTATATTCCTTCATCACTACAATGAATGAAAGAAATCCTGGAATCAGCAGTATTACCTGGGGAACATCCACAGCATCAGATGGCCTTCCAA
TTTGCATCACAATGCAGAGGCCTCCTCCACCAGCCAACAGCCCTCAGAATAGCCTTGGGAAGATAATTCTTGGCCTTACTTTTCAGGCTGTTTTGGCCTTGTTCATCAGC
ACACCCAGTTCATGTCCTCCACTTTTGACACATCTTTTTGCTGCTGCTGTGTTGATTAGCTTTGCACTTTCATGTGCTGGACTTTTCCTTCAAAGTGCACTCCCGAGAAT
CGCACTTTTGTTCGGAAAGATCGGTGCTCTTATCGCAGCAATTGGCGTGTTCATCATAGCAAGTCTTCTAATCCATCACAACTTCGCTTGGATCTCTTGGCTGGCATGTG
CCTTTTCCTTGATAGCCTTTGTTCTATCATTCAAGTGATTTTGACAGAAATGGGATTTTTCTGTAGCATCCATAACCATCTGCTTGACCTGTTGGAGTGGTTCATCACTA
ATACAAATAAGCTCCAATCTACTTGAAAACTATGTTATTCTACAACTTACAATGGCATGATAACTCTGCCTGTGTCATGCATCAAGCATCATTTCTTCTACGCAGCCGGT
CCCAAGCCCGGACAAAGGAGGAGGGTAGTGTCATCAGATCATTGGTTCTTGAGCTTCTTCTACAAGACAGACTCTGGAAGTTGTGTAGTTAAGCAATATTATCATCTAAG
AAATAAAGATGGCAGCATCCAGAAAGATGGGCTCTCTCTCTGGTGCTTACAGATCCAGAGGAACCAGATATTGCTGAAGCTTAGTCATCCTAAGATATACCTAAAATCTT
GTCCAACAAAAAAAAAGATACATATATTTGAGTGTGTAATTAAAAGTGGTATTGGATGCTATGTTGTTATATTGATAGCAAATATATATATATAAACCTATGTGGTTGTC
TATATTCCATGGGATCTAAAATGAAAACAAAAAATAAAACAATTGAGAATCAATCTTTACAAGAACATCAGCTAGAAAGAAATTCTGCAAAGGTGCAAAGAAAAGTGATT
GCTTCAAGCGTGTAAACTCCAATTGAGAAACCAGATTAGCAAATAAAAAGAGAATTTCATTTTGTTCTCCCCAGTTCTAACAGTTCTCACTCTAAACCATTTCATACAAA
TCACTGAGGTAATCATGACAAGAAAAGGAGAGATTTTGATGTTGCACAATGCTCGTATGGCCTGAGATACAAGTTCCTTGCAAACCAAGAATTTTAGTGCTACCAATATT
TTTTATGTCCCCAAAATTATTAGGAATATCCAAATTGATAGTCAAATCAACCAATTAAACATTGTTTAAAATGAAAGAAACCAGTAGTTTAAACACTAAGAAGAATGAAT
ATGAGAATCTCACCTTCAAACAGAAATCCCCAACATTTGCTATTATAAAAAACAGCAAAATAGTACAGCAATCAGAGAACTCTGAAGACCAGAATGATACAGAATCCACA
TCAATTTCAGATAACGAAGTTTTCATCATTTCGATTTGCTTTTCTTCCAGCATGTTAATCAATTCAAAACAATCCAATCATCGAGTTTACACTCTTGTCAGGCCTAAACA
AGCACGAACTATAAAATGGGAAAAAAGAAACATTATTCATCCATGTAAACATCTGCCATCATTTTAAAAAACAGAAACAAAATCAGAATTTAATTGGAAGTTTAAACGCA
AACAAACAGTCTTATCATAATGGTATTGGCACACAATCTCAAAAGTAAAAACTAAATTAATAATGTATCATTACACGAGCTGTACTGTGAACAAGATTGTTTTACATATC
AGCATCGATTCAAAGATTAGTCTAAATCCCCTTTTAAATCAGGAGAGATTGCAACAACACTTAATATCAAACATTAGGAAACAGCAATCAGGGAACTTTGAAGGCATTTC
ATCATATGGGTCAGGCACATCATAGGTTCTTAACGGGTAGAGAAATCGTACCAGATCAAGAAACAAAGGAAGCTAAAGATTTTGCATTCTAGTATTGAATATTACTCAAT
GATAAATGCAGTAATCACTACTACTCTTTATAAACAACAAAAACTAGTTCCTGGTGCAGAAATTCAAGGAACCAGTTTCTCCTAATTTGTAGCAAGAGATCGTTGGTGTC
CGAGTTTTGTGCAAGTGGCTGTTGACGGCTTCTGTGTTGTGGAATATTGAAACGAACCATTGAGCTGGAGAAAATCTTCCCCAAAACCAAAGATGAAAAGGCCAAAATGA
AACCTAGTAGGGAAATTTCAGTACACCATTTTCTCGATTTTTACTCGCAACCAAACTCAAAAGAAGTAAGGGGCTATTTGGTGAATCAGATCATGGAGGAAAAAGGAAAT
GAATGCTTATGCTTGAGTTGAGTAATGGAATTGTAGGAGACAATCGTTGAGAAGAGCCTGCTTCAGGTTTTGATGATTATGAACGAACAGAATTTATTCTTTTCTTTTCT
TTCTTTCTTTCTTTCTTTTTTTTTGGGGGATAAATTTTTGATAAATTG
Protein sequenceShow/hide protein sequence
MESNSQQISLDMNELYSFITTMNERNPGISSITWGTSTASDGLPICITMQRPPPPANSPQNSLGKIILGLTFQAVLALFISTPSSCPPLLTHLFAAAVLISFALSCAGLF
LQSALPRIALLFGKIGALIAAIGVFIIASLLIHHNFAWISWLACAFSLIAFVLSFK