; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g29000 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g29000
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionIleal sodium/bile acid cotransporter, putative
Genome locationchr4:21506443..21513728
RNA-Seq ExpressionMoc04g29000
SyntenyMoc04g29000
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR036259 - MFS transporter superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050246.1 putative Ileal sodium/bile acid cotransporter [Cucumis melo var. makuwa]1.6e-4166.24Show/hide
Query:  MNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPPPAKSSQ---SLGKTILGLTFQAVLALFISLPSSPPQLPTLLFGAAVLISFAVSFAG
        MN L SFITAINERN  I+S             LPICI MQRP PA SS+   ++GKTILGLTFQAVLALFI+ P+S P L T LF AAVLISFAVSFAG
Subjt:  MNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPPPAKSSQ---SLGKTILGLTFQAVLALFISLPSSPPQLPTLLFGAAVLISFAVSFAG

Query:  LFLQTAYPRMALLFEKVSALFAAIGVCIIASLLVHQNFAWICWLACAFTFIVFLLSF
        +FLQ  +PR+ALLFEK+ AL AAIGVCI+ASLL+HQNFAWI WLA  F+ + F+LSF
Subjt:  LFLQTAYPRMALLFEKVSALFAAIGVCIIASLLVHQNFAWICWLACAFTFIVFLLSF

KGN49803.1 hypothetical protein Csa_004681 [Cucumis sativus]3.3e-3963.12Show/hide
Query:  SVDMNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPPPAKSSQ---SLGKTILGLTFQAVLALFISLPSSPPQLPTLLFGAAVLISFAVS
        S+DMN L S IT+INERN  I+              LPICI MQ   PA SS+   ++G TILGLTFQAVLALFI+  +S P L T LFGAAVLISFAVS
Subjt:  SVDMNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPPPAKSSQ---SLGKTILGLTFQAVLALFISLPSSPPQLPTLLFGAAVLISFAVS

Query:  FAGLFLQTAYPRMALLFEKVSALFAAIGVCIIASLLVHQNFAWICWLACAFTFIVFLLSF
        F G+FLQ  +PR+ALLFEK+ AL AAIGVCI+ASLL+HQNFAWI WLAC F+ + FLLSF
Subjt:  FAGLFLQTAYPRMALLFEKVSALFAAIGVCIIASLLVHQNFAWICWLACAFTFIVFLLSF

KGN49806.1 hypothetical protein Csa_004683 [Cucumis sativus]7.9e-3354.43Show/hide
Query:  SVDMNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPPPAKSSQSLGKTILGLTFQAVLALFISL-PSSPPQLPTLLFGAAVLISFAVSFA
        S+DMN L+S I A+  RN GISS TWGG     SDCLPI I+MQRP P  S Q  G T L LTFQA++ LF+SL PSS   LP+ LF A +L SF  S+ 
Subjt:  SVDMNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPPPAKSSQSLGKTILGLTFQAVLALFISL-PSSPPQLPTLLFGAAVLISFAVSFA

Query:  GLFLQTAYPRMALLFEKVSALFAAIGVCIIASLLVHQNFAWICWLACAFTFIVFLLSF
        G+ LQ  +P+ A L +   ALFAAIG CII SLL++ NF WICWLA       F++SF
Subjt:  GLFLQTAYPRMALLFEKVSALFAAIGVCIIASLLVHQNFAWICWLACAFTFIVFLLSF

KGN49808.1 hypothetical protein Csa_004650 [Cucumis sativus]6.7e-4063.8Show/hide
Query:  KTSVDMNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPP--PAKSSQSLGKTILGLTFQAVLALFISL-PSSPPQLPTLLFGAAVLISFA
        +TS+DM+A SSFI  I++RN G+      G   TPS+CLP+ IRMQ+ P   A SSQ LGK IL L+FQAVLALFIS  P+SPP L    F AAV ISFA
Subjt:  KTSVDMNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPP--PAKSSQSLGKTILGLTFQAVLALFISL-PSSPPQLPTLLFGAAVLISFA

Query:  VSFAGLFLQTAYPRMALLFEKVSALFAAIGVCIIAS-LLVHQNFAWICWLACAFTFIVFLLSF
        VSFA LFL  ++PR A LFEKV ALF+A GVC IAS LLVHQNFAWICW+AC F+ IVF LSF
Subjt:  VSFAGLFLQTAYPRMALLFEKVSALFAAIGVCIIAS-LLVHQNFAWICWLACAFTFIVFLLSF

XP_022146444.1 uncharacterized protein LOC111015658 [Momordica charantia]1.9e-5599.15Show/hide
Query:  KTSVDMNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPPPAKSSQSLGKTILGLTFQAVLALFISLPSSPPQLPTLLFGAAVLISFAVSF
        +TSVDMNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPPPAKSSQSLGKTILGLTFQAVLALFISLPSSPPQLPTLLFGAAVLISFAVSF
Subjt:  KTSVDMNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPPPAKSSQSLGKTILGLTFQAVLALFISLPSSPPQLPTLLFGAAVLISFAVSF

Query:  AGLFLQTAYPRMALLFEK
        AGLFLQTAYPRMALLFEK
Subjt:  AGLFLQTAYPRMALLFEK

TrEMBL top hitse value%identityAlignment
A0A0A0KJN8 Uncharacterized protein1.6e-3963.12Show/hide
Query:  SVDMNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPPPAKSSQ---SLGKTILGLTFQAVLALFISLPSSPPQLPTLLFGAAVLISFAVS
        S+DMN L S IT+INERN  I+              LPICI MQ   PA SS+   ++G TILGLTFQAVLALFI+  +S P L T LFGAAVLISFAVS
Subjt:  SVDMNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPPPAKSSQ---SLGKTILGLTFQAVLALFISLPSSPPQLPTLLFGAAVLISFAVS

Query:  FAGLFLQTAYPRMALLFEKVSALFAAIGVCIIASLLVHQNFAWICWLACAFTFIVFLLSF
        F G+FLQ  +PR+ALLFEK+ AL AAIGVCI+ASLL+HQNFAWI WLAC F+ + FLLSF
Subjt:  FAGLFLQTAYPRMALLFEKVSALFAAIGVCIIASLLVHQNFAWICWLACAFTFIVFLLSF

A0A0A0KJP2 Uncharacterized protein3.3e-4063.8Show/hide
Query:  KTSVDMNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPP--PAKSSQSLGKTILGLTFQAVLALFISL-PSSPPQLPTLLFGAAVLISFA
        +TS+DM+A SSFI  I++RN G+      G   TPS+CLP+ IRMQ+ P   A SSQ LGK IL L+FQAVLALFIS  P+SPP L    F AAV ISFA
Subjt:  KTSVDMNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPP--PAKSSQSLGKTILGLTFQAVLALFISL-PSSPPQLPTLLFGAAVLISFA

Query:  VSFAGLFLQTAYPRMALLFEKVSALFAAIGVCIIAS-LLVHQNFAWICWLACAFTFIVFLLSF
        VSFA LFL  ++PR A LFEKV ALF+A GVC IAS LLVHQNFAWICW+AC F+ IVF LSF
Subjt:  VSFAGLFLQTAYPRMALLFEKVSALFAAIGVCIIAS-LLVHQNFAWICWLACAFTFIVFLLSF

A0A0A0KQ03 Uncharacterized protein3.8e-3354.43Show/hide
Query:  SVDMNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPPPAKSSQSLGKTILGLTFQAVLALFISL-PSSPPQLPTLLFGAAVLISFAVSFA
        S+DMN L+S I A+  RN GISS TWGG     SDCLPI I+MQRP P  S Q  G T L LTFQA++ LF+SL PSS   LP+ LF A +L SF  S+ 
Subjt:  SVDMNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPPPAKSSQSLGKTILGLTFQAVLALFISL-PSSPPQLPTLLFGAAVLISFAVSFA

Query:  GLFLQTAYPRMALLFEKVSALFAAIGVCIIASLLVHQNFAWICWLACAFTFIVFLLSF
        G+ LQ  +P+ A L +   ALFAAIG CII SLL++ NF WICWLA       F++SF
Subjt:  GLFLQTAYPRMALLFEKVSALFAAIGVCIIASLLVHQNFAWICWLACAFTFIVFLLSF

A0A5A7U7U1 Putative Ileal sodium/bile acid cotransporter7.7e-4266.24Show/hide
Query:  MNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPPPAKSSQ---SLGKTILGLTFQAVLALFISLPSSPPQLPTLLFGAAVLISFAVSFAG
        MN L SFITAINERN  I+S             LPICI MQRP PA SS+   ++GKTILGLTFQAVLALFI+ P+S P L T LF AAVLISFAVSFAG
Subjt:  MNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPPPAKSSQ---SLGKTILGLTFQAVLALFISLPSSPPQLPTLLFGAAVLISFAVSFAG

Query:  LFLQTAYPRMALLFEKVSALFAAIGVCIIASLLVHQNFAWICWLACAFTFIVFLLSF
        +FLQ  +PR+ALLFEK+ AL AAIGVCI+ASLL+HQNFAWI WLA  F+ + F+LSF
Subjt:  LFLQTAYPRMALLFEKVSALFAAIGVCIIASLLVHQNFAWICWLACAFTFIVFLLSF

A0A6J1CY58 uncharacterized protein LOC1110156589.4e-5699.15Show/hide
Query:  KTSVDMNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPPPAKSSQSLGKTILGLTFQAVLALFISLPSSPPQLPTLLFGAAVLISFAVSF
        +TSVDMNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPPPAKSSQSLGKTILGLTFQAVLALFISLPSSPPQLPTLLFGAAVLISFAVSF
Subjt:  KTSVDMNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPPPAKSSQSLGKTILGLTFQAVLALFISLPSSPPQLPTLLFGAAVLISFAVSF

Query:  AGLFLQTAYPRMALLFEK
        AGLFLQTAYPRMALLFEK
Subjt:  AGLFLQTAYPRMALLFEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACTCCCCCTTAACAACACGCAAAAGTCAAAGGGAAAAACTTTGCTTCTATTCAATGGAGCTACAAACGTGAAGACCATGCTTGGCAAGAAGATTTGGCCAAGGGT
ATCCAGATCTCCATTTATTAAAGAGATTGAGGTGCAAGAATCCGATTCCACGGAGAGGCTAGAACAACCAAAAAGTGGCCAAAGTGTGCCTTTGAGGGACCAAAAGTGCA
CTCAAAGCAGCACAGAAAGCAAAGGCAATGGAGATGAGCAAGGCATCATAGGCAGTACCGATCCCCACTCCACTTTCTGCAAACACCACAAGAATTCTGCATGCCATTCA
GAGAGGAGAGGAAGGGGAATGAATAGTGAAGTTTTTGTTTTTGATCAGCTCATTATTGGTTTTTTTTCAACCATTTGGAAATTTGGTTTTATAGAGGGCAAAAGGGGGAA
GGGAGAAGGAAGTTTCTTGGAAATTAAGACCTCTGTGGATATGAATGCACTCTCTTCCTTTATCACTGCAATCAACGAAAGAAATCTTGGAATCAGCAGCTTTACCTGGG
GAGGGGGAACATCCACACCATCAGATTGCCTACCAATTTGCATCAGAATGCAGAGGCCACCACCAGCCAAGAGCTCTCAGAGCCTTGGGAAGACGATTCTTGGTCTTACC
TTTCAGGCAGTTCTAGCCTTGTTCATCAGCTTACCCAGTTCACCTCCGCAACTCCCTACACTTCTTTTTGGAGCTGCCGTGTTGATCAGCTTTGCAGTTTCGTTTGCTGG
ACTTTTCCTTCAAACTGCATACCCGAGAATGGCGCTTTTGTTTGAAAAGGTCAGTGCTCTTTTTGCTGCAATTGGTGTCTGCATCATAGCTAGTCTTCTTGTCCATCAGA
ACTTCGCTTGGATCTGTTGGCTGGCATGTGCCTTCACCTTCATAGTCTTTCTTCTATCATTCAACCGGTCCCAAGCCCGGACAAAGGAGGAGGGTAGTGTCATCAGATCA
TTGGATCTTGATCTTGTTCTAAAAGACAGGCTGAAGAAAGTTAAACTGGAAGAGGAGGACTCGGGTGGATCTGTCGAGATGGAAGAGGGAAAGTGGATATTAGAAATTGT
GAAGGAGGAAGGAGACTTGGATCAATCCAACATGTTAAAAAGGAACTCCAATTCGGTGGCTTGTGGGATTGTTCAGTTTGCGATGGAAGAGGGAGAAGCAAGGGAATGGT
CTTCTGATCTCCCGCTTTGGTTATCGAGTCTTGTTGACTCTGATTTTTCATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAACTCCCCCTTAACAACACGCAAAAGTCAAAGGGAAAAACTTTGCTTCTATTCAATGGAGCTACAAACGTGAAGACCATGCTTGGCAAGAAGATTTGGCCAAGGGT
ATCCAGATCTCCATTTATTAAAGAGATTGAGGTGCAAGAATCCGATTCCACGGAGAGGCTAGAACAACCAAAAAGTGGCCAAAGTGTGCCTTTGAGGGACCAAAAGTGCA
CTCAAAGCAGCACAGAAAGCAAAGGCAATGGAGATGAGCAAGGCATCATAGGCAGTACCGATCCCCACTCCACTTTCTGCAAACACCACAAGAATTCTGCATGCCATTCA
GAGAGGAGAGGAAGGGGAATGAATAGTGAAGTTTTTGTTTTTGATCAGCTCATTATTGGTTTTTTTTCAACCATTTGGAAATTTGGTTTTATAGAGGGCAAAAGGGGGAA
GGGAGAAGGAAGTTTCTTGGAAATTAAGACCTCTGTGGATATGAATGCACTCTCTTCCTTTATCACTGCAATCAACGAAAGAAATCTTGGAATCAGCAGCTTTACCTGGG
GAGGGGGAACATCCACACCATCAGATTGCCTACCAATTTGCATCAGAATGCAGAGGCCACCACCAGCCAAGAGCTCTCAGAGCCTTGGGAAGACGATTCTTGGTCTTACC
TTTCAGGCAGTTCTAGCCTTGTTCATCAGCTTACCCAGTTCACCTCCGCAACTCCCTACACTTCTTTTTGGAGCTGCCGTGTTGATCAGCTTTGCAGTTTCGTTTGCTGG
ACTTTTCCTTCAAACTGCATACCCGAGAATGGCGCTTTTGTTTGAAAAGGTCAGTGCTCTTTTTGCTGCAATTGGTGTCTGCATCATAGCTAGTCTTCTTGTCCATCAGA
ACTTCGCTTGGATCTGTTGGCTGGCATGTGCCTTCACCTTCATAGTCTTTCTTCTATCATTCAACCGGTCCCAAGCCCGGACAAAGGAGGAGGGTAGTGTCATCAGATCA
TTGGATCTTGATCTTGTTCTAAAAGACAGGCTGAAGAAAGTTAAACTGGAAGAGGAGGACTCGGGTGGATCTGTCGAGATGGAAGAGGGAAAGTGGATATTAGAAATTGT
GAAGGAGGAAGGAGACTTGGATCAATCCAACATGTTAAAAAGGAACTCCAATTCGGTGGCTTGTGGGATTGTTCAGTTTGCGATGGAAGAGGGAGAAGCAAGGGAATGGT
CTTCTGATCTCCCGCTTTGGTTATCGAGTCTTGTTGACTCTGATTTTTCATAG
Protein sequenceShow/hide protein sequence
MKLPLNNTQKSKGKTLLLFNGATNVKTMLGKKIWPRVSRSPFIKEIEVQESDSTERLEQPKSGQSVPLRDQKCTQSSTESKGNGDEQGIIGSTDPHSTFCKHHKNSACHS
ERRGRGMNSEVFVFDQLIIGFFSTIWKFGFIEGKRGKGEGSFLEIKTSVDMNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPPPAKSSQSLGKTILGLT
FQAVLALFISLPSSPPQLPTLLFGAAVLISFAVSFAGLFLQTAYPRMALLFEKVSALFAAIGVCIIASLLVHQNFAWICWLACAFTFIVFLLSFNRSQARTKEEGSVIRS
LDLDLVLKDRLKKVKLEEEDSGGSVEMEEGKWILEIVKEEGDLDQSNMLKRNSNSVACGIVQFAMEEGEAREWSSDLPLWLSSLVDSDFS