; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g1703 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g1703
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionFUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages;
Genome locationMC09:22371905..22374116
RNA-Seq ExpressionMC09g1703
SyntenyMC09g1703
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009577 - Putative small multi-drug export


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013172.1 hypothetical protein SDJN02_25928 [Cucurbita argyrosperma subsp. argyrosperma]1.22e-17984.35Show/hide
Query:  MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS
        M TS+  T P+MSAFS RKT I L LNRPS+SQ KQSL  SS CIN+RHFN F+P+F+TSR+  TVTR  SN F+E DDI+PSFEEKPVK+LLLVLFWAS
Subjt:  MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS

Query:  LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR
        LSL+WFAASGDAKAA DSIRASNFGLKIA+ L+SSGWPAEA+VFALATLPV+ELRGAIPVGYWMQLKPVALTVLSVLGNMVPVP IILYLKKFATFLAGR
Subjt:  LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR

Query:  NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM
        NA+AS+FLDMLFKR K KAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSG+SANFFGVVLAGLLVNLLVNLGLKEA+ TGV LFI+STFM
Subjt:  NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM

Query:  WSILRLISKAFRK
        WSILRLI KAF K
Subjt:  WSILRLISKAFRK

XP_004135199.1 uncharacterized protein LOC101204187 [Cucumis sativus]2.03e-18084.66Show/hide
Query:  MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS
        M TS+  T P+ SAFSPRKT   LKLNRPS+++S QSLH SSP +NV HFN F P+  TSRI RTV R+ SNGF+E+D+I+PSFEEKPVK+LLLVLFWAS
Subjt:  MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS

Query:  LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR
        LSL+WFAASGDAKAA DSIRASNFGLKIA+ L++SGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVP IILYLKKFATFLAGR
Subjt:  LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR

Query:  NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM
        NASAS+FLDMLFKR KEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVV+AGLLVNLLVNLGLKEAIVTGV LFI+STFM
Subjt:  NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM

Query:  WSILRLISKAFRK
        WSILR+I K+F K
Subjt:  WSILRLISKAFRK

XP_022151043.1 uncharacterized protein LOC111019066 [Momordica charantia]8.44e-216100Show/hide
Query:  MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS
        MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS
Subjt:  MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS

Query:  LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR
        LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR
Subjt:  LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR

Query:  NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM
        NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM
Subjt:  NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM

Query:  WSILRLISKAFRK
        WSILRLISKAFRK
Subjt:  WSILRLISKAFRK

XP_023542277.1 uncharacterized protein LOC111802220 [Cucurbita pepo subsp. pepo]7.02e-17984.03Show/hide
Query:  MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS
        M TS+  T P+MSAFS RKT I L LNRPS+SQ KQ L  SS CIN+RHFN F+P+F+TSR+  TVTR  SN F+E DDI+PSFEEKPVK+LLLVLFWAS
Subjt:  MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS

Query:  LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR
        LSL+WFAASGDAKAA DSIRASNFGLKIA+ L+SSGWPAEA+VFALATLPV+ELRGAIPVGYWMQLKPVALTVLSVLGNMVPVP IILYLKKFATFLAGR
Subjt:  LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR

Query:  NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM
        NA+AS+FLDMLFKR K KAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSG+SANFFGVVLAGLLVNLLVNLGLKEA+ TGV LFI+STFM
Subjt:  NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM

Query:  WSILRLISKAFRK
        WSILRLI KAF K
Subjt:  WSILRLISKAFRK

XP_038893409.1 uncharacterized protein LOC120082205 [Benincasa hispida]4.17e-18485.3Show/hide
Query:  MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS
        M TS+  T P++S FSPRKT   LKLNRPS++QSKQSLH SS C+NVRHFN+F  +F+TSRIFRTVTR+ SNGF+E+D+I+PSFEEKPVK++LLVLFWAS
Subjt:  MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS

Query:  LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR
        LSL+WFAASGDAKAA DSIRASNFGLKIA+ L+SSGW  EAVVFALATLPVIELRGAIPVGYW+ LKPVALTVLSVLGNMVPVP IILYL+KFATFLAGR
Subjt:  LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR

Query:  NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM
        NASAS+FLDMLFKR KEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVV+AGLLVNLLVNLGLKEAIVTGV LFI+STFM
Subjt:  NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM

Query:  WSILRLISKAFRK
        WSILRLI KAFRK
Subjt:  WSILRLISKAFRK

TrEMBL top hitse value%identityAlignment
A0A0A0KQH3 Uncharacterized protein9.83e-18184.66Show/hide
Query:  MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS
        M TS+  T P+ SAFSPRKT   LKLNRPS+++S QSLH SSP +NV HFN F P+  TSRI RTV R+ SNGF+E+D+I+PSFEEKPVK+LLLVLFWAS
Subjt:  MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS

Query:  LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR
        LSL+WFAASGDAKAA DSIRASNFGLKIA+ L++SGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVP IILYLKKFATFLAGR
Subjt:  LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR

Query:  NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM
        NASAS+FLDMLFKR KEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVV+AGLLVNLLVNLGLKEAIVTGV LFI+STFM
Subjt:  NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM

Query:  WSILRLISKAFRK
        WSILR+I K+F K
Subjt:  WSILRLISKAFRK

A0A1S3BER4 uncharacterized protein LOC1034890823.13e-17783.07Show/hide
Query:  MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS
        M TS+  T P+ SAFSPRKT   LKLNRPS++QS  SLH SSP +NV H N   P+ +TSRI RTV R+ SNGF+E+D+I+PSFEEKP+K+L+LVLFWAS
Subjt:  MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS

Query:  LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR
        LSL+WFAASGDAKAA DSIRASNFGLKIA+ L+SSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPV LTVLSVLGNMVPVP IILYLKKFATFLAGR
Subjt:  LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR

Query:  NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM
        NASAS+FLDMLFKR KEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVV+AGLLVNLLVNLGLKEAIVTG  LFI+STFM
Subjt:  NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM

Query:  WSILRLISKAFRK
        WSILR+I K+F K
Subjt:  WSILRLISKAFRK

A0A5A7SV67 Sm_multidrug_ex domain-containing protein3.13e-17783.07Show/hide
Query:  MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS
        M TS+  T P+ SAFSPRKT   LKLNRPS++QS  SLH SSP +NV H N   P+ +TSRI RTV R+ SNGF+E+D+I+PSFEEKP+K+L+LVLFWAS
Subjt:  MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS

Query:  LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR
        LSL+WFAASGDAKAA DSIRASNFGLKIA+ L+SSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPV LTVLSVLGNMVPVP IILYLKKFATFLAGR
Subjt:  LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR

Query:  NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM
        NASAS+FLDMLFKR KEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVV+AGLLVNLLVNLGLKEAIVTG  LFI+STFM
Subjt:  NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM

Query:  WSILRLISKAFRK
        WSILR+I K+F K
Subjt:  WSILRLISKAFRK

A0A6J1DB35 uncharacterized protein LOC1110190664.09e-216100Show/hide
Query:  MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS
        MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS
Subjt:  MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS

Query:  LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR
        LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR
Subjt:  LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR

Query:  NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM
        NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM
Subjt:  NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM

Query:  WSILRLISKAFRK
        WSILRLISKAFRK
Subjt:  WSILRLISKAFRK

A0A6J1FZI4 uncharacterized protein LOC1114493491.38e-17883.71Show/hide
Query:  MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS
        M TS+  T P+MSAFS RKT I L LNRPS+SQ  QSL  SS CIN+RHFN F+P+F+TSR+  TVTR  SN F+E DDI+PSFEEKPVK+LLLVLFWAS
Subjt:  MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWAS

Query:  LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR
        LSL+WFAASGDAKAA DSIRASNFGLKIA+ L+SSGWPAEA+VFALATLPV+ELRGAIPVGYWMQLKPVALTVLSVLGNMVPVP IILYLKK ATFLAGR
Subjt:  LSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGR

Query:  NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM
        NA+AS+FLDMLFKR K KAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSG+SANFFGVVLAGLLVNLLVNLGLKEA+ TGV LFI+STFM
Subjt:  NASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM

Query:  WSILRLISKAFRK
        WSILRLI KAF K
Subjt:  WSILRLISKAFRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02590.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Putative small multi-drug export (InterPro:IPR009577); Has 405 Blast hits to 405 proteins in 185 species: Archae - 65; Bacteria - 295; Metazoa - 0; Fungi - 0; Plants - 23; Viruses - 0; Other Eukaryotes - 22 (source: NCBI BLink).3.4e-9576.29Show/hide
Query:  MPSFEEKPVKILLLVLFWASLSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM
        +PS    PVK  + V+ WAS SL WFA SGDAKAA DSI++S+FGL+IA+TLR  GWP EAVVFALATLPVIELRGAIPVGYWMQLKPV LT  SVLGNM
Subjt:  MPSFEEKPVKILLLVLFWASLSLSWFAASGDAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM

Query:  VPVPLIILYLKKFATFLAGRNASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVN
        VPVP I+LYLK FA+F+AG++ +AS+ LD+LFKR KEKA PVEEF+WLGLMLFVAVPFPGTGAWTGAIIASILDMPFWS VS+NF GVVLAGLLVNLLVN
Subjt:  VPVPLIILYLKKFATFLAGRNASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVN

Query:  LGLKEAIVTGVFLFIVSTFMWSILRLISKAFR
        LGLK+AIV G+ LF VSTFMWS+LR I K+ +
Subjt:  LGLKEAIVTGVFLFIVSTFMWSILRLISKAFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTACTTCTGTAGCATCAACTCCACCAATAATGTCGGCATTTTCGCCGAGAAAGACCCATATCTTCCTCAAGCTCAATCGACCCTCCGTTAGTCAGAGTAAACAATC
TCTTCATAGCTCTAGTCCATGTATAAACGTTCGCCATTTCAACCATTTCAGTCCTATTTTCGCGACTTCTCGGATCTTTCGTACTGTCACTCGAGCTTTTTCAAATGGGT
TTGTTGAAGAAGACGATATTATGCCTTCTTTCGAGGAGAAGCCGGTCAAGATTCTGCTGTTGGTTCTGTTTTGGGCTTCTCTATCCCTTTCTTGGTTTGCTGCTTCTGGG
GATGCCAAAGCTGCCGGTGATTCTATCAGAGCTTCGAATTTTGGCCTAAAGATCGCCACCACATTGCGGAGCTCGGGCTGGCCTGCCGAGGCTGTAGTATTTGCCCTCGC
TACACTTCCTGTAATTGAGCTCCGTGGGGCGATCCCTGTTGGTTACTGGATGCAGCTTAAGCCTGTTGCTCTAACGGTTCTCTCCGTTCTTGGGAACATGGTCCCGGTGC
CCCTTATCATCCTCTATTTGAAGAAGTTTGCAACTTTCCTTGCGGGAAGGAATGCTTCTGCCTCTCGATTCCTTGACATGTTATTCAAGAGGGGGAAAGAGAAAGCTGCA
CCAGTTGAAGAGTTCCAATGGCTTGGTCTAATGCTGTTTGTGGCTGTGCCTTTCCCTGGAACAGGAGCTTGGACCGGCGCCATCATAGCTTCCATCCTAGATATGCCATT
CTGGTCAGGTGTCTCTGCAAATTTCTTTGGTGTTGTATTGGCAGGTCTTCTGGTGAACTTGTTGGTGAATCTTGGTCTTAAGGAGGCCATTGTCACTGGAGTGTTTCTTT
TCATTGTATCGACATTCATGTGGAGCATTCTCCGACTGATTAGTAAAGCTTTCAGAAAATGA
mRNA sequenceShow/hide mRNA sequence
CAACAAGATTGAACCATAAATTGAATAACGTATATCTGATCGAACTCCATATTCTTTAATTATATTTTATTTGCTCATCTTCTTGTTCCTAATTCCCAAGTTCGTCTTGA
AAATCGTTCGCCGTCCTCTTCTACGTATTGCTTGTTTTTCTTCCGAAGCTCATTACGGACTCTCTCTTATCCAAAGTCCAAACAAAACCTCGGTAAATCCTCCCTCTTGT
TGCTTATGCCGATATCTCCCACTTCCCTCTCTACCCAATTTTGAATATCTTCGAACTTTTTATAGGGTAAAATTAGCAATGCCCTTTCCAATTCATCCCAGTTTCTAGGA
GGCACGAACTCCGCTCGCAATCTTGTGATTGGCAGCGGAGAAATCTCCTCTAAACCCCCTTCCATTATCATCCTCTGAATCAATTTCCAATGCCACAATCCCACCCAGTT
AAGTGACTCGGCAAAATCCCGCTGGAGCAGAAGAGGAATTTTCTTGAAGAATTCAACCCCAGGCGTGTCCGTCAGTCTCCGGCCAGCGATGGGTACTTCTGTAGCATCAA
CTCCACCAATAATGTCGGCATTTTCGCCGAGAAAGACCCATATCTTCCTCAAGCTCAATCGACCCTCCGTTAGTCAGAGTAAACAATCTCTTCATAGCTCTAGTCCATGT
ATAAACGTTCGCCATTTCAACCATTTCAGTCCTATTTTCGCGACTTCTCGGATCTTTCGTACTGTCACTCGAGCTTTTTCAAATGGGTTTGTTGAAGAAGACGATATTAT
GCCTTCTTTCGAGGAGAAGCCGGTCAAGATTCTGCTGTTGGTTCTGTTTTGGGCTTCTCTATCCCTTTCTTGGTTTGCTGCTTCTGGGGATGCCAAAGCTGCCGGTGATT
CTATCAGAGCTTCGAATTTTGGCCTAAAGATCGCCACCACATTGCGGAGCTCGGGCTGGCCTGCCGAGGCTGTAGTATTTGCCCTCGCTACACTTCCTGTAATTGAGCTC
CGTGGGGCGATCCCTGTTGGTTACTGGATGCAGCTTAAGCCTGTTGCTCTAACGGTTCTCTCCGTTCTTGGGAACATGGTCCCGGTGCCCCTTATCATCCTCTATTTGAA
GAAGTTTGCAACTTTCCTTGCGGGAAGGAATGCTTCTGCCTCTCGATTCCTTGACATGTTATTCAAGAGGGGGAAAGAGAAAGCTGCACCAGTTGAAGAGTTCCAATGGC
TTGGTCTAATGCTGTTTGTGGCTGTGCCTTTCCCTGGAACAGGAGCTTGGACCGGCGCCATCATAGCTTCCATCCTAGATATGCCATTCTGGTCAGGTGTCTCTGCAAAT
TTCTTTGGTGTTGTATTGGCAGGTCTTCTGGTGAACTTGTTGGTGAATCTTGGTCTTAAGGAGGCCATTGTCACTGGAGTGTTTCTTTTCATTGTATCGACATTCATGTG
GAGCATTCTCCGACTGATTAGTAAAGCTTTCAGAAAATGAATCAAATTGAAAGGGCGAATTAATGTTACATCGCACGACAATGAAAAAGGCCATGTTTGATCGTAGTCTC
ATTTGTCTTGACATAGACTTGTTACTTGAGTTAGCACTCTGATCAATGATAACTTACTATTTTCTTGGATGCACAGTTTTTTTCTCTTGTATCATATGACTGAATGATAG
AGCCGTCAAATTTAAATATAAAGTTG
Protein sequenceShow/hide protein sequence
MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATSRIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWASLSLSWFAASG
DAKAAGDSIRASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPLIILYLKKFATFLAGRNASASRFLDMLFKRGKEKAA
PVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFMWSILRLISKAFRK