; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g29540 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g29540
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionArabidopsis protein of unknown function (DUF241)
Genome locationchr1:20938488..20940072
RNA-Seq ExpressionMoc01g29540
SyntenyMoc01g29540
Gene Ontology termsNA
InterPro domainsIPR004320 - Protein of unknown function DUF241, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034047.1 hypothetical protein SDJN02_03773, partial [Cucurbita argyrosperma subsp. argyrosperma]1.0e-12981.41Show/hide
Query:  SIYKPKMAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEAL
        +I  P+M ISSIFT AHQP+RSVSLP RVELEPEPLL+SLKSFQVSSFNAK  +F LE I+AALVGLAELYNSVGEL QSSSTQQALVHYKEGKLVEEAL
Subjt:  SIYKPKMAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEAL

Query:  NESVVLIDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDR-ASFPILDLPNHDLLPLIRLLREARTIS
         ESV+LIDSC SARD+ILMMKQ+IQ LQSALRRK ADSSIESHVRA+FSFRRKA+KDIGS LG+LK+M+++R  SFP+LDLPNHDLLPLIRLLREARTIS
Subjt:  NESVVLIDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDR-ASFPILDLPNHDLLPLIRLLREARTIS

Query:  ISIFSELLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRC
        ISIF ELLA LS  VAK +ASGWSLVSQLMPTIRS SGKG+K+ NELESVD+AL SLLGH R N  +  KAEVQMAQRRL TLA+SFEGIE  LDCMFRC
Subjt:  ISIFSELLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRC

Query:  LVKHRVCFLNML
        LVK+RVCFLNML
Subjt:  LVKHRVCFLNML

XP_022132594.1 uncharacterized protein LOC111005419 [Momordica charantia]5.1e-169100Show/hide
Query:  MHAQRHSIYKPKMAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGK
        MHAQRHSIYKPKMAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGK
Subjt:  MHAQRHSIYKPKMAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGK

Query:  LVEEALNESVVLIDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDRASFPILDLPNHDLLPLIRLLRE
        LVEEALNESVVLIDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDRASFPILDLPNHDLLPLIRLLRE
Subjt:  LVEEALNESVVLIDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDRASFPILDLPNHDLLPLIRLLRE

Query:  ARTISISIFSELLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLD
        ARTISISIFSELLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLD
Subjt:  ARTISISIFSELLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLD

Query:  CMFRCLVKHRVCFLNMLAH
        CMFRCLVKHRVCFLNMLAH
Subjt:  CMFRCLVKHRVCFLNMLAH

XP_022949794.1 uncharacterized protein LOC111453085 [Cucurbita moschata]2.3e-12982.14Show/hide
Query:  MAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEALNESVVL
        M ISSIFT AHQP+RSVSLP RVELEPEPLL+SLKSFQVSSFNAK  +F LE I+AALVGLAELYNSVGEL QSSSTQQALVHYKEGKLVEEAL ESV+L
Subjt:  MAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEALNESVVL

Query:  IDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDR-ASFPILDLPNHDLLPLIRLLREARTISISIFSE
        IDSC SARD+ILMMKQ+IQ LQSALRRK ADSSIESHVRA+FSFRRKA+KDIGS LG+LK+M+++R  SFP+LDLPNHDLLPLIRLLREARTISISIF E
Subjt:  IDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDR-ASFPILDLPNHDLLPLIRLLREARTISISIFSE

Query:  LLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRV
        LLA LS  VAK +ASGWSLVSQLMPTIRS SGKG+K+ NELESVD+AL SLLGH R N  +  KAEVQMAQRRL TLA+SFEGIE  LDCMFRCLVK+RV
Subjt:  LLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRV

Query:  CFLNMLAH
        CFLNML H
Subjt:  CFLNMLAH

XP_023544975.1 uncharacterized protein LOC111804411 isoform X1 [Cucurbita pepo subsp. pepo]9.4e-13181.21Show/hide
Query:  SIYKPKMAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEAL
        +IYKP+M ISSIFT AHQP+RSVSLP RVELEPEPLL+SLKSFQVSS NAK   F LE I+AALVGLAELYNSVGEL QSSSTQQALVHYKEGKLVEEAL
Subjt:  SIYKPKMAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEAL

Query:  NESVVLIDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDR-ASFPILDLPNHDLLPLIRLLREARTIS
         ESV+LIDSC SARD+IL MKQ+IQ LQSALRRK ADSSIESHVRAYFSFRRKA+KDIGS LG+LK+M+++R  SFP+LDLPNHDLLPLIRLLREARTIS
Subjt:  NESVVLIDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDR-ASFPILDLPNHDLLPLIRLLREARTIS

Query:  ISIFSELLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRC
        ISIF ELLA LS  V K +ASGWSLVSQLMPTIRS SGKG+K+ NELESVD+ L SLLGH R N  +  KAEVQMAQRRL TLA+SFEGIE  LDCMFRC
Subjt:  ISIFSELLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRC

Query:  LVKHRVCFLNMLAH
        LVKHRVCFLNML H
Subjt:  LVKHRVCFLNMLAH

XP_038883062.1 uncharacterized protein LOC120074116 [Benincasa hispida]2.5e-13181.82Show/hide
Query:  MAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEALNESVVL
        M ++SIFT AH P+RSVSLPTRVELEP+PLLQSLKSFQ SS+NAKV  F LE IQAAL+GLAELYNSVGEL QSSSTQQALVHYKEGKLVEEALNESVVL
Subjt:  MAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEALNESVVL

Query:  IDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDR-ASFPILDLPNHDLLPLIRLLREARTISISIFSE
        IDSCSSARD+ILMMKQ+IQ LQSALRRKGA+SSIESH+RAY+SFRRKA+KDI SCL ALKRMENDR  +F +LD+PNHDLLPLIRLLREAR+ISISIF E
Subjt:  IDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDR-ASFPILDLPNHDLLPLIRLLREARTISISIFSE

Query:  LLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRV
        LLA LSAPV K +A GW LVSQLMP I+SGS KG+K+ NELE+VD+ALRSLLG  RGN G+ NKAEV++AQRRLGTLA+SFEGIE GLDCMFRCLVKHRV
Subjt:  LLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRV

Query:  CFLNMLAH
        CFLNML H
Subjt:  CFLNMLAH

TrEMBL top hitse value%identityAlignment
A0A0A0KMP4 Uncharacterized protein8.0e-12879.94Show/hide
Query:  MAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEALNESVVL
        M I SIFTGAHQP+RSVSLPTRVEL+PEPLLQSLKSFQVSS NAK   F LE IQAALVGLAELYNSVGEL QSSSTQQALVHYKEGKLVEEALNESVVL
Subjt:  MAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEALNESVVL

Query:  IDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDRAS--FPILDLPNHDLLPLIRLLREARTISISIFS
        IDSCSSARD+IL MKQ+IQ LQSALRRK A+S +E+HVRAYFSFRRKA+KDIG+ +  LKRMENDR +  F + D+ NHDLLPLI+LLREAR++SISIF 
Subjt:  IDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDRAS--FPILDLPNHDLLPLIRLLREARTISISIFS

Query:  ELLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHR
        ELLA LSAPV K  A GWSLVSQLMP I+SGSGKGQK  NELE+VD+AL SLLG  RG  G+ NKAEVQ+AQRR+GTLA+SFEGIE GLDCMFRCLVKHR
Subjt:  ELLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHR

Query:  VCFLNMLAH
        VCFLNML H
Subjt:  VCFLNMLAH

A0A5A7SKH1 Putative Selection and upkeep of intraepithelial T-cells protein 66.0e-12377.67Show/hide
Query:  MAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEALNESVVL
        M I+SIFTGAHQPIRSVSLPTRVE E EPLLQSLKSFQVSS  AK     LE IQ ALVGLAELYNSVG+L QSSSTQQALVHYKEGKLVEEALNESVVL
Subjt:  MAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEALNESVVL

Query:  IDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDRAS--FPILDLPNHDLLPLIRLLREARTISISIFS
        IDSCSSARD+IL MKQ IQ LQSALRRK A+S +ESHVRAYFS+RRKA+K+IGS +G LKRMENDR +  F + D+ NHDLLP+I+LLREAR++SISIF 
Subjt:  IDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDRAS--FPILDLPNHDLLPLIRLLREARTISISIFS

Query:  ELLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHR
        ELLA LSAPV K +A GWSLVSQLMP I+SGSGKGQK  NE+E+VD+AL SLLG  RG  G+ NKAEVQ+AQRR+GTLA+SFEGIE GLD MF+CLVKHR
Subjt:  ELLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHR

Query:  VCFLNMLAH
        VCFLNML H
Subjt:  VCFLNMLAH

A0A6J1BTH5 uncharacterized protein LOC1110054192.5e-169100Show/hide
Query:  MHAQRHSIYKPKMAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGK
        MHAQRHSIYKPKMAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGK
Subjt:  MHAQRHSIYKPKMAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGK

Query:  LVEEALNESVVLIDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDRASFPILDLPNHDLLPLIRLLRE
        LVEEALNESVVLIDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDRASFPILDLPNHDLLPLIRLLRE
Subjt:  LVEEALNESVVLIDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDRASFPILDLPNHDLLPLIRLLRE

Query:  ARTISISIFSELLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLD
        ARTISISIFSELLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLD
Subjt:  ARTISISIFSELLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLD

Query:  CMFRCLVKHRVCFLNMLAH
        CMFRCLVKHRVCFLNMLAH
Subjt:  CMFRCLVKHRVCFLNMLAH

A0A6J1GDT1 uncharacterized protein LOC1114530851.1e-12982.14Show/hide
Query:  MAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEALNESVVL
        M ISSIFT AHQP+RSVSLP RVELEPEPLL+SLKSFQVSSFNAK  +F LE I+AALVGLAELYNSVGEL QSSSTQQALVHYKEGKLVEEAL ESV+L
Subjt:  MAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEALNESVVL

Query:  IDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDR-ASFPILDLPNHDLLPLIRLLREARTISISIFSE
        IDSC SARD+ILMMKQ+IQ LQSALRRK ADSSIESHVRA+FSFRRKA+KDIGS LG+LK+M+++R  SFP+LDLPNHDLLPLIRLLREARTISISIF E
Subjt:  IDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDR-ASFPILDLPNHDLLPLIRLLREARTISISIFSE

Query:  LLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRV
        LLA LS  VAK +ASGWSLVSQLMPTIRS SGKG+K+ NELESVD+AL SLLGH R N  +  KAEVQMAQRRL TLA+SFEGIE  LDCMFRCLVK+RV
Subjt:  LLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRV

Query:  CFLNMLAH
        CFLNML H
Subjt:  CFLNMLAH

A0A6J1II33 uncharacterized protein LOC1114776706.8e-12781.17Show/hide
Query:  MAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEALNESVVL
        M ISSIFT AHQP+RSVSLP RVEL+PEPLL+SLKSFQV SFNAK  +F L+ IQAALVGLAELYNSVGEL QSSSTQQALVHYKEGKLVEEAL ESV+L
Subjt:  MAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEALNESVVL

Query:  IDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDR-ASFPILDLPNHDLLPLIRLLREARTISISIFSE
        IDSC SARD+IL+MKQ+IQ LQSALRRK ADSSIESHVRAYFSFRRKA+KDIGS L +LK+M++DR  SFP+LDLPNHDLLPLIRLLREARTISISIF E
Subjt:  IDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDR-ASFPILDLPNHDLLPLIRLLREARTISISIFSE

Query:  LLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRV
        LLA LS  VAK +ASGWSLVSQLM TIRS SGKG+K+ NELESVD+AL SLLGH R N  +  KAEVQMAQRRL TLA+SFEGIE  LDCMFRCLVK+RV
Subjt:  LLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRV

Query:  CFLNMLAH
        CFLNML H
Subjt:  CFLNMLAH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G17080.1 Arabidopsis protein of unknown function (DUF241)2.3e-1828.52Show/hide
Query:  IRSVSLPTRVELE----PEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEALNESVVLIDSCSSARD
        +RS S P+R   +     E L +   S Q SS ++     RL+ +Q       EL+ S+ +L     TQQAL      K VE+ L+ S+ ++D C+ ++D
Subjt:  IRSVSLPTRVELE----PEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEALNESVVLIDSCSSARD

Query:  VILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDRASFPILDLPNHDLLPLIRLLREARTISISIFSELLALLSAPVA
         +  MK+ +  +QS LRRK  D S E  V+ Y + R+  +K       +LK  + +          N+D    + +  EA  I++S+F  LL+ +S    
Subjt:  VILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDRASFPILDLPNHDLLPLIRLLREARTISISIFSELLALLSAPVA

Query:  KVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRVCFLNMLAH
            S WS+VS+LM          +  ENE   VD   +S               E  +    +  L +  + +E GL+ + + L+K+RV FLN+L H
Subjt:  KVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRVCFLNMLAH

AT3G51410.1 Arabidopsis protein of unknown function (DUF241)4.8e-2431.13Show/hide
Query:  TGAHQPIRSVSLPTRV---ELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEALNESVVLIDSC
        T    P+RS+SLP+R+     + +  L  +  FQ SS +        +++ A+L+ L+ELY+S+ +L  S  T QA          E +L+ S  L+DSC
Subjt:  TGAHQPIRSVSLPTRV---ELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEALNESVVLIDSC

Query:  SSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDRASFPILDLPNHDLLPLIRLLREARTISISIFSELLALL
         +AR+++L +++ +  LQSALRRK  D S+E  ++ YFSFR+K +K+    L  LK+  +D                      E   +S+SIF  L   L
Subjt:  SSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDRASFPILDLPNHDLLPLIRLLREARTISISIFSELLALL

Query:  S-APVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRVCFLN
        S     K +      VS+L   I  G      + +EL+++D  LRS         G  +K   +M +R    L    E +E  LD +F+ LV++RV  LN
Subjt:  S-APVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRVCFLN

Query:  ML
        +L
Subjt:  ML

AT4G35660.1 Arabidopsis protein of unknown function (DUF241)3.1e-2330.74Show/hide
Query:  SSIFTGAHQPIRSVSLPTR-VELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGE-LFQSSSTQQALVHYKEGKLVEEALNESVVLI
        SS     H P RS+SLPTR +  + + + + LK  Q  + ++  ++     IQ  L  L ELY+ V E +  S   QQAL   +  KLVE+AL+ES+VL+
Subjt:  SSIFTGAHQPIRSVSLPTR-VELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGE-LFQSSSTQQALVHYKEGKLVEEALNESVVLI

Query:  DSCSSARDVILMMKQSIQVLQSAL-RRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDRASFPILD---LPNHDLLPLIRLLREARTISISIF
        D     RD+I  + + IQ LQSAL RR+G  SS++S +R+Y SF +K++ +    + +L R +  + ++ I     L  H  + +  +LR++   +ISI 
Subjt:  DSCSSARDVILMMKQSIQVLQSAL-RRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDRASFPILD---LPNHDLLPLIRLLREARTISISIF

Query:  SELLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVA-LRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRCLVK
          LL  LS                      +     +K   E+  VD + +RS  G   G      + + Q    RL  +  S E I+  L  + R L++
Subjt:  SELLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVA-LRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRCLVK

Query:  HRVCFLNML
        HR   LN++
Subjt:  HRVCFLNML

AT4G35680.1 Arabidopsis protein of unknown function (DUF241)4.8e-3234.3Show/hide
Query:  HQPIRSVSLPTRV---ELEPEPLLQSLKSFQVSSFNAKV-ATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEALNESVVLIDSCSS
        HQP+RS SLP+R+    ++    L  L  ++ SS +  V A+F  E +   LV L ELY  V EL +S   +  L+H++EGKL++E+L+ SV+L+D    
Subjt:  HQPIRSVSLPTRV---ELEPEPLLQSLKSFQVSSFNAKV-ATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEALNESVVLIDSCSS

Query:  ARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDRASFPILDLPNHDLLPLIRLLREARTISISIFSELLALLS-
         R+VI+ M++ +  L+SALRRKG   S+E   +AYF+ R+KA+K+I   + ALK+ME    S    +      +    +LRE   I++S+F  LL  LS 
Subjt:  ARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDRASFPILDLPNHDLLPLIRLLREARTISISIFSELLALLS-

Query:  -----APVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESV-DVALRSLLGHTRGNSGSGNKAEVQMAQR---RLGTLAASFEGIEGGLDCMFRCLVK
             +P       G   +  + P++   S K   +  E++S+ DV L S+L   +         EV+  +    R   +   F  +E  LD + +CLVK
Subjt:  -----APVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESV-DVALRSLLGHTRGNSGSGNKAEVQMAQR---RLGTLAASFEGIEGGLDCMFRCLVK

Query:  HRVCFLNML
        +RV FLN+L
Subjt:  HRVCFLNML

AT4G35690.1 Arabidopsis protein of unknown function (DUF241)1.5e-1726Show/hide
Query:  IRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEALNESVVLIDSCSSARDVILM
        +RS+SLP+        + +SL   +V + N    T   E++   L GL ELYN   +  +  STQ+ +      + +EE L+ S+ L+D CS +RD+++ 
Subjt:  IRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRLEAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEALNESVVLIDSCSSARDVILM

Query:  MKQSIQVLQSALRRK---GADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDRASFPILDLPNHD--LLPLIRLLREARTISISIFSELLALLSAPV
         ++ ++ +QS +RRK   G +  ++  V  Y  FR+  RK+    LG+LK ++   +S   ++    +  L+ ++  +R+  ++S+++    L  LS   
Subjt:  MKQSIQVLQSALRRK---GADSSIESHVRAYFSFRRKARKDIGSCLGALKRMENDRASFPILDLPNHD--LLPLIRLLREARTISISIFSELLALLSAPV

Query:  AKVRASGWSLVSQLMPTIRSGS-GKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRVCFLNMLAH
           R S  ++ S+L   ++       ++ +NELE++D+ +            S N       Q++L  +  S +G E  L+ +FR L++ R   LN+++H
Subjt:  AKVRASGWSLVSQLMPTIRSGS-GKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRVCFLNMLAH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCCGCCCTCTGTCTGCTGCTGCTGCATTAGAATGTAAACACTGCCAATTCAATCTAGGCTATGGGAGTAGCAATGTTTTCTCATTGAGAATCAGAGAGAGAGAGCT
GAGGCTATGTTCGCAAGGCTGTGATTTTGACATGCATGCACAGAGACATTCAATTTACAAACCAAAGATGGCGATCTCGTCAATATTCACCGGTGCCCACCAGCCCATTA
GGTCTGTTAGCCTGCCAACGAGGGTGGAGCTCGAGCCAGAACCATTGCTGCAAAGTCTGAAATCCTTTCAAGTTTCGTCTTTCAATGCCAAAGTGGCTACATTCAGGCTC
GAGGCGATTCAAGCTGCCTTGGTTGGGCTGGCAGAGTTGTACAACTCTGTTGGAGAGCTTTTTCAGTCCTCTTCCACACAGCAGGCTCTTGTTCACTATAAGGAGGGGAA
GCTTGTGGAAGAGGCTTTAAATGAGTCCGTTGTATTGATAGACTCTTGCAGCTCTGCCAGGGACGTAATCCTTATGATGAAACAAAGTATACAAGTCCTTCAGTCAGCTT
TACGTCGGAAGGGTGCAGATTCAAGCATCGAAAGCCATGTTCGTGCGTACTTCAGCTTCCGAAGGAAGGCAAGGAAAGACATTGGAAGCTGCCTCGGCGCACTGAAACGA
ATGGAGAATGATAGAGCAAGCTTCCCTATATTGGATCTACCAAATCATGATTTGTTGCCTCTGATCAGACTGTTGAGAGAAGCAAGAACCATCAGCATTTCCATCTTTAG
CGAGCTTTTGGCACTCCTATCGGCACCCGTGGCAAAGGTGAGGGCTAGCGGGTGGTCGTTGGTCTCACAGTTGATGCCAACAATCAGGTCAGGATCAGGGAAGGGACAGA
AGATGGAAAACGAGTTGGAGAGCGTGGATGTCGCTCTTCGTTCGCTCCTTGGCCACACGAGAGGCAACAGCGGCAGTGGTAACAAAGCTGAAGTTCAAATGGCACAGAGA
AGGCTTGGAACATTAGCTGCAAGTTTTGAAGGAATTGAGGGTGGATTGGATTGCATGTTCAGATGTTTGGTTAAACATAGAGTGTGTTTTCTCAACATGTTAGCTCATTG
A
mRNA sequenceShow/hide mRNA sequence
ATGCCCCGCCCTCTGTCTGCTGCTGCTGCATTAGAATGTAAACACTGCCAATTCAATCTAGGCTATGGGAGTAGCAATGTTTTCTCATTGAGAATCAGAGAGAGAGAGCT
GAGGCTATGTTCGCAAGGCTGTGATTTTGACATGCATGCACAGAGACATTCAATTTACAAACCAAAGATGGCGATCTCGTCAATATTCACCGGTGCCCACCAGCCCATTA
GGTCTGTTAGCCTGCCAACGAGGGTGGAGCTCGAGCCAGAACCATTGCTGCAAAGTCTGAAATCCTTTCAAGTTTCGTCTTTCAATGCCAAAGTGGCTACATTCAGGCTC
GAGGCGATTCAAGCTGCCTTGGTTGGGCTGGCAGAGTTGTACAACTCTGTTGGAGAGCTTTTTCAGTCCTCTTCCACACAGCAGGCTCTTGTTCACTATAAGGAGGGGAA
GCTTGTGGAAGAGGCTTTAAATGAGTCCGTTGTATTGATAGACTCTTGCAGCTCTGCCAGGGACGTAATCCTTATGATGAAACAAAGTATACAAGTCCTTCAGTCAGCTT
TACGTCGGAAGGGTGCAGATTCAAGCATCGAAAGCCATGTTCGTGCGTACTTCAGCTTCCGAAGGAAGGCAAGGAAAGACATTGGAAGCTGCCTCGGCGCACTGAAACGA
ATGGAGAATGATAGAGCAAGCTTCCCTATATTGGATCTACCAAATCATGATTTGTTGCCTCTGATCAGACTGTTGAGAGAAGCAAGAACCATCAGCATTTCCATCTTTAG
CGAGCTTTTGGCACTCCTATCGGCACCCGTGGCAAAGGTGAGGGCTAGCGGGTGGTCGTTGGTCTCACAGTTGATGCCAACAATCAGGTCAGGATCAGGGAAGGGACAGA
AGATGGAAAACGAGTTGGAGAGCGTGGATGTCGCTCTTCGTTCGCTCCTTGGCCACACGAGAGGCAACAGCGGCAGTGGTAACAAAGCTGAAGTTCAAATGGCACAGAGA
AGGCTTGGAACATTAGCTGCAAGTTTTGAAGGAATTGAGGGTGGATTGGATTGCATGTTCAGATGTTTGGTTAAACATAGAGTGTGTTTTCTCAACATGTTAGCTCATTG
A
Protein sequenceShow/hide protein sequence
MPRPLSAAAALECKHCQFNLGYGSSNVFSLRIRERELRLCSQGCDFDMHAQRHSIYKPKMAISSIFTGAHQPIRSVSLPTRVELEPEPLLQSLKSFQVSSFNAKVATFRL
EAIQAALVGLAELYNSVGELFQSSSTQQALVHYKEGKLVEEALNESVVLIDSCSSARDVILMMKQSIQVLQSALRRKGADSSIESHVRAYFSFRRKARKDIGSCLGALKR
MENDRASFPILDLPNHDLLPLIRLLREARTISISIFSELLALLSAPVAKVRASGWSLVSQLMPTIRSGSGKGQKMENELESVDVALRSLLGHTRGNSGSGNKAEVQMAQR
RLGTLAASFEGIEGGLDCMFRCLVKHRVCFLNMLAH