; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017159 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017159
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionArabidopsis protein of unknown function (DUF241)
Genome locationtig00153031:697293..698349
RNA-Seq ExpressionSgr017159
SyntenySgr017159
Gene Ontology termsNA
InterPro domainsIPR004320 - Protein of unknown function DUF241, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034047.1 hypothetical protein SDJN02_03773, partial [Cucurbita argyrosperma subsp. argyrosperma]1.9e-13082.69Show/hide
Query:  SIYKPKMVISSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEAL
        +I  P+MVISSIFT AHQPVRSVSLPARVELEPEPLL+SLKSF+VSSFNAKT  FGLE I+AALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEAL
Subjt:  SIYKPKMVISSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEAL

Query:  NESVVLIDSCSSARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDR-ASFPLLDLPNHDLLPLIRLLREARATS
         ESV+LIDSC SARDIILMMKQ+IQ LQSALRRK ADSS+ESHV A+FSFRRKAKKDI S LG+LK+M+++R  SFPLLDLPNHDLLPLIRLLREAR  S
Subjt:  NESVVLIDSCSSARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDR-ASFPLLDLPNHDLLPLIRLLREARATS

Query:  ISIFGELLAFLSTPVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHAR---SNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRC
        ISIFGELLAFLST VAK +ASGWSLVSQLMP I S SGKG+K+VNELESVD+ALHSLLGH R   SN  +A++QMAQRRL TLA+SFEGIE  LDCMFRC
Subjt:  ISIFGELLAFLSTPVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHAR---SNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRC

Query:  LVKHRVCFLNML
        LVK+RVCFLNML
Subjt:  LVKHRVCFLNML

XP_022132594.1 uncharacterized protein LOC111005419 [Momordica charantia]6.3e-14789.46Show/hide
Query:  SIYKPKMVISSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEAL
        SIYKPKM ISSIFTGAHQP+RSVSLP RVELEPEPLLQSLKSF+VSSFNAK A F LEAIQAALVGLAELYNSVGEL QSSSTQQALVHYKEGKLVEEAL
Subjt:  SIYKPKMVISSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEAL

Query:  NESVVLIDSCSSARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDRASFPLLDLPNHDLLPLIRLLREARATSI
        NESVVLIDSCSSARD+ILMMKQSIQ+LQSALRRKGADSS+ESHV AYFSFRRKA+KDI SCLGALKRMENDRASFP+LDLPNHDLLPLIRLLREAR  SI
Subjt:  NESVVLIDSCSSARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDRASFPLLDLPNHDLLPLIRLLREARATSI

Query:  SIFGELLAFLSTPVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHARSN---GNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCL
        SIF ELLA LS PVAK RASGWSLVSQLMP I SGSGKGQKM NELESVDVAL SLLGH R N   GN+A++QMAQRRLGTLAASFEGIEGGLDCMFRCL
Subjt:  SIFGELLAFLSTPVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHARSN---GNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCL

Query:  VKHRVCFLNMLAH
        VKHRVCFLNMLAH
Subjt:  VKHRVCFLNMLAH

XP_022949794.1 uncharacterized protein LOC111453085 [Cucurbita moschata]4.1e-13083.44Show/hide
Query:  MVISSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVL
        MVISSIFT AHQPVRSVSLPARVELEPEPLL+SLKSF+VSSFNAKT  FGLE I+AALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEAL ESV+L
Subjt:  MVISSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVL

Query:  IDSCSSARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDR-ASFPLLDLPNHDLLPLIRLLREARATSISIFGE
        IDSC SARDIILMMKQ+IQ LQSALRRK ADSS+ESHV A+FSFRRKAKKDI S LG+LK+M+++R  SFPLLDLPNHDLLPLIRLLREAR  SISIFGE
Subjt:  IDSCSSARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDR-ASFPLLDLPNHDLLPLIRLLREARATSISIFGE

Query:  LLAFLSTPVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHAR---SNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRV
        LLAFLST VAK +ASGWSLVSQLMP I S SGKG+K+VNELESVD+ALHSLLGH R   SN  +A++QMAQRRL TLA+SFEGIE  LDCMFRCLVK+RV
Subjt:  LLAFLSTPVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHAR---SNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRV

Query:  CFLNMLAH
        CFLNML H
Subjt:  CFLNMLAH

XP_023544975.1 uncharacterized protein LOC111804411 isoform X1 [Cucurbita pepo subsp. pepo]8.9e-13382.8Show/hide
Query:  SIYKPKMVISSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEAL
        +IYKP+MVISSIFT AHQPVRSVSLPARVELEPEPLL+SLKSF+VSS NAKT PFGLE I+AALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEAL
Subjt:  SIYKPKMVISSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEAL

Query:  NESVVLIDSCSSARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDR-ASFPLLDLPNHDLLPLIRLLREARATS
         ESV+LIDSC SARDIIL MKQ+IQ LQSALRRK ADSS+ESHV AYFSFRRKAKKDI S LG+LK+M+++R  SFPLLDLPNHDLLPLIRLLREAR  S
Subjt:  NESVVLIDSCSSARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDR-ASFPLLDLPNHDLLPLIRLLREARATS

Query:  ISIFGELLAFLSTPVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHAR---SNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRC
        ISIFGELLAFLST V K +ASGWSLVSQLMP I S SGKG+K+VNELESVD+ LHSLLGH R   SN  +A++QMAQRRL TLA+SFEGIE  LDCMFRC
Subjt:  ISIFGELLAFLSTPVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHAR---SNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRC

Query:  LVKHRVCFLNMLAH
        LVKHRVCFLNML H
Subjt:  LVKHRVCFLNMLAH

XP_023544976.1 uncharacterized protein LOC111804411 isoform X2 [Cucurbita pepo subsp. pepo]1.2e-12983.12Show/hide
Query:  MVISSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVL
        MVISSIFT AHQPVRSVSLPARVELEPEPLL+SLKSF+VSS NAKT PFGLE I+AALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEAL ESV+L
Subjt:  MVISSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVL

Query:  IDSCSSARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDR-ASFPLLDLPNHDLLPLIRLLREARATSISIFGE
        IDSC SARDIIL MKQ+IQ LQSALRRK ADSS+ESHV AYFSFRRKAKKDI S LG+LK+M+++R  SFPLLDLPNHDLLPLIRLLREAR  SISIFGE
Subjt:  IDSCSSARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDR-ASFPLLDLPNHDLLPLIRLLREARATSISIFGE

Query:  LLAFLSTPVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHAR---SNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRV
        LLAFLST V K +ASGWSLVSQLMP I S SGKG+K+VNELESVD+ LHSLLGH R   SN  +A++QMAQRRL TLA+SFEGIE  LDCMFRCLVKHRV
Subjt:  LLAFLSTPVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHAR---SNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRV

Query:  CFLNMLAH
        CFLNML H
Subjt:  CFLNMLAH

TrEMBL top hitse value%identityAlignment
A0A0A0KMP4 Uncharacterized protein1.6e-12781.23Show/hide
Query:  MVISSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVL
        MVI SIFTGAHQPVRSVSLP RVEL+PEPLLQSLKSF+VSS NAKT PFGLE IQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVL
Subjt:  MVISSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVL

Query:  IDSCSSARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDRAS--FPLLDLPNHDLLPLIRLLREARATSISIFG
        IDSCSSARDIIL MKQ+IQ LQSALRRK A+S VE+HV AYFSFRRKAKKDI + +  LKRMENDR +  F L D+ NHDLLPLI+LLREAR+ SISIFG
Subjt:  IDSCSSARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDRAS--FPLLDLPNHDLLPLIRLLREARATSISIFG

Query:  ELLAFLSTPVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHAR---SNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHR
        ELLAFLS PV K  A GWSLVSQLMP I SGSGKGQK VNELE+VD+AL+SLLG  R    N N+A++Q+AQRR+GTLA+SFEGIE GLDCMFRCLVKHR
Subjt:  ELLAFLSTPVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHAR---SNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHR

Query:  VCFLNMLAH
        VCFLNML H
Subjt:  VCFLNMLAH

A0A5A7SKH1 Putative Selection and upkeep of intraepithelial T-cells protein 62.0e-12278.32Show/hide
Query:  MVISSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVL
        MVI+SIFTGAHQP+RSVSLP RVE E EPLLQSLKSF+VSS  AKT P GLE IQ ALVGLAELYNSVG+LVQSSSTQQALVHYKEGKLVEEALNESVVL
Subjt:  MVISSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVL

Query:  IDSCSSARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDRAS--FPLLDLPNHDLLPLIRLLREARATSISIFG
        IDSCSSARDIIL MKQ IQ LQSALRRK A+S VESHV AYFS+RRKAKK+I S +G LKRMENDR +  F L D+ NHDLLP+I+LLREAR+ SISIFG
Subjt:  IDSCSSARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDRAS--FPLLDLPNHDLLPLIRLLREARATSISIFG

Query:  ELLAFLSTPVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHAR---SNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHR
        ELLAFLS PV K +A GWSLVSQLMP I SGSGKGQK VNE+E+VD+AL+SLLG  R    N N+A++Q+AQRR+GTLA+SFEGIE GLD MF+CLVKHR
Subjt:  ELLAFLSTPVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHAR---SNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHR

Query:  VCFLNMLAH
        VCFLNML H
Subjt:  VCFLNMLAH

A0A6J1BTH5 uncharacterized protein LOC1110054193.1e-14789.46Show/hide
Query:  SIYKPKMVISSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEAL
        SIYKPKM ISSIFTGAHQP+RSVSLP RVELEPEPLLQSLKSF+VSSFNAK A F LEAIQAALVGLAELYNSVGEL QSSSTQQALVHYKEGKLVEEAL
Subjt:  SIYKPKMVISSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEAL

Query:  NESVVLIDSCSSARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDRASFPLLDLPNHDLLPLIRLLREARATSI
        NESVVLIDSCSSARD+ILMMKQSIQ+LQSALRRKGADSS+ESHV AYFSFRRKA+KDI SCLGALKRMENDRASFP+LDLPNHDLLPLIRLLREAR  SI
Subjt:  NESVVLIDSCSSARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDRASFPLLDLPNHDLLPLIRLLREARATSI

Query:  SIFGELLAFLSTPVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHARSN---GNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCL
        SIF ELLA LS PVAK RASGWSLVSQLMP I SGSGKGQKM NELESVDVAL SLLGH R N   GN+A++QMAQRRLGTLAASFEGIEGGLDCMFRCL
Subjt:  SIFGELLAFLSTPVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHARSN---GNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCL

Query:  VKHRVCFLNMLAH
        VKHRVCFLNMLAH
Subjt:  VKHRVCFLNMLAH

A0A6J1GDT1 uncharacterized protein LOC1114530852.0e-13083.44Show/hide
Query:  MVISSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVL
        MVISSIFT AHQPVRSVSLPARVELEPEPLL+SLKSF+VSSFNAKT  FGLE I+AALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEAL ESV+L
Subjt:  MVISSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVL

Query:  IDSCSSARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDR-ASFPLLDLPNHDLLPLIRLLREARATSISIFGE
        IDSC SARDIILMMKQ+IQ LQSALRRK ADSS+ESHV A+FSFRRKAKKDI S LG+LK+M+++R  SFPLLDLPNHDLLPLIRLLREAR  SISIFGE
Subjt:  IDSCSSARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDR-ASFPLLDLPNHDLLPLIRLLREARATSISIFGE

Query:  LLAFLSTPVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHAR---SNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRV
        LLAFLST VAK +ASGWSLVSQLMP I S SGKG+K+VNELESVD+ALHSLLGH R   SN  +A++QMAQRRL TLA+SFEGIE  LDCMFRCLVK+RV
Subjt:  LLAFLSTPVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHAR---SNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRV

Query:  CFLNMLAH
        CFLNML H
Subjt:  CFLNMLAH

A0A6J1II33 uncharacterized protein LOC1114776709.3e-12882.47Show/hide
Query:  MVISSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVL
        MVISSIFT AHQPVRSVSLPARVEL+PEPLL+SLKSF+V SFNAKT  FGL+ IQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEAL ESV+L
Subjt:  MVISSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVL

Query:  IDSCSSARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDR-ASFPLLDLPNHDLLPLIRLLREARATSISIFGE
        IDSC SARDIIL+MKQ+IQ LQSALRRK ADSS+ESHV AYFSFRRKAKKDI S L +LK+M++DR  SFPLLDLPNHDLLPLIRLLREAR  SISIFGE
Subjt:  IDSCSSARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDR-ASFPLLDLPNHDLLPLIRLLREARATSISIFGE

Query:  LLAFLSTPVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHAR---SNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRV
        LLAFLST VAK +ASGWSLVSQLM  I S SGKG+K+VNELESVD+ALHSLLGH R   SN  +A++QMAQRRL TLA+SFEGIE  LDCMFRCLVK+RV
Subjt:  LLAFLSTPVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHAR---SNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRV

Query:  CFLNMLAH
        CFLNML H
Subjt:  CFLNMLAH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G17080.1 Arabidopsis protein of unknown function (DUF241)2.7e-1829.21Show/hide
Query:  VRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVLIDSCSSARDIILM
        VRS S P+R   +   + + L   R S    + +     +I   L  L EL+ S+ +L+    TQQAL      K VE+ L+ S+ ++D C+ ++D +  
Subjt:  VRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVLIDSCSSARDIILM

Query:  MKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDRASFPLLDLPNHDLLPLIRLLREARATSISIFGELLAFLSTPVAKTRA
        MK+ +  +QS LRRK  D S E  V  Y + R+  KK  +    +LK  + +          N+D    + +  EA A ++S+F  LL+++S        
Subjt:  MKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDRASFPLLDLPNHDLLPLIRLLREARATSISIFGELLAFLSTPVAKTRA

Query:  SGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHARSNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRVCFLNMLAH
        S WS+VS+LM          +   NE   VD    S            KM   Q     L +  + +E GL+ + + L+K+RV FLN+L H
Subjt:  SGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHARSNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRVCFLNMLAH

AT3G51410.1 Arabidopsis protein of unknown function (DUF241)7.7e-2631.76Show/hide
Query:  TGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVLIDSCSSA
        T    PVRS+SLP+R+         +L    +   ++ +     +++ A+L+ L+ELY+S+ +L  S  T QA          E +L+ S  L+DSC +A
Subjt:  TGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVLIDSCSSA

Query:  RDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDRASFPLLDLPNHDLLPLIRLLREARATSISIFGELLAFLS-T
        R+++L +++ +  LQSALRRK  D S+E  +  YFSFR+K KK+    L  LK+  +D                      E    S+SIF  L  FLS T
Subjt:  RDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDRASFPLLDLPNHDLLPLIRLLREARATSISIFGELLAFLS-T

Query:  PVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHARSNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRVCFLNML
           KT+      VS+L   I  G      +++EL+++D  L       RS+G+ +K    ++ L  L    E +E  LD +F+ LV++RV  LN+L
Subjt:  PVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHARSNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRVCFLNML

AT4G35660.1 Arabidopsis protein of unknown function (DUF241)1.5e-2431.7Show/hide
Query:  SSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGE-LVQSSSTQQALVHYKEGKLVEEALNESVVLID
        SS     H P RS+SLP R+ + P+      +  ++ + N+ ++      IQ  L  L ELY+ V E ++ S   QQAL   +  KLVE+AL+ES+VL+D
Subjt:  SSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGE-LVQSSSTQQALVHYKEGKLVEEALNESVVLID

Query:  SCSSARDIILMMKQSIQILQSAL-RRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDRASFPLLD---LPNHDLLPLIRLLREARATSISIFG
             RD+I  + + IQ LQSAL RR+G  SSV+S + +Y SF +K+K +    + +L R +  + ++ +     L  H  + +  +LR++ A++ISI  
Subjt:  SCSSARDIILMMKQSIQILQSAL-RRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDRASFPLLD---LPNHDLLPLIRLLREARATSISIFG

Query:  ELLAFLSTPVAKTRASGWSL--VSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHARSNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRV
         LL FLST           +  V   M     G   G+KMV E+++      ++LG                RL  +  S E I+  L  + R L++HR 
Subjt:  ELLAFLSTPVAKTRASGWSL--VSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHARSNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRV

Query:  CFLNML
          LN++
Subjt:  CFLNML

AT4G35680.1 Arabidopsis protein of unknown function (DUF241)2.8e-3635.64Show/hide
Query:  HQPVRSVSLPARVELEPEPLLQSLKSF----RVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVLIDSCSS
        HQPVRS SLP+R+      L  +L       R SS  + +A FG E +   LV L ELY  V EL++S   +  L+H++EGKL++E+L+ SV+L+D    
Subjt:  HQPVRSVSLPARVELEPEPLLQSLKSF----RVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVLIDSCSS

Query:  ARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDRASFPLLDLPNHDLLPLIRLLREARATSISIFGELLAFLST
         R++I+ M++ +  L+SALRRKG   S+E    AYF+ R+KAKK+I   + ALK+ME    S    +      +    +LRE    ++S+F  LL FLST
Subjt:  ARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDRASFPLLDLPNHDLLPLIRLLREARATSISIFGELLAFLST

Query:  ------PVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESV-DVALHSLLGHARSNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRVCFL
              P       G   +  + P++   S K   ++ E++S+ DV L S+L   ++      M+  + R   +   F  +E  LD + +CLVK+RV FL
Subjt:  ------PVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESV-DVALHSLLGHARSNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRVCFL

Query:  NML
        N+L
Subjt:  NML

AT4G35690.1 Arabidopsis protein of unknown function (DUF241)1.6e-1825.59Show/hide
Query:  VRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVLIDSCSSARDIILM
        +RS+SLP+        + +SL   +V + N  T     E++   L GL ELYN   + ++  STQ+ +      + +EE L+ S+ L+D CS +RD+++ 
Subjt:  VRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVLIDSCSSARDIILM

Query:  MKQSIQILQSALRRK---GADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDRASFPLLDLPNHD--LLPLIRLLREARATSISIFGELLAFLSTPV
         ++ ++ +QS +RRK   G +  ++  V  Y  FR+  +K+ +  LG+LK ++   +S   ++    +  L+ ++  +R+  + S+++    L FLS   
Subjt:  MKQSIQILQSALRRK---GADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDRASFPLLDLPNHD--LLPLIRLLREARATSISIFGELLAFLSTPV

Query:  AKTRASGWSLVSQLMPAIMSGS-GKGQKMVNELESVDVALHSLLGHARSNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRVCFLNMLAH
           R S  ++ S+L   +        ++  NELE++D+ +                   Q++L  +  S +G E  L+ +FR L++ R   LN+++H
Subjt:  AKTRASGWSLVSQLMPAIMSGS-GKGQKMVNELESVDVALHSLLGHARSNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRVCFLNMLAH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCATAACGCTGTGGCTTTGACATGCATGCACGGACTTTCTATTTACAAACCAAAGATGGTGATCTCATCGATATTCACCGGTGCCCACCAGCCCGTTAGGTCCGT
TAGCTTGCCGGCGAGGGTGGAGCTCGAGCCAGAACCGTTGCTGCAGAGCCTAAAATCCTTTCGAGTTTCGTCTTTCAATGCGAAAACGGCTCCTTTCGGGCTCGAGGCTA
TTCAAGCCGCCTTGGTTGGGCTTGCAGAGTTGTATAACTCTGTGGGGGAGCTTGTTCAGTCTTCTTCCACCCAGCAGGCTCTCGTTCACTATAAGGAGGGGAAGCTTGTG
GAAGAGGCTTTAAATGAGTCTGTTGTATTGATAGACTCATGCAGCTCTGCAAGAGACATAATCCTTATGATGAAACAAAGTATACAAATCCTTCAGTCGGCTTTACGTCG
AAAGGGTGCAGATTCGAGTGTCGAAAGCCATGTTCATGCCTACTTCAGCTTCCGAAGGAAGGCAAAGAAAGACATCAGAAGCTGCCTCGGCGCGCTGAAGCGAATGGAGA
ATGACAGAGCAAGCTTCCCTTTACTGGATCTACCAAATCATGATTTGTTGCCTCTGATCAGACTGCTGAGAGAAGCAAGAGCCACCAGCATCTCCATCTTTGGCGAGCTT
CTAGCATTCCTTTCAACGCCGGTGGCAAAGACAAGGGCTAGCGGGTGGTCGTTGGTCTCGCAATTGATGCCAGCCATCATGTCGGGATCGGGAAAGGGACAGAAGATGGT
GAACGAGTTGGAAAGTGTGGATGTTGCTCTTCACTCCCTCCTCGGCCATGCGAGAAGCAATGGTAACAGAGCCAAGATGCAAATGGCACAGAGAAGGCTTGGAACATTGG
CTGCAAGTTTTGAAGGAATAGAGGGTGGGTTGGATTGCATGTTTAGATGTTTAGTTAAACATAGAGTGTGTTTTCTTAACATGCTGGCTCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCCATAACGCTGTGGCTTTGACATGCATGCACGGACTTTCTATTTACAAACCAAAGATGGTGATCTCATCGATATTCACCGGTGCCCACCAGCCCGTTAGGTCCGT
TAGCTTGCCGGCGAGGGTGGAGCTCGAGCCAGAACCGTTGCTGCAGAGCCTAAAATCCTTTCGAGTTTCGTCTTTCAATGCGAAAACGGCTCCTTTCGGGCTCGAGGCTA
TTCAAGCCGCCTTGGTTGGGCTTGCAGAGTTGTATAACTCTGTGGGGGAGCTTGTTCAGTCTTCTTCCACCCAGCAGGCTCTCGTTCACTATAAGGAGGGGAAGCTTGTG
GAAGAGGCTTTAAATGAGTCTGTTGTATTGATAGACTCATGCAGCTCTGCAAGAGACATAATCCTTATGATGAAACAAAGTATACAAATCCTTCAGTCGGCTTTACGTCG
AAAGGGTGCAGATTCGAGTGTCGAAAGCCATGTTCATGCCTACTTCAGCTTCCGAAGGAAGGCAAAGAAAGACATCAGAAGCTGCCTCGGCGCGCTGAAGCGAATGGAGA
ATGACAGAGCAAGCTTCCCTTTACTGGATCTACCAAATCATGATTTGTTGCCTCTGATCAGACTGCTGAGAGAAGCAAGAGCCACCAGCATCTCCATCTTTGGCGAGCTT
CTAGCATTCCTTTCAACGCCGGTGGCAAAGACAAGGGCTAGCGGGTGGTCGTTGGTCTCGCAATTGATGCCAGCCATCATGTCGGGATCGGGAAAGGGACAGAAGATGGT
GAACGAGTTGGAAAGTGTGGATGTTGCTCTTCACTCCCTCCTCGGCCATGCGAGAAGCAATGGTAACAGAGCCAAGATGCAAATGGCACAGAGAAGGCTTGGAACATTGG
CTGCAAGTTTTGAAGGAATAGAGGGTGGGTTGGATTGCATGTTTAGATGTTTAGTTAAACATAGAGTGTGTTTTCTTAACATGCTGGCTCATTGA
Protein sequenceShow/hide protein sequence
MVHNAVALTCMHGLSIYKPKMVISSIFTGAHQPVRSVSLPARVELEPEPLLQSLKSFRVSSFNAKTAPFGLEAIQAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLV
EEALNESVVLIDSCSSARDIILMMKQSIQILQSALRRKGADSSVESHVHAYFSFRRKAKKDIRSCLGALKRMENDRASFPLLDLPNHDLLPLIRLLREARATSISIFGEL
LAFLSTPVAKTRASGWSLVSQLMPAIMSGSGKGQKMVNELESVDVALHSLLGHARSNGNRAKMQMAQRRLGTLAASFEGIEGGLDCMFRCLVKHRVCFLNMLAH