; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC05G095000 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC05G095000
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionProtein of unknown function (DUF3754)
Genome locationCicolChr05:17039092..17043589
RNA-Seq ExpressionCcUC05G095000
SyntenyCcUC05G095000
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR022227 - Protein of unknown function DUF3754


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581139.1 hypothetical protein SDJN03_21141, partial [Cucurbita argyrosperma subsp. sororia]4.4e-23588.82Show/hide
Query:  RTMT-KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGK
        RTMT KK+EVIRLERESVIPILKP LISALSS LD SDRDEFL FCQRVEYSIRAWYLL FDDLLHLYSLFDP+HGARKLEQQNLSPEE D LEQKFLGK
Subjt:  RTMT-KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGK

Query:  LFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKISG
        LFQVMEKSNFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFY TKVNAII RIWMFFL I G
Subjt:  LFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKISG

Query:  LKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEM-ERGIFVKHFKNIPMADLEIVLPE
        LK LL  ASR+ QSQVFSKQIDIST+S+DDGLYVERIRVENM+LG S L  KITIQEPTFDRIIVVYRPA+  +E+ ERGIF+KHFKNIPMADLEIVLPE
Subjt:  LKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEM-ERGIFVKHFKNIPMADLEIVLPE

Query:  KKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVI
        KK+PGLTPMDWV FLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVG YCVKTYLSFQG+LVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVI
Subjt:  KKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVI

Query:  ISFYILMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKE
        ISFYILMKQG+ATKQELD RCEELIQ QFDQSCNF+VDDAVHKLEKLGI+++ ADGAY C DL SAN IIGITTEEIV+KAK+
Subjt:  ISFYILMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKE

XP_022934332.1 uncharacterized protein LOC111441529 [Cucurbita moschata]2.0e-23589.03Show/hide
Query:  RTMT-KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGK
        RTMT KK+EVIRLERESVIPILKP LISALSS LD SDRDEFL FCQRVEYSIRAWYLL FDDLLHLYSLFDP+HGARKLEQQNLSPEE D LEQKFLGK
Subjt:  RTMT-KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGK

Query:  LFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKISG
        LFQVMEKSNFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFY TKVNAII RIWMFFL I G
Subjt:  LFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKISG

Query:  LKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEM-ERGIFVKHFKNIPMADLEIVLPE
        LK LL  ASR+ QSQVFSKQIDIST+S+DDGLYVERIRVENM+LG S L  KITIQEPTFDRIIVVYRPA+  +E+ ERGIF+KHFKNIPMADLEIVLPE
Subjt:  LKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEM-ERGIFVKHFKNIPMADLEIVLPE

Query:  KKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVI
        KK+PGLTPMDWV FLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVG YCVKTYLSFQG+LVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVI
Subjt:  KKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVI

Query:  ISFYILMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKE
        ISFYILMKQG+ATKQELD RCEELIQ QFDQSCNF+VDDAVHKLEKLGI+++ ADGAY C DL SAN IIGITTEEIV+KAKE
Subjt:  ISFYILMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKE

XP_022983456.1 uncharacterized protein LOC111482053 [Cucurbita maxima]4.8e-23488.91Show/hide
Query:  KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVM
        KK+EVIRLERESVIPILKP LISAL+S LD SDRDEFL FCQRVEYSIRAWYLL FDDLLHLYSLFDP+HGARKLEQQNLSPEE D LEQKFLGKLFQVM
Subjt:  KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVM

Query:  EKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKISGLKSLL
        EKSNFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFY TKVNAII RIWMFFL I GLK LL
Subjt:  EKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKISGLKSLL

Query:  -GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEM-ERGIFVKHFKNIPMADLEIVLPEKKNPG
          ASR+ QSQVFSKQIDIST+S DDGLYVERIRVENM+LG S L  KITIQEPTFDRIIVVYRPA+  +E+ ERGIF+KHFKNIPMADLEIVLPEKK+PG
Subjt:  -GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEM-ERGIFVKHFKNIPMADLEIVLPEKKNPG

Query:  LTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYI
        LTPMDW+ FLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVG YCVKTYLSFQG+LVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYI
Subjt:  LTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYI

Query:  LMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKE
        LMKQG+ATKQELDMRCEELIQ QFDQSCNF+VDDAVHKLEKLGI+++ ADGAY C DL SAN IIGITTEEIV+KAKE
Subjt:  LMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKE

XP_023528251.1 uncharacterized protein LOC111791222 isoform X1 [Cucurbita pepo subsp. pepo]1.7e-23489.12Show/hide
Query:  KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVM
        KK+EVIRLERESVIPILKP LISALSS LD SDRDEFL FCQRVEYSIRAWYLL FDDLLHLYSLFDP+HGARKLEQQNLSPEE D LEQKFLGKLFQVM
Subjt:  KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVM

Query:  EKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKISGLKSLL
        EKSNFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFY TKVNAII RIWMFFL I GLK LL
Subjt:  EKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKISGLKSLL

Query:  -GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEM-ERGIFVKHFKNIPMADLEIVLPEKKNPG
          ASR+ QSQVFSKQIDIST+S+DDGLYVERIRVENM+LG S L  KITIQEPTFDRIIVVYRPA+  +E+ ERGIF+KHFKNIPMADLEIVLPEKK+PG
Subjt:  -GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEM-ERGIFVKHFKNIPMADLEIVLPEKKNPG

Query:  LTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYI
        LTPMDWV FLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVG YCVKTYLSFQG+LVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYI
Subjt:  LTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYI

Query:  LMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKE
        LMKQG+ATKQELD RCEELIQ QFDQSCNF+VDDAVHKLEKLGI+++ ADGAY C DL SAN IIGITTEEIV+KAKE
Subjt:  LMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKE

XP_038892952.1 uncharacterized protein LOC120081846 [Benincasa hispida]1.1e-25789.56Show/hide
Query:  FICHKLVNQIKAPSIDL------------RTMTKK-REVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLF
        FICH+LVNQI+APSI +            RTMTKK REVIRLERESVIPILKPTLI+ALSSHLDT DR EFLMFCQRVEYSIRAWYLLQFDDLLHLYSLF
Subjt:  FICHKLVNQIKAPSIDL------------RTMTKK-REVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLF

Query:  DPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIG
        +P+HGARKLE++NLSPEEIDV+EQKFLGKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYF ENPHDNLPYFADKYIIFRRGIG
Subjt:  DPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIG

Query:  IDQMNDHFYHTKVNAIIIRIWMFFLKISGLKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPAN
        IDQMND+FY TKVNAII+RIWMFFLK++GLK+LL GASR+RQSQVFSKQIDISTESEDDGLYVERIRVENMT GIS LL KITIQEPTFDRIIV+YRPAN
Subjt:  IDQMNDHFYHTKVNAIIIRIWMFFLKISGLKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPAN

Query:  TTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCV
        TTKEMERGIFVKHFKNIPMADLEIVLPEK NPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVG YCVKTYLSFQG+LVSYQSLITSCV
Subjt:  TTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCV

Query:  YDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGI
        YDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQG+ATKQELD+RCEELI+ +FDQSCNFDVDDAVHKL+KLGI+V+GADGAY C DL SANKIIGI
Subjt:  YDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGI

Query:  TTEEIVSKAKEGDSSST
        TTEEIVSKAKEGD+S+T
Subjt:  TTEEIVSKAKEGDSSST

TrEMBL top hitse value%identityAlignment
A0A6J1D1Z1 uncharacterized protein LOC111016855 isoform X21.3e-22985.31Show/hide
Query:  RTMTKKR-EVIRLERESVIPILKPTLISALSSHL-DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLG
        RTM+KK+ EVIRLERESVIPILKP LIS LS+HL D SDR+EF+  CQRVEYSIRAWYLL FDDLLHLY+LFDP+HGA KLEQQNLS EE DVLEQKFLG
Subjt:  RTMTKKR-EVIRLERESVIPILKPTLISALSSHL-DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLG

Query:  KLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKIS
         LFQVM+KSNF++TTD+EIAVALSAQYRLNLPISVDESKLDKKLLTKYF ENPHDNLPYFADKYIIFRRGIGIDQM DHFY TKVN II+RIW FFLKIS
Subjt:  KLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKIS

Query:  GLKSLL--GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEMERGIFVKHFKNIPMADLEIVLP
        GL  L+  GASR+ +SQVF+KQIDIST+SEDDGLYVERIRVENM LGIS LL +ITIQEPTFDRIIVVYRPAN   EMERGIFVKHFKNIPMADLEIVLP
Subjt:  GLKSLL--GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEMERGIFVKHFKNIPMADLEIVLP

Query:  EKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEV
        EKKNP LTPMDWVKFLVSAAIGLVTVIGSLSVP AD++VIFAI+SAV  Y VKTYLSFQ +LVSYQ+LIT CVYDKQLDSGRGTLLHLCDEVIQQEVKEV
Subjt:  EKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEV

Query:  IISFYILMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKEGDSSST
        IISFYILMKQGKAT QELD RCEELIQ QF QSCNFDVDDAVHKLEKLGIVV+ ADGAY C DL SANKIIG TTEEI+SKAKE D+S+T
Subjt:  IISFYILMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKEGDSSST

A0A6J1D2Z1 uncharacterized protein LOC111016855 isoform X11.7e-22985.13Show/hide
Query:  RTMTKKR-EVIRLERESVIPILKPTLISALSSHL--DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFL
        RTM+KK+ EVIRLERESVIPILKP LIS LS+HL  D SDR+EF+  CQRVEYSIRAWYLL FDDLLHLY+LFDP+HGA KLEQQNLS EE DVLEQKFL
Subjt:  RTMTKKR-EVIRLERESVIPILKPTLISALSSHL--DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFL

Query:  GKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKI
        G LFQVM+KSNF++TTD+EIAVALSAQYRLNLPISVDESKLDKKLLTKYF ENPHDNLPYFADKYIIFRRGIGIDQM DHFY TKVN II+RIW FFLKI
Subjt:  GKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKI

Query:  SGLKSLL--GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEMERGIFVKHFKNIPMADLEIVL
        SGL  L+  GASR+ +SQVF+KQIDIST+SEDDGLYVERIRVENM LGIS LL +ITIQEPTFDRIIVVYRPAN   EMERGIFVKHFKNIPMADLEIVL
Subjt:  SGLKSLL--GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEMERGIFVKHFKNIPMADLEIVL

Query:  PEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKE
        PEKKNP LTPMDWVKFLVSAAIGLVTVIGSLSVP AD++VIFAI+SAV  Y VKTYLSFQ +LVSYQ+LIT CVYDKQLDSGRGTLLHLCDEVIQQEVKE
Subjt:  PEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKE

Query:  VIISFYILMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKEGDSSST
        VIISFYILMKQGKAT QELD RCEELIQ QF QSCNFDVDDAVHKLEKLGIVV+ ADGAY C DL SANKIIG TTEEI+SKAKE D+S+T
Subjt:  VIISFYILMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKEGDSSST

A0A6J1D4B7 uncharacterized protein LOC111016855 isoform X38.1e-21180.32Show/hide
Query:  RTMTKKR-EVIRLERESVIPILKPTLISALSSHL--DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFL
        RTM+KK+ EVIRLERESVIPILKP LIS LS+HL  D SDR+EF+  CQRVEYSIRAWYLL FDDLLHLY+LFDP+HGA KLEQQNLS EE DVLEQKFL
Subjt:  RTMTKKR-EVIRLERESVIPILKPTLISALSSHL--DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFL

Query:  GKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKI
        G LFQVM+KSNF++TTD+EIAVALSAQYRLNLPISVDESKLDKKLLTKYF ENPHDNLPYFADKYIIFRRGIGIDQM DHFY TKVN II+RIW FFLKI
Subjt:  GKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKI

Query:  SGLKSLL--GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEMERGIFVKHFKNIPMADLEIVL
        SGL  L+  GASR+ +SQVF+KQIDIST+SEDDGLYVERIRVENM LGIS LL +ITIQEPTFDRIIVVYRPAN   EMERGIFVKHFKNIPMADLEIVL
Subjt:  SGLKSLL--GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEMERGIFVKHFKNIPMADLEIVL

Query:  PEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYL--SFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEV
        PEKKNP LTPMDWVKFLVSAAIGLV   G    P +    ++       S    + L   FQ +LVSYQ+LIT CVYDKQLDSGRGTLLHLCDEVIQQEV
Subjt:  PEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYL--SFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEV

Query:  KEVIISFYILMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKEGDSSST
        KEVIISFYILMKQGKAT QELD RCEELIQ QF QSCNFDVDDAVHKLEKLGIVV+ ADGAY C DL SANKIIG TTEEI+SKAKE D+S+T
Subjt:  KEVIISFYILMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKEGDSSST

A0A6J1F2F4 uncharacterized protein LOC1114415299.5e-23689.03Show/hide
Query:  RTMT-KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGK
        RTMT KK+EVIRLERESVIPILKP LISALSS LD SDRDEFL FCQRVEYSIRAWYLL FDDLLHLYSLFDP+HGARKLEQQNLSPEE D LEQKFLGK
Subjt:  RTMT-KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGK

Query:  LFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKISG
        LFQVMEKSNFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFY TKVNAII RIWMFFL I G
Subjt:  LFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKISG

Query:  LKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEM-ERGIFVKHFKNIPMADLEIVLPE
        LK LL  ASR+ QSQVFSKQIDIST+S+DDGLYVERIRVENM+LG S L  KITIQEPTFDRIIVVYRPA+  +E+ ERGIF+KHFKNIPMADLEIVLPE
Subjt:  LKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEM-ERGIFVKHFKNIPMADLEIVLPE

Query:  KKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVI
        KK+PGLTPMDWV FLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVG YCVKTYLSFQG+LVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVI
Subjt:  KKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVI

Query:  ISFYILMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKE
        ISFYILMKQG+ATKQELD RCEELIQ QFDQSCNF+VDDAVHKLEKLGI+++ ADGAY C DL SAN IIGITTEEIV+KAKE
Subjt:  ISFYILMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKE

A0A6J1J295 uncharacterized protein LOC1114820532.3e-23488.91Show/hide
Query:  KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVM
        KK+EVIRLERESVIPILKP LISAL+S LD SDRDEFL FCQRVEYSIRAWYLL FDDLLHLYSLFDP+HGARKLEQQNLSPEE D LEQKFLGKLFQVM
Subjt:  KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVM

Query:  EKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKISGLKSLL
        EKSNFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFY TKVNAII RIWMFFL I GLK LL
Subjt:  EKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKISGLKSLL

Query:  -GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEM-ERGIFVKHFKNIPMADLEIVLPEKKNPG
          ASR+ QSQVFSKQIDIST+S DDGLYVERIRVENM+LG S L  KITIQEPTFDRIIVVYRPA+  +E+ ERGIF+KHFKNIPMADLEIVLPEKK+PG
Subjt:  -GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEM-ERGIFVKHFKNIPMADLEIVLPEKKNPG

Query:  LTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYI
        LTPMDW+ FLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVG YCVKTYLSFQG+LVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYI
Subjt:  LTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYI

Query:  LMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKE
        LMKQG+ATKQELDMRCEELIQ QFDQSCNF+VDDAVHKLEKLGI+++ ADGAY C DL SAN IIGITTEEIV+KAKE
Subjt:  LMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46915.1 Protein of unknown function (DUF3754)9.7e-1527.6Show/hide
Query:  ISTLLKKITIQEPTFDRIIVVYRPANTTK------EMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSL------SVPKA
        IS LL   T+QEP F+ +I++Y    + K      E    + ++ F+ IP+ DL ++ P KK      +D V+  +++ +GL     +       S P A
Subjt:  ISTLLKKITIQEPTFDRIIVVYRPANTTK------EMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSL------SVPKA

Query:  DVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGK---ATKQELDMRCEELIQRQFDQ
            + A+ +A+  Y  +  L ++ +   YQ L+   +Y+K L SG G++  L D   QQ+ KE I+++ I+++ GK    + + +  RCE  +   F  
Subjt:  DVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGK---ATKQELDMRCEELIQRQFDQ

Query:  SCNFDVDDAVHKLEKLGIVVQ
             V+ A+  L +LG+V +
Subjt:  SCNFDVDDAVHKLEKLGIVVQ

AT3G19340.1 Protein of unknown function (DUF3754)8.6e-17362.73Show/hide
Query:  REVIRLERESVIPILKPTLISALSSHLD-TSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVME
        +EVIRLE ESVIPILKP LI  L++ ++ ++DR EFL  C+R+EY++RAWYLLQF+DL+ LYSLFDPVHGA+K++QQNL+ +EIDVLEQ FL  LFQVME
Subjt:  REVIRLERESVIPILKPTLISALSSHLD-TSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVME

Query:  KSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKISGLKSL--
        KSNFK+T++EE+ VA S QY LNLPI VDESKLDKKLL +YF E+PH+N+P F+DKY+IFRRGIG+D+  D+F+  K++ II R W F ++I+ L+ L  
Subjt:  KSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKISGLKSL--

Query:  -LGASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNPG
           +S N++     K  + + ++++D LYVERIR+EN  L   + L K+TIQEPTFDR+IVVYR A++   +ERGI+VKHFKNIPMAD+EIVLPEK+NPG
Subjt:  -LGASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNPG

Query:  LTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYI
        LTPMDWVKFL+SA +GLV V+ S+ +PK+D  VI AILS V  YC KTY +FQ ++ +YQ+LIT  +YDKQLDSGRGTLLHLCD+VIQQEVKEV+I FYI
Subjt:  LTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYI

Query:  LMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKEGDSSS
        LM+QGKAT ++LD+RCEELI+ +F   CNFDV+DAV KLEKLGIV +   G Y+C  L  AN+IIG TTEE+V KAK+G + S
Subjt:  LMKQGKATKQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKEGDSSS

AT5G13940.1 aminopeptidases7.8e-16665.92Show/hide
Query:  DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISV
        D  +R+EFL FCQRVE +IRAWY L F+DL+ LYSLF+PV GA +L QQNLS  EID LE +FL  LFQVMEKSNFK+ T+EEI VALSAQYRLNLPI V
Subjt:  DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISV

Query:  DESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKISGLKSLLGASRNRQSQVFSKQIDISTESEDDGLYVE
        +E+KLD KLLT+YF + P D+LP+FADKYIIFRRG GID M  +F+  K++ I++RIW F L I+ LK L+   +N      S+QIDIS E+E D LY+E
Subjt:  DESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLKISGLKSLLGASRNRQSQVFSKQIDISTESEDDGLYVE

Query:  RIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADV
        RIR+E + L +S L+KKITIQEPTF+RIIVVYR  +  KE ER I+VKHFK IPMAD+EIVLPEKKNPGLTP+DWVKFLVSAAIGLVTV+ S+S+ KAD+
Subjt:  RIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADV

Query:  KVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKAT-KQELDMRCEELIQRQFDQSCNF
        +VI AILS V +YCVKTY +FQ +LV YQSLIT  VYDKQLDSGRGTLLHLCDEVIQQEVKEVIISF++L+K+G  T K+ELDM+ E  I+ +F++SCNF
Subjt:  KVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKAT-KQELDMRCEELIQRQFDQSCNF

Query:  DVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKEG
        DVDDA+ KLEKLG+V + ++  Y C ++  AN+I+G TTEE+V KA++G
Subjt:  DVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKEG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGCCATGGCGATGGATTCATCTGCCACAAACTGGTAAACCAGATCAAAGCTCCATCAATTGATTTGCGAACAATGACCAAGAAGAGGGAAGTCATACGCTTGGA
AAGGGAGTCGGTTATCCCCATCCTCAAGCCCACGCTTATCAGCGCCTTGTCCAGCCATCTCGATACTTCGGACCGGGATGAGTTTCTGATGTTTTGCCAGAGAGTTGAAT
ACTCAATTCGAGCTTGGTACCTTCTGCAATTTGATGATCTTTTGCATTTATATTCATTATTCGATCCTGTACATGGGGCACGAAAATTGGAGCAGCAAAATCTCTCGCCT
GAAGAAATCGATGTTTTGGAACAAAAATTTCTAGGGAAGCTGTTTCAGGTGATGGAGAAGAGCAATTTTAAATTAACAACAGACGAGGAAATCGCGGTTGCACTATCTGC
ACAATATCGTCTAAATCTTCCCATCTCTGTGGATGAGTCCAAGCTTGACAAGAAGCTTTTGACGAAATACTTCATGGAGAATCCTCACGACAATCTTCCATATTTTGCTG
ATAAGTACATAATTTTCCGCCGTGGTATTGGGATTGATCAAATGAACGATCACTTTTACCACACGAAAGTAAATGCCATCATTATACGAATATGGATGTTCTTTCTCAAA
ATCTCAGGGTTAAAGAGCCTATTAGGAGCATCAAGAAACCGCCAAAGTCAGGTATTTTCAAAACAAATTGACATCAGTACCGAGTCAGAGGATGATGGCTTGTATGTCGA
GCGGATTCGCGTCGAGAACATGACACTTGGGATCTCTACGCTATTGAAGAAGATTACGATCCAAGAACCCACATTTGATAGAATTATTGTTGTTTACAGGCCGGCAAATA
CGACAAAGGAAATGGAACGGGGTATCTTCGTGAAGCATTTCAAGAATATACCGATGGCAGATCTTGAGATTGTGCTTCCCGAAAAGAAAAATCCAGGTTTAACTCCAATG
GACTGGGTGAAGTTTCTCGTGTCTGCTGCAATCGGGCTGGTTACTGTTATTGGCTCGCTTAGCGTCCCTAAAGCAGATGTCAAAGTCATTTTTGCTATCCTCTCTGCAGT
CGGTAGTTACTGTGTGAAAACATATCTCTCGTTTCAGGGTAGTTTAGTGTCATATCAGAGCCTAATCACAAGCTGCGTGTATGACAAACAACTAGACAGTGGAAGGGGCA
CTCTTCTTCACTTGTGTGACGAAGTTATTCAGCAAGAAGTAAAGGAGGTGATTATTTCCTTCTACATATTGATGAAACAGGGAAAGGCGACGAAACAGGAGCTTGACATG
CGGTGTGAGGAGCTGATTCAAAGACAGTTTGATCAGAGCTGTAATTTTGACGTGGATGATGCAGTTCACAAGTTAGAGAAGTTGGGGATCGTTGTCCAGGGTGCGGATGG
GGCATATTTCTGTGCAGATTTGGGGAGTGCCAACAAGATCATAGGCATCACCACAGAGGAGATAGTTTCCAAAGCTAAAGAAGGCGATTCCTCCAGTACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAGCCATGGCGATGGATTCATCTGCCACAAACTGGTAAACCAGATCAAAGCTCCATCAATTGATTTGCGAACAATGACCAAGAAGAGGGAAGTCATACGCTTGGA
AAGGGAGTCGGTTATCCCCATCCTCAAGCCCACGCTTATCAGCGCCTTGTCCAGCCATCTCGATACTTCGGACCGGGATGAGTTTCTGATGTTTTGCCAGAGAGTTGAAT
ACTCAATTCGAGCTTGGTACCTTCTGCAATTTGATGATCTTTTGCATTTATATTCATTATTCGATCCTGTACATGGGGCACGAAAATTGGAGCAGCAAAATCTCTCGCCT
GAAGAAATCGATGTTTTGGAACAAAAATTTCTAGGGAAGCTGTTTCAGGTGATGGAGAAGAGCAATTTTAAATTAACAACAGACGAGGAAATCGCGGTTGCACTATCTGC
ACAATATCGTCTAAATCTTCCCATCTCTGTGGATGAGTCCAAGCTTGACAAGAAGCTTTTGACGAAATACTTCATGGAGAATCCTCACGACAATCTTCCATATTTTGCTG
ATAAGTACATAATTTTCCGCCGTGGTATTGGGATTGATCAAATGAACGATCACTTTTACCACACGAAAGTAAATGCCATCATTATACGAATATGGATGTTCTTTCTCAAA
ATCTCAGGGTTAAAGAGCCTATTAGGAGCATCAAGAAACCGCCAAAGTCAGGTATTTTCAAAACAAATTGACATCAGTACCGAGTCAGAGGATGATGGCTTGTATGTCGA
GCGGATTCGCGTCGAGAACATGACACTTGGGATCTCTACGCTATTGAAGAAGATTACGATCCAAGAACCCACATTTGATAGAATTATTGTTGTTTACAGGCCGGCAAATA
CGACAAAGGAAATGGAACGGGGTATCTTCGTGAAGCATTTCAAGAATATACCGATGGCAGATCTTGAGATTGTGCTTCCCGAAAAGAAAAATCCAGGTTTAACTCCAATG
GACTGGGTGAAGTTTCTCGTGTCTGCTGCAATCGGGCTGGTTACTGTTATTGGCTCGCTTAGCGTCCCTAAAGCAGATGTCAAAGTCATTTTTGCTATCCTCTCTGCAGT
CGGTAGTTACTGTGTGAAAACATATCTCTCGTTTCAGGGTAGTTTAGTGTCATATCAGAGCCTAATCACAAGCTGCGTGTATGACAAACAACTAGACAGTGGAAGGGGCA
CTCTTCTTCACTTGTGTGACGAAGTTATTCAGCAAGAAGTAAAGGAGGTGATTATTTCCTTCTACATATTGATGAAACAGGGAAAGGCGACGAAACAGGAGCTTGACATG
CGGTGTGAGGAGCTGATTCAAAGACAGTTTGATCAGAGCTGTAATTTTGACGTGGATGATGCAGTTCACAAGTTAGAGAAGTTGGGGATCGTTGTCCAGGGTGCGGATGG
GGCATATTTCTGTGCAGATTTGGGGAGTGCCAACAAGATCATAGGCATCACCACAGAGGAGATAGTTTCCAAAGCTAAAGAAGGCGATTCCTCCAGTACTTGA
Protein sequenceShow/hide protein sequence
MASHGDGFICHKLVNQIKAPSIDLRTMTKKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSP
EEIDVLEQKFLGKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIIRIWMFFLK
ISGLKSLLGASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGISTLLKKITIQEPTFDRIIVVYRPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPM
DWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCVKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQELDM
RCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCADLGSANKIIGITTEEIVSKAKEGDSSST