; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G012580 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G012580
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionProtein of unknown function (DUF3754)
Genome locationCG_Chr05:16817585..16822259
RNA-Seq ExpressionClCG05G012580
SyntenyClCG05G012580
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR022227 - Protein of unknown function DUF3754


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581139.1 hypothetical protein SDJN03_21141, partial [Cucurbita argyrosperma subsp. sororia]1.1e-22080.62Show/hide
Query:  RTMT-KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGK
        RTMT KK+EVIRLERESVIPILKP LISALSS LD SDRDEFL FCQRVEYSIRAWYLL FDDLLHLYSLFDP+HGARKLEQQNLSPEE D LEQKFLGK
Subjt:  RTMT-KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGK

Query:  LFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKISG
        LFQVMEKSNFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLPYFADKYIIFRRGIGID MNDHFY TKVNAII RIWMFFL I G
Subjt:  LFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKISG

Query:  LKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSLIACFDHLL
        LK LL  ASR+ QSQVFSKQIDIST+S+DDGLYVERIRVENM+LGF +              W  +T + P F                D +I  +    
Subjt:  LKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSLIACFDHLL

Query:  PANTTKEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLI
        PA+  +E+ ERGIF+KHFKNIPMADLEIVLPEKK+PGLTPMDWV FLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVG YC+KTYLSFQG+LVSYQSLI
Subjt:  PANTTKEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLI

Query:  TSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSAN
        TSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQG+ATK QELD RCEELIQ QFDQSCNF+VDDAVHKLEKLGI+++ ADGAY CVDL SAN
Subjt:  TSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSAN

Query:  KIIGITTEEIVSKAKE
         IIGITTEEIV+KAK+
Subjt:  KIIGITTEEIVSKAKE

XP_022934332.1 uncharacterized protein LOC111441529 [Cucurbita moschata]5.0e-22180.81Show/hide
Query:  RTMT-KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGK
        RTMT KK+EVIRLERESVIPILKP LISALSS LD SDRDEFL FCQRVEYSIRAWYLL FDDLLHLYSLFDP+HGARKLEQQNLSPEE D LEQKFLGK
Subjt:  RTMT-KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGK

Query:  LFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKISG
        LFQVMEKSNFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLPYFADKYIIFRRGIGID MNDHFY TKVNAII RIWMFFL I G
Subjt:  LFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKISG

Query:  LKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSLIACFDHLL
        LK LL  ASR+ QSQVFSKQIDIST+S+DDGLYVERIRVENM+LGF +              W  +T + P F                D +I  +    
Subjt:  LKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSLIACFDHLL

Query:  PANTTKEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLI
        PA+  +E+ ERGIF+KHFKNIPMADLEIVLPEKK+PGLTPMDWV FLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVG YC+KTYLSFQG+LVSYQSLI
Subjt:  PANTTKEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLI

Query:  TSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSAN
        TSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQG+ATK QELD RCEELIQ QFDQSCNF+VDDAVHKLEKLGI+++ ADGAY CVDL SAN
Subjt:  TSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSAN

Query:  KIIGITTEEIVSKAKE
         IIGITTEEIV+KAKE
Subjt:  KIIGITTEEIVSKAKE

XP_022983456.1 uncharacterized protein LOC111482053 [Cucurbita maxima]1.2e-21980.63Show/hide
Query:  KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVM
        KK+EVIRLERESVIPILKP LISAL+S LD SDRDEFL FCQRVEYSIRAWYLL FDDLLHLYSLFDP+HGARKLEQQNLSPEE D LEQKFLGKLFQVM
Subjt:  KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVM

Query:  EKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKISGLKSLL
        EKSNFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLPYFADKYIIFRRGIGID MNDHFY TKVNAII RIWMFFL I GLK LL
Subjt:  EKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKISGLKSLL

Query:  -GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSLIACFDHLLPANTT
          ASR+ QSQVFSKQIDIST+S DDGLYVERIRVENM+LGF +              W  +T + P F                D +I  +    PA+  
Subjt:  -GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSLIACFDHLLPANTT

Query:  KEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCVY
        +E+ ERGIF+KHFKNIPMADLEIVLPEKK+PGLTPMDW+ FLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVG YC+KTYLSFQG+LVSYQSLITSCVY
Subjt:  KEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCVY

Query:  DKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIGI
        DKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQG+ATK QELDMRCEELIQ QFDQSCNF+VDDAVHKLEKLGI+++ ADGAY CVDL SAN IIGI
Subjt:  DKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIGI

Query:  TTEEIVSKAKE
        TTEEIV+KAKE
Subjt:  TTEEIVSKAKE

XP_023528251.1 uncharacterized protein LOC111791222 isoform X1 [Cucurbita pepo subsp. pepo]4.2e-22080.82Show/hide
Query:  KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVM
        KK+EVIRLERESVIPILKP LISALSS LD SDRDEFL FCQRVEYSIRAWYLL FDDLLHLYSLFDP+HGARKLEQQNLSPEE D LEQKFLGKLFQVM
Subjt:  KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVM

Query:  EKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKISGLKSLL
        EKSNFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLPYFADKYIIFRRGIGID MNDHFY TKVNAII RIWMFFL I GLK LL
Subjt:  EKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKISGLKSLL

Query:  -GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSLIACFDHLLPANTT
          ASR+ QSQVFSKQIDIST+S+DDGLYVERIRVENM+LGF +              W  +T + P F                D +I  +    PA+  
Subjt:  -GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSLIACFDHLLPANTT

Query:  KEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCVY
        +E+ ERGIF+KHFKNIPMADLEIVLPEKK+PGLTPMDWV FLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVG YC+KTYLSFQG+LVSYQSLITSCVY
Subjt:  KEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCVY

Query:  DKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIGI
        DKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQG+ATK QELD RCEELIQ QFDQSCNF+VDDAVHKLEKLGI+++ ADGAY CVDL SAN IIGI
Subjt:  DKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIGI

Query:  TTEEIVSKAKE
        TTEEIV+KAKE
Subjt:  TTEEIVSKAKE

XP_038892952.1 uncharacterized protein LOC120081846 [Benincasa hispida]1.7e-24081.45Show/hide
Query:  FICHKLVNQIKAPSIDL------------RTMTKK-REVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLF
        FICH+LVNQI+APSI +            RTMTKK REVIRLERESVIPILKPTLI+ALSSHLDT DR EFLMFCQRVEYSIRAWYLLQFDDLLHLYSLF
Subjt:  FICHKLVNQIKAPSIDL------------RTMTKK-REVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLF

Query:  DPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIG
        +P+HGARKLE++NLSPEEIDV+EQKFLGKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYF ENPHDNLPYFADKYIIFRRGIG
Subjt:  DPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIG

Query:  IDLMNDHFYHTKVNAIIMRIWMFFLKISGLKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGP
        ID MND+FY TKVNAIIMRIWMFFLK++GLK+LL GASR+RQSQVFSKQIDISTESEDDGLYVERIRVENMT G  +  +              +T + P
Subjt:  IDLMNDHFYHTKVNAIIMRIWMFFLKISGLKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGP

Query:  CFLPSNREVKLGHAFCISDSLIACFDHLLPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIF
         F                D +I  +    PANTTKEMERGIFVKHFKNIPMADLEIVLPEK NPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIF
Subjt:  CFLPSNREVKLGHAFCISDSLIACFDHLLPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIF

Query:  AILSAVGSYCLKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDD
        AILSAVG YC+KTYLSFQG+LVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQG+ATK QELD+RCEELI+ +FDQSCNFDVDD
Subjt:  AILSAVGSYCLKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDD

Query:  AVHKLEKLGIVVQGADGAYFCVDLGSANKIIGITTEEIVSKAKEGDSSST
        AVHKL+KLGI+V+GADGAY CVDL SANKIIGITTEEIVSKAKEGD+S+T
Subjt:  AVHKLEKLGIVVQGADGAYFCVDLGSANKIIGITTEEIVSKAKEGDSSST

TrEMBL top hitse value%identityAlignment
A0A6J1D1Z1 uncharacterized protein LOC111016855 isoform X25.4e-21377.25Show/hide
Query:  RTMTKKR-EVIRLERESVIPILKPTLISALSSHL-DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLG
        RTM+KK+ EVIRLERESVIPILKP LIS LS+HL D SDR+EF+  CQRVEYSIRAWYLL FDDLLHLY+LFDP+HGA KLEQQNLS EE DVLEQKFLG
Subjt:  RTMTKKR-EVIRLERESVIPILKPTLISALSSHL-DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLG

Query:  KLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKIS
         LFQVM+KSNF++TTD+EIAVALSAQYRLNLPISVDESKLDKKLLTKYF ENPHDNLPYFADKYIIFRRGIGID M DHFY TKVN IIMRIW FFLKIS
Subjt:  KLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKIS

Query:  GLKSLL--GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSLIACFDH
        GL  L+  GASR+ +SQVF+KQIDIST+SEDDGLYVERIRVENM LG  +  S              +T + P F                D +I  +  
Subjt:  GLKSLL--GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSLIACFDH

Query:  LLPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSL
          PAN   EMERGIFVKHFKNIPMADLEIVLPEKKNP LTPMDWVKFLVSAAIGLVTVIGSLSVP AD++VIFAI+SAV  Y +KTYLSFQ +LVSYQ+L
Subjt:  LLPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSL

Query:  ITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSA
        IT CVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKAT  QELD RCEELIQ QF QSCNFDVDDAVHKLEKLGIVV+ ADGAY CVDL SA
Subjt:  ITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSA

Query:  NKIIGITTEEIVSKAKEGDSSST
        NKIIG TTEEI+SKAKE D+S+T
Subjt:  NKIIGITTEEIVSKAKEGDSSST

A0A6J1D2Z1 uncharacterized protein LOC111016855 isoform X17.0e-21377.1Show/hide
Query:  RTMTKKR-EVIRLERESVIPILKPTLISALSSHL--DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFL
        RTM+KK+ EVIRLERESVIPILKP LIS LS+HL  D SDR+EF+  CQRVEYSIRAWYLL FDDLLHLY+LFDP+HGA KLEQQNLS EE DVLEQKFL
Subjt:  RTMTKKR-EVIRLERESVIPILKPTLISALSSHL--DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFL

Query:  GKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKI
        G LFQVM+KSNF++TTD+EIAVALSAQYRLNLPISVDESKLDKKLLTKYF ENPHDNLPYFADKYIIFRRGIGID M DHFY TKVN IIMRIW FFLKI
Subjt:  GKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKI

Query:  SGLKSLL--GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSLIACFD
        SGL  L+  GASR+ +SQVF+KQIDIST+SEDDGLYVERIRVENM LG  +  S              +T + P F                D +I  + 
Subjt:  SGLKSLL--GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSLIACFD

Query:  HLLPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQS
           PAN   EMERGIFVKHFKNIPMADLEIVLPEKKNP LTPMDWVKFLVSAAIGLVTVIGSLSVP AD++VIFAI+SAV  Y +KTYLSFQ +LVSYQ+
Subjt:  HLLPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQS

Query:  LITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGS
        LIT CVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKAT  QELD RCEELIQ QF QSCNFDVDDAVHKLEKLGIVV+ ADGAY CVDL S
Subjt:  LITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGS

Query:  ANKIIGITTEEIVSKAKEGDSSST
        ANKIIG TTEEI+SKAKE D+S+T
Subjt:  ANKIIGITTEEIVSKAKEGDSSST

A0A6J1D4B7 uncharacterized protein LOC111016855 isoform X31.1e-19473.28Show/hide
Query:  RTMTKKR-EVIRLERESVIPILKPTLISALSSHL--DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFL
        RTM+KK+ EVIRLERESVIPILKP LIS LS+HL  D SDR+EF+  CQRVEYSIRAWYLL FDDLLHLY+LFDP+HGA KLEQQNLS EE DVLEQKFL
Subjt:  RTMTKKR-EVIRLERESVIPILKPTLISALSSHL--DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFL

Query:  GKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKI
        G LFQVM+KSNF++TTD+EIAVALSAQYRLNLPISVDESKLDKKLLTKYF ENPHDNLPYFADKYIIFRRGIGID M DHFY TKVN IIMRIW FFLKI
Subjt:  GKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKI

Query:  SGLKSLL--GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSLIACFD
        SGL  L+  GASR+ +SQVF+KQIDIST+SEDDGLYVERIRVENM LG  +  S              +T + P F                D +I  + 
Subjt:  SGLKSLL--GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSLIACFD

Query:  HLLPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQS
           PAN   EMERGIFVKHFKNIPMADLEIVLPEKKNP LTPMDWVKFLVSAAIGLV   G    P +         S   S  L     FQ +LVSYQ+
Subjt:  HLLPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQS

Query:  LITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGS
        LIT CVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKAT  QELD RCEELIQ QF QSCNFDVDDAVHKLEKLGIVV+ ADGAY CVDL S
Subjt:  LITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGS

Query:  ANKIIGITTEEIVSKAKEGDSSST
        ANKIIG TTEEI+SKAKE D+S+T
Subjt:  ANKIIGITTEEIVSKAKEGDSSST

A0A6J1F2F4 uncharacterized protein LOC1114415292.4e-22180.81Show/hide
Query:  RTMT-KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGK
        RTMT KK+EVIRLERESVIPILKP LISALSS LD SDRDEFL FCQRVEYSIRAWYLL FDDLLHLYSLFDP+HGARKLEQQNLSPEE D LEQKFLGK
Subjt:  RTMT-KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGK

Query:  LFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKISG
        LFQVMEKSNFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLPYFADKYIIFRRGIGID MNDHFY TKVNAII RIWMFFL I G
Subjt:  LFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKISG

Query:  LKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSLIACFDHLL
        LK LL  ASR+ QSQVFSKQIDIST+S+DDGLYVERIRVENM+LGF +              W  +T + P F                D +I  +    
Subjt:  LKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSLIACFDHLL

Query:  PANTTKEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLI
        PA+  +E+ ERGIF+KHFKNIPMADLEIVLPEKK+PGLTPMDWV FLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVG YC+KTYLSFQG+LVSYQSLI
Subjt:  PANTTKEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLI

Query:  TSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSAN
        TSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQG+ATK QELD RCEELIQ QFDQSCNF+VDDAVHKLEKLGI+++ ADGAY CVDL SAN
Subjt:  TSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSAN

Query:  KIIGITTEEIVSKAKE
         IIGITTEEIV+KAKE
Subjt:  KIIGITTEEIVSKAKE

A0A6J1J295 uncharacterized protein LOC1114820536.0e-22080.63Show/hide
Query:  KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVM
        KK+EVIRLERESVIPILKP LISAL+S LD SDRDEFL FCQRVEYSIRAWYLL FDDLLHLYSLFDP+HGARKLEQQNLSPEE D LEQKFLGKLFQVM
Subjt:  KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVM

Query:  EKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKISGLKSLL
        EKSNFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLPYFADKYIIFRRGIGID MNDHFY TKVNAII RIWMFFL I GLK LL
Subjt:  EKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKISGLKSLL

Query:  -GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSLIACFDHLLPANTT
          ASR+ QSQVFSKQIDIST+S DDGLYVERIRVENM+LGF +              W  +T + P F                D +I  +    PA+  
Subjt:  -GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSLIACFDHLLPANTT

Query:  KEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCVY
        +E+ ERGIF+KHFKNIPMADLEIVLPEKK+PGLTPMDW+ FLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVG YC+KTYLSFQG+LVSYQSLITSCVY
Subjt:  KEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCVY

Query:  DKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIGI
        DKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQG+ATK QELDMRCEELIQ QFDQSCNF+VDDAVHKLEKLGI+++ ADGAY CVDL SAN IIGI
Subjt:  DKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIGI

Query:  TTEEIVSKAKE
        TTEEIV+KAKE
Subjt:  TTEEIVSKAKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46915.1 Protein of unknown function (DUF3754)5.8e-1026.34Show/hide
Query:  EMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSL------SVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLIT
        E    + ++ F+ IP+ DL ++ P KK      +D V+  +++ +GL     +       S P A    + A+ +A+  Y  +  L ++ +   YQ L+ 
Subjt:  EMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSL------SVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLIT

Query:  SCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQ--QELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQ
          +Y+K L SG G++  L D   QQ+ KE I+++ I+++ GK      + +  RCE  +   F       V+ A+  L +LG+V +
Subjt:  SCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQ--QELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQ

AT3G19340.1 Protein of unknown function (DUF3754)7.6e-15957.25Show/hide
Query:  REVIRLERESVIPILKPTLISALSSHLD-TSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVME
        +EVIRLE ESVIPILKP LI  L++ ++ ++DR EFL  C+R+EY++RAWYLLQF+DL+ LYSLFDPVHGA+K++QQNL+ +EIDVLEQ FL  LFQVME
Subjt:  REVIRLERESVIPILKPTLISALSSHLD-TSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVME

Query:  KSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKISGLKSL--
        KSNFK+T++EE+ VA S QY LNLPI VDESKLDKKLL +YF E+PH+N+P F+DKY+IFRRGIG+D   D+F+  K++ II R W F ++I+ L+ L  
Subjt:  KSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKISGLKSL--

Query:  -LGASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWIS-LTREGPCFLPSNREVKLGHAFCISDSLIACFDHLLPAN
           +S N++     K  + + ++++D LYVERIR+EN  L F              +S++S LT + P F                D +I  +     A+
Subjt:  -LGASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWIS-LTREGPCFLPSNREVKLGHAFCISDSLIACFDHLLPAN

Query:  TTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCV
        +   +ERGI+VKHFKNIPMAD+EIVLPEK+NPGLTPMDWVKFL+SA +GLV V+ S+ +PK+D  VI AILS V  YC KTY +FQ ++ +YQ+LIT  +
Subjt:  TTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCV

Query:  YDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIG
        YDKQLDSGRGTLLHLCD+VIQQEVKEV+I FYILM+QGKAT  ++LD+RCEELI+ +F   CNFDV+DAV KLEKLGIV +   G Y+C+ L  AN+IIG
Subjt:  YDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIG

Query:  ITTEEIVSKAKEGDSSS
         TTEE+V KAK+G + S
Subjt:  ITTEEIVSKAKEGDSSS

AT5G13940.1 aminopeptidases7.6e-15158.84Show/hide
Query:  DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISV
        D  +R+EFL FCQRVE +IRAWY L F+DL+ LYSLF+PV GA +L QQNLS  EID LE +FL  LFQVMEKSNFK+ T+EEI VALSAQYRLNLPI V
Subjt:  DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISV

Query:  DESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKISGLKSLLGASRNRQSQVFSKQIDISTESEDDGLYVE
        +E+KLD KLLT+YF + P D+LP+FADKYIIFRRG GID M  +F+  K++ I++RIW F L I+ LK L+   +N      S+QIDIS E+E D LY+E
Subjt:  DESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLKISGLKSLLGASRNRQSQVFSKQIDISTESEDDGLYVE

Query:  RIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSLIACFDHLLPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNP
        RIR+E + L          L  LM +    +T + P F                + +I  +  +   +  KE ER I+VKHFK IPMAD+EIVLPEKKNP
Subjt:  RIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSLIACFDHLLPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNP

Query:  GLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFY
        GLTP+DWVKFLVSAAIGLVTV+ S+S+ KAD++VI AILS V +YC+KTY +FQ +LV YQSLIT  VYDKQLDSGRGTLLHLCDEVIQQEVKEVIISF+
Subjt:  GLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFY

Query:  ILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIGITTEEIVSKAKEG
        +L+K+G  T ++ELDM+ E  I+ +F++SCNFDVDDA+ KLEKLG+V + ++  Y CV++  AN+I+G TTEE+V KA++G
Subjt:  ILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIGITTEEIVSKAKEG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGCCATGGCGATGGATTCATCTGCCACAAACTGGTAAACCAGATCAAAGCTCCATCAATTGATTTGCGAACAATGACCAAGAAGAGAGAAGTCATACGCTTGGA
AAGGGAGTCGGTTATCCCCATCCTCAAGCCCACGCTTATCAGCGCCTTGTCCAGCCATCTCGATACTTCGGACCGGGATGAGTTTCTGATGTTTTGCCAGAGAGTTGAAT
ACTCAATTCGAGCTTGGTACCTTCTGCAATTTGATGATCTTTTGCATTTATATTCATTATTCGATCCTGTACATGGGGCTCGAAAATTGGAGCAGCAAAATCTCTCGCCT
GAAGAAATCGATGTTTTGGAACAAAAATTTCTAGGGAAGCTGTTTCAGGTGATGGAGAAGAGCAATTTTAAACTAACAACAGACGAGGAAATCGCGGTTGCACTATCTGC
ACAATATCGTCTAAATCTTCCCATCTCTGTGGATGAGTCCAAGCTTGACAAGAAGCTTTTGACGAAATATTTCATGGAGAATCCTCACGACAATCTTCCATATTTTGCTG
ATAAGTACATAATTTTCCGCCGTGGTATTGGGATTGATCTAATGAACGACCACTTTTACCACACGAAAGTAAATGCCATCATTATGCGAATATGGATGTTCTTTCTCAAA
ATCTCAGGGTTAAAGAGCCTATTAGGAGCATCAAGAAACCGCCAAAGTCAGGTATTTTCAAAACAAATTGACATCAGTACCGAGTCAGAGGATGATGGCTTGTATGTCGA
GCGGATTCGCGTTGAGAACATGACACTTGGGTTTGAACTCGATCACTCTATAATTTGGCTTCATGCATTGATGGTTAGATCTTGGATTAGTTTAACTAGGGAAGGCCCTT
GCTTTCTTCCTAGCAACCGGGAGGTCAAGCTTGGGCATGCATTTTGTATTTCAGATTCCTTGATTGCTTGCTTTGATCATCTATTGCCGGCAAATACGACAAAGGAAATG
GAACGGGGTATCTTCGTGAAGCATTTCAAGAATATACCGATGGCAGATCTTGAGATTGTGCTTCCCGAAAAGAAAAATCCAGGTTTAACTCCAATGGACTGGGTGAAGTT
TCTCGTGTCTGCTGCAATCGGGCTGGTTACTGTTATTGGCTCGCTTAGCGTCCCTAAAGCAGATGTCAAAGTCATTTTTGCTATCCTCTCTGCAGTCGGTAGTTACTGTC
TGAAAACATATCTCTCGTTTCAGGGTAGTTTAGTGTCATATCAGAGCCTAATCACAAGCTGCGTGTATGACAAACAACTAGACAGTGGAAGGGGCACTCTTCTTCACTTG
TGTGACGAAGTTATTCAGCAAGAAGTAAAGGAGGTGATTATTTCCTTCTACATATTGATGAAACAGGGAAAGGCGACGAAACAGCAGGAGCTTGACATGCGATGTGAGGA
GCTGATTCAGAGACAGTTTGATCAGAGCTGTAATTTTGACGTGGATGATGCAGTTCACAAGTTAGAGAAGTTGGGGATCGTTGTCCAGGGTGCGGATGGGGCATATTTCT
GTGTAGATTTGGGGAGTGCCAACAAGATCATAGGCATCACCACAGAGGAGATAGTTTCCAAAGCTAAAGAAGGCGATTCCTCCAGTACTTGA
mRNA sequenceShow/hide mRNA sequence
AGAATTTCCCGCCAAATAGATCGGGAGGGTTTGTTGAACGAAAATGGGAAGCCATGGCGATGGATTCATCTGCCACAAACTGGTAAACCAGATCAAAGCTCCATCAATTG
ATTTGCGAACAATGACCAAGAAGAGAGAAGTCATACGCTTGGAAAGGGAGTCGGTTATCCCCATCCTCAAGCCCACGCTTATCAGCGCCTTGTCCAGCCATCTCGATACT
TCGGACCGGGATGAGTTTCTGATGTTTTGCCAGAGAGTTGAATACTCAATTCGAGCTTGGTACCTTCTGCAATTTGATGATCTTTTGCATTTATATTCATTATTCGATCC
TGTACATGGGGCTCGAAAATTGGAGCAGCAAAATCTCTCGCCTGAAGAAATCGATGTTTTGGAACAAAAATTTCTAGGGAAGCTGTTTCAGGTGATGGAGAAGAGCAATT
TTAAACTAACAACAGACGAGGAAATCGCGGTTGCACTATCTGCACAATATCGTCTAAATCTTCCCATCTCTGTGGATGAGTCCAAGCTTGACAAGAAGCTTTTGACGAAA
TATTTCATGGAGAATCCTCACGACAATCTTCCATATTTTGCTGATAAGTACATAATTTTCCGCCGTGGTATTGGGATTGATCTAATGAACGACCACTTTTACCACACGAA
AGTAAATGCCATCATTATGCGAATATGGATGTTCTTTCTCAAAATCTCAGGGTTAAAGAGCCTATTAGGAGCATCAAGAAACCGCCAAAGTCAGGTATTTTCAAAACAAA
TTGACATCAGTACCGAGTCAGAGGATGATGGCTTGTATGTCGAGCGGATTCGCGTTGAGAACATGACACTTGGGTTTGAACTCGATCACTCTATAATTTGGCTTCATGCA
TTGATGGTTAGATCTTGGATTAGTTTAACTAGGGAAGGCCCTTGCTTTCTTCCTAGCAACCGGGAGGTCAAGCTTGGGCATGCATTTTGTATTTCAGATTCCTTGATTGC
TTGCTTTGATCATCTATTGCCGGCAAATACGACAAAGGAAATGGAACGGGGTATCTTCGTGAAGCATTTCAAGAATATACCGATGGCAGATCTTGAGATTGTGCTTCCCG
AAAAGAAAAATCCAGGTTTAACTCCAATGGACTGGGTGAAGTTTCTCGTGTCTGCTGCAATCGGGCTGGTTACTGTTATTGGCTCGCTTAGCGTCCCTAAAGCAGATGTC
AAAGTCATTTTTGCTATCCTCTCTGCAGTCGGTAGTTACTGTCTGAAAACATATCTCTCGTTTCAGGGTAGTTTAGTGTCATATCAGAGCCTAATCACAAGCTGCGTGTA
TGACAAACAACTAGACAGTGGAAGGGGCACTCTTCTTCACTTGTGTGACGAAGTTATTCAGCAAGAAGTAAAGGAGGTGATTATTTCCTTCTACATATTGATGAAACAGG
GAAAGGCGACGAAACAGCAGGAGCTTGACATGCGATGTGAGGAGCTGATTCAGAGACAGTTTGATCAGAGCTGTAATTTTGACGTGGATGATGCAGTTCACAAGTTAGAG
AAGTTGGGGATCGTTGTCCAGGGTGCGGATGGGGCATATTTCTGTGTAGATTTGGGGAGTGCCAACAAGATCATAGGCATCACCACAGAGGAGATAGTTTCCAAAGCTAA
AGAAGGCGATTCCTCCAGTACTTGATACCATAATTATACAGAAACTGTATGTACTATTCTGCTTTCCATTCTCACACAAGGAAAGCAATATTCTACGATTAGTTCACAAC
TTAAAATATCACTGCATAGTTTGTTTCTAAAAGTTACGTATTCAATCCATCAAGCAC
Protein sequenceShow/hide protein sequence
MGSHGDGFICHKLVNQIKAPSIDLRTMTKKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSP
EEIDVLEQKFLGKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDLMNDHFYHTKVNAIIMRIWMFFLK
ISGLKSLLGASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSLIACFDHLLPANTTKEM
ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHL
CDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIGITTEEIVSKAKEGDSSST