; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G14830 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G14830
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of unknown function (DUF3754)
Genome locationClcChr05:16080589..16085265
RNA-Seq ExpressionClc05G14830
SyntenyClc05G14830
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR022227 - Protein of unknown function DUF3754


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581139.1 hypothetical protein SDJN03_21141, partial [Cucurbita argyrosperma subsp. sororia]2.2e-22180.81Show/hide
Query:  RTMT-KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGK
        RTMT KK+EVIRLERESVIPILKP LISALSS LD SDRDEFL FCQRVEYSIRAWYLL FDDLLHLYSLFDP+HGARKLEQQNLSPEE D LEQKFLGK
Subjt:  RTMT-KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGK

Query:  LFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKISG
        LFQVMEKSNFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFY TKVNAII RIWMFFL I G
Subjt:  LFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKISG

Query:  LKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSFIACFDHLL
        LK LL  ASR+ QSQVFSKQIDIST+S+DDGLYVERIRVENM+LGF +              W  +T + P F                D  I  +    
Subjt:  LKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSFIACFDHLL

Query:  PANTTKEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLI
        PA+  +E+ ERGIF+KHFKNIPMADLEIVLPEKK+PGLTPMDWV FLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVG YC+KTYLSFQG+LVSYQSLI
Subjt:  PANTTKEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLI

Query:  TSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSAN
        TSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQG+ATK QELD RCEELIQ QFDQSCNF+VDDAVHKLEKLGI+++ ADGAY CVDL SAN
Subjt:  TSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSAN

Query:  KIIGITTEEIVSKAKE
         IIGITTEEIV+KAK+
Subjt:  KIIGITTEEIVSKAKE

XP_022934332.1 uncharacterized protein LOC111441529 [Cucurbita moschata]1.0e-22181.01Show/hide
Query:  RTMT-KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGK
        RTMT KK+EVIRLERESVIPILKP LISALSS LD SDRDEFL FCQRVEYSIRAWYLL FDDLLHLYSLFDP+HGARKLEQQNLSPEE D LEQKFLGK
Subjt:  RTMT-KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGK

Query:  LFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKISG
        LFQVMEKSNFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFY TKVNAII RIWMFFL I G
Subjt:  LFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKISG

Query:  LKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSFIACFDHLL
        LK LL  ASR+ QSQVFSKQIDIST+S+DDGLYVERIRVENM+LGF +              W  +T + P F                D  I  +    
Subjt:  LKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSFIACFDHLL

Query:  PANTTKEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLI
        PA+  +E+ ERGIF+KHFKNIPMADLEIVLPEKK+PGLTPMDWV FLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVG YC+KTYLSFQG+LVSYQSLI
Subjt:  PANTTKEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLI

Query:  TSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSAN
        TSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQG+ATK QELD RCEELIQ QFDQSCNF+VDDAVHKLEKLGI+++ ADGAY CVDL SAN
Subjt:  TSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSAN

Query:  KIIGITTEEIVSKAKE
         IIGITTEEIV+KAKE
Subjt:  KIIGITTEEIVSKAKE

XP_022983456.1 uncharacterized protein LOC111482053 [Cucurbita maxima]2.5e-22080.82Show/hide
Query:  KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVM
        KK+EVIRLERESVIPILKP LISAL+S LD SDRDEFL FCQRVEYSIRAWYLL FDDLLHLYSLFDP+HGARKLEQQNLSPEE D LEQKFLGKLFQVM
Subjt:  KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVM

Query:  EKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKISGLKSLL
        EKSNFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFY TKVNAII RIWMFFL I GLK LL
Subjt:  EKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKISGLKSLL

Query:  -GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSFIACFDHLLPANTT
          ASR+ QSQVFSKQIDIST+S DDGLYVERIRVENM+LGF +              W  +T + P F                D  I  +    PA+  
Subjt:  -GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSFIACFDHLLPANTT

Query:  KEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCVY
        +E+ ERGIF+KHFKNIPMADLEIVLPEKK+PGLTPMDW+ FLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVG YC+KTYLSFQG+LVSYQSLITSCVY
Subjt:  KEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCVY

Query:  DKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIGI
        DKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQG+ATK QELDMRCEELIQ QFDQSCNF+VDDAVHKLEKLGI+++ ADGAY CVDL SAN IIGI
Subjt:  DKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIGI

Query:  TTEEIVSKAKE
        TTEEIV+KAKE
Subjt:  TTEEIVSKAKE

XP_023528251.1 uncharacterized protein LOC111791222 isoform X1 [Cucurbita pepo subsp. pepo]8.5e-22181.02Show/hide
Query:  KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVM
        KK+EVIRLERESVIPILKP LISALSS LD SDRDEFL FCQRVEYSIRAWYLL FDDLLHLYSLFDP+HGARKLEQQNLSPEE D LEQKFLGKLFQVM
Subjt:  KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVM

Query:  EKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKISGLKSLL
        EKSNFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFY TKVNAII RIWMFFL I GLK LL
Subjt:  EKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKISGLKSLL

Query:  -GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSFIACFDHLLPANTT
          ASR+ QSQVFSKQIDIST+S+DDGLYVERIRVENM+LGF +              W  +T + P F                D  I  +    PA+  
Subjt:  -GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSFIACFDHLLPANTT

Query:  KEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCVY
        +E+ ERGIF+KHFKNIPMADLEIVLPEKK+PGLTPMDWV FLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVG YC+KTYLSFQG+LVSYQSLITSCVY
Subjt:  KEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCVY

Query:  DKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIGI
        DKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQG+ATK QELD RCEELIQ QFDQSCNF+VDDAVHKLEKLGI+++ ADGAY CVDL SAN IIGI
Subjt:  DKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIGI

Query:  TTEEIVSKAKE
        TTEEIV+KAKE
Subjt:  TTEEIVSKAKE

XP_038892952.1 uncharacterized protein LOC120081846 [Benincasa hispida]3.3e-24181.64Show/hide
Query:  FICHKLVNQIKAPSIDL------------RTMTKK-REVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLF
        FICH+LVNQI+APSI +            RTMTKK REVIRLERESVIPILKPTLI+ALSSHLDT DR EFLMFCQRVEYSIRAWYLLQFDDLLHLYSLF
Subjt:  FICHKLVNQIKAPSIDL------------RTMTKK-REVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLF

Query:  DPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIG
        +P+HGARKLE++NLSPEEIDV+EQKFLGKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYF ENPHDNLPYFADKYIIFRRGIG
Subjt:  DPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIG

Query:  IDQMNDHFYHTKVNAIIMRIWMFFLKISGLKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGP
        IDQMND+FY TKVNAIIMRIWMFFLK++GLK+LL GASR+RQSQVFSKQIDISTESEDDGLYVERIRVENMT G  +  +              +T + P
Subjt:  IDQMNDHFYHTKVNAIIMRIWMFFLKISGLKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGP

Query:  CFLPSNREVKLGHAFCISDSFIACFDHLLPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIF
         F                D  I  +    PANTTKEMERGIFVKHFKNIPMADLEIVLPEK NPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIF
Subjt:  CFLPSNREVKLGHAFCISDSFIACFDHLLPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIF

Query:  AILSAVGSYCLKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDD
        AILSAVG YC+KTYLSFQG+LVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQG+ATK QELD+RCEELI+ +FDQSCNFDVDD
Subjt:  AILSAVGSYCLKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDD

Query:  AVHKLEKLGIVVQGADGAYFCVDLGSANKIIGITTEEIVSKAKEGDSSST
        AVHKL+KLGI+V+GADGAY CVDL SANKIIGITTEEIVSKAKEGD+S+T
Subjt:  AVHKLEKLGIVVQGADGAYFCVDLGSANKIIGITTEEIVSKAKEGDSSST

TrEMBL top hitse value%identityAlignment
A0A6J1D1Z1 uncharacterized protein LOC111016855 isoform X21.1e-21377.44Show/hide
Query:  RTMTKKR-EVIRLERESVIPILKPTLISALSSHL-DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLG
        RTM+KK+ EVIRLERESVIPILKP LIS LS+HL D SDR+EF+  CQRVEYSIRAWYLL FDDLLHLY+LFDP+HGA KLEQQNLS EE DVLEQKFLG
Subjt:  RTMTKKR-EVIRLERESVIPILKPTLISALSSHL-DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLG

Query:  KLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKIS
         LFQVM+KSNF++TTD+EIAVALSAQYRLNLPISVDESKLDKKLLTKYF ENPHDNLPYFADKYIIFRRGIGIDQM DHFY TKVN IIMRIW FFLKIS
Subjt:  KLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKIS

Query:  GLKSLL--GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSFIACFDH
        GL  L+  GASR+ +SQVF+KQIDIST+SEDDGLYVERIRVENM LG  +  S              +T + P F                D  I  +  
Subjt:  GLKSLL--GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSFIACFDH

Query:  LLPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSL
          PAN   EMERGIFVKHFKNIPMADLEIVLPEKKNP LTPMDWVKFLVSAAIGLVTVIGSLSVP AD++VIFAI+SAV  Y +KTYLSFQ +LVSYQ+L
Subjt:  LLPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSL

Query:  ITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSA
        IT CVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKAT  QELD RCEELIQ QF QSCNFDVDDAVHKLEKLGIVV+ ADGAY CVDL SA
Subjt:  ITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSA

Query:  NKIIGITTEEIVSKAKEGDSSST
        NKIIG TTEEI+SKAKE D+S+T
Subjt:  NKIIGITTEEIVSKAKEGDSSST

A0A6J1D2Z1 uncharacterized protein LOC111016855 isoform X11.4e-21377.29Show/hide
Query:  RTMTKKR-EVIRLERESVIPILKPTLISALSSHL--DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFL
        RTM+KK+ EVIRLERESVIPILKP LIS LS+HL  D SDR+EF+  CQRVEYSIRAWYLL FDDLLHLY+LFDP+HGA KLEQQNLS EE DVLEQKFL
Subjt:  RTMTKKR-EVIRLERESVIPILKPTLISALSSHL--DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFL

Query:  GKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKI
        G LFQVM+KSNF++TTD+EIAVALSAQYRLNLPISVDESKLDKKLLTKYF ENPHDNLPYFADKYIIFRRGIGIDQM DHFY TKVN IIMRIW FFLKI
Subjt:  GKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKI

Query:  SGLKSLL--GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSFIACFD
        SGL  L+  GASR+ +SQVF+KQIDIST+SEDDGLYVERIRVENM LG  +  S              +T + P F                D  I  + 
Subjt:  SGLKSLL--GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSFIACFD

Query:  HLLPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQS
           PAN   EMERGIFVKHFKNIPMADLEIVLPEKKNP LTPMDWVKFLVSAAIGLVTVIGSLSVP AD++VIFAI+SAV  Y +KTYLSFQ +LVSYQ+
Subjt:  HLLPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQS

Query:  LITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGS
        LIT CVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKAT  QELD RCEELIQ QF QSCNFDVDDAVHKLEKLGIVV+ ADGAY CVDL S
Subjt:  LITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGS

Query:  ANKIIGITTEEIVSKAKEGDSSST
        ANKIIG TTEEI+SKAKE D+S+T
Subjt:  ANKIIGITTEEIVSKAKEGDSSST

A0A6J1D4B7 uncharacterized protein LOC111016855 isoform X32.3e-19573.47Show/hide
Query:  RTMTKKR-EVIRLERESVIPILKPTLISALSSHL--DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFL
        RTM+KK+ EVIRLERESVIPILKP LIS LS+HL  D SDR+EF+  CQRVEYSIRAWYLL FDDLLHLY+LFDP+HGA KLEQQNLS EE DVLEQKFL
Subjt:  RTMTKKR-EVIRLERESVIPILKPTLISALSSHL--DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFL

Query:  GKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKI
        G LFQVM+KSNF++TTD+EIAVALSAQYRLNLPISVDESKLDKKLLTKYF ENPHDNLPYFADKYIIFRRGIGIDQM DHFY TKVN IIMRIW FFLKI
Subjt:  GKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKI

Query:  SGLKSLL--GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSFIACFD
        SGL  L+  GASR+ +SQVF+KQIDIST+SEDDGLYVERIRVENM LG  +  S              +T + P F                D  I  + 
Subjt:  SGLKSLL--GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSFIACFD

Query:  HLLPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQS
           PAN   EMERGIFVKHFKNIPMADLEIVLPEKKNP LTPMDWVKFLVSAAIGLV   G    P +         S   S  L     FQ +LVSYQ+
Subjt:  HLLPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQS

Query:  LITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGS
        LIT CVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKAT  QELD RCEELIQ QF QSCNFDVDDAVHKLEKLGIVV+ ADGAY CVDL S
Subjt:  LITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGS

Query:  ANKIIGITTEEIVSKAKEGDSSST
        ANKIIG TTEEI+SKAKE D+S+T
Subjt:  ANKIIGITTEEIVSKAKEGDSSST

A0A6J1F2F4 uncharacterized protein LOC1114415294.9e-22281.01Show/hide
Query:  RTMT-KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGK
        RTMT KK+EVIRLERESVIPILKP LISALSS LD SDRDEFL FCQRVEYSIRAWYLL FDDLLHLYSLFDP+HGARKLEQQNLSPEE D LEQKFLGK
Subjt:  RTMT-KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGK

Query:  LFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKISG
        LFQVMEKSNFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFY TKVNAII RIWMFFL I G
Subjt:  LFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKISG

Query:  LKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSFIACFDHLL
        LK LL  ASR+ QSQVFSKQIDIST+S+DDGLYVERIRVENM+LGF +              W  +T + P F                D  I  +    
Subjt:  LKSLL-GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSFIACFDHLL

Query:  PANTTKEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLI
        PA+  +E+ ERGIF+KHFKNIPMADLEIVLPEKK+PGLTPMDWV FLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVG YC+KTYLSFQG+LVSYQSLI
Subjt:  PANTTKEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLI

Query:  TSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSAN
        TSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQG+ATK QELD RCEELIQ QFDQSCNF+VDDAVHKLEKLGI+++ ADGAY CVDL SAN
Subjt:  TSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSAN

Query:  KIIGITTEEIVSKAKE
         IIGITTEEIV+KAKE
Subjt:  KIIGITTEEIVSKAKE

A0A6J1J295 uncharacterized protein LOC1114820531.2e-22080.82Show/hide
Query:  KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVM
        KK+EVIRLERESVIPILKP LISAL+S LD SDRDEFL FCQRVEYSIRAWYLL FDDLLHLYSLFDP+HGARKLEQQNLSPEE D LEQKFLGKLFQVM
Subjt:  KKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVM

Query:  EKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKISGLKSLL
        EKSNFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFY TKVNAII RIWMFFL I GLK LL
Subjt:  EKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKISGLKSLL

Query:  -GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSFIACFDHLLPANTT
          ASR+ QSQVFSKQIDIST+S DDGLYVERIRVENM+LGF +              W  +T + P F                D  I  +    PA+  
Subjt:  -GASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSFIACFDHLLPANTT

Query:  KEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCVY
        +E+ ERGIF+KHFKNIPMADLEIVLPEKK+PGLTPMDW+ FLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVG YC+KTYLSFQG+LVSYQSLITSCVY
Subjt:  KEM-ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCVY

Query:  DKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIGI
        DKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQG+ATK QELDMRCEELIQ QFDQSCNF+VDDAVHKLEKLGI+++ ADGAY CVDL SAN IIGI
Subjt:  DKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIGI

Query:  TTEEIVSKAKE
        TTEEIV+KAKE
Subjt:  TTEEIVSKAKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46915.1 Protein of unknown function (DUF3754)4.5e-1026.34Show/hide
Query:  EMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSL------SVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLIT
        E    + ++ F+ IP+ DL ++ P KK      +D V+  +++ +GL     +       S P A    + A+ +A+  Y  +  L ++ +   YQ L+ 
Subjt:  EMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSL------SVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLIT

Query:  SCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQ--QELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQ
          +Y+K L SG G++  L D   QQ+ KE I+++ I+++ GK      + +  RCE  +   F       V+ A+  L +LG+V +
Subjt:  SCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQ--QELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQ

AT3G19340.1 Protein of unknown function (DUF3754)4.4e-15957.25Show/hide
Query:  REVIRLERESVIPILKPTLISALSSHLD-TSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVME
        +EVIRLE ESVIPILKP LI  L++ ++ ++DR EFL  C+R+EY++RAWYLLQF+DL+ LYSLFDPVHGA+K++QQNL+ +EIDVLEQ FL  LFQVME
Subjt:  REVIRLERESVIPILKPTLISALSSHLD-TSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVME

Query:  KSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKISGLKSL--
        KSNFK+T++EE+ VA S QY LNLPI VDESKLDKKLL +YF E+PH+N+P F+DKY+IFRRGIG+D+  D+F+  K++ II R W F ++I+ L+ L  
Subjt:  KSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKISGLKSL--

Query:  -LGASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWIS-LTREGPCFLPSNREVKLGHAFCISDSFIACFDHLLPAN
           +S N++     K  + + ++++D LYVERIR+EN  L F              +S++S LT + P F                D  I  +     A+
Subjt:  -LGASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWIS-LTREGPCFLPSNREVKLGHAFCISDSFIACFDHLLPAN

Query:  TTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCV
        +   +ERGI+VKHFKNIPMAD+EIVLPEK+NPGLTPMDWVKFL+SA +GLV V+ S+ +PK+D  VI AILS V  YC KTY +FQ ++ +YQ+LIT  +
Subjt:  TTKEMERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCV

Query:  YDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIG
        YDKQLDSGRGTLLHLCD+VIQQEVKEV+I FYILM+QGKAT  ++LD+RCEELI+ +F   CNFDV+DAV KLEKLGIV +   G Y+C+ L  AN+IIG
Subjt:  YDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIG

Query:  ITTEEIVSKAKEGDSSS
         TTEE+V KAK+G + S
Subjt:  ITTEEIVSKAKEGDSSS

AT5G13940.1 aminopeptidases5.8e-15158.84Show/hide
Query:  DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISV
        D  +R+EFL FCQRVE +IRAWY L F+DL+ LYSLF+PV GA +L QQNLS  EID LE +FL  LFQVMEKSNFK+ T+EEI VALSAQYRLNLPI V
Subjt:  DTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSPEEIDVLEQKFLGKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISV

Query:  DESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKISGLKSLLGASRNRQSQVFSKQIDISTESEDDGLYVE
        +E+KLD KLLT+YF + P D+LP+FADKYIIFRRG GID M  +F+  K++ I++RIW F L I+ LK L+   +N      S+QIDIS E+E D LY+E
Subjt:  DESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLKISGLKSLLGASRNRQSQVFSKQIDISTESEDDGLYVE

Query:  RIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSFIACFDHLLPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNP
        RIR+E + L          L  LM +    +T + P F                +  I  +  +   +  KE ER I+VKHFK IPMAD+EIVLPEKKNP
Subjt:  RIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSFIACFDHLLPANTTKEMERGIFVKHFKNIPMADLEIVLPEKKNP

Query:  GLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFY
        GLTP+DWVKFLVSAAIGLVTV+ S+S+ KAD++VI AILS V +YC+KTY +FQ +LV YQSLIT  VYDKQLDSGRGTLLHLCDEVIQQEVKEVIISF+
Subjt:  GLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFY

Query:  ILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIGITTEEIVSKAKEG
        +L+K+G  T ++ELDM+ E  I+ +F++SCNFDVDDA+ KLEKLG+V + ++  Y CV++  AN+I+G TTEE+V KA++G
Subjt:  ILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIGITTEEIVSKAKEG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGCCATGGCGATGGATTCATCTGCCACAAACTGGTAAACCAGATCAAAGCTCCATCAATTGATTTGCGAACAATGACCAAGAAGAGAGAAGTCATACGCTTGGA
AAGGGAGTCGGTTATCCCCATCCTCAAGCCCACGCTTATCAGCGCCTTGTCCAGCCATCTCGATACTTCGGACCGGGATGAGTTTCTGATGTTTTGCCAGAGAGTTGAAT
ACTCAATTCGAGCTTGGTACCTTCTGCAATTTGATGATCTTTTGCATTTATATTCATTATTCGATCCTGTACATGGGGCTCGAAAATTGGAGCAGCAAAATCTCTCGCCT
GAAGAAATCGATGTTTTGGAACAAAAATTTCTAGGGAAGCTGTTTCAGGTGATGGAGAAGAGCAATTTTAAACTAACAACAGACGAGGAAATCGCGGTTGCACTATCTGC
ACAATATCGTCTAAATCTTCCCATCTCTGTGGATGAGTCCAAGCTTGACAAGAAGCTTTTGACGAAATATTTCATGGAGAATCCTCACGACAATCTTCCATATTTTGCTG
ATAAGTACATAATTTTCCGCCGTGGTATTGGGATTGATCAAATGAACGACCACTTTTACCACACGAAAGTAAATGCCATCATTATGCGAATATGGATGTTCTTTCTCAAA
ATCTCAGGGTTAAAGAGCCTATTAGGAGCATCAAGAAACCGCCAAAGTCAGGTATTTTCAAAACAAATTGACATCAGTACCGAGTCAGAGGATGATGGCTTGTATGTCGA
GCGGATTCGCGTTGAGAACATGACACTTGGGTTTGAACTCGATCACTCTATAATTTGGCTTCATGCATTGATGGTTAGATCTTGGATTAGTTTAACTAGGGAAGGCCCTT
GCTTTCTTCCTAGCAACCGGGAGGTCAAGCTTGGGCATGCATTTTGTATTTCAGATTCCTTTATTGCTTGCTTTGATCATCTATTGCCGGCAAATACGACAAAGGAAATG
GAACGGGGTATCTTCGTGAAGCATTTCAAGAATATACCGATGGCAGATCTTGAGATTGTGCTTCCCGAAAAGAAAAATCCAGGTTTAACTCCAATGGACTGGGTGAAGTT
TCTCGTGTCTGCTGCAATCGGGCTGGTTACTGTTATTGGCTCGCTTAGCGTCCCTAAAGCAGATGTCAAAGTCATTTTTGCTATCCTCTCTGCAGTCGGTAGTTACTGTC
TGAAAACATATCTCTCGTTTCAGGGTAGTTTAGTGTCATATCAGAGCCTAATCACAAGCTGCGTGTATGACAAACAACTAGACAGTGGAAGGGGCACTCTTCTTCACTTG
TGTGACGAAGTTATTCAGCAAGAAGTAAAGGAGGTGATTATTTCCTTCTACATATTGATGAAACAGGGAAAGGCGACGAAACAGCAGGAGCTTGACATGCGATGTGAGGA
GCTGATTCAGAGACAGTTTGATCAGAGCTGTAATTTTGACGTGGATGATGCAGTTCACAAGTTAGAGAAGTTGGGGATCGTTGTCCAGGGTGCGGATGGGGCATATTTCT
GTGTAGATTTGGGGAGTGCCAACAAGATCATAGGCATCACCACAGAGGAGATAGTTTCCAAAGCTAAAGAAGGCGATTCCTCCAGTACTTGA
mRNA sequenceShow/hide mRNA sequence
AGAATTTCCCGCCAAATAGATCGGGAGGGTTTGTTGAACGAAAATGGGAAGCCATGGCGATGGATTCATCTGCCACAAACTGGTAAACCAGATCAAAGCTCCATCAATTG
ATTTGCGAACAATGACCAAGAAGAGAGAAGTCATACGCTTGGAAAGGGAGTCGGTTATCCCCATCCTCAAGCCCACGCTTATCAGCGCCTTGTCCAGCCATCTCGATACT
TCGGACCGGGATGAGTTTCTGATGTTTTGCCAGAGAGTTGAATACTCAATTCGAGCTTGGTACCTTCTGCAATTTGATGATCTTTTGCATTTATATTCATTATTCGATCC
TGTACATGGGGCTCGAAAATTGGAGCAGCAAAATCTCTCGCCTGAAGAAATCGATGTTTTGGAACAAAAATTTCTAGGGAAGCTGTTTCAGGTGATGGAGAAGAGCAATT
TTAAACTAACAACAGACGAGGAAATCGCGGTTGCACTATCTGCACAATATCGTCTAAATCTTCCCATCTCTGTGGATGAGTCCAAGCTTGACAAGAAGCTTTTGACGAAA
TATTTCATGGAGAATCCTCACGACAATCTTCCATATTTTGCTGATAAGTACATAATTTTCCGCCGTGGTATTGGGATTGATCAAATGAACGACCACTTTTACCACACGAA
AGTAAATGCCATCATTATGCGAATATGGATGTTCTTTCTCAAAATCTCAGGGTTAAAGAGCCTATTAGGAGCATCAAGAAACCGCCAAAGTCAGGTATTTTCAAAACAAA
TTGACATCAGTACCGAGTCAGAGGATGATGGCTTGTATGTCGAGCGGATTCGCGTTGAGAACATGACACTTGGGTTTGAACTCGATCACTCTATAATTTGGCTTCATGCA
TTGATGGTTAGATCTTGGATTAGTTTAACTAGGGAAGGCCCTTGCTTTCTTCCTAGCAACCGGGAGGTCAAGCTTGGGCATGCATTTTGTATTTCAGATTCCTTTATTGC
TTGCTTTGATCATCTATTGCCGGCAAATACGACAAAGGAAATGGAACGGGGTATCTTCGTGAAGCATTTCAAGAATATACCGATGGCAGATCTTGAGATTGTGCTTCCCG
AAAAGAAAAATCCAGGTTTAACTCCAATGGACTGGGTGAAGTTTCTCGTGTCTGCTGCAATCGGGCTGGTTACTGTTATTGGCTCGCTTAGCGTCCCTAAAGCAGATGTC
AAAGTCATTTTTGCTATCCTCTCTGCAGTCGGTAGTTACTGTCTGAAAACATATCTCTCGTTTCAGGGTAGTTTAGTGTCATATCAGAGCCTAATCACAAGCTGCGTGTA
TGACAAACAACTAGACAGTGGAAGGGGCACTCTTCTTCACTTGTGTGACGAAGTTATTCAGCAAGAAGTAAAGGAGGTGATTATTTCCTTCTACATATTGATGAAACAGG
GAAAGGCGACGAAACAGCAGGAGCTTGACATGCGATGTGAGGAGCTGATTCAGAGACAGTTTGATCAGAGCTGTAATTTTGACGTGGATGATGCAGTTCACAAGTTAGAG
AAGTTGGGGATCGTTGTCCAGGGTGCGGATGGGGCATATTTCTGTGTAGATTTGGGGAGTGCCAACAAGATCATAGGCATCACCACAGAGGAGATAGTTTCCAAAGCTAA
AGAAGGCGATTCCTCCAGTACTTGATACCATAATTATACAGAAACTGTATGTACTATTCTGCTTTCCATTCTCACACAAGGAAAGCAATATTCTACGATTAGTTCACAAC
TTAAAATATCACTGCATAGTTTGTTTCTAAAAGTTACGTATTCAATCCATCAAGCAC
Protein sequenceShow/hide protein sequence
MGSHGDGFICHKLVNQIKAPSIDLRTMTKKREVIRLERESVIPILKPTLISALSSHLDTSDRDEFLMFCQRVEYSIRAWYLLQFDDLLHLYSLFDPVHGARKLEQQNLSP
EEIDVLEQKFLGKLFQVMEKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYHTKVNAIIMRIWMFFLK
ISGLKSLLGASRNRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELDHSIIWLHALMVRSWISLTREGPCFLPSNREVKLGHAFCISDSFIACFDHLLPANTTKEM
ERGIFVKHFKNIPMADLEIVLPEKKNPGLTPMDWVKFLVSAAIGLVTVIGSLSVPKADVKVIFAILSAVGSYCLKTYLSFQGSLVSYQSLITSCVYDKQLDSGRGTLLHL
CDEVIQQEVKEVIISFYILMKQGKATKQQELDMRCEELIQRQFDQSCNFDVDDAVHKLEKLGIVVQGADGAYFCVDLGSANKIIGITTEEIVSKAKEGDSSST