; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G00520 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G00520
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of unknown function (DUF707)
Genome locationClcChr11:565745..570732
RNA-Seq ExpressionClc11G00520
SyntenyClc11G00520
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007877 - Protein of unknown function DUF707


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008464732.2 PREDICTED: uncharacterized protein LOC103502548, partial [Cucumis melo]5.4e-16670.34Show/hide
Query:  SFFHIDLVVLSFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFE
        SF +I   +L+FI S+SK+RL + TFF A+FLGAGVYFIASEFITK           EIFRWEVFYSARNVKSSTCKN+CRPPGSESLPEGIISKTSNFE
Subjt:  SFFHIDLVVLSFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFE

Query:  FQPLWGSTLQDKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLW
        FQPLWGS+LQ+KKPK SKNLLA+AVGIKQRHVVS+IIEKFP DDFDV+LFHYDGVVDEWR+F+W SRALHVSALNQTK WFAKRFLHPDIVAEYNYIFLW
Subjt:  FQPLWGSTLQDKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLW

Query:  DEDLGVEYFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGR---NDGSCVLKGSMEMHMVYDSDRNLMPKYPGYCFLFA
        DEDLGVEYFDPKRYVSILKEEGLEISQPALDP+KSKVHQPLTARKTGSKVHR   +    GR   N       G +EM         +  +    C  + 
Subjt:  DEDLGVEYFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGR---NDGSCVLKGSMEMHMVYDSDRNLMPKYPGYCFLFA

Query:  LQNDLIHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSD-------------------ASTLSWRKDSSNFDGSEPKVDNRVKVS
        +QNDLIHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDN LS                     +S    +KDSSN DGSEP+VDNRVK  
Subjt:  LQNDLIHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSD-------------------ASTLSWRKDSSNFDGSEPKVDNRVKVS

Query:  YHNQSKYYCFSDFEWVRIQSSIEMQIFKDRWTDAAKKDRCWIDPY
                       VR+QSS+EMQIFKDRW DAAK DRCWIDPY
Subjt:  YHNQSKYYCFSDFEWVRIQSSIEMQIFKDRWTDAAKKDRCWIDPY

XP_031742290.1 uncharacterized protein LOC101205845 isoform X2 [Cucumis sativus]2.0e-16570.31Show/hide
Query:  IPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQDK
        I S++K+RL + TFF AL LGAGVYFIA+EFITK             FRWEVFYSA+NVKSSTCKN+CRPPGSESLPEGIISKTSNFEFQPLWGS+LQ+K
Subjt:  IPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQDK

Query:  KPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFDPK
        KPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDV+LFHYDGVVDEWR+FAW SRALHVSALNQTK WFAKRFLHPDIVAEYNYIFLWDEDLGV+YFDPK
Subjt:  KPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFDPK

Query:  RYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGR---NDGSCVLKGSMEMHMVYDSDRNLMPKYPGYCFLFALQNDLIHAWGLD
        RY+SILKEEGLEISQPALDP+KSKVHQPLTARKTGSKVHR   +    GR   N       G +EM         +  +    C  + +QNDLIHAWGLD
Subjt:  RYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGR---NDGSCVLKGSMEMHMVYDSDRNLMPKYPGYCFLFALQNDLIHAWGLD

Query:  RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRW
        RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDN L+SDA+    +KDSSN D SEP+V+NRVK                 VR+QSS+EMQIFKDRW
Subjt:  RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRW

Query:  TDAAKKDRCWIDPYLWNKAGSEGVPVSSNRAQTRPEHAASLDNILVVS
        TDAAK DRCWIDPY                AQT+ EH + LD I  +S
Subjt:  TDAAKKDRCWIDPYLWNKAGSEGVPVSSNRAQTRPEHAASLDNILVVS

XP_038893166.1 uncharacterized protein LOC120082028 isoform X1 [Benincasa hispida]4.2e-17175.24Show/hide
Query:  SFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ
        +FI  ESK+RL LCTFF+A+FLGAGVYFIAS+FITK           EIFRWEVFYSAR+VKSSTCKN+CRPPGSESLPEGIISKTSNFEF  LWGS +Q
Subjt:  SFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ

Query:  DKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD
        +K+PK+SKNLLAIAVGI+QRHVVSKIIEKFPQD FDV+LFHYDGVVDEWRDFAWSSRALHVSALNQTK WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD
Subjt:  DKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD

Query:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGR---NDGSCVLKGSMEMHMVYDSDRNLMPKYPGYCFLFALQNDLIHAWG
        PKRYVSILKEEGLEISQPALDP+KSKVHQPLTARK G KVHR   +    GR   N  +    G +EM         +  +    C  + +QNDLIHAWG
Subjt:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGR---NDGSCVLKGSMEMHMVYDSDRNLMPKYPGYCFLFALQNDLIHAWG

Query:  LDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKD
        LDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVL+SDAS  S +K+SSNFDGSEPKVDNRVK                 VR+QSS+EMQIFKD
Subjt:  LDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKD

Query:  RWTDAAKKDRCWIDPY
        RWT+AAKKDRCWIDPY
Subjt:  RWTDAAKKDRCWIDPY

XP_038893167.1 uncharacterized protein LOC120082028 isoform X2 [Benincasa hispida]3.6e-17074.64Show/hide
Query:  SFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ
        +FI  ESK+RL LCTFF+A+FLGAGVYFIAS+FITK           EIFRWEVFYSAR+VKSSTCKN+CRPPGSESLPEGIISKTSNFEF  LWGS +Q
Subjt:  SFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ

Query:  DKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD
        +K+PK+SKNLLAIAVGI+QRHVVSKIIEKFPQD FDV+LFHYDGVVDEWRDFAWSSRALHVSALNQTK WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD
Subjt:  DKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD

Query:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGRNDGSCVLKGSMEMHMVY-DSDRNLMPKYPGYCFLFALQNDLIHAWGLD
        PKRYVSILKEEGLEISQPALDP+KSKVHQPLTARK G KVHR      +  +  G C    +      + +    +  +    C  + +QNDLIHAWGLD
Subjt:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGRNDGSCVLKGSMEMHMVY-DSDRNLMPKYPGYCFLFALQNDLIHAWGLD

Query:  RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRW
        RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVL+SDAS  S +K+SSNFDGSEPKVDNRVK                 VR+QSS+EMQIFKDRW
Subjt:  RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRW

Query:  TDAAKKDRCWIDPY
        T+AAKKDRCWIDPY
Subjt:  TDAAKKDRCWIDPY

XP_038893169.1 uncharacterized protein LOC120082028 isoform X3 [Benincasa hispida]4.2e-17175.24Show/hide
Query:  SFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ
        +FI  ESK+RL LCTFF+A+FLGAGVYFIAS+FITK           EIFRWEVFYSAR+VKSSTCKN+CRPPGSESLPEGIISKTSNFEF  LWGS +Q
Subjt:  SFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ

Query:  DKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD
        +K+PK+SKNLLAIAVGI+QRHVVSKIIEKFPQD FDV+LFHYDGVVDEWRDFAWSSRALHVSALNQTK WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD
Subjt:  DKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD

Query:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGR---NDGSCVLKGSMEMHMVYDSDRNLMPKYPGYCFLFALQNDLIHAWG
        PKRYVSILKEEGLEISQPALDP+KSKVHQPLTARK G KVHR   +    GR   N  +    G +EM         +  +    C  + +QNDLIHAWG
Subjt:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGR---NDGSCVLKGSMEMHMVYDSDRNLMPKYPGYCFLFALQNDLIHAWG

Query:  LDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKD
        LDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVL+SDAS  S +K+SSNFDGSEPKVDNRVK                 VR+QSS+EMQIFKD
Subjt:  LDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKD

Query:  RWTDAAKKDRCWIDPY
        RWT+AAKKDRCWIDPY
Subjt:  RWTDAAKKDRCWIDPY

TrEMBL top hitse value%identityAlignment
A0A0A0KMF9 Uncharacterized protein2.2e-16573.91Show/hide
Query:  IPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQDK
        I S++K+RL + TFF AL LGAGVYFIA+EFITK             FRWEVFYSA+NVKSSTCKN+CRPPGSESLPEGIISKTSNFEFQPLWGS+LQ+K
Subjt:  IPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQDK

Query:  KPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFDPK
        KPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDV+LFHYDGVVDEWR+FAW SRALHVSALNQTK WFAKRFLHPDIVAEYNYIFLWDEDLGV+YFDPK
Subjt:  KPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFDPK

Query:  RYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGR---NDGSCVLKGSMEMHMVYDSDRNLMPKYPGYCFLFALQNDLIHAWGLD
        RY+SILKEEGLEISQPALDP+KSKVHQPLTARKTGSKVHR   +    GR   N       G +EM         +  +    C  + +QNDLIHAWGLD
Subjt:  RYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGR---NDGSCVLKGSMEMHMVYDSDRNLMPKYPGYCFLFALQNDLIHAWGLD

Query:  RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRW
        RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDN L+SDA+    +KDSSN D SEP+V+NRVK                 VR+QSS+EMQIFKDRW
Subjt:  RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRW

Query:  TDAAKKDRCWIDPY
        TDAAK DRCWIDPY
Subjt:  TDAAKKDRCWIDPY

A0A1S3CM49 uncharacterized protein LOC1035025482.6e-16670.34Show/hide
Query:  SFFHIDLVVLSFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFE
        SF +I   +L+FI S+SK+RL + TFF A+FLGAGVYFIASEFITK           EIFRWEVFYSARNVKSSTCKN+CRPPGSESLPEGIISKTSNFE
Subjt:  SFFHIDLVVLSFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFE

Query:  FQPLWGSTLQDKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLW
        FQPLWGS+LQ+KKPK SKNLLA+AVGIKQRHVVS+IIEKFP DDFDV+LFHYDGVVDEWR+F+W SRALHVSALNQTK WFAKRFLHPDIVAEYNYIFLW
Subjt:  FQPLWGSTLQDKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLW

Query:  DEDLGVEYFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGR---NDGSCVLKGSMEMHMVYDSDRNLMPKYPGYCFLFA
        DEDLGVEYFDPKRYVSILKEEGLEISQPALDP+KSKVHQPLTARKTGSKVHR   +    GR   N       G +EM         +  +    C  + 
Subjt:  DEDLGVEYFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGR---NDGSCVLKGSMEMHMVYDSDRNLMPKYPGYCFLFA

Query:  LQNDLIHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSD-------------------ASTLSWRKDSSNFDGSEPKVDNRVKVS
        +QNDLIHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDN LS                     +S    +KDSSN DGSEP+VDNRVK  
Subjt:  LQNDLIHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSD-------------------ASTLSWRKDSSNFDGSEPKVDNRVKVS

Query:  YHNQSKYYCFSDFEWVRIQSSIEMQIFKDRWTDAAKKDRCWIDPY
                       VR+QSS+EMQIFKDRW DAAK DRCWIDPY
Subjt:  YHNQSKYYCFSDFEWVRIQSSIEMQIFKDRWTDAAKKDRCWIDPY

A0A6J1DXM8 uncharacterized protein LOC111024438 isoform X17.3e-16171.81Show/hide
Query:  IPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLW-GSTLQD
        I SESK RLHLCTFFIA+ LGAGVYFIASEFITK           E  RWEVFY+ARNVKSSTCK+RCRPPGSE+LPEGI+SKTSNFE QPLW GST+Q+
Subjt:  IPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLW-GSTLQD

Query:  KKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFDP
        K P+  KNLLAIAVGIKQ+HVVSKI+EKFP+DDFDV+LFHYDG+VDEW+D AWS   +H+SALNQTK WFAKRFLHPDIVAEYNYIFLWDEDLGVE FDP
Subjt:  KKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFDP

Query:  KRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGRNDGSCV---LKGSMEMHMVYDSDRNLMPKYPGYCFLFALQNDLIHAWGL
        KRY+SILKEEGLEISQPALDP+KSKVHQ LTARKT SKVHR   +   MGR D +       G +EM         +  +    C  + +QNDLIHAWGL
Subjt:  KRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGRNDGSCV---LKGSMEMHMVYDSDRNLMPKYPGYCFLFALQNDLIHAWGL

Query:  DRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDR
        DRQLGYCAQGDRT KVGVVDAEYIVHLGLPTLG SH NVL+S+A  LS +KDSSNFD SEPKVDNRV+                 VRIQSSIEMQIFK+R
Subjt:  DRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDR

Query:  WTDAAKKDRCWIDPY
        WTDAAKKDRCWIDPY
Subjt:  WTDAAKKDRCWIDPY

A0A6J1FRZ3 uncharacterized protein LOC111447910 isoform X11.4e-15970.19Show/hide
Query:  SFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ
        + I  ESK+RL LCTFF+A+ LGAGVYFIASEFITK           EIFRWEVFYSARNV SS CKN+CRPPGSE LPEGI+SKTSNFEFQPLWGSTL 
Subjt:  SFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ

Query:  DKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD
        +K PKVSKNLL++AVGI QRH+VSKI+EKFP+DDFDV+LFHYDGVVDEW+DF+WSSRA+HVS+LNQTK WFAKRFLHPDIVAEYNYIFLWDEDLGVE FD
Subjt:  DKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD

Query:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGRNDGSCV---LKGSMEMHMVYDSDRNLMPKYPGYCFLFALQNDLIHAWG
        PKRY+SILKEEGLEISQPALDP+KSKVHQ LTARKTGSKVHR   +     R D +       G +EM         +  +    C  + +QNDLIHAWG
Subjt:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGRNDGSCV---LKGSMEMHMVYDSDRNLMPKYPGYCFLFALQNDLIHAWG

Query:  LDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKD
        LDRQLGYCAQGDRT+KVGVVDAEYIVHLGLPTLGAS+ NVL++DA   S +K+ SNF+  E KVDNRVK                 VRIQSS+EMQIFK+
Subjt:  LDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKD

Query:  RWTDAAKKDRCWIDPY
        RWT+AAK+DRCWIDPY
Subjt:  RWTDAAKKDRCWIDPY

A0A6J1FWB2 uncharacterized protein LOC111447910 isoform X31.4e-15970.19Show/hide
Query:  SFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ
        + I  ESK+RL LCTFF+A+ LGAGVYFIASEFITK           EIFRWEVFYSARNV SS CKN+CRPPGSE LPEGI+SKTSNFEFQPLWGSTL 
Subjt:  SFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ

Query:  DKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD
        +K PKVSKNLL++AVGI QRH+VSKI+EKFP+DDFDV+LFHYDGVVDEW+DF+WSSRA+HVS+LNQTK WFAKRFLHPDIVAEYNYIFLWDEDLGVE FD
Subjt:  DKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD

Query:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGRNDGSCV---LKGSMEMHMVYDSDRNLMPKYPGYCFLFALQNDLIHAWG
        PKRY+SILKEEGLEISQPALDP+KSKVHQ LTARKTGSKVHR   +     R D +       G +EM         +  +    C  + +QNDLIHAWG
Subjt:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGRNDGSCV---LKGSMEMHMVYDSDRNLMPKYPGYCFLFALQNDLIHAWG

Query:  LDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKD
        LDRQLGYCAQGDRT+KVGVVDAEYIVHLGLPTLGAS+ NVL++DA   S +K+ SNF+  E KVDNRVK                 VRIQSS+EMQIFK+
Subjt:  LDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKD

Query:  RWTDAAKKDRCWIDPY
        RWT+AAK+DRCWIDPY
Subjt:  RWTDAAKKDRCWIDPY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11170.1 Protein of unknown function (DUF707)3.3e-7342.94Show/hide
Query:  ESLPEGIISKTSNFEFQPLW--GSTLQDKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAK
        + LP GII   S+ E +PLW  GS         ++NLLAI VG+KQ+  V  +++KF   +F +VLFHYDG +D+W D  WSS+++H+ A NQTK WFAK
Subjt:  ESLPEGIISKTSNFEFQPLW--GSTLQDKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAK

Query:  RFLHPDIVAEYNYIFLWDEDLGVEYFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGRNDGSCVLKGSMEMHMVY-DSD
        RFLHPD+V+ Y+YIFLWDEDLGVE F+P+RY+ I+K  GLEISQPALD   +++H  +T R    K HR     V++ R    C    S      + +  
Subjt:  RFLHPDIVAEYNYIFLWDEDLGVEYFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGRNDGSCVLKGSMEMHMVY-DSD

Query:  RNLMPKYPGYCFLFALQNDLIHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQ
          +  K    C    +QNDL+H WG+D +LGYCAQGDRTK VG+VD+EYI+H G+ TLG S      +     + R+  + FD                 
Subjt:  RNLMPKYPGYCFLFALQNDLIHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQ

Query:  SKYYCFSDFEWVRIQSSIEMQIFKDRWTDAAKKDRCWIDP
                   +R QS+ E+Q FK+RW+ A ++D  WIDP
Subjt:  SKYYCFSDFEWVRIQSSIEMQIFKDRWTDAAKKDRCWIDP

AT1G61240.1 Protein of unknown function (DUF707)3.7e-7242.77Show/hide
Query:  ESLPEGIISKTSNFEFQPLW-GSTLQDKKPKV-SKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAK
        + LP GI+   S+ E +PLW  S+L+ K  ++ ++NLLA+ VG+KQ+  V  +++KF   +F V+LFHYDG +D+W D  WSS+A+H+ A NQTK WFAK
Subjt:  ESLPEGIISKTSNFEFQPLW-GSTLQDKKPKV-SKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAK

Query:  RFLHPDIVAEYNYIFLWDEDLGVEYFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGRNDGSCVLKGSMEMHMVYDSDR
        RFLHPDIV+ Y+Y+FLWDEDLGVE F+P++Y+ I+K  GLEISQPAL P  ++VH  +T R      HR V      G    S   +G      V +   
Subjt:  RFLHPDIVAEYNYIFLWDEDLGVEYFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGRNDGSCVLKGSMEMHMVYDSDR

Query:  NLMPKYPGYCFLFALQNDLIHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGAS-HDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQ
         +  +   +C    +QNDL+H WG+D +LGYCAQGDR+KKVG+VD+EYI H G+ TLG S + +  +S  S ++ R+ S+ FD                 
Subjt:  NLMPKYPGYCFLFALQNDLIHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGAS-HDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQ

Query:  SKYYCFSDFEWVRIQSSIEMQIFKDRWTDAAKKDRCWID
                   +R QS+ E+Q FK+RW  A  +D+ W++
Subjt:  SKYYCFSDFEWVRIQSSIEMQIFKDRWTDAAKKDRCWID

AT4G12840.1 Protein of unknown function (DUF707)1.0e-9346Show/hide
Query:  SESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQDKKP
        ++ +  L L   F  +F  A ++ I + FIT    E        I  W         K   CK + RPPGSE+LP GI++ TS+ E +PLWG+  +DKKP
Subjt:  SESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQDKKP

Query:  KVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFDPKRY
        K S  LLA+AVGI+Q+  V+KI++KFP  +F V+LFHYDG VDEW++F WS  A+H+S +NQTK WFAKRFLHPDIV+ Y+YIFLWDEDLGV++FD +RY
Subjt:  KVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFDPKRY

Query:  VSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHR---SVLHGVFMGRNDGSCVLKGSMEMHMVYDSDRNLMPKYPGYCFLFALQNDLIHAWGLDRQ
        VSI+KEE LEISQPALDP  S+VH  LT+R   S+VHR    V+       N       G +EM         +  +    C    +QNDL H WG+D Q
Subjt:  VSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHR---SVLHGVFMGRNDGSCVLKGSMEMHMVYDSDRNLMPKYPGYCFLFALQNDLIHAWGLDRQ

Query:  LGYCAQGDRTKKVGVVDAEYIVHLGLPTL-GASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRWT
        LGYCAQGDRTK +G+VD+EYI+H+GLPTL G S +N   +D+  L   K     D S      R +                 VR Q+ +E++ FK RW 
Subjt:  LGYCAQGDRTKKVGVVDAEYIVHLGLPTL-GASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRWT

Query:  DAAKKDRCWIDPY
        +A K D CWID +
Subjt:  DAAKKDRCWIDPY

AT4G12840.2 Protein of unknown function (DUF707)1.0e-9346Show/hide
Query:  SESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQDKKP
        ++ +  L L   F  +F  A ++ I + FIT    E        I  W         K   CK + RPPGSE+LP GI++ TS+ E +PLWG+  +DKKP
Subjt:  SESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQDKKP

Query:  KVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFDPKRY
        K S  LLA+AVGI+Q+  V+KI++KFP  +F V+LFHYDG VDEW++F WS  A+H+S +NQTK WFAKRFLHPDIV+ Y+YIFLWDEDLGV++FD +RY
Subjt:  KVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFDPKRY

Query:  VSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHR---SVLHGVFMGRNDGSCVLKGSMEMHMVYDSDRNLMPKYPGYCFLFALQNDLIHAWGLDRQ
        VSI+KEE LEISQPALDP  S+VH  LT+R   S+VHR    V+       N       G +EM         +  +    C    +QNDL H WG+D Q
Subjt:  VSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHR---SVLHGVFMGRNDGSCVLKGSMEMHMVYDSDRNLMPKYPGYCFLFALQNDLIHAWGLDRQ

Query:  LGYCAQGDRTKKVGVVDAEYIVHLGLPTL-GASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRWT
        LGYCAQGDRTK +G+VD+EYI+H+GLPTL G S +N   +D+  L   K     D S      R +                 VR Q+ +E++ FK RW 
Subjt:  LGYCAQGDRTKKVGVVDAEYIVHLGLPTL-GASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRWT

Query:  DAAKKDRCWIDPY
        +A K D CWID +
Subjt:  DAAKKDRCWIDPY

AT4G18530.1 Protein of unknown function (DUF707)6.9e-10348.58Show/hide
Query:  SKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARN--------VKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWG-S
        S +R  LC+  I   L  G YFI + ++ K   E       ++ +WE+     N          +STCKN  +P G+E+LP+GII KTSN E Q LW   
Subjt:  SKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARN--------VKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWG-S

Query:  TLQDKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVE
          + ++P  S +LLA+AVGIKQ+ +V+K+I+KFP  DF V+LFHYDGVVD+W+ + W++ A+HVS +NQTK WFAKRFLHPDIVAEY YIFLWDEDLGV 
Subjt:  TLQDKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVE

Query:  YFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGRNDG-----SCVLKGSMEMHMVYDSDRNLMPKYPGYCFLFALQNDL
        +F+P+RY+SI+KEEGLEISQPALD  KS+VH P+TAR+  SKVHR +      GR D       C+  G +EM         +  +    C  + +QNDL
Subjt:  YFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGRNDG-----SCVLKGSMEMHMVYDSDRNLMPKYPGYCFLFALQNDL

Query:  IHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPK-VDNRVKVSYHNQSKYYCFSDFEWVRIQSSIE
        IHAWGLD QLGYCAQGDR K VGVVDAEYI+H GLPTLG     V+ + +S L    DS + +  E + VDNR +                 VR++S +E
Subjt:  IHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPK-VDNRVKVSYHNQSKYYCFSDFEWVRIQSSIE

Query:  MQIFKDRWTDAAKKDRCWIDPY
        M+ FK+RW  A + D CW+DPY
Subjt:  MQIFKDRWTDAAKKDRCWIDPY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTATCGTTGGTTTGATCTCTTGGATTCTGATATTCAGCTCATTGAGTTTTCTGTGTCTTCCTTTTTCCATATTGATCTTGTTGTTTTGAGTTTCATACCATCGGA
GTCAAAGCATAGATTGCACCTCTGTACTTTCTTCATTGCGTTATTCCTTGGTGCAGGAGTCTATTTCATTGCAAGTGAATTTATTACAAAGGTAGAAATTGAAGCATATC
CTTCCATAGTTCTTGAAATTTTTAGATGGGAGGTATTTTATTCTGCCCGAAATGTAAAATCCAGTACATGCAAGAATCGATGCAGGCCTCCTGGCAGTGAGTCTTTGCCA
GAAGGAATCATTAGTAAAACATCTAACTTTGAATTTCAGCCTTTATGGGGCTCGACTTTACAAGATAAAAAGCCGAAGGTTTCAAAGAACTTGTTAGCAATTGCTGTTGG
AATCAAACAAAGACACGTAGTGTCAAAAATTATTGAAAAGTTCCCGCAAGATGATTTTGACGTGGTTCTTTTTCATTACGACGGTGTTGTGGATGAATGGAGGGATTTTG
CATGGAGTTCTCGTGCACTACATGTCTCTGCATTGAATCAGACAAAGTGGTTTGCCAAGCGTTTCTTGCATCCAGATATAGTTGCTGAATATAATTATATATTTCTTTGG
GATGAGGACCTTGGTGTCGAGTATTTTGACCCAAAACGATATGTATCAATCCTCAAGGAGGAGGGGCTCGAGATATCACAACCAGCTCTTGATCCCATTAAGTCCAAGGT
GCACCAGCCACTTACTGCTCGCAAAACAGGATCGAAAGTTCACAGGTCAGTTTTACATGGGGTTTTTATGGGTCGAAATGATGGCTCCTGTGTTCTCAAGGGCAGCATGG
AGATGCACATGGTATATGATTCAGACAGAAATTTAATGCCAAAATACCCTGGCTATTGCTTTCTTTTTGCTTTACAGAATGACTTGATCCACGCATGGGGATTAGATAGA
CAGCTTGGCTATTGTGCACAAGGCGACCGAACAAAAAAAGTCGGCGTTGTTGATGCAGAGTACATAGTTCATTTAGGTCTGCCAACGCTCGGTGCTTCCCATGACAATGT
GCTGAGTTCTGATGCTTCAACTCTTTCGTGGAGGAAAGACTCGTCAAACTTCGATGGATCGGAACCCAAAGTGGATAATAGAGTTAAAGTAAGTTACCACAATCAATCCA
AGTATTATTGTTTTTCCGATTTCGAATGGGTGAGGATACAATCTTCCATAGAAATGCAGATCTTCAAGGACCGATGGACCGATGCAGCAAAGAAGGATAGATGTTGGATC
GACCCGTATCTTTGGAATAAAGCTGGAAGTGAAGGTGTTCCAGTCTCCAGTAACAGAGCTCAAACACGGCCTGAGCATGCAGCCAGCTTGGATAATATTCTAGTTGTCTC
AGGACAATTTATACCCAACAAATTCAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTATCGTTGGTTTGATCTCTTGGATTCTGATATTCAGCTCATTGAGTTTTCTGTGTCTTCCTTTTTCCATATTGATCTTGTTGTTTTGAGTTTCATACCATCGGA
GTCAAAGCATAGATTGCACCTCTGTACTTTCTTCATTGCGTTATTCCTTGGTGCAGGAGTCTATTTCATTGCAAGTGAATTTATTACAAAGGTAGAAATTGAAGCATATC
CTTCCATAGTTCTTGAAATTTTTAGATGGGAGGTATTTTATTCTGCCCGAAATGTAAAATCCAGTACATGCAAGAATCGATGCAGGCCTCCTGGCAGTGAGTCTTTGCCA
GAAGGAATCATTAGTAAAACATCTAACTTTGAATTTCAGCCTTTATGGGGCTCGACTTTACAAGATAAAAAGCCGAAGGTTTCAAAGAACTTGTTAGCAATTGCTGTTGG
AATCAAACAAAGACACGTAGTGTCAAAAATTATTGAAAAGTTCCCGCAAGATGATTTTGACGTGGTTCTTTTTCATTACGACGGTGTTGTGGATGAATGGAGGGATTTTG
CATGGAGTTCTCGTGCACTACATGTCTCTGCATTGAATCAGACAAAGTGGTTTGCCAAGCGTTTCTTGCATCCAGATATAGTTGCTGAATATAATTATATATTTCTTTGG
GATGAGGACCTTGGTGTCGAGTATTTTGACCCAAAACGATATGTATCAATCCTCAAGGAGGAGGGGCTCGAGATATCACAACCAGCTCTTGATCCCATTAAGTCCAAGGT
GCACCAGCCACTTACTGCTCGCAAAACAGGATCGAAAGTTCACAGGTCAGTTTTACATGGGGTTTTTATGGGTCGAAATGATGGCTCCTGTGTTCTCAAGGGCAGCATGG
AGATGCACATGGTATATGATTCAGACAGAAATTTAATGCCAAAATACCCTGGCTATTGCTTTCTTTTTGCTTTACAGAATGACTTGATCCACGCATGGGGATTAGATAGA
CAGCTTGGCTATTGTGCACAAGGCGACCGAACAAAAAAAGTCGGCGTTGTTGATGCAGAGTACATAGTTCATTTAGGTCTGCCAACGCTCGGTGCTTCCCATGACAATGT
GCTGAGTTCTGATGCTTCAACTCTTTCGTGGAGGAAAGACTCGTCAAACTTCGATGGATCGGAACCCAAAGTGGATAATAGAGTTAAAGTAAGTTACCACAATCAATCCA
AGTATTATTGTTTTTCCGATTTCGAATGGGTGAGGATACAATCTTCCATAGAAATGCAGATCTTCAAGGACCGATGGACCGATGCAGCAAAGAAGGATAGATGTTGGATC
GACCCGTATCTTTGGAATAAAGCTGGAAGTGAAGGTGTTCCAGTCTCCAGTAACAGAGCTCAAACACGGCCTGAGCATGCAGCCAGCTTGGATAATATTCTAGTTGTCTC
AGGACAATTTATACCCAACAAATTCAAATAA
Protein sequenceShow/hide protein sequence
MDYRWFDLLDSDIQLIEFSVSSFFHIDLVVLSFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLP
EGIISKTSNFEFQPLWGSTLQDKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTKWFAKRFLHPDIVAEYNYIFLW
DEDLGVEYFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGVFMGRNDGSCVLKGSMEMHMVYDSDRNLMPKYPGYCFLFALQNDLIHAWGLDR
QLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRWTDAAKKDRCWI
DPYLWNKAGSEGVPVSSNRAQTRPEHAASLDNILVVSGQFIPNKFK