; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC11G218200 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC11G218200
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionProtein of unknown function (DUF707)
Genome locationCiama_Chr11:31836137..31841119
RNA-Seq ExpressionCaUC11G218200
SyntenyCaUC11G218200
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007877 - Protein of unknown function DUF707


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN49567.1 hypothetical protein Csa_018500 [Cucumis sativus]2.4e-16673.91Show/hide
Query:  NFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ
        N I S++K+RL + TFF AL LGAGVYFIA+EFITK             FRWEVFYSA+NVKSSTCKN+CRPPGSESLPEGIISKTSNFEFQPLWGS+LQ
Subjt:  NFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ

Query:  NKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD
        NKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDV+LFHYDGVVDEWR+FAW SRALHVSALNQTK WFAKRFLHPDIVAEYNYIFLWDEDLGV+YFD
Subjt:  NKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD

Query:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCVLKGSMEMHMVYDSDRNLIPKYPGY-CFLFALQNDLIHAWGLD
        PKRY+SILKEEGLEISQPALDP+KSKVHQPLTARKTGSKVHR     F+  +  G C    +      +      +    G+ C  + +QNDLIHAWGLD
Subjt:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCVLKGSMEMHMVYDSDRNLIPKYPGY-CFLFALQNDLIHAWGLD

Query:  RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRW
        RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDN L+SDA+    +KDSSN D SEP+V+NRVK                 VR+QSS+EMQIFKDRW
Subjt:  RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRW

Query:  TDAAKKDRCWIDPY
        TDAAK DRCWIDPY
Subjt:  TDAAKKDRCWIDPY

XP_008464732.2 PREDICTED: uncharacterized protein LOC103502548, partial [Cucumis melo]9.8e-16870.65Show/hide
Query:  SFFHIALVVLNFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFE
        SF +I   +LNFI S+SK+RL + TFF A+FLGAGVYFIASEFITK           EIFRWEVFYSARNVKSSTCKN+CRPPGSESLPEGIISKTSNFE
Subjt:  SFFHIALVVLNFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFE

Query:  FQPLWGSTLQNKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLW
        FQPLWGS+LQNKKPK SKNLLA+AVGIKQRHVVS+IIEKFP DDFDV+LFHYDGVVDEWR+F+W SRALHVSALNQTK WFAKRFLHPDIVAEYNYIFLW
Subjt:  FQPLWGSTLQNKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLW

Query:  DEDLGVEYFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCVLKGSMEMHMVYDSDRNLIPKYPGY-CFLFALQ
        DEDLGVEYFDPKRYVSILKEEGLEISQPALDP+KSKVHQPLTARKTGSKVHR     F+  +  G C    +      +      +    G+ C  + +Q
Subjt:  DEDLGVEYFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCVLKGSMEMHMVYDSDRNLIPKYPGY-CFLFALQ

Query:  NDLIHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSD-------------------ASTLSWRKDSSNFDGSEPKVDNRVKVSYH
        NDLIHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDN LS                     +S    +KDSSN DGSEP+VDNRVK    
Subjt:  NDLIHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSD-------------------ASTLSWRKDSSNFDGSEPKVDNRVKVSYH

Query:  NQSKYYCFSDFEWVRIQSSIEMQIFKDRWTDAAKKDRCWIDPY
                     VR+QSS+EMQIFKDRW DAAK DRCWIDPY
Subjt:  NQSKYYCFSDFEWVRIQSSIEMQIFKDRWTDAAKKDRCWIDPY

XP_038893166.1 uncharacterized protein LOC120082028 isoform X1 [Benincasa hispida]2.9e-17275.36Show/hide
Query:  NFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ
        NFI  ESK+RL LCTFF+A+FLGAGVYFIAS+FITK           EIFRWEVFYSAR+VKSSTCKN+CRPPGSESLPEGIISKTSNFEF  LWGS +Q
Subjt:  NFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ

Query:  NKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD
        NK+PK+SKNLLAIAVGI+QRHVVSKIIEKFPQD FDV+LFHYDGVVDEWRDFAWSSRALHVSALNQTK WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD
Subjt:  NKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD

Query:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCVLKGSMEMHMVY-DSDRNLIPKYPGYCFLFALQNDLIHAWGLD
        PKRYVSILKEEGLEISQPALDP+KSKVHQPLTARK G KVHR     F+  +  G C    +      + +    +  +    C  + +QNDLIHAWGLD
Subjt:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCVLKGSMEMHMVY-DSDRNLIPKYPGYCFLFALQNDLIHAWGLD

Query:  RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRW
        RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVL+SDAS  S +K+SSNFDGSEPKVDNRVK                 VR+QSS+EMQIFKDRW
Subjt:  RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRW

Query:  TDAAKKDRCWIDPY
        T+AAKKDRCWIDPY
Subjt:  TDAAKKDRCWIDPY

XP_038893167.1 uncharacterized protein LOC120082028 isoform X2 [Benincasa hispida]2.9e-17275.36Show/hide
Query:  NFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ
        NFI  ESK+RL LCTFF+A+FLGAGVYFIAS+FITK           EIFRWEVFYSAR+VKSSTCKN+CRPPGSESLPEGIISKTSNFEF  LWGS +Q
Subjt:  NFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ

Query:  NKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD
        NK+PK+SKNLLAIAVGI+QRHVVSKIIEKFPQD FDV+LFHYDGVVDEWRDFAWSSRALHVSALNQTK WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD
Subjt:  NKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD

Query:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCVLKGSMEMHMVY-DSDRNLIPKYPGYCFLFALQNDLIHAWGLD
        PKRYVSILKEEGLEISQPALDP+KSKVHQPLTARK G KVHR     F+  +  G C    +      + +    +  +    C  + +QNDLIHAWGLD
Subjt:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCVLKGSMEMHMVY-DSDRNLIPKYPGYCFLFALQNDLIHAWGLD

Query:  RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRW
        RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVL+SDAS  S +K+SSNFDGSEPKVDNRVK                 VR+QSS+EMQIFKDRW
Subjt:  RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRW

Query:  TDAAKKDRCWIDPY
        T+AAKKDRCWIDPY
Subjt:  TDAAKKDRCWIDPY

XP_038893169.1 uncharacterized protein LOC120082028 isoform X3 [Benincasa hispida]2.9e-17275.36Show/hide
Query:  NFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ
        NFI  ESK+RL LCTFF+A+FLGAGVYFIAS+FITK           EIFRWEVFYSAR+VKSSTCKN+CRPPGSESLPEGIISKTSNFEF  LWGS +Q
Subjt:  NFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ

Query:  NKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD
        NK+PK+SKNLLAIAVGI+QRHVVSKIIEKFPQD FDV+LFHYDGVVDEWRDFAWSSRALHVSALNQTK WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD
Subjt:  NKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD

Query:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCVLKGSMEMHMVY-DSDRNLIPKYPGYCFLFALQNDLIHAWGLD
        PKRYVSILKEEGLEISQPALDP+KSKVHQPLTARK G KVHR     F+  +  G C    +      + +    +  +    C  + +QNDLIHAWGLD
Subjt:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCVLKGSMEMHMVY-DSDRNLIPKYPGYCFLFALQNDLIHAWGLD

Query:  RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRW
        RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVL+SDAS  S +K+SSNFDGSEPKVDNRVK                 VR+QSS+EMQIFKDRW
Subjt:  RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRW

Query:  TDAAKKDRCWIDPY
        T+AAKKDRCWIDPY
Subjt:  TDAAKKDRCWIDPY

TrEMBL top hitse value%identityAlignment
A0A0A0KMF9 Uncharacterized protein1.2e-16673.91Show/hide
Query:  NFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ
        N I S++K+RL + TFF AL LGAGVYFIA+EFITK             FRWEVFYSA+NVKSSTCKN+CRPPGSESLPEGIISKTSNFEFQPLWGS+LQ
Subjt:  NFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ

Query:  NKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD
        NKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDV+LFHYDGVVDEWR+FAW SRALHVSALNQTK WFAKRFLHPDIVAEYNYIFLWDEDLGV+YFD
Subjt:  NKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD

Query:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCVLKGSMEMHMVYDSDRNLIPKYPGY-CFLFALQNDLIHAWGLD
        PKRY+SILKEEGLEISQPALDP+KSKVHQPLTARKTGSKVHR     F+  +  G C    +      +      +    G+ C  + +QNDLIHAWGLD
Subjt:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCVLKGSMEMHMVYDSDRNLIPKYPGY-CFLFALQNDLIHAWGLD

Query:  RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRW
        RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDN L+SDA+    +KDSSN D SEP+V+NRVK                 VR+QSS+EMQIFKDRW
Subjt:  RQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRW

Query:  TDAAKKDRCWIDPY
        TDAAK DRCWIDPY
Subjt:  TDAAKKDRCWIDPY

A0A1S3CM49 uncharacterized protein LOC1035025484.7e-16870.65Show/hide
Query:  SFFHIALVVLNFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFE
        SF +I   +LNFI S+SK+RL + TFF A+FLGAGVYFIASEFITK           EIFRWEVFYSARNVKSSTCKN+CRPPGSESLPEGIISKTSNFE
Subjt:  SFFHIALVVLNFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFE

Query:  FQPLWGSTLQNKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLW
        FQPLWGS+LQNKKPK SKNLLA+AVGIKQRHVVS+IIEKFP DDFDV+LFHYDGVVDEWR+F+W SRALHVSALNQTK WFAKRFLHPDIVAEYNYIFLW
Subjt:  FQPLWGSTLQNKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLW

Query:  DEDLGVEYFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCVLKGSMEMHMVYDSDRNLIPKYPGY-CFLFALQ
        DEDLGVEYFDPKRYVSILKEEGLEISQPALDP+KSKVHQPLTARKTGSKVHR     F+  +  G C    +      +      +    G+ C  + +Q
Subjt:  DEDLGVEYFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCVLKGSMEMHMVYDSDRNLIPKYPGY-CFLFALQ

Query:  NDLIHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSD-------------------ASTLSWRKDSSNFDGSEPKVDNRVKVSYH
        NDLIHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDN LS                     +S    +KDSSN DGSEP+VDNRVK    
Subjt:  NDLIHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSD-------------------ASTLSWRKDSSNFDGSEPKVDNRVKVSYH

Query:  NQSKYYCFSDFEWVRIQSSIEMQIFKDRWTDAAKKDRCWIDPY
                     VR+QSS+EMQIFKDRW DAAK DRCWIDPY
Subjt:  NQSKYYCFSDFEWVRIQSSIEMQIFKDRWTDAAKKDRCWIDPY

A0A6J1DXM8 uncharacterized protein LOC111024438 isoform X18.6e-16271.94Show/hide
Query:  NFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLW-GSTL
        N I SESK RLHLCTFFIA+ LGAGVYFIASEFITK           E  RWEVFY+ARNVKSSTCK+RCRPPGSE+LPEGI+SKTSNFE QPLW GST+
Subjt:  NFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLW-GSTL

Query:  QNKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYF
        QNK P+  KNLLAIAVGIKQ+HVVSKI+EKFP+DDFDV+LFHYDG+VDEW+D AWS   +H+SALNQTK WFAKRFLHPDIVAEYNYIFLWDEDLGVE F
Subjt:  QNKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYF

Query:  DPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCV---LKGSMEMHMVYDSDRNLIPKYPGYCFLFALQNDLIHAW
        DPKRY+SILKEEGLEISQPALDP+KSKVHQ LTARKT SKVHR   +   MGR D +       G +EM         +  +    C  + +QNDLIHAW
Subjt:  DPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCV---LKGSMEMHMVYDSDRNLIPKYPGYCFLFALQNDLIHAW

Query:  GLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFK
        GLDRQLGYCAQGDRT KVGVVDAEYIVHLGLPTLG SH NVL+S+A  LS +KDSSNFD SEPKVDNRV+                 VRIQSSIEMQIFK
Subjt:  GLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFK

Query:  DRWTDAAKKDRCWIDPY
        +RWTDAAKKDRCWIDPY
Subjt:  DRWTDAAKKDRCWIDPY

A0A6J1FRZ3 uncharacterized protein LOC111447910 isoform X19.5e-16170.67Show/hide
Query:  NFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ
        N I  ESK+RL LCTFF+A+ LGAGVYFIASEFITK           EIFRWEVFYSARNV SS CKN+CRPPGSE LPEGI+SKTSNFEFQPLWGSTL 
Subjt:  NFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ

Query:  NKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD
        NK PKVSKNLL++AVGI QRH+VSKI+EKFP+DDFDV+LFHYDGVVDEW+DF+WSSRA+HVS+LNQTK WFAKRFLHPDIVAEYNYIFLWDEDLGVE FD
Subjt:  NKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD

Query:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCV---LKGSMEMHMVYDSDRNLIPKYPGYCFLFALQNDLIHAWG
        PKRY+SILKEEGLEISQPALDP+KSKVHQ LTARKTGSKVHR   +     R D +       G +EM         +  +    C  + +QNDLIHAWG
Subjt:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCV---LKGSMEMHMVYDSDRNLIPKYPGYCFLFALQNDLIHAWG

Query:  LDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKD
        LDRQLGYCAQGDRT+KVGVVDAEYIVHLGLPTLGAS+ NVL++DA   S +K+ SNF+  E KVDNRVK                 VRIQSS+EMQIFK+
Subjt:  LDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKD

Query:  RWTDAAKKDRCWIDPY
        RWT+AAK+DRCWIDPY
Subjt:  RWTDAAKKDRCWIDPY

A0A6J1FWB2 uncharacterized protein LOC111447910 isoform X39.5e-16170.67Show/hide
Query:  NFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ
        N I  ESK+RL LCTFF+A+ LGAGVYFIASEFITK           EIFRWEVFYSARNV SS CKN+CRPPGSE LPEGI+SKTSNFEFQPLWGSTL 
Subjt:  NFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQ

Query:  NKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD
        NK PKVSKNLL++AVGI QRH+VSKI+EKFP+DDFDV+LFHYDGVVDEW+DF+WSSRA+HVS+LNQTK WFAKRFLHPDIVAEYNYIFLWDEDLGVE FD
Subjt:  NKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFD

Query:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCV---LKGSMEMHMVYDSDRNLIPKYPGYCFLFALQNDLIHAWG
        PKRY+SILKEEGLEISQPALDP+KSKVHQ LTARKTGSKVHR   +     R D +       G +EM         +  +    C  + +QNDLIHAWG
Subjt:  PKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCV---LKGSMEMHMVYDSDRNLIPKYPGYCFLFALQNDLIHAWG

Query:  LDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKD
        LDRQLGYCAQGDRT+KVGVVDAEYIVHLGLPTLGAS+ NVL++DA   S +K+ SNF+  E KVDNRVK                 VRIQSS+EMQIFK+
Subjt:  LDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKD

Query:  RWTDAAKKDRCWIDPY
        RWT+AAK+DRCWIDPY
Subjt:  RWTDAAKKDRCWIDPY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11170.1 Protein of unknown function (DUF707)1.9e-7342.94Show/hide
Query:  ESLPEGIISKTSNFEFQPLW--GSTLQNKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAK
        + LP GII   S+ E +PLW  GS         ++NLLAI VG+KQ+  V  +++KF   +F +VLFHYDG +D+W D  WSS+++H+ A NQTK WFAK
Subjt:  ESLPEGIISKTSNFEFQPLW--GSTLQNKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAK

Query:  RFLHPDIVAEYNYIFLWDEDLGVEYFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCVLKGSMEMHMVY-DSD
        RFLHPD+V+ Y+YIFLWDEDLGVE F+P+RY+ I+K  GLEISQPALD   +++H  +T R    K HR V    ++ R    C    S      + +  
Subjt:  RFLHPDIVAEYNYIFLWDEDLGVEYFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCVLKGSMEMHMVY-DSD

Query:  RNLIPKYPGYCFLFALQNDLIHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQ
          +  K    C    +QNDL+H WG+D +LGYCAQGDRTK VG+VD+EYI+H G+ TLG S      +     + R+  + FD                 
Subjt:  RNLIPKYPGYCFLFALQNDLIHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQ

Query:  SKYYCFSDFEWVRIQSSIEMQIFKDRWTDAAKKDRCWIDP
                   +R QS+ E+Q FK+RW+ A ++D  WIDP
Subjt:  SKYYCFSDFEWVRIQSSIEMQIFKDRWTDAAKKDRCWIDP

AT1G61240.1 Protein of unknown function (DUF707)2.2e-7242.77Show/hide
Query:  ESLPEGIISKTSNFEFQPLW-GSTLQNKKPKV-SKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAK
        + LP GI+   S+ E +PLW  S+L++K  ++ ++NLLA+ VG+KQ+  V  +++KF   +F V+LFHYDG +D+W D  WSS+A+H+ A NQTK WFAK
Subjt:  ESLPEGIISKTSNFEFQPLW-GSTLQNKKPKV-SKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAK

Query:  RFLHPDIVAEYNYIFLWDEDLGVEYFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCVLKGSMEMHMVYDSDR
        RFLHPDIV+ Y+Y+FLWDEDLGVE F+P++Y+ I+K  GLEISQPAL P  ++VH  +T R      HR V      G    S   +G      V +   
Subjt:  RFLHPDIVAEYNYIFLWDEDLGVEYFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCVLKGSMEMHMVYDSDR

Query:  NLIPKYPGYCFLFALQNDLIHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGAS-HDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQ
         +  +   +C    +QNDL+H WG+D +LGYCAQGDR+KKVG+VD+EYI H G+ TLG S + +  +S  S ++ R+ S+ FD                 
Subjt:  NLIPKYPGYCFLFALQNDLIHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGAS-HDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQ

Query:  SKYYCFSDFEWVRIQSSIEMQIFKDRWTDAAKKDRCWID
                   +R QS+ E+Q FK+RW  A  +D+ W++
Subjt:  SKYYCFSDFEWVRIQSSIEMQIFKDRWTDAAKKDRCWID

AT4G12840.1 Protein of unknown function (DUF707)4.9e-9345.76Show/hide
Query:  SESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQNKKP
        ++ +  L L   F  +F  A ++ I + FIT    E        I  W         K   CK + RPPGSE+LP GI++ TS+ E +PLWG+  ++KKP
Subjt:  SESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQNKKP

Query:  KVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFDPKRY
        K S  LLA+AVGI+Q+  V+KI++KFP  +F V+LFHYDG VDEW++F WS  A+H+S +NQTK WFAKRFLHPDIV+ Y+YIFLWDEDLGV++FD +RY
Subjt:  KVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFDPKRY

Query:  VSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHR---SVLHGFFMGRNDGSCVLKGSMEMHMVYDSDRNLIPKYPGYCFLFALQNDLIHAWGLDRQ
        VSI+KEE LEISQPALDP  S+VH  LT+R   S+VHR    V+       N       G +EM         +  +    C    +QNDL H WG+D Q
Subjt:  VSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHR---SVLHGFFMGRNDGSCVLKGSMEMHMVYDSDRNLIPKYPGYCFLFALQNDLIHAWGLDRQ

Query:  LGYCAQGDRTKKVGVVDAEYIVHLGLPTL-GASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRWT
        LGYCAQGDRTK +G+VD+EYI+H+GLPTL G S +N   +D+  L   K     D S      R +                 VR Q+ +E++ FK RW 
Subjt:  LGYCAQGDRTKKVGVVDAEYIVHLGLPTL-GASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRWT

Query:  DAAKKDRCWIDPY
        +A K D CWID +
Subjt:  DAAKKDRCWIDPY

AT4G12840.2 Protein of unknown function (DUF707)4.9e-9345.76Show/hide
Query:  SESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQNKKP
        ++ +  L L   F  +F  A ++ I + FIT    E        I  W         K   CK + RPPGSE+LP GI++ TS+ E +PLWG+  ++KKP
Subjt:  SESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWGSTLQNKKP

Query:  KVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFDPKRY
        K S  LLA+AVGI+Q+  V+KI++KFP  +F V+LFHYDG VDEW++F WS  A+H+S +NQTK WFAKRFLHPDIV+ Y+YIFLWDEDLGV++FD +RY
Subjt:  KVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVEYFDPKRY

Query:  VSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHR---SVLHGFFMGRNDGSCVLKGSMEMHMVYDSDRNLIPKYPGYCFLFALQNDLIHAWGLDRQ
        VSI+KEE LEISQPALDP  S+VH  LT+R   S+VHR    V+       N       G +EM         +  +    C    +QNDL H WG+D Q
Subjt:  VSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHR---SVLHGFFMGRNDGSCVLKGSMEMHMVYDSDRNLIPKYPGYCFLFALQNDLIHAWGLDRQ

Query:  LGYCAQGDRTKKVGVVDAEYIVHLGLPTL-GASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRWT
        LGYCAQGDRTK +G+VD+EYI+H+GLPTL G S +N   +D+  L   K     D S      R +                 VR Q+ +E++ FK RW 
Subjt:  LGYCAQGDRTKKVGVVDAEYIVHLGLPTL-GASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSIEMQIFKDRWT

Query:  DAAKKDRCWIDPY
        +A K D CWID +
Subjt:  DAAKKDRCWIDPY

AT4G18530.1 Protein of unknown function (DUF707)6.9e-10348.58Show/hide
Query:  SKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARN--------VKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWG-S
        S +R  LC+  I   L  G YFI + ++ K   E       ++ +WE+     N          +STCKN  +P G+E+LP+GII KTSN E Q LW   
Subjt:  SKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARN--------VKSSTCKNRCRPPGSESLPEGIISKTSNFEFQPLWG-S

Query:  TLQNKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVE
          + ++P  S +LLA+AVGIKQ+ +V+K+I+KFP  DF V+LFHYDGVVD+W+ + W++ A+HVS +NQTK WFAKRFLHPDIVAEY YIFLWDEDLGV 
Subjt:  TLQNKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTK-WFAKRFLHPDIVAEYNYIFLWDEDLGVE

Query:  YFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDG-----SCVLKGSMEMHMVYDSDRNLIPKYPGYCFLFALQNDL
        +F+P+RY+SI+KEEGLEISQPALD  KS+VH P+TAR+  SKVHR +      GR D       C+  G +EM         +  +    C  + +QNDL
Subjt:  YFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDG-----SCVLKGSMEMHMVYDSDRNLIPKYPGYCFLFALQNDL

Query:  IHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPK-VDNRVKVSYHNQSKYYCFSDFEWVRIQSSIE
        IHAWGLD QLGYCAQGDR K VGVVDAEYI+H GLPTLG     V+ + +S L    DS + +  E + VDNR +                 VR++S +E
Subjt:  IHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPK-VDNRVKVSYHNQSKYYCFSDFEWVRIQSSIE

Query:  MQIFKDRWTDAAKKDRCWIDPY
        M+ FK+RW  A + D CW+DPY
Subjt:  MQIFKDRWTDAAKKDRCWIDPY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTATCGTTGGTTTGATCTCTTGGATTCTGATATTCAGCTCATTGAGTTTTCTGTGTCTTCCTTTTTCCATATTGCTCTTGTTGTTTTGAATTTCATACCA
TCGGAGTCAAAGCATAGATTGCACCTCTGTACTTTCTTCATTGCGTTATTCCTTGGTGCAGGAGTCTATTTCATTGCAAGTGAATTTATTACAAAGGTAGAAATT
GAAGCATATCCTTCCATAGTTCTTGAAATTTTTAGATGGGAGGTATTTTATTCTGCCCGAAATGTAAAATCCAGTACATGCAAGAATCGATGCAGGCCTCCTGGG
AGCGAGTCTTTGCCAGAAGGAATCATTAGTAAAACATCTAACTTTGAATTTCAGCCTTTATGGGGCTCGACTTTACAAAATAAAAAGCCGAAGGTTTCAAAGAAC
TTGTTAGCAATTGCTGTTGGAATCAAACAAAGACACGTAGTGTCAAAAATTATTGAAAAGTTCCCGCAAGATGATTTTGACGTGGTTCTTTTTCATTACGACGGT
GTTGTGGATGAATGGAGGGATTTTGCATGGAGTTCTCGTGCACTACATGTCTCTGCATTGAATCAGACAAAGTGGTTTGCCAAGCGTTTCTTGCATCCAGATATA
GTTGCTGAATATAATTATATATTTCTTTGGGATGAGGACCTTGGTGTCGAGTATTTTGACCCAAAACGATATGTATCAATCCTCAAGGAGGAGGGGCTCGAGATA
TCACAACCAGCTCTCGATCCCATTAAGTCCAAGGTGCACCAGCCACTTACTGCACGCAAAACAGGATCGAAAGTTCACAGGTCAGTTTTACATGGGTTTTTTATG
GGTCGAAATGATGGCTCCTGTGTTCTCAAGGGCAGCATGGAGATGCACATGGTATATGATTCAGACAGAAATTTAATTCCAAAATACCCTGGCTATTGCTTTCTT
TTTGCTTTACAGAATGACTTGATCCACGCATGGGGATTAGATAGACAGCTTGGCTATTGTGCACAAGGCGACCGAACAAAAAAAGTCGGCGTTGTTGATGCAGAG
TACATAGTTCATTTAGGTCTGCCAACGCTCGGTGCTTCCCATGACAATGTGCTGAGTTCTGATGCTTCAACTCTTTCGTGGAGGAAAGACTCGTCAAACTTCGAT
GGATCGGAACCCAAAGTGGATAATAGAGTTAAAGTAAGTTACCACAATCAATCCAAGTATTATTGTTTTTCCGATTTCGAATGGGTGAGGATACAATCTTCCATA
GAAATGCAGATCTTCAAGGACCGATGGACCGATGCAGCAAAGAAGGATAGATGTTGGATCGACCCGTATCTTTGGAATAAAGCTGGAAGTGAAGGTGTTCCAGTC
TCCATTAACAGAGCTCAAACACGGCCTGAGCATGCAGCCAGCTTGGATATTCTAGTTGTCTCAGGACAATTTATACCCAACAAATTCAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTATCGTTGGTTTGATCTCTTGGATTCTGATATTCAGCTCATTGAGTTTTCTGTGTCTTCCTTTTTCCATATTGCTCTTGTTGTTTTGAATTTCATACCA
TCGGAGTCAAAGCATAGATTGCACCTCTGTACTTTCTTCATTGCGTTATTCCTTGGTGCAGGAGTCTATTTCATTGCAAGTGAATTTATTACAAAGGTAGAAATT
GAAGCATATCCTTCCATAGTTCTTGAAATTTTTAGATGGGAGGTATTTTATTCTGCCCGAAATGTAAAATCCAGTACATGCAAGAATCGATGCAGGCCTCCTGGG
AGCGAGTCTTTGCCAGAAGGAATCATTAGTAAAACATCTAACTTTGAATTTCAGCCTTTATGGGGCTCGACTTTACAAAATAAAAAGCCGAAGGTTTCAAAGAAC
TTGTTAGCAATTGCTGTTGGAATCAAACAAAGACACGTAGTGTCAAAAATTATTGAAAAGTTCCCGCAAGATGATTTTGACGTGGTTCTTTTTCATTACGACGGT
GTTGTGGATGAATGGAGGGATTTTGCATGGAGTTCTCGTGCACTACATGTCTCTGCATTGAATCAGACAAAGTGGTTTGCCAAGCGTTTCTTGCATCCAGATATA
GTTGCTGAATATAATTATATATTTCTTTGGGATGAGGACCTTGGTGTCGAGTATTTTGACCCAAAACGATATGTATCAATCCTCAAGGAGGAGGGGCTCGAGATA
TCACAACCAGCTCTCGATCCCATTAAGTCCAAGGTGCACCAGCCACTTACTGCACGCAAAACAGGATCGAAAGTTCACAGGTCAGTTTTACATGGGTTTTTTATG
GGTCGAAATGATGGCTCCTGTGTTCTCAAGGGCAGCATGGAGATGCACATGGTATATGATTCAGACAGAAATTTAATTCCAAAATACCCTGGCTATTGCTTTCTT
TTTGCTTTACAGAATGACTTGATCCACGCATGGGGATTAGATAGACAGCTTGGCTATTGTGCACAAGGCGACCGAACAAAAAAAGTCGGCGTTGTTGATGCAGAG
TACATAGTTCATTTAGGTCTGCCAACGCTCGGTGCTTCCCATGACAATGTGCTGAGTTCTGATGCTTCAACTCTTTCGTGGAGGAAAGACTCGTCAAACTTCGAT
GGATCGGAACCCAAAGTGGATAATAGAGTTAAAGTAAGTTACCACAATCAATCCAAGTATTATTGTTTTTCCGATTTCGAATGGGTGAGGATACAATCTTCCATA
GAAATGCAGATCTTCAAGGACCGATGGACCGATGCAGCAAAGAAGGATAGATGTTGGATCGACCCGTATCTTTGGAATAAAGCTGGAAGTGAAGGTGTTCCAGTC
TCCATTAACAGAGCTCAAACACGGCCTGAGCATGCAGCCAGCTTGGATATTCTAGTTGTCTCAGGACAATTTATACCCAACAAATTCAAATAA
Protein sequenceShow/hide protein sequence
MDYRWFDLLDSDIQLIEFSVSSFFHIALVVLNFIPSESKHRLHLCTFFIALFLGAGVYFIASEFITKVEIEAYPSIVLEIFRWEVFYSARNVKSSTCKNRCRPPG
SESLPEGIISKTSNFEFQPLWGSTLQNKKPKVSKNLLAIAVGIKQRHVVSKIIEKFPQDDFDVVLFHYDGVVDEWRDFAWSSRALHVSALNQTKWFAKRFLHPDI
VAEYNYIFLWDEDLGVEYFDPKRYVSILKEEGLEISQPALDPIKSKVHQPLTARKTGSKVHRSVLHGFFMGRNDGSCVLKGSMEMHMVYDSDRNLIPKYPGYCFL
FALQNDLIHAWGLDRQLGYCAQGDRTKKVGVVDAEYIVHLGLPTLGASHDNVLSSDASTLSWRKDSSNFDGSEPKVDNRVKVSYHNQSKYYCFSDFEWVRIQSSI
EMQIFKDRWTDAAKKDRCWIDPYLWNKAGSEGVPVSINRAQTRPEHAASLDILVVSGQFIPNKFK