; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0004473 (gene) of Chayote v1 genome

Gene IDSed0004473
OrganismSechium edule (Chayote v1)
DescriptionProtein of unknown function (DUF688)
Genome locationLG02:52421701..52422757
RNA-Seq ExpressionSed0004473
SyntenySed0004473
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019077.1 Exportin-2, partial [Cucurbita argyrosperma subsp. argyrosperma]5.5e-8773.71Show/hide
Query:  SQREKQSKKRGEMGSEAKQWQNSKVPKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTP-KNMATQRKCLELPPRLLLMDPKIKKH
        + R+   ++ GEMGSEAK W+NSKVPKLPLLA AAMESPDRSGM TPPI  SVSVPFRWEE+PGKPRFCFRTP  + ATQ KCLELPPRLLL+DPKI K 
Subjt:  SQREKQSKKRGEMGSEAKQWQNSKVPKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTP-KNMATQRKCLELPPRLLLMDPKIKKH

Query:  PPPINAQKGMFQLPKQCCGSFSFDKKQLGAMVLRKRGVFTDKEWFFWLRKGSFWSKREVRSAFGSVFP-SLEKENESEGGIVADKSSSRLKIAETLKEGS
        PPPI A KG+FQ PKQ  GSF +DK QLGAMVLRKRGV  + EWF WL K SF  K EV SA+GSVFP SLEKENE   G+VA++SSS +K+AET KEGS
Subjt:  PPPINAQKGMFQLPKQCCGSFSFDKKQLGAMVLRKRGVFTDKEWFFWLRKGSFWSKREVRSAFGSVFP-SLEKENESEGGIVADKSSSRLKIAETLKEGS

Query:  FSSCLVQTRTEFWGSIGEGLKQINIPWKSRKA
        FSSCLV+ +TEFWGSIGEGLKQIN+PWKS+KA
Subjt:  FSSCLVQTRTEFWGSIGEGLKQINIPWKSRKA

XP_022924712.1 uncharacterized protein At4g00950-like [Cucurbita moschata]2.3e-8576.36Show/hide
Query:  MGSEAKQWQNSKVPKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTP-KNMATQRKCLELPPRLLLMDPKIKKHPPPINAQKGMFQ
        MGSEAK W+NSKVPKLPLLA AAMESPDRSGM TPPI  SVSVPFRWEE+PGKPRFCFRTP  + ATQ KCLELPPRLLL+DPKI K PPPI A KG+FQ
Subjt:  MGSEAKQWQNSKVPKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTP-KNMATQRKCLELPPRLLLMDPKIKKHPPPINAQKGMFQ

Query:  LPKQCCGSFSFDKKQLGAMVLRKRGVFTDKEWFFWLRKGSFWSKREVRSAFGSVFP-SLEKENESEGGIVADKSSSRLKIAETLKEGSFSSCLVQTRTEF
         PKQ  GSF +DK QLGAMVLRKRGV  + EWF WL K SF  K EV SA+GSVFP SLEKENE   G+VA++SSS +K+AET KEGSFSSCLV+ +TEF
Subjt:  LPKQCCGSFSFDKKQLGAMVLRKRGVFTDKEWFFWLRKGSFWSKREVRSAFGSVFP-SLEKENESEGGIVADKSSSRLKIAETLKEGSFSSCLVQTRTEF

Query:  WGSIGEGLKQINIPWKSRKA
        WGSIGEGLKQIN+PWKS+KA
Subjt:  WGSIGEGLKQINIPWKSRKA

XP_022980847.1 uncharacterized protein At4g00950 [Cucurbita maxima]2.9e-8877.27Show/hide
Query:  MGSEAKQWQNSKVPKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTP-KNMATQRKCLELPPRLLLMDPKIKKHPPPINAQKGMFQ
        MGSEAK W+NS+VPKLPLLA AAMESPDRSGM TPPI  SVSVPFRWEE+PGKPRFCFRTP  + ATQ KCLELPPRLLL+DPKI K PPPI A KG+FQ
Subjt:  MGSEAKQWQNSKVPKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTP-KNMATQRKCLELPPRLLLMDPKIKKHPPPINAQKGMFQ

Query:  LPKQCCGSFSFDKKQLGAMVLRKRGVFTDKEWFFWLRKGSFWSKREVRSAFGSVFP-SLEKENESEGGIVADKSSSRLKIAETLKEGSFSSCLVQTRTEF
        LPKQ CGSF +DK QLGAMVLRKRGV  +KEWF WL K SF  K EV SA GSVFP SLEKENE   G+VA++SSS +K+AET KEGSFSSCLV+ +TEF
Subjt:  LPKQCCGSFSFDKKQLGAMVLRKRGVFTDKEWFFWLRKGSFWSKREVRSAFGSVFP-SLEKENESEGGIVADKSSSRLKIAETLKEGSFSSCLVQTRTEF

Query:  WGSIGEGLKQINIPWKSRKA
        WGSIGEGLKQIN+PWKS+KA
Subjt:  WGSIGEGLKQINIPWKSRKA

XP_023527385.1 uncharacterized protein At4g00950-like [Cucurbita pepo subsp. pepo]9.3e-8775.91Show/hide
Query:  MGSEAKQWQNSKVPKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTP-KNMATQRKCLELPPRLLLMDPKIKKHPPPINAQKGMFQ
        MGSEAK W+NS+VPKLPLLA AAMESPDRSGM TPPI  SVSVPFRWEE+PGKPRFCFRTP  + ATQ KCLELPPRLLL+DPKI K PPPI A KG+FQ
Subjt:  MGSEAKQWQNSKVPKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTP-KNMATQRKCLELPPRLLLMDPKIKKHPPPINAQKGMFQ

Query:  LPKQCCGSFSFDKKQLGAMVLRKRGVFTDKEWFFWLRKGSFWSKREVRSAFGSVFP-SLEKENESEGGIVADKSSSRLKIAETLKEGSFSSCLVQTRTEF
         PKQ CGSF +DK QLGAMVLRKRGV  +KEWF WL K SF  K EV SA+GSVFP SLEKENE    +VA++SSS +K+AET KEGSFSSCLV+ +TEF
Subjt:  LPKQCCGSFSFDKKQLGAMVLRKRGVFTDKEWFFWLRKGSFWSKREVRSAFGSVFP-SLEKENESEGGIVADKSSSRLKIAETLKEGSFSSCLVQTRTEF

Query:  WGSIGEGLKQINIPWKSRKA
        WGSIGEG KQIN+PWKS+KA
Subjt:  WGSIGEGLKQINIPWKSRKA

XP_038896089.1 uncharacterized protein At4g00950-like [Benincasa hispida]2.2e-8068.67Show/hide
Query:  SVHSQREKQSKKRGEMGSEAKQWQNSKVPKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTPKNMATQRKCLELPPRLLLMDPKIK
        SVH+    Q +  GEMGSE K W NS+VPKLPLLA AAMESPDRSGM TPP+  SVSVPFRWEE+PGKPR CF +P N+ TQ   LELPPRL LMDPKI 
Subjt:  SVHSQREKQSKKRGEMGSEAKQWQNSKVPKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTPKNMATQRKCLELPPRLLLMDPKIK

Query:  KHPPPINAQKGMFQLPKQCCGSFSFDKKQLGAMVLRKRGVFTDKEWFFWLRKGSFWSKREVRSAFGSVFPSLEKENESEGGIVADKSSSRLKIAETLKEG
        K PPPI +QKG FQ PKQC GSF F K QLGAMVLRKRGV  +KEWF WL K +F  K EV SAFGSVFPS       E GIVA++SSSR+K+A T K G
Subjt:  KHPPPINAQKGMFQLPKQCCGSFSFDKKQLGAMVLRKRGVFTDKEWFFWLRKGSFWSKREVRSAFGSVFPSLEKENESEGGIVADKSSSRLKIAETLKEG

Query:  SFSSCLVQTRTEFWGSIGEGLKQINIPWKSRKA
        +FSSCL+Q + +FWGSIGEG KQINIPWKS+KA
Subjt:  SFSSCLVQTRTEFWGSIGEGLKQINIPWKSRKA

TrEMBL top hitse value%identityAlignment
A0A0A0L338 Uncharacterized protein1.4e-7263.71Show/hide
Query:  SQREKQSKKRGEMGSEAKQWQNSKVPKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRT---PKNMATQRKCLELPPRLLLMDPKIK
        ++RE++ ++R +MGSEAK  ++ +VPKLPLLA  AMESPDRSGM TPP+ +SVSVPFRWEE+PGKPRFCF +   P    TQ   LELPPRLLLMDPKI 
Subjt:  SQREKQSKKRGEMGSEAKQWQNSKVPKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRT---PKNMATQRKCLELPPRLLLMDPKIK

Query:  KHPPPINAQKGMFQLPKQCCGSFSFDKKQLGAMVLRKRGVFTDKEWFFWLRKGSFWSKREVRSAFGSVFP-SLEKENESEGGIVADKSSSRLKIAETLKE
        K  PPI +QKG FQ  K CC SF FDK QLGAMVLRKRGV  +KEWF WL K SF SK EV S +G+VFP SL+KE          KSSSR+K+A   K 
Subjt:  KHPPPINAQKGMFQLPKQCCGSFSFDKKQLGAMVLRKRGVFTDKEWFFWLRKGSFWSKREVRSAFGSVFP-SLEKENESEGGIVADKSSSRLKIAETLKE

Query:  GSFSSCLVQTRTEFW---GSIGEGLKQINIPWKSRKA
        GSFSSC VQ +TEFW   G+IGEG KQINIPWKS++A
Subjt:  GSFSSCLVQTRTEFW---GSIGEGLKQINIPWKSRKA

A0A5D3DBI9 Uncharacterized protein5.0e-7065.32Show/hide
Query:  MGSEAKQWQNSKVPKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRT---PKNMATQRKCLELPPRLLLMDPKIKKHPPPINAQKGM
        MGSEAK  +N +VPKLPLLA  AMESPDRSGM TPPI +SVSVPFRWEE+PGKPRFCF +   P    TQ   LELPPRLLLMDPKI K  P I +QKG 
Subjt:  MGSEAKQWQNSKVPKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRT---PKNMATQRKCLELPPRLLLMDPKIKKHPPPINAQKGM

Query:  FQLPKQCCGSFSFDKKQLGAMVLRKRGVFTDKEWFFWLRKGSFWSKREVRSAFGSVFP-SLEKENESEGGIVADKSSSRLKIAETLKEGSFSSCLVQTRT
        FQ  K CC SF F K QLGAMVLRKRG+  +KEWF W RK +F  K EV S + +VFP SL+KE          +SSSR+K+A T K GSFSSC VQ +T
Subjt:  FQLPKQCCGSFSFDKKQLGAMVLRKRGVFTDKEWFFWLRKGSFWSKREVRSAFGSVFP-SLEKENESEGGIVADKSSSRLKIAETLKEGSFSSCLVQTRT

Query:  EFWGSIGEGLKQINIPWKSRKA
        EFWG+IGEG KQINIPWKS+KA
Subjt:  EFWGSIGEGLKQINIPWKSRKA

A0A6J1D0Z4 uncharacterized protein At4g009504.1e-8071.68Show/hide
Query:  MGSEAKQWQNSKVPKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTP--KNMATQRKCLELPPRLLLMDPKIKKHPPPINAQ--KG
        MGSEAK W+NS++PKLPLLA  AMESPDRSGMQTPPI++SVSVPFRWEE+PGKPRFCF TP  K    QRKCLELPPRLLLM+PKI K PPPI A   KG
Subjt:  MGSEAKQWQNSKVPKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTP--KNMATQRKCLELPPRLLLMDPKIKKHPPPINAQ--KG

Query:  MFQLPKQCCGSFSFDKKQLGAMVLRKRGVFTDKEWFFWLRKGSFWSKREVRSAFGSVFP-SLEK----ENESEGGIVADKSSSRLKIAETLKEGSFSSCL
        MFQ   QCCGSF FDK+QLGAMVLRKRGV  +KEWF WL   SF  + ++ SA GSVFP SLEK    E+E+EGG     SSSR+K+A+T  EGSFSSCL
Subjt:  MFQLPKQCCGSFSFDKKQLGAMVLRKRGVFTDKEWFFWLRKGSFWSKREVRSAFGSVFP-SLEK----ENESEGGIVADKSSSRLKIAETLKEGSFSSCL

Query:  VQTRTEFWGSIGEGLKQINIPWKSRK
        VQ +TEFWGSIGEGLKQINIPWK +K
Subjt:  VQTRTEFWGSIGEGLKQINIPWKSRK

A0A6J1E9R6 uncharacterized protein At4g00950-like1.1e-8576.36Show/hide
Query:  MGSEAKQWQNSKVPKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTP-KNMATQRKCLELPPRLLLMDPKIKKHPPPINAQKGMFQ
        MGSEAK W+NSKVPKLPLLA AAMESPDRSGM TPPI  SVSVPFRWEE+PGKPRFCFRTP  + ATQ KCLELPPRLLL+DPKI K PPPI A KG+FQ
Subjt:  MGSEAKQWQNSKVPKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTP-KNMATQRKCLELPPRLLLMDPKIKKHPPPINAQKGMFQ

Query:  LPKQCCGSFSFDKKQLGAMVLRKRGVFTDKEWFFWLRKGSFWSKREVRSAFGSVFP-SLEKENESEGGIVADKSSSRLKIAETLKEGSFSSCLVQTRTEF
         PKQ  GSF +DK QLGAMVLRKRGV  + EWF WL K SF  K EV SA+GSVFP SLEKENE   G+VA++SSS +K+AET KEGSFSSCLV+ +TEF
Subjt:  LPKQCCGSFSFDKKQLGAMVLRKRGVFTDKEWFFWLRKGSFWSKREVRSAFGSVFP-SLEKENESEGGIVADKSSSRLKIAETLKEGSFSSCLVQTRTEF

Query:  WGSIGEGLKQINIPWKSRKA
        WGSIGEGLKQIN+PWKS+KA
Subjt:  WGSIGEGLKQINIPWKSRKA

A0A6J1ISE2 uncharacterized protein At4g009501.4e-8877.27Show/hide
Query:  MGSEAKQWQNSKVPKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTP-KNMATQRKCLELPPRLLLMDPKIKKHPPPINAQKGMFQ
        MGSEAK W+NS+VPKLPLLA AAMESPDRSGM TPPI  SVSVPFRWEE+PGKPRFCFRTP  + ATQ KCLELPPRLLL+DPKI K PPPI A KG+FQ
Subjt:  MGSEAKQWQNSKVPKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTP-KNMATQRKCLELPPRLLLMDPKIKKHPPPINAQKGMFQ

Query:  LPKQCCGSFSFDKKQLGAMVLRKRGVFTDKEWFFWLRKGSFWSKREVRSAFGSVFP-SLEKENESEGGIVADKSSSRLKIAETLKEGSFSSCLVQTRTEF
        LPKQ CGSF +DK QLGAMVLRKRGV  +KEWF WL K SF  K EV SA GSVFP SLEKENE   G+VA++SSS +K+AET KEGSFSSCLV+ +TEF
Subjt:  LPKQCCGSFSFDKKQLGAMVLRKRGVFTDKEWFFWLRKGSFWSKREVRSAFGSVFP-SLEKENESEGGIVADKSSSRLKIAETLKEGSFSSCLVQTRTEF

Query:  WGSIGEGLKQINIPWKSRKA
        WGSIGEGLKQIN+PWKS+KA
Subjt:  WGSIGEGLKQINIPWKSRKA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46535.1 unknown protein1.1e-0827.46Show/hide
Query:  SPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTPKNMATQRKCLELPPRLLLMDPKIKKHPPPINAQKGMFQLPKQCCGSFSF-DKKQLGAMVLRKRG
        SP    +   PI    SVPF WE++PGKP+     P    +  KCL+LPPRLLL              +     LP++  G   F  +K  G +V+R   
Subjt:  SPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTPKNMATQRKCLELPPRLLLMDPKIKKHPPPINAQKGMFQLPKQCCGSFSF-DKKQLGAMVLRKRG

Query:  VFTDKEWFFWLRKGSFWSKREVRSAFGSVFPSLEKENESEGGIVADKSSSRLKIAETLKEGSFSSCLVQTRTEFWGSIGEGLKQINIPWKSRK
        VF                                 EN+  G  +   + + +KI +  + GS+        + FWGS+ +GLK + +PWK++K
Subjt:  VFTDKEWFFWLRKGSFWSKREVRSAFGSVFPSLEKENESEGGIVADKSSSRLKIAETLKEGSFSSCLVQTRTEFWGSIGEGLKQINIPWKSRK

AT3G61840.1 Protein of unknown function (DUF688)8.6e-0634.44Show/hide
Query:  LAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTPKNMATQRKCLELPPRLLLMD-----PKIKKHPPPINAQKGMFQLPKQ
        ++ + M SPD S        +  S+PF WEE+PGKP+   R P    +  KCL+LPPR+L  D     P+   H   +   K  F+  K+
Subjt:  LAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTPKNMATQRKCLELPPRLLLMD-----PKIKKHPPPINAQKGMFQLPKQ

AT4G27810.1 unknown protein2.2e-0935Show/hide
Query:  PKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTPKNMATQR----------KCLELPPRLLLMDPKIKKHPPPINAQKGMFQLPKQ
        PKLPL +     + D  G+ TPP+  + SVPF WEE PGKPR         + Q           +CLELPPRL          P P     G + +P++
Subjt:  PKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTPKNMATQR----------KCLELPPRLLLMDPKIKKHPPPINAQKGMFQLPKQ

AT5G53030.1 unknown protein9.5e-1330.24Show/hide
Query:  GMQTPPIQASVSVPFRWEEKPGKPRFCFRTPK-NMATQRKCLELPPRLLLM-DPKIKKHPPPINAQKGMFQLPKQCCGSFSFDKKQLGAMVLRKRGV---
        G+ TPP+  + SVPF WEE PGKPR   +  + N     + LELPPRL+L  +      P P     G + L ++     S    +  A++ + RGV   
Subjt:  GMQTPPIQASVSVPFRWEEKPGKPRFCFRTPK-NMATQRKCLELPPRLLLM-DPKIKKHPPPINAQKGMFQLPKQCCGSFSFDKKQLGAMVLRKRGV---

Query:  FTDKEWFF---WLRKGSFWSKREVR------SAFGSVFPSLEKENESEGGIVADKSSSRLKIAETLKEGSFSSCLVQTRTEFW----GSIGEGLKQINIP
          +KE        R GSF + +EV       S F        ++    GG+      +++K+   +K+GSF +    T+++FW      + EG KQ+ IP
Subjt:  FTDKEWFF---WLRKGSFWSKREVR------SAFGSVFPSLEKENESEGGIVADKSSSRLKIAETLKEGSFSSCLVQTRTEFW----GSIGEGLKQINIP

Query:  WKSRK
        WK ++
Subjt:  WKSRK

AT5G53030.2 unknown protein9.8e-1029.73Show/hide
Query:  GMQTPPIQASVSVPFRWEEKPGKPRFCFRTPK-NMATQRKCLELPPRLLLM-DPKIKKHPPPINAQKGMFQLPKQCCGSFSFDKKQLGAMVLRKRGV---
        G+ TPP+  + SVPF WEE PGKPR   +  + N     + LELPPRL+L  +      P P     G + L ++     S    +  A++ + RGV   
Subjt:  GMQTPPIQASVSVPFRWEEKPGKPRFCFRTPK-NMATQRKCLELPPRLLLM-DPKIKKHPPPINAQKGMFQLPKQCCGSFSFDKKQLGAMVLRKRGV---

Query:  FTDKEWFF---WLRKGSFWSKREVR------SAFGSVFPSLEKENESEGGIVADKSSSRLKIAETLKEGSFSSCLVQTRTEFWGS
          +KE        R GSF + +EV       S F        ++    GG+      +++K+   +K+GSF +    T+++FW S
Subjt:  FTDKEWFF---WLRKGSFWSKREVR------SAFGSVFPSLEKENESEGGIVADKSSSRLKIAETLKEGSFSSCLVQTRTEFWGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACCACAACTTGTCATTTCAGTACACTCACAGAGAGAGAAACAGAGCAAAAAAAGAGGGGAAATGGGAAGTGAAGCAAAGCAATGGCAGAACTCAAAAGTACCAAA
GCTTCCATTACTTGCATTTGCAGCAATGGAGTCACCAGATCGCTCTGGAATGCAAACCCCACCAATCCAGGCTTCAGTTTCAGTCCCATTTCGATGGGAAGAGAAGCCTG
GAAAGCCCAGGTTCTGTTTCAGAACCCCCAAGAACATGGCCACTCAAAGAAAGTGCCTTGAACTTCCACCAAGATTGCTCCTTATGGACCCCAAAATTAAAAAACATCCT
CCTCCCATCAATGCCCAAAAGGGTATGTTTCAGCTTCCTAAGCAGTGCTGTGGCTCATTCAGTTTTGACAAAAAGCAGCTTGGAGCCATGGTTTTGAGAAAGAGAGGGGT
TTTCACAGACAAGGAGTGGTTTTTTTGGTTGAGGAAAGGAAGTTTTTGGAGCAAAAGAGAAGTTAGATCTGCCTTTGGAAGTGTTTTTCCTTCTTTGGAGAAGGAAAATG
AGAGTGAAGGTGGCATTGTTGCAGATAAAAGCAGTTCAAGGTTGAAGATTGCAGAGACACTCAAAGAAGGGAGCTTTTCTTCATGCCTTGTCCAAACCAGGACTGAGTTT
TGGGGAAGCATAGGAGAAGGCTTGAAACAGATCAACATTCCATGGAAGAGCAGAAAAGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGCCACCACAACTTGTCATTTCAGTACACTCACAGAGAGAGAAACAGAGCAAAAAAAGAGGGGAAATGGGAAGTGAAGCAAAGCAATGGCAGAACTCAAAAGTACCAAA
GCTTCCATTACTTGCATTTGCAGCAATGGAGTCACCAGATCGCTCTGGAATGCAAACCCCACCAATCCAGGCTTCAGTTTCAGTCCCATTTCGATGGGAAGAGAAGCCTG
GAAAGCCCAGGTTCTGTTTCAGAACCCCCAAGAACATGGCCACTCAAAGAAAGTGCCTTGAACTTCCACCAAGATTGCTCCTTATGGACCCCAAAATTAAAAAACATCCT
CCTCCCATCAATGCCCAAAAGGGTATGTTTCAGCTTCCTAAGCAGTGCTGTGGCTCATTCAGTTTTGACAAAAAGCAGCTTGGAGCCATGGTTTTGAGAAAGAGAGGGGT
TTTCACAGACAAGGAGTGGTTTTTTTGGTTGAGGAAAGGAAGTTTTTGGAGCAAAAGAGAAGTTAGATCTGCCTTTGGAAGTGTTTTTCCTTCTTTGGAGAAGGAAAATG
AGAGTGAAGGTGGCATTGTTGCAGATAAAAGCAGTTCAAGGTTGAAGATTGCAGAGACACTCAAAGAAGGGAGCTTTTCTTCATGCCTTGTCCAAACCAGGACTGAGTTT
TGGGGAAGCATAGGAGAAGGCTTGAAACAGATCAACATTCCATGGAAGAGCAGAAAAGCATAAGAAAGAGCACTGGTAATCGTCAGAATGTTCATGTATAAACCGAGTTA
AGCTTTCTTGTAATGTAAATGTATCAAATAATGGTAAAATGGAGAGGGCCAGATGCTTCAGCTAATAGCTGACTGAGCTCGTATCATATTTGGCAGTCTGTATAGCAATA
ATGTTAGGAC
Protein sequenceShow/hide protein sequence
MPPQLVISVHSQREKQSKKRGEMGSEAKQWQNSKVPKLPLLAFAAMESPDRSGMQTPPIQASVSVPFRWEEKPGKPRFCFRTPKNMATQRKCLELPPRLLLMDPKIKKHP
PPINAQKGMFQLPKQCCGSFSFDKKQLGAMVLRKRGVFTDKEWFFWLRKGSFWSKREVRSAFGSVFPSLEKENESEGGIVADKSSSRLKIAETLKEGSFSSCLVQTRTEF
WGSIGEGLKQINIPWKSRKA