; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G027690 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G027690
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of unknown function (DUF506)
Genome locationCmo_Chr04:20006460..20007987
RNA-Seq ExpressionCmoCh04G027690
SyntenyCmoCh04G027690
Gene Ontology termsNA
InterPro domainsIPR006502 - Protein of unknown function PDDEXK-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033056.1 hypothetical protein SDJN02_07109 [Cucurbita argyrosperma subsp. argyrosperma]9.1e-10798.99Show/hide
Query:  MDCRVCVAGGDLWVKLSGIGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNE
        MDCRVCVAGGDLWVKLSGIGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNE
Subjt:  MDCRVCVAGGDLWVKLSGIGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNE

Query:  KALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKL
        KALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYV  L
Subjt:  KALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKL

XP_004142760.2 uncharacterized protein LOC101214727 [Cucumis sativus]4.2e-9689.5Show/hide
Query:  MDCRVCVAGGDLWVKLSG-IGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMN
        MDCRVCVAGGDLWVK+   +GG GQMGGFSHESEHDLALMVSDFLENGSGG +S CSSDSDSGVSDLAHLA+KI+FYKNPVSQYESDLLSVVHSLTLSMN
Subjt:  MDCRVCVAGGDLWVKLSG-IGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMN

Query:  EKALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKL
        EK LN+NK+GPCNASCIRF LVKLLR SGYDAAVCTTRWQGAGKVPGGD+EYIDVVNYT+GSSERLIVDIDFRSHFEIARAVESYDRIL+SLPVIYV  L
Subjt:  EKALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKL

XP_022952529.1 uncharacterized protein LOC111455189 [Cucurbita moschata]9.1e-10798.99Show/hide
Query:  MDCRVCVAGGDLWVKLSGIGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNE
        MDCRVCVAGGDLWVKLSGIGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNE
Subjt:  MDCRVCVAGGDLWVKLSGIGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNE

Query:  KALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKL
        KALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYV  L
Subjt:  KALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKL

XP_022990939.1 uncharacterized protein LOC111487679 [Cucurbita maxima]3.8e-10597.49Show/hide
Query:  MDCRVCVAGGDLWVKLSGIGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNE
        MDCRVCVAGGDLWVKLSGIGG GQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNE
Subjt:  MDCRVCVAGGDLWVKLSGIGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNE

Query:  KALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKL
        KALNVNKSGPCNASC RFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYT+GSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYV  L
Subjt:  KALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKL

XP_023524832.1 uncharacterized protein LOC111788641 [Cucurbita pepo subsp. pepo]7.7e-10697.99Show/hide
Query:  MDCRVCVAGGDLWVKLSGIGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNE
        MDCRVCVAGGDLWVKLSGIGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNE
Subjt:  MDCRVCVAGGDLWVKLSGIGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNE

Query:  KALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKL
        KALNVNKSGPCNASCIRFALVKLLRLSGYDAA+CTTRWQGAGKVPGGDYEYIDVVNYT GSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYV  L
Subjt:  KALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKL

TrEMBL top hitse value%identityAlignment
A0A0A0KNU6 Uncharacterized protein2.1e-9689.5Show/hide
Query:  MDCRVCVAGGDLWVKLSG-IGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMN
        MDCRVCVAGGDLWVK+   +GG GQMGGFSHESEHDLALMVSDFLENGSGG +S CSSDSDSGVSDLAHLA+KI+FYKNPVSQYESDLLSVVHSLTLSMN
Subjt:  MDCRVCVAGGDLWVKLSG-IGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMN

Query:  EKALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKL
        EK LN+NK+GPCNASCIRF LVKLLR SGYDAAVCTTRWQGAGKVPGGD+EYIDVVNYT+GSSERLIVDIDFRSHFEIARAVESYDRIL+SLPVIYV  L
Subjt:  EKALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKL

A0A5A7T7X2 Uncharacterized protein4.6e-9688.5Show/hide
Query:  MDCRVCVAGGDLWVKLSG-IGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMN
        MDCR+CVAGGDLWVK+ G +GG GQMGGFSHESEHDLALMVSDFLENGSGG ES CSSDSDSGVSDL HLA+KI+FYKNPVSQYESDLLSVVHSLTLSMN
Subjt:  MDCRVCVAGGDLWVKLSG-IGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMN

Query:  EKALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKL
         K LN+NK+GPCNASCIRF LVKLLR SGYDAAVCTTRWQGAGKVPGGD+EYIDVVNYT+GSSERLI+DIDFRSHFEIARAVESYDRIL+SLPVIYV  L
Subjt:  EKALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKL

A0A6J1FL63 uncharacterized protein LOC1114452332.1e-9689.22Show/hide
Query:  MDCRVCVAGGDLWVKLSGIG-----GRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLT
        MDCR+CVAGGDLWVK+ G G     G GQMGGFSHESEHDLALMVSDFLENGSGGAES CSSDSDSGVSDL HLADKILFYKNPVSQYESDLLSVVHSLT
Subjt:  MDCRVCVAGGDLWVKLSGIG-----GRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLT

Query:  LSMNEKALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIY
        LSMNEK LN+NKSG CNASCIRF LVKLLRL GYDAAVCTTRWQGAGKVPGGD+EYIDVVNYT+GSSERLIVDIDFRSHFEIARAVESYDRIL+SLPVIY
Subjt:  LSMNEKALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIY

Query:  VAKL
        V  L
Subjt:  VAKL

A0A6J1GLY9 uncharacterized protein LOC1114551894.4e-10798.99Show/hide
Query:  MDCRVCVAGGDLWVKLSGIGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNE
        MDCRVCVAGGDLWVKLSGIGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNE
Subjt:  MDCRVCVAGGDLWVKLSGIGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNE

Query:  KALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKL
        KALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYV  L
Subjt:  KALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKL

A0A6J1JTE2 uncharacterized protein LOC1114876791.9e-10597.49Show/hide
Query:  MDCRVCVAGGDLWVKLSGIGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNE
        MDCRVCVAGGDLWVKLSGIGG GQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNE
Subjt:  MDCRVCVAGGDLWVKLSGIGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNE

Query:  KALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKL
        KALNVNKSGPCNASC RFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYT+GSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYV  L
Subjt:  KALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G38820.1 Protein of unknown function (DUF506)8.4e-1832.66Show/hide
Query:  GGDLWVKLSGIGGRGQMGGFSHESEHDLALMVSDFLENGSGG-----AESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNEKAL
        G D+   LS    RG  G F   S   LA MV +F+E+ +GG       SRC+  S SG               +       +   ++ SL L       
Subjt:  GGDLWVKLSGIGGRGQMGGFSHESEHDLALMVSDFLENGSGG-----AESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNEKAL

Query:  NVNKSGPCNASCIRFALVKLLRL--SGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKLD
               C +  +R  L  + ++  + YDAA+C +RW+ +   P G+YEY+DV+       ERL++DIDF+S FEIARA ++Y  +L +LP I+V K D
Subjt:  NVNKSGPCNASCIRFALVKLLRL--SGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKLD

AT2G38820.2 Protein of unknown function (DUF506)4.5e-1933.65Show/hide
Query:  GGDLWVKLSGIGGRGQMGGFSHESEHDLALMVSDFLENGSGG-----AESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNEKAL
        G D+   LS    RG  G F   S   LA MV +F+E+ +GG       SRC+  S SG               +       +   ++ SL L  + +  
Subjt:  GGDLWVKLSGIGGRGQMGGFSHESEHDLALMVSDFLENGSGG-----AESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNEKAL

Query:  NV--------NKSGPC---NASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLP
        N+          S  C   + SC++     L+ L GYDAA+C +RW+ +   P G+YEY+DV+       ERL++DIDF+S FEIARA ++Y  +L +LP
Subjt:  NV--------NKSGPC---NASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLP

Query:  VIYVAKLD
         I+V K D
Subjt:  VIYVAKLD

AT2G39650.1 Protein of unknown function (DUF506)2.9e-5866.28Show/hide
Query:  SHESEHDLALMVSDFLE--NGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNEKALNVNKSGPCNASCIRFALVKLLRL
        SH+ EHDL LMV+DFLE   GSGGA S CSSDSDSG  D ++L+DKI + K  ++Q+E+++LSVV +L L++ EK L+  KSG CNASCIRF L KLLRL
Subjt:  SHESEHDLALMVSDFLE--NGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNEKALNVNKSGPCNASCIRFALVKLLRL

Query:  SGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTT--GSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYV
        SGYDAAVC+ RWQG GKVPGGD EYID++   T  G  +RLIVDIDFRSHFEIARAV+SY RI++SLPV+YV
Subjt:  SGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTT--GSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYV

AT3G07350.1 Protein of unknown function (DUF506)1.4e-1735.53Show/hide
Query:  ESRCSSDSDSGVSDLAHLADKIL-FYKNPVSQYESDLLSVVHSLTLSMNEKALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYE
        +S   SDSDS + +L   AD I    +N + +       +VH   ++   + L+   S P   +  +  ++ LLR  G++AA+C T+W+ +G +  G++E
Subjt:  ESRCSSDSDSGVSDLAHLADKIL-FYKNPVSQYESDLLSVVHSLTLSMNEKALNVNKSGPCNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYE

Query:  YIDVVNYTTGSSE--RLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKLD
        +IDVV   + SS+  R IVD+DF S F+IAR    Y R+L SLP ++V K D
Subjt:  YIDVVNYTTGSSE--RLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKLD

AT4G14620.1 Protein of unknown function (DUF506)2.1e-1627.92Show/hide
Query:  IGGRGQMGGFSHESEHDLALMVSDFLENG------SGGAESRCSS-DSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNEKALNVNKSGPC
        I G G + G   E E  LA MV +++E        +G    RC+  + ++ +SD     D++ F+     +      S V    L    K +  NKS   
Subjt:  IGGRGQMGGFSHESEHDLALMVSDFLENG------SGGAESRCSS-DSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNEKALNVNKSGPC

Query:  NASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKLDAT---------
            +R  +V  L   GYD+++C ++W     +P G+YEYIDV+     + ERLI+DIDFRS FEIAR    Y  +L SLP+I+V K D           
Subjt:  NASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKLDAT---------

Query:  -------------PSMEISCILASEMAVTMPENGSSSRRTAAAAAAVDVKPQ----AVHGTFEETSISTPIRNRNGSVSKADE
                     P    +  + ++   +   N    + T  +AA V  +P+     +   FEE  +  P+++   SV + D+
Subjt:  -------------PSMEISCILASEMAVTMPENGSSSRRTAAAAAAVDVKPQ----AVHGTFEETSISTPIRNRNGSVSKADE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTGCCGGGTATGCGTTGCGGGTGGGGATTTATGGGTCAAGCTAAGCGGAATCGGGGGTAGGGGTCAGATGGGTGGTTTTAGCCATGAAAGCGAGCATGATTTGGC
TCTTATGGTCAGCGATTTCTTGGAAAATGGCAGCGGTGGGGCTGAGTCTCGGTGTAGCAGCGATAGCGATTCTGGTGTCTCTGATCTTGCTCATCTTGCCGACAAGATTC
TGTTCTATAAGAATCCAGTATCCCAATACGAAAGCGATTTACTTTCGGTGGTTCATTCACTGACCTTGTCGATGAACGAGAAGGCCCTGAACGTTAACAAGTCGGGTCCC
TGCAATGCCAGTTGCATCCGGTTTGCTTTAGTCAAGCTATTGAGACTCTCTGGTTATGATGCTGCTGTGTGCACAACCAGATGGCAGGGCGCTGGCAAGGTTCCTGGAGG
AGATTACGAGTACATCGATGTCGTTAACTACACCACTGGAAGCTCAGAGCGACTGATAGTCGATATCGACTTCCGAAGCCACTTTGAAATCGCAAGGGCAGTTGAATCGT
ACGATAGGATATTGGATTCGCTTCCTGTAATCTACGTTGCTAAACTCGATGCCACTCCCTCCATGGAGATCTCTTGCATACTTGCAAGCGAAATGGCAGTCACCATGCCA
GAGAATGGTTCATCATCCAGAAGAACAGCAGCAGCAGCAGCAGCAGTTGATGTTAAGCCACAAGCAGTGCATGGGACATTTGAAGAGACTTCAATCAGTACTCCAATCAG
AAATCGAAATGGATCGGTTTCTAAGGCCGATGAACGGCGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGACTGCCGGGTATGCGTTGCGGGTGGGGATTTATGGGTCAAGCTAAGCGGAATCGGGGGTAGGGGTCAGATGGGTGGTTTTAGCCATGAAAGCGAGCATGATTTGGC
TCTTATGGTCAGCGATTTCTTGGAAAATGGCAGCGGTGGGGCTGAGTCTCGGTGTAGCAGCGATAGCGATTCTGGTGTCTCTGATCTTGCTCATCTTGCCGACAAGATTC
TGTTCTATAAGAATCCAGTATCCCAATACGAAAGCGATTTACTTTCGGTGGTTCATTCACTGACCTTGTCGATGAACGAGAAGGCCCTGAACGTTAACAAGTCGGGTCCC
TGCAATGCCAGTTGCATCCGGTTTGCTTTAGTCAAGCTATTGAGACTCTCTGGTTATGATGCTGCTGTGTGCACAACCAGATGGCAGGGCGCTGGCAAGGTTCCTGGAGG
AGATTACGAGTACATCGATGTCGTTAACTACACCACTGGAAGCTCAGAGCGACTGATAGTCGATATCGACTTCCGAAGCCACTTTGAAATCGCAAGGGCAGTTGAATCGT
ACGATAGGATATTGGATTCGCTTCCTGTAATCTACGTTGCTAAACTCGATGCCACTCCCTCCATGGAGATCTCTTGCATACTTGCAAGCGAAATGGCAGTCACCATGCCA
GAGAATGGTTCATCATCCAGAAGAACAGCAGCAGCAGCAGCAGCAGTTGATGTTAAGCCACAAGCAGTGCATGGGACATTTGAAGAGACTTCAATCAGTACTCCAATCAG
AAATCGAAATGGATCGGTTTCTAAGGCCGATGAACGGCGATAA
Protein sequenceShow/hide protein sequence
MDCRVCVAGGDLWVKLSGIGGRGQMGGFSHESEHDLALMVSDFLENGSGGAESRCSSDSDSGVSDLAHLADKILFYKNPVSQYESDLLSVVHSLTLSMNEKALNVNKSGP
CNASCIRFALVKLLRLSGYDAAVCTTRWQGAGKVPGGDYEYIDVVNYTTGSSERLIVDIDFRSHFEIARAVESYDRILDSLPVIYVAKLDATPSMEISCILASEMAVTMP
ENGSSSRRTAAAAAAVDVKPQAVHGTFEETSISTPIRNRNGSVSKADERR