; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cp4.1LG01g03260 (gene) of Cucurbita pepo (MU-CU-16) v4.1 genome

Gene IDCp4.1LG01g03260
OrganismCucurbita pepo var. pepo MU-CU-16 (Cucurbita pepo (MU-CU-16) v4.1)
DescriptionProtein of unknown function (DUF1218)
Genome locationCp4.1LG01:1917361..1919047
RNA-Seq ExpressionCp4.1LG01g03260
SyntenyCp4.1LG01g03260
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600217.1 Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. sororia]1.55e-13396.21Show/hide
Query:  MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISW
        MENKAL+VYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLN ATGCFCCFAAPRSS+SKWRIALICYVISW
Subjt:  MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISW

Query:  ITFAKAFIMLLTGAALNDQRGEQSYFLGYCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV
        ITFAKAFIMLLTGAALNDQRGEQSYFLGYCYVLRSGFF +ATIVAT+SIVLGLAYYILLNS ETEPSVFGNPCIPPQANIAMGQPQFPPPP RSADPVFV
Subjt:  ITFAKAFIMLLTGAALNDQRGEQSYFLGYCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV

Query:  PEDTYMRRQFT
        PEDTYMRRQFT
Subjt:  PEDTYMRRQFT

KAG7030876.1 hypothetical protein SDJN02_04913, partial [Cucurbita argyrosperma subsp. argyrosperma]3.50e-13995.73Show/hide
Query:  MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISW
        MENKAL+VYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLN ATGCFCCFAAPRSS+SKWRIALICYVISW
Subjt:  MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISW

Query:  ITFAKAFIMLLTGAALNDQRGEQSYFLGYCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV
        ITFAKAFIMLLTGAALNDQRGEQSYFLGYCYVLRSGFF +ATIVAT+SIVLGLAYYILLNS ETEPSVFGNPCIPPQANIAMGQPQFPPPP RSADPVFV
Subjt:  ITFAKAFIMLLTGAALNDQRGEQSYFLGYCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV

Query:  PEDTYMRRQFT
        PEDTY+RRQFT
Subjt:  PEDTYMRRQFT

XP_022942637.1 uncharacterized protein LOC111447615 [Cucurbita moschata]1.43e-13895.73Show/hide
Query:  MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISW
        ME KALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLN ATGCFCCFAAPRSS+SKWRIALICYVISW
Subjt:  MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISW

Query:  ITFAKAFIMLLTGAALNDQRGEQSYFLGYCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV
        ITFAKAFIMLLTGAALNDQRGEQSYFLGYCYVLRSGFF +ATIVAT+SIVLGL YYILLNS ETEPSVFGNPCIPPQANIAMGQPQFPPPP RSADPVFV
Subjt:  ITFAKAFIMLLTGAALNDQRGEQSYFLGYCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV

Query:  PEDTYMRRQFT
        PEDTYMRRQFT
Subjt:  PEDTYMRRQFT

XP_022984726.1 uncharacterized protein LOC111482918 [Cucurbita maxima]1.17e-13795.26Show/hide
Query:  MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISW
        ME KALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLN ATGCFCCF+APRSS+SKWRIALICYVISW
Subjt:  MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISW

Query:  ITFAKAFIMLLTGAALNDQRGEQSYFLGYCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV
        ITFAKAFIMLLTGAALNDQRGEQSYFLG+CYVLRSGFF +ATIVATVSIVLGLAYYILLNS E EPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV
Subjt:  ITFAKAFIMLLTGAALNDQRGEQSYFLGYCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV

Query:  PEDTYMRRQFT
         EDTYMRRQFT
Subjt:  PEDTYMRRQFT

XP_023542651.1 uncharacterized protein LOC111802489 [Cucurbita pepo subsp. pepo]3.96e-145100Show/hide
Query:  MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISW
        MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISW
Subjt:  MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISW

Query:  ITFAKAFIMLLTGAALNDQRGEQSYFLGYCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV
        ITFAKAFIMLLTGAALNDQRGEQSYFLGYCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV
Subjt:  ITFAKAFIMLLTGAALNDQRGEQSYFLGYCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV

Query:  PEDTYMRRQFT
        PEDTYMRRQFT
Subjt:  PEDTYMRRQFT

TrEMBL top hitse value%identityAlignment
A0A0A0KU80 Uncharacterized protein1.39e-7760.09Show/hide
Query:  MENK-ALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVIS
        ME K AL+V  VV  LG+++IATGFAAE T+ K N V  V    CKYP+SPA+GLGL AALSLL A IT+  +TGC CC   PR   SKWR A+IC+ IS
Subjt:  MENK-ALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVIS

Query:  WITFAKAFIMLLTGAALNDQRGEQ-SYFLGY-CYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPS-VFGNPCIPPQANIAMGQPQFP---PPPHR
        W+T+  AF++ LTGAALN+ RGEQ +YF  Y CYVL+ G F  ATIV   S+ LG++Y+++LNS + +PS V+G+P +PPQ NIAM QPQFP   PPP R
Subjt:  WITFAKAFIMLLTGAALNDQRGEQ-SYFLGY-CYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPS-VFGNPCIPPQANIAMGQPQFP---PPPHR

Query:  SADPVFVPEDTYMRRQFT
        +ADPVFV EDTYMRRQFT
Subjt:  SADPVFVPEDTYMRRQFT

A0A6J1CG96 uncharacterized protein LOC1110114038.63e-9566.2Show/hide
Query:  MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISW
        ME KA+ V +VV FLGLLV+ATGFAAEGT++KL+ VI V   TC YP+SPA+GLGL AALSLL+A +T+N +TGC CC   PR   SKWR  ++C+VISW
Subjt:  MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISW

Query:  ITFAKAFIMLLTGAALNDQRGEQSYFLGY--CYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPP---HRSA
         TF  AF++LLTGAALND+RGE+SY+ GY  CYVL+ G F +ATI+AT SIVLGL YY++LNS +  P+V+GNP +PPQANIAMGQPQFPPPP    RS 
Subjt:  ITFAKAFIMLLTGAALNDQRGEQSYFLGY--CYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPP---HRSA

Query:  DPVFVPEDTYMRRQFT
        DPVFV EDTYMRRQ+T
Subjt:  DPVFVPEDTYMRRQFT

A0A6J1E6P1 uncharacterized protein LOC1114304668.10e-8662.96Show/hide
Query:  MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISW
        ME KAL V +VV FLGLL++ATGFAAEGT+VK N V+ V  T CKYP+SPA  LGL AALSLLLA I +N +TGC CC   PR   SKWR A++C+V+SW
Subjt:  MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISW

Query:  ITFAKAFIMLLTGAALNDQRGEQSYFLGY--CYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHR---SA
         TF  AF++LLTGAALND R EQS +  Y  CYVL+ G F +AT+V   S+ LGL YY++LNS + +P+V+GNP IPP ANIAM QPQFPPPP     +A
Subjt:  ITFAKAFIMLLTGAALNDQRGEQSYFLGY--CYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHR---SA

Query:  DPVFVPEDTYMRRQFT
        DPVFV EDTY RRQFT
Subjt:  DPVFVPEDTYMRRQFT

A0A6J1FQT9 uncharacterized protein LOC1114476156.90e-13995.73Show/hide
Query:  MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISW
        ME KALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLN ATGCFCCFAAPRSS+SKWRIALICYVISW
Subjt:  MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISW

Query:  ITFAKAFIMLLTGAALNDQRGEQSYFLGYCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV
        ITFAKAFIMLLTGAALNDQRGEQSYFLGYCYVLRSGFF +ATIVAT+SIVLGL YYILLNS ETEPSVFGNPCIPPQANIAMGQPQFPPPP RSADPVFV
Subjt:  ITFAKAFIMLLTGAALNDQRGEQSYFLGYCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV

Query:  PEDTYMRRQFT
        PEDTYMRRQFT
Subjt:  PEDTYMRRQFT

A0A6J1J634 uncharacterized protein LOC1114829185.67e-13895.26Show/hide
Query:  MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISW
        ME KALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLN ATGCFCCF+APRSS+SKWRIALICYVISW
Subjt:  MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISW

Query:  ITFAKAFIMLLTGAALNDQRGEQSYFLGYCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV
        ITFAKAFIMLLTGAALNDQRGEQSYFLG+CYVLRSGFF +ATIVATVSIVLGLAYYILLNS E EPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV
Subjt:  ITFAKAFIMLLTGAALNDQRGEQSYFLGYCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV

Query:  PEDTYMRRQFT
         EDTYMRRQFT
Subjt:  PEDTYMRRQFT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13380.1 Protein of unknown function (DUF1218)1.4e-0730.18Show/hide
Query:  ENKA-LVVYTVVVFLGLLVIATGFAAEGTKV--KLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVI
        E KA  +V+ +VV L L+      AAE  +   K     + N T C Y    A G G+ A L LL +   L   T C  CF  P +  S    ++I ++ 
Subjt:  ENKA-LVVYTVVVFLGLLVIATGFAAEGTKV--KLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVI

Query:  SWITFAKAFIMLLTGAALNDQRGEQ-SYFLGYCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEP
        SW+TF  A   ++ GA  N    +  S     C  LR G FI   +    ++VL + YY+    + + P
Subjt:  SWITFAKAFIMLLTGAALNDQRGEQ-SYFLGYCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEP

AT1G68220.1 Protein of unknown function (DUF1218)1.9e-0426.99Show/hide
Query:  VYTVVVFLGLLVIATGFAAEGTKVKLNHV--IVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISWITFAK
        + TVV  L LL     F AE  +     V      +T CKY    +   G+ A   LL++   +N  T C C      +  S    A++ +V+SW++F  
Subjt:  VYTVVVFLGLLVIATGFAAEGTKVKLNHV--IVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISWITFAK

Query:  AFIMLLTGAALNDQRGE-QSYFLG---YCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTET
        A   LL G+A N    + +  + G    C VL  G F        +S++  + YY+  +  +T
Subjt:  AFIMLLTGAALNDQRGE-QSYFLG---YCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTET

AT2G32280.1 Protein of unknown function (DUF1218)1.2e-0626.54Show/hide
Query:  LVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAP--RSSISKWRIALICYVISWITF
        ++V  V+V L +     G  AE  + ++ H+ +     C+ P   A  LGL AA  L++AH+ LN   GC C  +    + S S  +I++ C V++WI F
Subjt:  LVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAP--RSSISKWRIALICYVISWITF

Query:  AKAFIMLLTGAALNDQRGEQSYFLGYCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETE
        A  F  ++ G   N +          C      F  +  I+  +  +  +AYY+   + + E
Subjt:  AKAFIMLLTGAALNDQRGEQSYFLGYCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETE

AT5G17210.1 Protein of unknown function (DUF1218)7.7e-4344.91Show/hide
Query:  MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVV---NRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYV
        ME + +V+  V+  LGLL   T F AE T++K + V V    + T C YP+SPA  LG  +AL L++A I ++ ++GCFCC   P  S S W I+LIC+V
Subjt:  MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVV---NRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYV

Query:  ISWITFAKAFIMLLTGAALNDQRGEQSYFLG--YCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHRSA
        +SW TF  AF++LL+GAALND+  E+S   G  +CY+++ G F    +++ V+I LG+ YY+ L S +   +            IAMGQPQ    P R  
Subjt:  ISWITFAKAFIMLLTGAALNDQRGEQSYFLG--YCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHRSA

Query:  DPVFVPEDTYMRRQFT
        DPVFV EDTYMRRQFT
Subjt:  DPVFVPEDTYMRRQFT

AT5G17210.2 Protein of unknown function (DUF1218)3.7e-3747.09Show/hide
Query:  TTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISWITFAKAFIMLLTGAALNDQRGEQSYFLG--YCYVLRSGFFI
        T C YP+SPA  LG  +AL L++A I ++ ++GCFCC   P  S S W I+LIC+V+SW TF  AF++LL+GAALND+  E+S   G  +CY+++ G F 
Subjt:  TTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISWITFAKAFIMLLTGAALNDQRGEQSYFLG--YCYVLRSGFFI

Query:  MATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFVPEDTYMRRQFT
           +++ V+I LG+ YY+ L S +   +            IAMGQPQ    P R  DPVFV EDTYMRRQFT
Subjt:  MATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFVPEDTYMRRQFT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAATAAGGCTCTGGTTGTGTACACTGTGGTCGTTTTTTTGGGGCTTTTGGTGATCGCCACTGGCTTCGCCGCTGAGGGCACCAAAGTTAAGCTTAATCATGTTAT
TGTAGTCAATCGTACTACGTGCAAATATCCCAAAAGTCCAGCGGTGGGCCTTGGTTTGGTTGCAGCTCTATCACTCTTGCTTGCTCATATAACGTTAAATTTTGCAACCG
GGTGCTTTTGCTGCTTTGCGGCCCCTCGCTCTTCTATTTCTAAATGGCGAATAGCCCTGATCTGCTACGTCATTTCCTGGATTACATTTGCGAAAGCGTTCATTATGTTG
CTCACCGGTGCTGCACTGAACGACCAACGGGGCGAACAAAGCTACTTTTTAGGCTATTGCTATGTCCTGAGATCAGGATTTTTTATTATGGCTACCATTGTGGCCACGGT
GAGCATAGTGCTGGGATTGGCCTATTACATTCTATTGAACTCAACAGAGACTGAGCCTTCTGTGTTTGGTAATCCCTGCATTCCTCCTCAAGCAAACATTGCAATGGGGC
AGCCCCAATTCCCTCCCCCTCCACACAGATCTGCTGACCCCGTATTCGTCCCCGAAGACACGTACATGAGACGACAATTCACGTGA
mRNA sequenceShow/hide mRNA sequence
TCTCTACCACGGTTTCTTCGCCCATTGAAATCGCGACATCAATCCGAAACTCCACCAAATTCCGCCAGTTTTTTCTCCCTTGAGTCTCGAGGTAGGCCATCGGCGGCGGC
GGAGATGGAGAATAAGGCTCTGGTTGTGTACACTGTGGTCGTTTTTTTGGGGCTTTTGGTGATCGCCACTGGCTTCGCCGCTGAGGGCACCAAAGTTAAGCTTAATCATG
TTATTGTAGTCAATCGTACTACGTGCAAATATCCCAAAAGTCCAGCGGTGGGCCTTGGTTTGGTTGCAGCTCTATCACTCTTGCTTGCTCATATAACGTTAAATTTTGCA
ACCGGGTGCTTTTGCTGCTTTGCGGCCCCTCGCTCTTCTATTTCTAAATGGCGAATAGCCCTGATCTGCTACGTCATTTCCTGGATTACATTTGCGAAAGCGTTCATTAT
GTTGCTCACCGGTGCTGCACTGAACGACCAACGGGGCGAACAAAGCTACTTTTTAGGCTATTGCTATGTCCTGAGATCAGGATTTTTTATTATGGCTACCATTGTGGCCA
CGGTGAGCATAGTGCTGGGATTGGCCTATTACATTCTATTGAACTCAACAGAGACTGAGCCTTCTGTGTTTGGTAATCCCTGCATTCCTCCTCAAGCAAACATTGCAATG
GGGCAGCCCCAATTCCCTCCCCCTCCACACAGATCTGCTGACCCCGTATTCGTCCCCGAAGACACGTACATGAGACGACAATTCACGTGATCGTTAATCGATAAATGTAG
GTCGATACCGAACGCGTTTAACAAAACTATGTAACTCAGACCAC
Protein sequenceShow/hide protein sequence
MENKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNFATGCFCCFAAPRSSISKWRIALICYVISWITFAKAFIML
LTGAALNDQRGEQSYFLGYCYVLRSGFFIMATIVATVSIVLGLAYYILLNSTETEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFVPEDTYMRRQFT