; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh04G003790 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh04G003790
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionProtein of unknown function (DUF1218)
Genome locationCma_Chr04:1898457..1900439
RNA-Seq ExpressionCmaCh04G003790
SyntenyCmaCh04G003790
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600217.1 Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. sororia]2.4e-10796.21Show/hide
Query:  MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW
        ME KAL+VYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCF+APRSSVSKWRIALICYVISW
Subjt:  MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW

Query:  ITFAKAFIMLLTGAALNDQRGEQSYFLGFCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV
        ITFAKAFIMLLTGAALNDQRGEQSYFLG+CYVLRSGFFTVATIVAT+SIVLGLAYYILLNSAE EPSVFGNPCIPPQANIAMGQPQFPPPP RSADPVFV
Subjt:  ITFAKAFIMLLTGAALNDQRGEQSYFLGFCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV

Query:  HEDTYMRRQFT
         EDTYMRRQFT
Subjt:  HEDTYMRRQFT

KAG7030876.1 hypothetical protein SDJN02_04913, partial [Cucurbita argyrosperma subsp. argyrosperma]5.5e-10795.73Show/hide
Query:  MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW
        ME KAL+VYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCF+APRSSVSKWRIALICYVISW
Subjt:  MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW

Query:  ITFAKAFIMLLTGAALNDQRGEQSYFLGFCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV
        ITFAKAFIMLLTGAALNDQRGEQSYFLG+CYVLRSGFFTVATIVAT+SIVLGLAYYILLNSAE EPSVFGNPCIPPQANIAMGQPQFPPPP RSADPVFV
Subjt:  ITFAKAFIMLLTGAALNDQRGEQSYFLGFCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV

Query:  HEDTYMRRQFT
         EDTY+RRQFT
Subjt:  HEDTYMRRQFT

XP_022942637.1 uncharacterized protein LOC111447615 [Cucurbita moschata]8.4e-10896.68Show/hide
Query:  MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW
        MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCF+APRSSVSKWRIALICYVISW
Subjt:  MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW

Query:  ITFAKAFIMLLTGAALNDQRGEQSYFLGFCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV
        ITFAKAFIMLLTGAALNDQRGEQSYFLG+CYVLRSGFFTVATIVAT+SIVLGL YYILLNSAE EPSVFGNPCIPPQANIAMGQPQFPPPP RSADPVFV
Subjt:  ITFAKAFIMLLTGAALNDQRGEQSYFLGFCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV

Query:  HEDTYMRRQFT
         EDTYMRRQFT
Subjt:  HEDTYMRRQFT

XP_022984726.1 uncharacterized protein LOC111482918 [Cucurbita maxima]4.3e-112100Show/hide
Query:  MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW
        MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW
Subjt:  MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW

Query:  ITFAKAFIMLLTGAALNDQRGEQSYFLGFCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV
        ITFAKAFIMLLTGAALNDQRGEQSYFLGFCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV
Subjt:  ITFAKAFIMLLTGAALNDQRGEQSYFLGFCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV

Query:  HEDTYMRRQFT
        HEDTYMRRQFT
Subjt:  HEDTYMRRQFT

XP_023542651.1 uncharacterized protein LOC111802489 [Cucurbita pepo subsp. pepo]9.3e-10795.26Show/hide
Query:  MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW
        ME KALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLN ATGCFCCF+APRSS+SKWRIALICYVISW
Subjt:  MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW

Query:  ITFAKAFIMLLTGAALNDQRGEQSYFLGFCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV
        ITFAKAFIMLLTGAALNDQRGEQSYFLG+CYVLRSGFF +ATIVATVSIVLGLAYYILLNS E EPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV
Subjt:  ITFAKAFIMLLTGAALNDQRGEQSYFLGFCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV

Query:  HEDTYMRRQFT
         EDTYMRRQFT
Subjt:  HEDTYMRRQFT

TrEMBL top hitse value%identityAlignment
A0A0A0KU80 Uncharacterized protein8.0e-6460.93Show/hide
Query:  KKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISWIT
        K AL+V  VV  LG+++IATGFAAE T+ K N V  V    CKYP+SPA+GLGL AALSLL A IT+  +TGC CC   PR   SKWR A+IC+ ISW+T
Subjt:  KKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISWIT

Query:  FAKAFIMLLTGAALNDQRGEQ-SYFLGF-CYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPS-VFGNPCIPPQANIAMGQPQF---PPPPHRSAD
        +  AF++ LTGAALN+ RGEQ +YF  + CYVL+ G F+ ATIV   S+ LG++Y+++LNSA+N+PS V+G+P +PPQ NIAM QPQF   PPPP R+AD
Subjt:  FAKAFIMLLTGAALNDQRGEQ-SYFLGF-CYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPS-VFGNPCIPPQANIAMGQPQF---PPPPHRSAD

Query:  PVFVHEDTYMRRQFT
        PVFVHEDTYMRRQFT
Subjt:  PVFVHEDTYMRRQFT

A0A6J1CG96 uncharacterized protein LOC1110114031.7e-7768.06Show/hide
Query:  MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW
        ME+KA+ V +VV FLGLLV+ATGFAAEGT++KL+ VI V   TC YP+SPA+GLGL AALSLL+A +T+NV+TGC CC   PR   SKWR  ++C+VISW
Subjt:  MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW

Query:  ITFAKAFIMLLTGAALNDQRGEQSYFLGF--CYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPP---HRSA
         TF  AF++LLTGAALND+RGE+SY+ G+  CYVL+ G F VATI+AT SIVLGL YY++LNSA+N P+V+GNP +PPQANIAMGQPQFPPPP    RS 
Subjt:  ITFAKAFIMLLTGAALNDQRGEQSYFLGF--CYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPP---HRSA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTYMRRQ+T
Subjt:  DPVFVHEDTYMRRQFT

A0A6J1E6P1 uncharacterized protein LOC1114304661.2e-7064.81Show/hide
Query:  MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW
        ME+KAL V +VV FLGLL++ATGFAAEGT+VK N V+ V  T CKYP+SPA  LGL AALSLLLA I +NV+TGC CC   PR   SKWR A++C+V+SW
Subjt:  MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW

Query:  ITFAKAFIMLLTGAALNDQRGEQSYFLG--FCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPP---HRSA
         TF  AF++LLTGAALND R EQS +    +CYVL+ G F VAT+V   S+ LGL YY++LNSA+N+P+V+GNP IPP ANIAM QPQFPPPP     +A
Subjt:  ITFAKAFIMLLTGAALNDQRGEQSYFLG--FCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPP---HRSA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTY RRQFT
Subjt:  DPVFVHEDTYMRRQFT

A0A6J1FQT9 uncharacterized protein LOC1114476154.1e-10896.68Show/hide
Query:  MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW
        MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCF+APRSSVSKWRIALICYVISW
Subjt:  MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW

Query:  ITFAKAFIMLLTGAALNDQRGEQSYFLGFCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV
        ITFAKAFIMLLTGAALNDQRGEQSYFLG+CYVLRSGFFTVATIVAT+SIVLGL YYILLNSAE EPSVFGNPCIPPQANIAMGQPQFPPPP RSADPVFV
Subjt:  ITFAKAFIMLLTGAALNDQRGEQSYFLGFCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV

Query:  HEDTYMRRQFT
         EDTYMRRQFT
Subjt:  HEDTYMRRQFT

A0A6J1J634 uncharacterized protein LOC1114829182.1e-112100Show/hide
Query:  MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW
        MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW
Subjt:  MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW

Query:  ITFAKAFIMLLTGAALNDQRGEQSYFLGFCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV
        ITFAKAFIMLLTGAALNDQRGEQSYFLGFCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV
Subjt:  ITFAKAFIMLLTGAALNDQRGEQSYFLGFCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFV

Query:  HEDTYMRRQFT
        HEDTYMRRQFT
Subjt:  HEDTYMRRQFT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13380.1 Protein of unknown function (DUF1218)8.1e-0828.74Show/hide
Query:  KKALVVYTVVVFLGLLVIATGFAAEGTKV--KLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW
        K + +V+ +VV L L+      AAE  +   K     + N T C Y    A G G+ A L LL +   L   T C  CF  P +  S    ++I ++ SW
Subjt:  KKALVVYTVVVFLGLLVIATGFAAEGTKV--KLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISW

Query:  ITFAKAFIMLLTGAALNDQRGEQSYFLGF-CYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEP
        +TF  A   ++ GA  N    +      F C  LR G F    +    ++VL + YY+    + + P
Subjt:  ITFAKAFIMLLTGAALNDQRGEQSYFLGF-CYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEP

AT2G32280.1 Protein of unknown function (DUF1218)1.6e-0827.78Show/hide
Query:  LVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAP--RSSVSKWRIALICYVISWITF
        ++V  V+V L +     G  AE  + ++ H+ +     C+ P   A  LGL AA  L++AH+ LN+  GC C  S    + S S  +I++ C V++WI F
Subjt:  LVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAP--RSSVSKWRIALICYVISWITF

Query:  AKAFIMLLTGAALNDQRGEQSYFLGFCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENE
        A  F  ++ G   N +          C      F ++  I+  +  +  +AYY+   +A++E
Subjt:  AKAFIMLLTGAALNDQRGEQSYFLGFCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENE

AT4G21310.1 Protein of unknown function (DUF1218)2.4e-0727.38Show/hide
Query:  FLGLLVIA-------TGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFS---APRSSVSKWRIALICYVISWITF
        F+ +L++A        G  AE  + K+ H + +    C+ P   A   GL A + L+LAH+T N   GC C  S     +SS +K ++A+   + +WI  
Subjt:  FLGLLVIA-------TGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFS---APRSSVSKWRIALICYVISWITF

Query:  AKAFIMLLTGAALNDQRGEQSYFLGFCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGN
        A AF ML+ G   N +  +       C +      ++  I+  V  +  +AYYI   ++  E +  G+
Subjt:  AKAFIMLLTGAALNDQRGEQSYFLGFCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGN

AT5G17210.1 Protein of unknown function (DUF1218)7.5e-4647.22Show/hide
Query:  MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVV---NRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYV
        ME++ +V+  V+  LGLL   T F AE T++K + V V    + T C YP+SPA  LG  +AL L++A I ++V++GCFCC   P  S S W I+LIC+V
Subjt:  MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVV---NRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYV

Query:  ISWITFAKAFIMLLTGAALNDQRGEQSYFLG--FCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPPHRSA
        +SW TF  AF++LL+GAALND+  E+S   G  FCY+++ G F+   +++ V+I LG+ YY+ L S  N+  V           IAMGQPQ    P R  
Subjt:  ISWITFAKAFIMLLTGAALNDQRGEQSYFLG--FCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPPHRSA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTYMRRQFT
Subjt:  DPVFVHEDTYMRRQFT

AT5G17210.2 Protein of unknown function (DUF1218)4.7e-4050Show/hide
Query:  TTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISWITFAKAFIMLLTGAALNDQRGEQSYFLG--FCYVLRSGFFT
        T C YP+SPA  LG  +AL L++A I ++V++GCFCC   P  S S W I+LIC+V+SW TF  AF++LL+GAALND+  E+S   G  FCY+++ G F+
Subjt:  TTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISWITFAKAFIMLLTGAALNDQRGEQSYFLG--FCYVLRSGFFT

Query:  VATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFVHEDTYMRRQFT
           +++ V+I LG+ YY+ L S  N+  V           IAMGQPQ    P R  DPVFVHEDTYMRRQFT
Subjt:  VATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFVHEDTYMRRQFT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGAAGGCTCTGGTTGTGTACACTGTGGTCGTTTTTTTGGGGCTTTTGGTGATCGCCACTGGCTTCGCCGCTGAGGGCACCAAAGTTAAGCTTAATCATGTTAT
TGTAGTCAATCGTACTACGTGCAAATATCCCAAAAGTCCAGCGGTGGGCCTTGGTTTGGTTGCAGCTCTATCACTCTTGCTTGCTCATATAACGCTAAATGTTGCGACGG
GGTGCTTTTGCTGCTTCTCGGCCCCTCGCTCTTCTGTTTCTAAATGGCGAATAGCCTTGATCTGCTACGTCATTTCCTGGATTACATTTGCGAAAGCGTTCATCATGTTA
CTCACCGGTGCTGCACTGAACGACCAACGGGGCGAACAAAGCTACTTTTTAGGCTTCTGCTATGTCCTGAGATCAGGATTTTTTACTGTGGCTACCATTGTGGCCACGGT
GAGCATAGTGCTGGGATTGGCCTATTACATCCTATTGAACTCAGCAGAGAATGAGCCTTCTGTGTTTGGTAATCCCTGCATTCCTCCTCAAGCAAACATTGCAATGGGGC
AGCCCCAATTCCCTCCCCCTCCACACAGATCGGCTGACCCCGTATTCGTCCACGAAGATACGTACATGAGACGACAATTCACGTGA
mRNA sequenceShow/hide mRNA sequence
TCTTTCAGTTTCCTCTCAAGACCTTTTAGTACAAGTTTCTCTACCACGGTTTCTTCGCCCATTGAAATCACGACATCAATCCGAAACTCCACCAAATTCCGCCGGTTTTT
TCTCCCTTAGACTCGAGGTAGGCCATCGGCGGCGGAAATGGAGAAGAAGGCTCTGGTTGTGTACACTGTGGTCGTTTTTTTGGGGCTTTTGGTGATCGCCACTGGCTTCG
CCGCTGAGGGCACCAAAGTTAAGCTTAATCATGTTATTGTAGTCAATCGTACTACGTGCAAATATCCCAAAAGTCCAGCGGTGGGCCTTGGTTTGGTTGCAGCTCTATCA
CTCTTGCTTGCTCATATAACGCTAAATGTTGCGACGGGGTGCTTTTGCTGCTTCTCGGCCCCTCGCTCTTCTGTTTCTAAATGGCGAATAGCCTTGATCTGCTACGTCAT
TTCCTGGATTACATTTGCGAAAGCGTTCATCATGTTACTCACCGGTGCTGCACTGAACGACCAACGGGGCGAACAAAGCTACTTTTTAGGCTTCTGCTATGTCCTGAGAT
CAGGATTTTTTACTGTGGCTACCATTGTGGCCACGGTGAGCATAGTGCTGGGATTGGCCTATTACATCCTATTGAACTCAGCAGAGAATGAGCCTTCTGTGTTTGGTAAT
CCCTGCATTCCTCCTCAAGCAAACATTGCAATGGGGCAGCCCCAATTCCCTCCCCCTCCACACAGATCGGCTGACCCCGTATTCGTCCACGAAGATACGTACATGAGACG
ACAATTCACGTGATCATTAATTGGTAAATGTAGGTCAATACCGAACGCGTTTAACAAAACTATGTAACTCAGACCACATAATTTATGTATGAAGTTATGAATGTGTTTTC
TTGGCTCTTTAGAAGGTAGCTGTTTTGGAGTTGGTACACAAT
Protein sequenceShow/hide protein sequence
MEKKALVVYTVVVFLGLLVIATGFAAEGTKVKLNHVIVVNRTTCKYPKSPAVGLGLVAALSLLLAHITLNVATGCFCCFSAPRSSVSKWRIALICYVISWITFAKAFIML
LTGAALNDQRGEQSYFLGFCYVLRSGFFTVATIVATVSIVLGLAYYILLNSAENEPSVFGNPCIPPQANIAMGQPQFPPPPHRSADPVFVHEDTYMRRQFT