; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC02G044500 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC02G044500
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionYcf54-like protein
Genome locationCmU531Chr02:32520434..32523478
RNA-Seq ExpressionCmUC02G044500
SyntenyCmUC02G044500
Gene Ontology termsGO:0015995 - chlorophyll biosynthetic process (biological process)
GO:0048529 - magnesium-protoporphyrin IX monomethyl ester (oxidative) cyclase activity (molecular function)
InterPro domainsIPR019616 - Ycf54 protein
IPR038409 - Ycf54-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010462.1 putative protein ycf54 [Cucurbita argyrosperma subsp. argyrosperma]7.9e-10287.5Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVN------SDSADKQESNKYYFVV
        MLGTVNLVMGSSSAAMATPT  AAVKSLASS+IG HNHCRT  LPLGL SAS SS+ F S  GSSLS  F+TAIAAVN      SDS DK+ESNKYYF+V
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVN------SDSADKQESNKYYFVV

Query:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF
        ANAKFMLDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNS+WITFMKLRLDRVLAESYEANSLEEALASTPTNLEF
Subjt:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF

Query:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV
        EKPEKWVAPYSKYEYGWWEAFLPPV K EAKV
Subjt:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV

XP_008456263.1 PREDICTED: uncharacterized protein LOC103496259 [Cucumis melo]2.1e-10288.94Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM
        MLGT NLVMGSSSAA+AT TH AAVKSL +S IGHHN   T  LPL LPS SI SSCFLS P SSLSSPF+TAIAAVNSDSADKQES KYYF+VANAKFM
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM

Query:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW
        LDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRL+RPAVALVST+S+WITFMKLRLDRVLAESYEANS+EEALASTPTNLEFEKPEKW
Subjt:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW

Query:  VAPYSKYEYGWWEAFLPPVTKAEAKV
        VAPYSKYEYGWWEAFLPPVTKAEAKV
Subjt:  VAPYSKYEYGWWEAFLPPVTKAEAKV

XP_022944632.1 uncharacterized protein LOC111449035 [Cucurbita moschata]9.4e-10388.36Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVN------SDSADKQESNKYYFVV
        MLGTVNLVMGSSSAAMATPT  AAVKSLASS+IG HNHCRT  LPLGL SAS SS+ F S  GSSLS  F+TAIAAVN      SDSADK+ESNKYYF+V
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVN------SDSADKQESNKYYFVV

Query:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF
        ANAKFMLDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNS+WITFMKLRLDRVLAESYEANSLEEALASTPTNLEF
Subjt:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF

Query:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV
        EKPEKWVAPYSKYEYGWWEAFLPPV KAEAKV
Subjt:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV

XP_022986936.1 uncharacterized protein LOC111484525 [Cucurbita maxima]3.6e-10288.41Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLS-SPFSTAIAAVN------SDSADKQESNKYYFV
        MLGTVNLVMGSSSAAMATPT  AAVKSLASS+IG HNHCRT  LPLGL SAS SSS FLS  GSSLS   F TA+AAVN      SDSADK+ESNKYYF+
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLS-SPFSTAIAAVN------SDSADKQESNKYYFV

Query:  VANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLE
        VANAKFMLDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNS+WITFMKLRLDRVLAESYEANSLEEALASTPTNLE
Subjt:  VANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLE

Query:  FEKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV
        FEKPEKWVAPYSKYEYGWWEAFLPPV KAEAKV
Subjt:  FEKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV

XP_038900405.1 uncharacterized protein LOC120087636 [Benincasa hispida]4.0e-11495.13Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM
        MLGTVNLVMGSSSAAMATPTH  AVKSLASSRIG+H+HCRT+ LPLGLPS+SISSSCFLSPPGSSLSSPF+T IAAVNSDSADKQESNKYYFVVANAKFM
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM

Query:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW
        LDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNS+WITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW
Subjt:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW

Query:  VAPYSKYEYGWWEAFLPPVTKAEAKV
        VAPYSKYEYGWWEAFLPPVTKAEAKV
Subjt:  VAPYSKYEYGWWEAFLPPVTKAEAKV

TrEMBL top hitse value%identityAlignment
A0A1S3C2E4 uncharacterized protein LOC1034962591.0e-10288.94Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM
        MLGT NLVMGSSSAA+AT TH AAVKSL +S IGHHN   T  LPL LPS SI SSCFLS P SSLSSPF+TAIAAVNSDSADKQES KYYF+VANAKFM
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM

Query:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW
        LDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRL+RPAVALVST+S+WITFMKLRLDRVLAESYEANS+EEALASTPTNLEFEKPEKW
Subjt:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW

Query:  VAPYSKYEYGWWEAFLPPVTKAEAKV
        VAPYSKYEYGWWEAFLPPVTKAEAKV
Subjt:  VAPYSKYEYGWWEAFLPPVTKAEAKV

A0A5A7SVU2 Uncharacterized protein1.0e-10288.94Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM
        MLGT NLVMGSSSAA+AT TH AAVKSL +S IGHHN   T  LPL LPS SI SSCFLS P SSLSSPF+TAIAAVNSDSADKQES KYYF+VANAKFM
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM

Query:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW
        LDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRL+RPAVALVST+S+WITFMKLRLDRVLAESYEANS+EEALASTPTNLEFEKPEKW
Subjt:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW

Query:  VAPYSKYEYGWWEAFLPPVTKAEAKV
        VAPYSKYEYGWWEAFLPPVTKAEAKV
Subjt:  VAPYSKYEYGWWEAFLPPVTKAEAKV

A0A6J1D8T3 uncharacterized protein LOC1110179731.1e-10184.91Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVN------SDSADKQESNKYYFVV
        MLGTVNLVMGSSSAAM TP   AAVKSLAS+R  HH HCRTL LPLG  +AS S+SCFL   GSS SSPFSTAIAAVN      SD ADKQE+NKYYF+V
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVN------SDSADKQESNKYYFVV

Query:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF
        ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFL+KFPNITKRLQRPAVALVSTNS+WITFMKLRLDRVLAESYEANS EEA+AS PTN+EF
Subjt:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF

Query:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV
        EKPEKWVAPY KYEYGWWE FLPPVTKAEAKV
Subjt:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV

A0A6J1FW50 uncharacterized protein LOC1114490354.5e-10388.36Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVN------SDSADKQESNKYYFVV
        MLGTVNLVMGSSSAAMATPT  AAVKSLASS+IG HNHCRT  LPLGL SAS SS+ F S  GSSLS  F+TAIAAVN      SDSADK+ESNKYYF+V
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVN------SDSADKQESNKYYFVV

Query:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF
        ANAKFMLDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNS+WITFMKLRLDRVLAESYEANSLEEALASTPTNLEF
Subjt:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF

Query:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV
        EKPEKWVAPYSKYEYGWWEAFLPPV KAEAKV
Subjt:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV

A0A6J1JCP0 uncharacterized protein LOC1114845251.7e-10288.41Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLS-SPFSTAIAAVN------SDSADKQESNKYYFV
        MLGTVNLVMGSSSAAMATPT  AAVKSLASS+IG HNHCRT  LPLGL SAS SSS FLS  GSSLS   F TA+AAVN      SDSADK+ESNKYYF+
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLS-SPFSTAIAAVN------SDSADKQESNKYYFV

Query:  VANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLE
        VANAKFMLDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNS+WITFMKLRLDRVLAESYEANSLEEALASTPTNLE
Subjt:  VANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLE

Query:  FEKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV
        FEKPEKWVAPYSKYEYGWWEAFLPPV KAEAKV
Subjt:  FEKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV

SwissProt top hitse value%identityAlignment
P51204 Uncharacterized protein ycf546.7e-1141.94Show/hide
Query:  YYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLD-----KFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYE
        YYF +A+  F+L EE   +E+  ER+ YY   NKE DFWL+  PKFL+     KF N+   +   A+A++STNS +I ++KLR+  V    +E
Subjt:  YYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLD-----KFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYE

P72777 Ycf54-like protein1.3e-1448.04Show/hide
Query:  YYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPA--VALVSTNSSWITFMKLRLDRVLAESYEA--NSLEEAL
        YY+ +A+ KF+L EEE F+E+L ER R YGE+NKE DFW VI+P FL+       + + P   VA+VSTN S+I ++KLRL+ VL   +EA  +++ + L
Subjt:  YYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPA--VALVSTNSSWITFMKLRLDRVLAESYEA--NSLEEAL

Query:  AS
        AS
Subjt:  AS

Q1XDT3 Uncharacterized protein ycf543.7e-0938.04Show/hide
Query:  YYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKR--LQRPAVALVSTNSSWITFMKLRLDRVLAESYEAN
        YYF +A+  F+L  +E  +E+  ER+ YY   NK  DFWL+  P FL+K   I+ +  + + AVA++STN  +I ++KLR+  +    +E N
Subjt:  YYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKR--LQRPAVALVSTNSSWITFMKLRLDRVLAESYEAN

Arabidopsis top hitse value%identityAlignment
AT5G58250.1 unknown protein4.2e-6169.59Show/hide
Query:  SSCFLSPPGSSLSSPFSTAIAAVNSDSA-DKQESNKYYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVAL
        SS  LS P S  SS F TA  ++   S+ +K ES KY+F+VANAKFMLDEEEHF+E LFERLRY+GER   QDFWLVIEPKFLD FP IT+RL+RPAVAL
Subjt:  SSCFLSPPGSSLSSPFSTAIAAVNSDSA-DKQESNKYYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVAL

Query:  VSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKWVAPYSKYEYGWWEAFLPPVTKAEA
        VSTN +WITFMKLRLDRVL +S+EA SL+EALAS PT LEF+KP+ WVAPY KYE GWW+ FLP VT+  A
Subjt:  VSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKWVAPYSKYEYGWWEAFLPPVTKAEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGCAGTTGGCCAAATGTTAGGCACTGTGAATCTTGTCATGGGCTCATCTTCAGCCGCCATGGCTACTCCAACGCA
TTCTGCTGCCGTGAAATCTCTTGCGAGCTCCAGAATTGGTCACCACAACCACTGCCGGACTCTTCCATTGCCTTTGGGATTGCCTTCTGCTTCTATTTCCAGCTCCTGCT
TCTTGTCTCCGCCAGGTTCTTCTCTCTCTTCGCCCTTCAGCACAGCAATCGCCGCCGTTAACTCCGATTCGGCTGACAAGCAAGAATCGAACAAGTATTATTTTGTAGTT
GCAAATGCGAAGTTCATGCTTGATGAGGAGGAGCATTTCAAAGAACTCCTGTTCGAGCGGCTTCGGTACTATGGCGAGCGTAACAAGGAGCAGGACTTTTGGCTTGTCAT
TGAGCCTAAGTTCTTGGACAAGTTTCCTAATATCACAAAGAGATTGCAGAGACCTGCCGTTGCTCTTGTTTCAACCAATAGTTCCTGGATTACGTTCATGAAGCTGAGAC
TGGATCGAGTTTTAGCGGAAAGCTATGAAGCCAACAGCTTAGAAGAAGCATTGGCTTCGACCCCGACCAACCTCGAGTTTGAGAAGCCTGAAAAATGGGTGGCTCCCTAT
TCCAAATATGAATATGGATGGTGGGAGGCTTTCTTGCCGCCGGTGACAAAAGCAGAAGCAAAAGTATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCACAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGCAGTTGGCCAAATGTTAGGCACTGTGAATCTTGTCATGGGCTCATCTTCAGCCGCCATGGCTACTCCAACGCA
TTCTGCTGCCGTGAAATCTCTTGCGAGCTCCAGAATTGGTCACCACAACCACTGCCGGACTCTTCCATTGCCTTTGGGATTGCCTTCTGCTTCTATTTCCAGCTCCTGCT
TCTTGTCTCCGCCAGGTTCTTCTCTCTCTTCGCCCTTCAGCACAGCAATCGCCGCCGTTAACTCCGATTCGGCTGACAAGCAAGAATCGAACAAGTATTATTTTGTAGTT
GCAAATGCGAAGTTCATGCTTGATGAGGAGGAGCATTTCAAAGAACTCCTGTTCGAGCGGCTTCGGTACTATGGCGAGCGTAACAAGGAGCAGGACTTTTGGCTTGTCAT
TGAGCCTAAGTTCTTGGACAAGTTTCCTAATATCACAAAGAGATTGCAGAGACCTGCCGTTGCTCTTGTTTCAACCAATAGTTCCTGGATTACGTTCATGAAGCTGAGAC
TGGATCGAGTTTTAGCGGAAAGCTATGAAGCCAACAGCTTAGAAGAAGCATTGGCTTCGACCCCGACCAACCTCGAGTTTGAGAAGCCTGAAAAATGGGTGGCTCCCTAT
TCCAAATATGAATATGGATGGTGGGAGGCTTTCTTGCCGCCGGTGACAAAAGCAGAAGCAAAAGTATAAGCTGTATGTAGCTTTAATTTGTTTATTTACTAGTCTTTTTT
ACCCTCTCTGATCAAGTATGAATTTTGATGGTGTGAGGATTTCGTAAAACATCTGTAGTTGAAGGCAGCGTTTTTGGTAATATAGTGTGGTAAAAGTAAATCGTATTTCG
AAAGGCTCTAATATTAATTTTAAACTTTCAACTTTTGTAATATTAATTTGTG
Protein sequenceShow/hide protein sequence
MSQEEEEEEEEEAVGQMLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVNSDSADKQESNKYYFVV
ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKWVAPY
SKYEYGWWEAFLPPVTKAEAKV