; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G19990 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G19990
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionYcf54-like protein
Genome locationClcChr02:32595586..32598631
RNA-Seq ExpressionClc02G19990
SyntenyClc02G19990
Gene Ontology termsGO:0015995 - chlorophyll biosynthetic process (biological process)
GO:0048529 - magnesium-protoporphyrin IX monomethyl ester (oxidative) cyclase activity (molecular function)
InterPro domainsIPR019616 - Ycf54 protein
IPR038409 - Ycf54-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010462.1 putative protein ycf54 [Cucurbita argyrosperma subsp. argyrosperma]2.3e-10187.07Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVN------SDSADKQESNKYYFVV
        MLGTVNLVMGSSSAAMATPT  AAVKSLASS+IG HNHCRT  LPLGL SAS SS+ F S  GSSLS  F+TAIAAVN      SDS DK+ESNKYYF+V
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVN------SDSADKQESNKYYFVV

Query:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLEF
        ANAKFMLDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNS+WITFMKLRLDRVL ESYEANSLEEALASTPTNLEF
Subjt:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLEF

Query:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV
        EKPEKWVAPYSKYEYGWWEAFLPPV K EAKV
Subjt:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV

XP_008456263.1 PREDICTED: uncharacterized protein LOC103496259 [Cucumis melo]6.1e-10288.5Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM
        MLGT NLVMGSSSAA+AT TH AAVKSL +S IGHHN   T  LPL LPS SI SSCFLS P SSLSSPF+TAIAAVNSDSADKQES KYYF+VANAKFM
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM

Query:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLEFEKPEKW
        LDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRL+RPAVALVST+S+WITFMKLRLDRVL ESYEANS+EEALASTPTNLEFEKPEKW
Subjt:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLEFEKPEKW

Query:  VAPYSKYEYGWWEAFLPPVTKAEAKV
        VAPYSKYEYGWWEAFLPPVTKAEAKV
Subjt:  VAPYSKYEYGWWEAFLPPVTKAEAKV

XP_022944632.1 uncharacterized protein LOC111449035 [Cucurbita moschata]2.7e-10287.93Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVN------SDSADKQESNKYYFVV
        MLGTVNLVMGSSSAAMATPT  AAVKSLASS+IG HNHCRT  LPLGL SAS SS+ F S  GSSLS  F+TAIAAVN      SDSADK+ESNKYYF+V
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVN------SDSADKQESNKYYFVV

Query:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLEF
        ANAKFMLDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNS+WITFMKLRLDRVL ESYEANSLEEALASTPTNLEF
Subjt:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLEF

Query:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV
        EKPEKWVAPYSKYEYGWWEAFLPPV KAEAKV
Subjt:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV

XP_022986936.1 uncharacterized protein LOC111484525 [Cucurbita maxima]1.0e-10187.98Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLS-SPFSTAIAAVN------SDSADKQESNKYYFV
        MLGTVNLVMGSSSAAMATPT  AAVKSLASS+IG HNHCRT  LPLGL SAS SSS FLS  GSSLS   F TA+AAVN      SDSADK+ESNKYYF+
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLS-SPFSTAIAAVN------SDSADKQESNKYYFV

Query:  VANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLE
        VANAKFMLDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNS+WITFMKLRLDRVL ESYEANSLEEALASTPTNLE
Subjt:  VANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLE

Query:  FEKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV
        FEKPEKWVAPYSKYEYGWWEAFLPPV KAEAKV
Subjt:  FEKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV

XP_038900405.1 uncharacterized protein LOC120087636 [Benincasa hispida]1.2e-11394.69Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM
        MLGTVNLVMGSSSAAMATPTH  AVKSLASSRIG+H+HCRT+ LPLGLPS+SISSSCFLSPPGSSLSSPF+T IAAVNSDSADKQESNKYYFVVANAKFM
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM

Query:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLEFEKPEKW
        LDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNS+WITFMKLRLDRVL ESYEANSLEEALASTPTNLEFEKPEKW
Subjt:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLEFEKPEKW

Query:  VAPYSKYEYGWWEAFLPPVTKAEAKV
        VAPYSKYEYGWWEAFLPPVTKAEAKV
Subjt:  VAPYSKYEYGWWEAFLPPVTKAEAKV

TrEMBL top hitse value%identityAlignment
A0A1S3C2E4 uncharacterized protein LOC1034962592.9e-10288.5Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM
        MLGT NLVMGSSSAA+AT TH AAVKSL +S IGHHN   T  LPL LPS SI SSCFLS P SSLSSPF+TAIAAVNSDSADKQES KYYF+VANAKFM
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM

Query:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLEFEKPEKW
        LDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRL+RPAVALVST+S+WITFMKLRLDRVL ESYEANS+EEALASTPTNLEFEKPEKW
Subjt:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLEFEKPEKW

Query:  VAPYSKYEYGWWEAFLPPVTKAEAKV
        VAPYSKYEYGWWEAFLPPVTKAEAKV
Subjt:  VAPYSKYEYGWWEAFLPPVTKAEAKV

A0A5A7SVU2 Uncharacterized protein2.9e-10288.5Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM
        MLGT NLVMGSSSAA+AT TH AAVKSL +S IGHHN   T  LPL LPS SI SSCFLS P SSLSSPF+TAIAAVNSDSADKQES KYYF+VANAKFM
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM

Query:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLEFEKPEKW
        LDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRL+RPAVALVST+S+WITFMKLRLDRVL ESYEANS+EEALASTPTNLEFEKPEKW
Subjt:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLEFEKPEKW

Query:  VAPYSKYEYGWWEAFLPPVTKAEAKV
        VAPYSKYEYGWWEAFLPPVTKAEAKV
Subjt:  VAPYSKYEYGWWEAFLPPVTKAEAKV

A0A6J1D8T3 uncharacterized protein LOC1110179733.3e-10184.48Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVN------SDSADKQESNKYYFVV
        MLGTVNLVMGSSSAAM TP   AAVKSLAS+R  HH HCRTL LPLG  +AS S+SCFL   GSS SSPFSTAIAAVN      SD ADKQE+NKYYF+V
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVN------SDSADKQESNKYYFVV

Query:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLEF
        ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFL+KFPNITKRLQRPAVALVSTNS+WITFMKLRLDRVL ESYEANS EEA+AS PTN+EF
Subjt:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLEF

Query:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV
        EKPEKWVAPY KYEYGWWE FLPPVTKAEAKV
Subjt:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV

A0A6J1FW50 uncharacterized protein LOC1114490351.3e-10287.93Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVN------SDSADKQESNKYYFVV
        MLGTVNLVMGSSSAAMATPT  AAVKSLASS+IG HNHCRT  LPLGL SAS SS+ F S  GSSLS  F+TAIAAVN      SDSADK+ESNKYYF+V
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVN------SDSADKQESNKYYFVV

Query:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLEF
        ANAKFMLDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNS+WITFMKLRLDRVL ESYEANSLEEALASTPTNLEF
Subjt:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLEF

Query:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV
        EKPEKWVAPYSKYEYGWWEAFLPPV KAEAKV
Subjt:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV

A0A6J1JCP0 uncharacterized protein LOC1114845255.0e-10287.98Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLS-SPFSTAIAAVN------SDSADKQESNKYYFV
        MLGTVNLVMGSSSAAMATPT  AAVKSLASS+IG HNHCRT  LPLGL SAS SSS FLS  GSSLS   F TA+AAVN      SDSADK+ESNKYYF+
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLS-SPFSTAIAAVN------SDSADKQESNKYYFV

Query:  VANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLE
        VANAKFMLDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNS+WITFMKLRLDRVL ESYEANSLEEALASTPTNLE
Subjt:  VANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLE

Query:  FEKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV
        FEKPEKWVAPYSKYEYGWWEAFLPPV KAEAKV
Subjt:  FEKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV

SwissProt top hitse value%identityAlignment
P51204 Uncharacterized protein ycf542.3e-1141.94Show/hide
Query:  YYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLD-----KFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYE
        YYF +A+  F+L EE   +E+  ER+ YY   NKE DFWL+  PKFL+     KF N+   +   A+A++STNS +I ++KLR+  V +  +E
Subjt:  YYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLD-----KFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYE

P72777 Ycf54-like protein1.3e-1448.04Show/hide
Query:  YYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPA--VALVSTNSSWITFMKLRLDRVLVESYEA--NSLEEAL
        YY+ +A+ KF+L EEE F+E+L ER R YGE+NKE DFW VI+P FL+       + + P   VA+VSTN S+I ++KLRL+ VL   +EA  +++ + L
Subjt:  YYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPA--VALVSTNSSWITFMKLRLDRVLVESYEA--NSLEEAL

Query:  AS
        AS
Subjt:  AS

Q1XDT3 Uncharacterized protein ycf541.3e-0938.04Show/hide
Query:  YYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKR--LQRPAVALVSTNSSWITFMKLRLDRVLVESYEAN
        YYF +A+  F+L  +E  +E+  ER+ YY   NK  DFWL+  P FL+K   I+ +  + + AVA++STN  +I ++KLR+  + +  +E N
Subjt:  YYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKR--LQRPAVALVSTNSSWITFMKLRLDRVLVESYEAN

Arabidopsis top hitse value%identityAlignment
AT5G58250.1 unknown protein3.3e-6169.59Show/hide
Query:  SSCFLSPPGSSLSSPFSTAIAAVNSDSA-DKQESNKYYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVAL
        SS  LS P S  SS F TA  ++   S+ +K ES KY+F+VANAKFMLDEEEHF+E LFERLRY+GER   QDFWLVIEPKFLD FP IT+RL+RPAVAL
Subjt:  SSCFLSPPGSSLSSPFSTAIAAVNSDSA-DKQESNKYYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVAL

Query:  VSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLEFEKPEKWVAPYSKYEYGWWEAFLPPVTKAEA
        VSTN +WITFMKLRLDRVL +S+EA SL+EALAS PT LEF+KP+ WVAPY KYE GWW+ FLP VT+  A
Subjt:  VSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLEFEKPEKWVAPYSKYEYGWWEAFLPPVTKAEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGCAGTTGGCCAAATGTTAGGCACTGTGAATCTTGTCATGGGCTCATCTTCAGCCGCCATGGCTACTCCAAC
GCATTCTGCTGCCGTGAAATCTCTTGCGAGCTCCAGAATTGGTCACCACAACCACTGCCGGACTCTTCCATTGCCTTTGGGATTGCCTTCTGCTTCTATTTCCAGCTCCT
GCTTCTTGTCTCCGCCAGGTTCTTCTCTCTCTTCGCCCTTCAGCACAGCAATCGCCGCCGTTAACTCCGATTCGGCTGACAAGCAAGAATCGAACAAGTATTATTTTGTA
GTTGCAAATGCGAAGTTCATGCTTGATGAGGAGGAGCATTTCAAAGAACTCCTGTTCGAGCGGCTTCGGTACTATGGCGAGCGTAACAAGGAGCAGGACTTTTGGCTTGT
CATTGAGCCTAAGTTCTTGGACAAGTTTCCTAATATCACAAAGAGATTGCAGAGACCTGCCGTTGCTCTTGTTTCAACCAATAGTTCCTGGATTACGTTCATGAAGCTGA
GACTGGATCGAGTTTTAGTGGAAAGCTATGAAGCCAACAGCTTAGAAGAAGCATTGGCTTCGACCCCGACCAACCTCGAGTTTGAGAAGCCTGAAAAATGGGTGGCTCCC
TATTCCAAATATGAATATGGATGGTGGGAGGCTTTCTTGCCGCCGGTGACAAAAGCAGAAGCAAAAGTATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCACAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGCAGTTGGCCAAATGTTAGGCACTGTGAATCTTGTCATGGGCTCATCTTCAGCCGCCATGGCTACTCCAAC
GCATTCTGCTGCCGTGAAATCTCTTGCGAGCTCCAGAATTGGTCACCACAACCACTGCCGGACTCTTCCATTGCCTTTGGGATTGCCTTCTGCTTCTATTTCCAGCTCCT
GCTTCTTGTCTCCGCCAGGTTCTTCTCTCTCTTCGCCCTTCAGCACAGCAATCGCCGCCGTTAACTCCGATTCGGCTGACAAGCAAGAATCGAACAAGTATTATTTTGTA
GTTGCAAATGCGAAGTTCATGCTTGATGAGGAGGAGCATTTCAAAGAACTCCTGTTCGAGCGGCTTCGGTACTATGGCGAGCGTAACAAGGAGCAGGACTTTTGGCTTGT
CATTGAGCCTAAGTTCTTGGACAAGTTTCCTAATATCACAAAGAGATTGCAGAGACCTGCCGTTGCTCTTGTTTCAACCAATAGTTCCTGGATTACGTTCATGAAGCTGA
GACTGGATCGAGTTTTAGTGGAAAGCTATGAAGCCAACAGCTTAGAAGAAGCATTGGCTTCGACCCCGACCAACCTCGAGTTTGAGAAGCCTGAAAAATGGGTGGCTCCC
TATTCCAAATATGAATATGGATGGTGGGAGGCTTTCTTGCCGCCGGTGACAAAAGCAGAAGCAAAAGTATAAGCTGTATGTAGCTTTAATTTGTTTATTTACTAGTCTTT
TTTACCCTCTCTGATCAAGTATGAATTTTGATGGTGTGAGGATTTCGTAAAACATCTGTAGTTGAAGGCAGCGTTTTTGGTAATATAGTGTGGTAAAAGTAAATCGTATT
TCGAAAGGCTCTAATATTAATTTTAAACTTTCAACTTTTGTAATATTAATTTGTG
Protein sequenceShow/hide protein sequence
MSQEEEEEEEEEEAVGQMLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGHHNHCRTLPLPLGLPSASISSSCFLSPPGSSLSSPFSTAIAAVNSDSADKQESNKYYFV
VANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLVESYEANSLEEALASTPTNLEFEKPEKWVAP
YSKYEYGWWEAFLPPVTKAEAKV