; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G036400 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G036400
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionYcf54-like protein
Genome locationCicolChr02:32181332..32184416
RNA-Seq ExpressionCcUC02G036400
SyntenyCcUC02G036400
Gene Ontology termsGO:0015995 - chlorophyll biosynthetic process (biological process)
GO:0048529 - magnesium-protoporphyrin IX monomethyl ester (oxidative) cyclase activity (molecular function)
InterPro domainsIPR019616 - Ycf54 protein
IPR038409 - Ycf54-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010462.1 putative protein ycf54 [Cucurbita argyrosperma subsp. argyrosperma]1.3e-9986.21Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGYHNHCRTLPLPLGLPSASISSSCFLS---PPLSSPFSTAIAAVN------SDSADKQESNKYYFVV
        MLGTVNLVMGSSSAAMATPT  AAVKSLASS+IG HNHCRT  LPLGL SAS SS+ F S     LS  F+TAIAAVN      SDS DK+ESNKYYF+V
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGYHNHCRTLPLPLGLPSASISSSCFLS---PPLSSPFSTAIAAVN------SDSADKQESNKYYFVV

Query:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF
        ANAKFMLDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNS+WITFMKLRLDRVLAESYEANSLEEALASTPTNLEF
Subjt:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF

Query:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV
        EKPEKWVAPYSKYEYGWWEAFLPPV K EAKV
Subjt:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV

XP_008456263.1 PREDICTED: uncharacterized protein LOC103496259 [Cucumis melo]3.3e-10087.61Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGYHNHCRTLPLPLGLPSASISSSCFLSPP---LSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM
        MLGT NLVMGSSSAA+AT TH AAVKSL +S IG+HN   T  LPL LPS SI SSCFLS P   LSSPF+TAIAAVNSDSADKQES KYYF+VANAKFM
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGYHNHCRTLPLPLGLPSASISSSCFLSPP---LSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM

Query:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW
        LDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRL+RPAVALVST+S+WITFMKLRLDRVLAESYEANS+EEALASTPTNLEFEKPEKW
Subjt:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW

Query:  VAPYSKYEYGWWEAFLPPVTKAEAKV
        VAPYSKYEYGWWEAFLPPVTKAEAKV
Subjt:  VAPYSKYEYGWWEAFLPPVTKAEAKV

XP_022944632.1 uncharacterized protein LOC111449035 [Cucurbita moschata]1.5e-10087.07Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGYHNHCRTLPLPLGLPSASISSSCFLS---PPLSSPFSTAIAAVN------SDSADKQESNKYYFVV
        MLGTVNLVMGSSSAAMATPT  AAVKSLASS+IG HNHCRT  LPLGL SAS SS+ F S     LS  F+TAIAAVN      SDSADK+ESNKYYF+V
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGYHNHCRTLPLPLGLPSASISSSCFLS---PPLSSPFSTAIAAVN------SDSADKQESNKYYFVV

Query:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF
        ANAKFMLDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNS+WITFMKLRLDRVLAESYEANSLEEALASTPTNLEF
Subjt:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF

Query:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV
        EKPEKWVAPYSKYEYGWWEAFLPPV KAEAKV
Subjt:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV

XP_022986936.1 uncharacterized protein LOC111484525 [Cucurbita maxima]8.8e-10187.12Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGYHNHCRTLPLPLGLPSASISSSCFLSPPLSS----PFSTAIAAVN------SDSADKQESNKYYFV
        MLGTVNLVMGSSSAAMATPT  AAVKSLASS+IG HNHCRT  LPLGL SAS SSS FLS   SS     F TA+AAVN      SDSADK+ESNKYYF+
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGYHNHCRTLPLPLGLPSASISSSCFLSPPLSS----PFSTAIAAVN------SDSADKQESNKYYFV

Query:  VANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLE
        VANAKFMLDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNS+WITFMKLRLDRVLAESYEANSLEEALASTPTNLE
Subjt:  VANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLE

Query:  FEKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV
        FEKPEKWVAPYSKYEYGWWEAFLPPV KAEAKV
Subjt:  FEKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV

XP_038900405.1 uncharacterized protein LOC120087636 [Benincasa hispida]4.9e-11294.25Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGYHNHCRTLPLPLGLPSASISSSCFLSPP---LSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM
        MLGTVNLVMGSSSAAMATPTH  AVKSLASSRIGYH+HCRT+ LPLGLPS+SISSSCFLSPP   LSSPF+T IAAVNSDSADKQESNKYYFVVANAKFM
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGYHNHCRTLPLPLGLPSASISSSCFLSPP---LSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM

Query:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW
        LDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNS+WITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW
Subjt:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW

Query:  VAPYSKYEYGWWEAFLPPVTKAEAKV
        VAPYSKYEYGWWEAFLPPVTKAEAKV
Subjt:  VAPYSKYEYGWWEAFLPPVTKAEAKV

TrEMBL top hitse value%identityAlignment
A0A1S3C2E4 uncharacterized protein LOC1034962591.6e-10087.61Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGYHNHCRTLPLPLGLPSASISSSCFLSPP---LSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM
        MLGT NLVMGSSSAA+AT TH AAVKSL +S IG+HN   T  LPL LPS SI SSCFLS P   LSSPF+TAIAAVNSDSADKQES KYYF+VANAKFM
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGYHNHCRTLPLPLGLPSASISSSCFLSPP---LSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM

Query:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW
        LDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRL+RPAVALVST+S+WITFMKLRLDRVLAESYEANS+EEALASTPTNLEFEKPEKW
Subjt:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW

Query:  VAPYSKYEYGWWEAFLPPVTKAEAKV
        VAPYSKYEYGWWEAFLPPVTKAEAKV
Subjt:  VAPYSKYEYGWWEAFLPPVTKAEAKV

A0A5A7SVU2 Uncharacterized protein1.6e-10087.61Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGYHNHCRTLPLPLGLPSASISSSCFLSPP---LSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM
        MLGT NLVMGSSSAA+AT TH AAVKSL +S IG+HN   T  LPL LPS SI SSCFLS P   LSSPF+TAIAAVNSDSADKQES KYYF+VANAKFM
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGYHNHCRTLPLPLGLPSASISSSCFLSPP---LSSPFSTAIAAVNSDSADKQESNKYYFVVANAKFM

Query:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW
        LDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRL+RPAVALVST+S+WITFMKLRLDRVLAESYEANS+EEALASTPTNLEFEKPEKW
Subjt:  LDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW

Query:  VAPYSKYEYGWWEAFLPPVTKAEAKV
        VAPYSKYEYGWWEAFLPPVTKAEAKV
Subjt:  VAPYSKYEYGWWEAFLPPVTKAEAKV

A0A6J1D8T3 uncharacterized protein LOC1110179738.8e-9983.69Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGYHNHCRTLPLPLGLPSASISSSCFL----SPPLSSPFSTAIAAVN------SDSADKQESNKYYFV
        MLGTVNLVMGSSSAAM TP   AAVKSLAS+R  +H HCRTL LPLG  +AS S+SCFL    S P SSPFSTAIAAVN      SD ADKQE+NKYYF+
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGYHNHCRTLPLPLGLPSASISSSCFL----SPPLSSPFSTAIAAVN------SDSADKQESNKYYFV

Query:  VANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLE
        VANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFL+KFPNITKRLQRPAVALVSTNS+WITFMKLRLDRVLAESYEANS EEA+AS PTN+E
Subjt:  VANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLE

Query:  FEKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV
        FEKPEKWVAPY KYEYGWWE FLPPVTKAEAKV
Subjt:  FEKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV

A0A6J1FW50 uncharacterized protein LOC1114490357.2e-10187.07Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGYHNHCRTLPLPLGLPSASISSSCFLS---PPLSSPFSTAIAAVN------SDSADKQESNKYYFVV
        MLGTVNLVMGSSSAAMATPT  AAVKSLASS+IG HNHCRT  LPLGL SAS SS+ F S     LS  F+TAIAAVN      SDSADK+ESNKYYF+V
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGYHNHCRTLPLPLGLPSASISSSCFLS---PPLSSPFSTAIAAVN------SDSADKQESNKYYFVV

Query:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF
        ANAKFMLDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNS+WITFMKLRLDRVLAESYEANSLEEALASTPTNLEF
Subjt:  ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF

Query:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV
        EKPEKWVAPYSKYEYGWWEAFLPPV KAEAKV
Subjt:  EKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV

A0A6J1JCP0 uncharacterized protein LOC1114845254.2e-10187.12Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGYHNHCRTLPLPLGLPSASISSSCFLSPPLSS----PFSTAIAAVN------SDSADKQESNKYYFV
        MLGTVNLVMGSSSAAMATPT  AAVKSLASS+IG HNHCRT  LPLGL SAS SSS FLS   SS     F TA+AAVN      SDSADK+ESNKYYF+
Subjt:  MLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGYHNHCRTLPLPLGLPSASISSSCFLSPPLSS----PFSTAIAAVN------SDSADKQESNKYYFV

Query:  VANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLE
        VANAKFMLDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNS+WITFMKLRLDRVLAESYEANSLEEALASTPTNLE
Subjt:  VANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLE

Query:  FEKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV
        FEKPEKWVAPYSKYEYGWWEAFLPPV KAEAKV
Subjt:  FEKPEKWVAPYSKYEYGWWEAFLPPVTKAEAKV

SwissProt top hitse value%identityAlignment
P51204 Uncharacterized protein ycf546.7e-1141.94Show/hide
Query:  YYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLD-----KFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYE
        YYF +A+  F+L EE   +E+  ER+ YY   NKE DFWL+  PKFL+     KF N+   +   A+A++STNS +I ++KLR+  V    +E
Subjt:  YYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLD-----KFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYE

P72777 Ycf54-like protein1.3e-1448.04Show/hide
Query:  YYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPA--VALVSTNSSWITFMKLRLDRVLAESYEA--NSLEEAL
        YY+ +A+ KF+L EEE F+E+L ER R YGE+NKE DFW VI+P FL+       + + P   VA+VSTN S+I ++KLRL+ VL   +EA  +++ + L
Subjt:  YYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPA--VALVSTNSSWITFMKLRLDRVLAESYEA--NSLEEAL

Query:  AS
        AS
Subjt:  AS

Q1XDT3 Uncharacterized protein ycf543.7e-0938.04Show/hide
Query:  YYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKR--LQRPAVALVSTNSSWITFMKLRLDRVLAESYEAN
        YYF +A+  F+L  +E  +E+  ER+ YY   NK  DFWL+  P FL+K   I+ +  + + AVA++STN  +I ++KLR+  +    +E N
Subjt:  YYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKR--LQRPAVALVSTNSSWITFMKLRLDRVLAESYEAN

Arabidopsis top hitse value%identityAlignment
AT5G58250.1 unknown protein1.9e-6166.67Show/hide
Query:  RTLPLPLGLPSASISSSCFLSPPLSSPFSTAIAAVNSDSA-DKQESNKYYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPN
        +T  L L  PS S+ SS F     SS F TA  ++   S+ +K ES KY+F+VANAKFMLDEEEHF+E LFERLRY+GER   QDFWLVIEPKFLD FP 
Subjt:  RTLPLPLGLPSASISSSCFLSPPLSSPFSTAIAAVNSDSA-DKQESNKYYFVVANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPN

Query:  ITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKWVAPYSKYEYGWWEAFLPPVTKAEA
        IT+RL+RPAVALVSTN +WITFMKLRLDRVL +S+EA SL+EALAS PT LEF+KP+ WVAPY KYE GWW+ FLP VT+  A
Subjt:  ITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKWVAPYSKYEYGWWEAFLPPVTKAEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGCAGTTGCCCAAATGTTAGGCACTGTGAATCTTGTCATGGGCTCATCTTCAGCCGCCATGGCTAC
TCCAACGCATTCTGCTGCCGTGAAATCTCTTGCGAGCTCCAGAATTGGTTACCACAACCACTGCCGGACTCTTCCATTGCCTTTGGGATTGCCTTCTGCTTCTATTTCCA
GCTCCTGCTTCTTGTCTCCGCCACTCTCTTCCCCCTTCAGCACAGCAATCGCCGCCGTTAACTCCGATTCGGCCGACAAGCAAGAATCGAACAAGTATTATTTTGTAGTT
GCAAATGCGAAGTTCATGCTTGATGAGGAGGAGCATTTCAAAGAACTCCTGTTCGAGCGGCTTCGGTACTATGGCGAGCGTAACAAGGAGCAGGACTTTTGGCTTGTCAT
TGAGCCTAAGTTCTTGGACAAGTTTCCTAATATCACAAAGAGATTGCAGAGACCTGCCGTTGCTCTTGTTTCAACCAATAGTTCCTGGATTACGTTCATGAAGCTGAGAC
TGGATCGAGTTTTAGCGGAAAGCTATGAAGCCAACAGCTTAGAAGAAGCATTGGCTTCGACCCCGACCAACCTCGAGTTTGAGAAGCCTGAAAAATGGGTGGCTCCCTAT
TCCAAATATGAATATGGATGGTGGGAGGCTTTCTTGCCGCCGGTGACAAAAGCAGAAGCGAAAGTATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCACAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGCAGTTGCCCAAATGTTAGGCACTGTGAATCTTGTCATGGGCTCATCTTCAGCCGCCATGGCTAC
TCCAACGCATTCTGCTGCCGTGAAATCTCTTGCGAGCTCCAGAATTGGTTACCACAACCACTGCCGGACTCTTCCATTGCCTTTGGGATTGCCTTCTGCTTCTATTTCCA
GCTCCTGCTTCTTGTCTCCGCCACTCTCTTCCCCCTTCAGCACAGCAATCGCCGCCGTTAACTCCGATTCGGCCGACAAGCAAGAATCGAACAAGTATTATTTTGTAGTT
GCAAATGCGAAGTTCATGCTTGATGAGGAGGAGCATTTCAAAGAACTCCTGTTCGAGCGGCTTCGGTACTATGGCGAGCGTAACAAGGAGCAGGACTTTTGGCTTGTCAT
TGAGCCTAAGTTCTTGGACAAGTTTCCTAATATCACAAAGAGATTGCAGAGACCTGCCGTTGCTCTTGTTTCAACCAATAGTTCCTGGATTACGTTCATGAAGCTGAGAC
TGGATCGAGTTTTAGCGGAAAGCTATGAAGCCAACAGCTTAGAAGAAGCATTGGCTTCGACCCCGACCAACCTCGAGTTTGAGAAGCCTGAAAAATGGGTGGCTCCCTAT
TCCAAATATGAATATGGATGGTGGGAGGCTTTCTTGCCGCCGGTGACAAAAGCAGAAGCGAAAGTATAAGCTGTATGTAGCTTTAATTTGTTTATTTACTAGTCTTTTTT
ACCCTCTCTGATCAAGTATGAATTTTGATGGTGTGAGGATTTCGTAAAACATCTGTACTTGGAGGCAGTTTTTTTGGTAATATAGTGTGGTAAAAGTAAATCGTATTTCA
AGAGGCTCTCATATTAATTTTAAACTTTGAACTTTTGTAATATTAATTTGTA
Protein sequenceShow/hide protein sequence
MSQEEEEEEEEEEEEAVAQMLGTVNLVMGSSSAAMATPTHSAAVKSLASSRIGYHNHCRTLPLPLGLPSASISSSCFLSPPLSSPFSTAIAAVNSDSADKQESNKYYFVV
ANAKFMLDEEEHFKELLFERLRYYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSSWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKWVAPY
SKYEYGWWEAFLPPVTKAEAKV