; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh20G003230 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh20G003230
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionYcf54-like protein
Genome locationCmo_Chr20:1578026..1580227
RNA-Seq ExpressionCmoCh20G003230
SyntenyCmoCh20G003230
Gene Ontology termsNA
InterPro domainsIPR019616 - Ycf54 protein
IPR038409 - Ycf54-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570612.1 putative protein ycf54, partial [Cucurbita argyrosperma subsp. sororia]4.2e-11698.67Show/hide
Query:  MGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDSADKVIEESNKYYFLVANAKFM
        MGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDS DK  EESNKYYFLVANAKFM
Subjt:  MGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDSADKVIEESNKYYFLVANAKFM

Query:  LDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW
        LDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW
Subjt:  LDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW

Query:  VAPYSKYEYGWWEAFLPPVAKAEAKV
        VAPYSKYEYGWWEAFLPPVAKAEAKV
Subjt:  VAPYSKYEYGWWEAFLPPVAKAEAKV

KAG7010462.1 putative protein ycf54 [Cucurbita argyrosperma subsp. argyrosperma]8.1e-12098.29Show/hide
Query:  MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDSADKVIEESNKYYF
        MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDS DK  EESNKYYF
Subjt:  MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDSADKVIEESNKYYF

Query:  LVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNL
        LVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNL
Subjt:  LVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNL

Query:  EFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAKV
        EFEKPEKWVAPYSKYEYGWWEAFLPPVAK EAKV
Subjt:  EFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAKV

XP_022944632.1 uncharacterized protein LOC111449035 [Cucurbita moschata]9.6e-12199.15Show/hide
Query:  MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDSADKVIEESNKYYF
        MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDSADK  EESNKYYF
Subjt:  MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDSADKVIEESNKYYF

Query:  LVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNL
        LVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNL
Subjt:  LVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNL

Query:  EFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAKV
        EFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAKV
Subjt:  EFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAKV

XP_022986936.1 uncharacterized protein LOC111484525 [Cucurbita maxima]1.7e-11797.02Show/hide
Query:  MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLS-LRFNTAIAAVNSDQIVSSDSADKVIEESNKYY
        MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSS+FF SHRGSSLS LRF TA+AAVNSDQIVSSDSADK  EESNKYY
Subjt:  MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLS-LRFNTAIAAVNSDQIVSSDSADKVIEESNKYY

Query:  FLVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTN
        FLVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTN
Subjt:  FLVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTN

Query:  LEFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAKV
        LEFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAKV
Subjt:  LEFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAKV

XP_038900405.1 uncharacterized protein LOC120087636 [Benincasa hispida]3.1e-10388.03Show/hide
Query:  MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDSADKVIEESNKYYF
        MLGTVNLVMGSSSAAMATPT C AVKSLASS+IG H+HCRT SLPLGL S+S SS+ F S  GSSLS  FNT IAAVN      SDSADK  +ESNKYYF
Subjt:  MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDSADKVIEESNKYYF

Query:  LVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNL
        +VANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNL
Subjt:  LVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNL

Query:  EFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAKV
        EFEKPEKWVAPYSKYEYGWWEAFLPPV KAEAKV
Subjt:  EFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAKV

TrEMBL top hitse value%identityAlignment
A0A5A7SVU2 Uncharacterized protein5.9e-9282.91Show/hide
Query:  MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDSADKVIEESNKYYF
        MLGT NLVMGSSSAA+AT T  AAVKSL +S IG HN   T SLPL L S S  S+ F S   SSLS  FNTAIAAVN      SDSADK  +ES KYYF
Subjt:  MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDSADKVIEESNKYYF

Query:  LVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNL
        LVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRL+RPAVALVST+STWITFMKLRLDRVLAESYEANS+EEALASTPTNL
Subjt:  LVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNL

Query:  EFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAKV
        EFEKPEKWVAPYSKYEYGWWEAFLPPV KAEAKV
Subjt:  EFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAKV

A0A6J1D8T3 uncharacterized protein LOC1110179732.0e-10083.76Show/hide
Query:  MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDSADKVIEESNKYYF
        MLGTVNLVMGSSSAAM TP Q AAVKSLAS++   H HCRT SLPLG A+ASGS++ F   RGSS S  F+TAIAAVNS+Q+VSSD ADK  +E+NKYYF
Subjt:  MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDSADKVIEESNKYYF

Query:  LVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNL
        LVANAKFMLDEEEHFKELLFERLR YGERNKEQDFWLVIEPKFL+KFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANS EEA+AS PTN+
Subjt:  LVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNL

Query:  EFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAKV
        EFEKPEKWVAPY KYEYGWWE FLPPV KAEAKV
Subjt:  EFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAKV

A0A6J1FW50 uncharacterized protein LOC1114490354.7e-12199.15Show/hide
Query:  MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDSADKVIEESNKYYF
        MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDSADK  EESNKYYF
Subjt:  MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDSADKVIEESNKYYF

Query:  LVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNL
        LVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNL
Subjt:  LVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNL

Query:  EFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAKV
        EFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAKV
Subjt:  EFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAKV

A0A6J1G6K2 uncharacterized protein LOC1114512469.8e-9581.55Show/hide
Query:  MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDSADKVIEESNKYYF
        MLGTVNLVMGSSSAAM T TQC A+KSLAS +IG H+HCR  SLP GL SASG S+ F + RGSS    FNTAIAAVN DQIVSSDSAD+  +ESNKYYF
Subjt:  MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDSADKVIEESNKYYF

Query:  LVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNL
        +VANAKFMLDEEEHF+ELLFERLR Y ER+KEQDFWLVIEPKFL++FPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEA+SLE+ALAS PTN+
Subjt:  LVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNL

Query:  EFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAK
        EFEKPEKWVAPY KYEYGWWE FLPPV K EA+
Subjt:  EFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAK

A0A6J1JCP0 uncharacterized protein LOC1114845258.2e-11897.02Show/hide
Query:  MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLS-LRFNTAIAAVNSDQIVSSDSADKVIEESNKYY
        MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSS+FF SHRGSSLS LRF TA+AAVNSDQIVSSDSADK  EESNKYY
Subjt:  MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLS-LRFNTAIAAVNSDQIVSSDSADKVIEESNKYY

Query:  FLVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTN
        FLVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTN
Subjt:  FLVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTN

Query:  LEFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAKV
        LEFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAKV
Subjt:  LEFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEAKV

SwissProt top hitse value%identityAlignment
P51204 Uncharacterized protein ycf545.5e-1040.86Show/hide
Query:  YYFLVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLD-----KFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYE
        YYF +A+  F+L EE   +E+  ER+  Y   NKE DFWL+  PKFL+     KF N+   +   A+A++STNS +I ++KLR+  V    +E
Subjt:  YYFLVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLD-----KFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYE

P72777 Ycf54-like protein1.3e-1447.06Show/hide
Query:  YYFLVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPA--VALVSTNSTWITFMKLRLDRVLAESYEA--NSLEEAL
        YY+ +A+ KF+L EEE F+E+L ER R+YGE+NKE DFW VI+P FL+       + + P   VA+VSTN ++I ++KLRL+ VL   +EA  +++ + L
Subjt:  YYFLVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPA--VALVSTNSTWITFMKLRLDRVLAESYEA--NSLEEAL

Query:  AS
        AS
Subjt:  AS

Q1XDT3 Uncharacterized protein ycf543.0e-0836.96Show/hide
Query:  YYFLVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKR--LQRPAVALVSTNSTWITFMKLRLDRVLAESYEAN
        YYF +A+  F+L  +E  +E+  ER+  Y   NK  DFWL+  P FL+K   I+ +  + + AVA++STN  +I ++KLR+  +    +E N
Subjt:  YYFLVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKR--LQRPAVALVSTNSTWITFMKLRLDRVLAESYEAN

Arabidopsis top hitse value%identityAlignment
AT5G58250.1 unknown protein4.7e-5764.29Show/hide
Query:  ASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDSADKVIEESNKYYFLVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNI
        +S S +  SSH  SS    F TA  ++     V+         ES KY+FLVANAKFMLDEEEHF+E LFERLR +GER   QDFWLVIEPKFLD FP I
Subjt:  ASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDSADKVIEESNKYYFLVANAKFMLDEEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNI

Query:  TKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEA
        T+RL+RPAVALVSTN TWITFMKLRLDRVL +S+EA SL+EALAS PT LEF+KP+ WVAPY KYE GWW+ FLP V +  A
Subjt:  TKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKWVAPYSKYEYGWWEAFLPPVAKAEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGCACGGTGAATCTAGTAATGGGCTCATCTTCAGCCGCCATGGCTACTCCAACGCAGTGTGCTGCCGTGAAATCTCTTGCGAGCTCCAAAATTGGTCTCCACAA
CCACTGCCGAACGTTTTCATTGCCTTTGGGATTGGCTTCTGCTTCTGGTTCCAGCGCTTTCTTCTCGTCTCATCGAGGTTCCTCTCTTTCTTTGCGGTTCAACACAGCAA
TCGCCGCCGTTAACTCCGATCAAATTGTGTCCTCCGATTCGGCGGACAAGGTAATCGAAGAATCGAACAAGTATTATTTTCTAGTTGCAAATGCGAAGTTCATGCTTGAT
GAGGAGGAGCATTTCAAAGAACTTCTGTTCGAGCGGCTTCGGAACTATGGCGAGCGTAACAAGGAGCAGGATTTTTGGCTGGTCATTGAGCCTAAGTTCTTGGACAAGTT
TCCTAATATCACAAAGAGATTGCAGAGACCTGCCGTTGCTCTTGTTTCAACCAACAGTACCTGGATTACGTTCATGAAGCTGAGACTGGACCGAGTTTTGGCCGAAAGTT
ACGAAGCCAATAGCTTAGAAGAAGCATTGGCTTCTACGCCCACCAACCTCGAGTTTGAGAAGCCTGAAAAATGGGTGGCTCCCTATTCCAAGTATGAATACGGATGGTGG
GAGGCTTTCTTGCCGCCGGTGGCAAAAGCAGAAGCAAAAGTATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGGCACGGTGAATCTAGTAATGGGCTCATCTTCAGCCGCCATGGCTACTCCAACGCAGTGTGCTGCCGTGAAATCTCTTGCGAGCTCCAAAATTGGTCTCCACAA
CCACTGCCGAACGTTTTCATTGCCTTTGGGATTGGCTTCTGCTTCTGGTTCCAGCGCTTTCTTCTCGTCTCATCGAGGTTCCTCTCTTTCTTTGCGGTTCAACACAGCAA
TCGCCGCCGTTAACTCCGATCAAATTGTGTCCTCCGATTCGGCGGACAAGGTAATCGAAGAATCGAACAAGTATTATTTTCTAGTTGCAAATGCGAAGTTCATGCTTGAT
GAGGAGGAGCATTTCAAAGAACTTCTGTTCGAGCGGCTTCGGAACTATGGCGAGCGTAACAAGGAGCAGGATTTTTGGCTGGTCATTGAGCCTAAGTTCTTGGACAAGTT
TCCTAATATCACAAAGAGATTGCAGAGACCTGCCGTTGCTCTTGTTTCAACCAACAGTACCTGGATTACGTTCATGAAGCTGAGACTGGACCGAGTTTTGGCCGAAAGTT
ACGAAGCCAATAGCTTAGAAGAAGCATTGGCTTCTACGCCCACCAACCTCGAGTTTGAGAAGCCTGAAAAATGGGTGGCTCCCTATTCCAAGTATGAATACGGATGGTGG
GAGGCTTTCTTGCCGCCGGTGGCAAAAGCAGAAGCAAAAGTATAAGCTGTGTATGTAGCTTCGTTTCTTTAGTACGCCCTTTTTTTGTTTACCCCTTCTAATCAAGTATG
AATTTTGATTGTGTGAGGATTTCACAGAACATTTGGAGTTGAAGGTAGTTTTTTTTTTTTTTTGGTAATCTAATTGTATTTGAGAGGCTCGATATCTTCCGTGTGGGTCA
ATATCTGCAATGTAGTTCGTCGAACGATGAACGTAGTTCGAATGTCTTTTTATGTCTATTTTCTTTAATGGTTTAGCAAGTTTGTGCGTTGAAA
Protein sequenceShow/hide protein sequence
MLGTVNLVMGSSSAAMATPTQCAAVKSLASSKIGLHNHCRTFSLPLGLASASGSSAFFSSHRGSSLSLRFNTAIAAVNSDQIVSSDSADKVIEESNKYYFLVANAKFMLD
EEEHFKELLFERLRNYGERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKWVAPYSKYEYGWW
EAFLPPVAKAEAKV