; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi10G014930 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi10G014930
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionYcf54-like protein
Genome locationchr10:19126846..19130486
RNA-Seq ExpressionLsi10G014930
SyntenyLsi10G014930
Gene Ontology termsGO:0015995 - chlorophyll biosynthetic process (biological process)
GO:0048529 - magnesium-protoporphyrin IX monomethyl ester (oxidative) cyclase activity (molecular function)
InterPro domainsIPR019616 - Ycf54 protein
IPR038409 - Ycf54-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010462.1 putative protein ycf54 [Cucurbita argyrosperma subsp. argyrosperma]5.1e-10388.36Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHCAAVKSLASSRIGHHNYCRTISLPLGLPSASISSSCFLSPPGSSLSSPFNTAIAAVN------SDSANKQESNKYYFLV
        MLGTVNLVMGSSSAAMATPT CAAVKSLASS+IG HN+CRT SLPLGL SAS SS+ F S  GSSLS  FNTAIAAVN      SDS +K+ESNKYYFLV
Subjt:  MLGTVNLVMGSSSAAMATPTHCAAVKSLASSRIGHHNYCRTISLPLGLPSASISSSCFLSPPGSSLSSPFNTAIAAVN------SDSANKQESNKYYFLV

Query:  ANAKFMLDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF
        ANAKFMLDEEEHFKELLFERLRNY ERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF
Subjt:  ANAKFMLDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF

Query:  EKPEKWVAPYSKYEYGWWEVFLPPVTKAEAKV
        EKPEKWVAPYSKYEYGWWE FLPPV K EAKV
Subjt:  EKPEKWVAPYSKYEYGWWEVFLPPVTKAEAKV

XP_008456263.1 PREDICTED: uncharacterized protein LOC103496259 [Cucumis melo]6.7e-10389.82Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHCAAVKSLASSRIGHHNYCRTISLPLGLPSASISSSCFLSPPGSSLSSPFNTAIAAVNSDSANKQESNKYYFLVANAKFM
        MLGT NLVMGSSSAA+AT TH AAVKSL +S IGHHN   T SLPL LPS SI SSCFLS P SSLSSPFNTAIAAVNSDSA+KQES KYYFLVANAKFM
Subjt:  MLGTVNLVMGSSSAAMATPTHCAAVKSLASSRIGHHNYCRTISLPLGLPSASISSSCFLSPPGSSLSSPFNTAIAAVNSDSANKQESNKYYFLVANAKFM

Query:  LDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW
        LDEEEHFKELLFERLRNY ERNKEQDFWLVIEPKFLDKFPNITKRL+RPAVALVST+STWITFMKLRLDRVLAESYEANS+EEALASTPTNLEFEKPEKW
Subjt:  LDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW

Query:  VAPYSKYEYGWWEVFLPPVTKAEAKV
        VAPYSKYEYGWWE FLPPVTKAEAKV
Subjt:  VAPYSKYEYGWWEVFLPPVTKAEAKV

XP_022944632.1 uncharacterized protein LOC111449035 [Cucurbita moschata]6.0e-10489.22Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHCAAVKSLASSRIGHHNYCRTISLPLGLPSASISSSCFLSPPGSSLSSPFNTAIAAVN------SDSANKQESNKYYFLV
        MLGTVNLVMGSSSAAMATPT CAAVKSLASS+IG HN+CRT SLPLGL SAS SS+ F S  GSSLS  FNTAIAAVN      SDSA+K+ESNKYYFLV
Subjt:  MLGTVNLVMGSSSAAMATPTHCAAVKSLASSRIGHHNYCRTISLPLGLPSASISSSCFLSPPGSSLSSPFNTAIAAVN------SDSANKQESNKYYFLV

Query:  ANAKFMLDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF
        ANAKFMLDEEEHFKELLFERLRNY ERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF
Subjt:  ANAKFMLDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF

Query:  EKPEKWVAPYSKYEYGWWEVFLPPVTKAEAKV
        EKPEKWVAPYSKYEYGWWE FLPPV KAEAKV
Subjt:  EKPEKWVAPYSKYEYGWWEVFLPPVTKAEAKV

XP_022986936.1 uncharacterized protein LOC111484525 [Cucurbita maxima]8.7e-10388.84Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHCAAVKSLASSRIGHHNYCRTISLPLGLPSASISSSCFLSPPGSSLS-SPFNTAIAAVN------SDSANKQESNKYYFL
        MLGTVNLVMGSSSAAMATPT CAAVKSLASS+IG HN+CRT SLPLGL SAS SSS FLS  GSSLS   F TA+AAVN      SDSA+K+ESNKYYFL
Subjt:  MLGTVNLVMGSSSAAMATPTHCAAVKSLASSRIGHHNYCRTISLPLGLPSASISSSCFLSPPGSSLS-SPFNTAIAAVN------SDSANKQESNKYYFL

Query:  VANAKFMLDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLE
        VANAKFMLDEEEHFKELLFERLRNY ERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLE
Subjt:  VANAKFMLDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLE

Query:  FEKPEKWVAPYSKYEYGWWEVFLPPVTKAEAKV
        FEKPEKWVAPYSKYEYGWWE FLPPV KAEAKV
Subjt:  FEKPEKWVAPYSKYEYGWWEVFLPPVTKAEAKV

XP_038900405.1 uncharacterized protein LOC120087636 [Benincasa hispida]7.6e-11595.58Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHCAAVKSLASSRIGHHNYCRTISLPLGLPSASISSSCFLSPPGSSLSSPFNTAIAAVNSDSANKQESNKYYFLVANAKFM
        MLGTVNLVMGSSSAAMATPTHC AVKSLASSRIG+H++CRTISLPLGLPS+SISSSCFLSPPGSSLSSPFNT IAAVNSDSA+KQESNKYYF+VANAKFM
Subjt:  MLGTVNLVMGSSSAAMATPTHCAAVKSLASSRIGHHNYCRTISLPLGLPSASISSSCFLSPPGSSLSSPFNTAIAAVNSDSANKQESNKYYFLVANAKFM

Query:  LDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW
        LDEEEHFKELLFERLRNY ERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW
Subjt:  LDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW

Query:  VAPYSKYEYGWWEVFLPPVTKAEAKV
        VAPYSKYEYGWWE FLPPVTKAEAKV
Subjt:  VAPYSKYEYGWWEVFLPPVTKAEAKV

TrEMBL top hitse value%identityAlignment
A0A0A0KB63 Uncharacterized protein8.8e-10186.28Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHCAAVKSLASSRIGHHNYCRTISLPLGLPSASISSSCFLSPPGSSLSSPFNTAIAAVNSDSANKQESNKYYFLVANAKFM
        MLGTV+LVMGSSSAA+AT TH  A+KSL +SRIGHHN+  T+S P  LPS SI +S FLS P SSLSSPFNTAIAAVNSDSA+KQESNKYYFLVANAKFM
Subjt:  MLGTVNLVMGSSSAAMATPTHCAAVKSLASSRIGHHNYCRTISLPLGLPSASISSSCFLSPPGSSLSSPFNTAIAAVNSDSANKQESNKYYFLVANAKFM

Query:  LDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW
        LDEEEHFKELLFERLRN+ ERNKEQ+FWLVIEPKFLDKFPNITKRL+RPAVALVST+STWITFMKLRLDRVLAESYEANS+EEALASTPTNLEFEKPE W
Subjt:  LDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW

Query:  VAPYSKYEYGWWEVFLPPVTKAEAKV
        VAPYSKYEYGWWE FLPP TKAEAKV
Subjt:  VAPYSKYEYGWWEVFLPPVTKAEAKV

A0A1S3C2E4 uncharacterized protein LOC1034962593.2e-10389.82Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHCAAVKSLASSRIGHHNYCRTISLPLGLPSASISSSCFLSPPGSSLSSPFNTAIAAVNSDSANKQESNKYYFLVANAKFM
        MLGT NLVMGSSSAA+AT TH AAVKSL +S IGHHN   T SLPL LPS SI SSCFLS P SSLSSPFNTAIAAVNSDSA+KQES KYYFLVANAKFM
Subjt:  MLGTVNLVMGSSSAAMATPTHCAAVKSLASSRIGHHNYCRTISLPLGLPSASISSSCFLSPPGSSLSSPFNTAIAAVNSDSANKQESNKYYFLVANAKFM

Query:  LDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW
        LDEEEHFKELLFERLRNY ERNKEQDFWLVIEPKFLDKFPNITKRL+RPAVALVST+STWITFMKLRLDRVLAESYEANS+EEALASTPTNLEFEKPEKW
Subjt:  LDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW

Query:  VAPYSKYEYGWWEVFLPPVTKAEAKV
        VAPYSKYEYGWWE FLPPVTKAEAKV
Subjt:  VAPYSKYEYGWWEVFLPPVTKAEAKV

A0A5A7SVU2 Uncharacterized protein3.2e-10389.82Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHCAAVKSLASSRIGHHNYCRTISLPLGLPSASISSSCFLSPPGSSLSSPFNTAIAAVNSDSANKQESNKYYFLVANAKFM
        MLGT NLVMGSSSAA+AT TH AAVKSL +S IGHHN   T SLPL LPS SI SSCFLS P SSLSSPFNTAIAAVNSDSA+KQES KYYFLVANAKFM
Subjt:  MLGTVNLVMGSSSAAMATPTHCAAVKSLASSRIGHHNYCRTISLPLGLPSASISSSCFLSPPGSSLSSPFNTAIAAVNSDSANKQESNKYYFLVANAKFM

Query:  LDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW
        LDEEEHFKELLFERLRNY ERNKEQDFWLVIEPKFLDKFPNITKRL+RPAVALVST+STWITFMKLRLDRVLAESYEANS+EEALASTPTNLEFEKPEKW
Subjt:  LDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKW

Query:  VAPYSKYEYGWWEVFLPPVTKAEAKV
        VAPYSKYEYGWWE FLPPVTKAEAKV
Subjt:  VAPYSKYEYGWWEVFLPPVTKAEAKV

A0A6J1FW50 uncharacterized protein LOC1114490352.9e-10489.22Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHCAAVKSLASSRIGHHNYCRTISLPLGLPSASISSSCFLSPPGSSLSSPFNTAIAAVN------SDSANKQESNKYYFLV
        MLGTVNLVMGSSSAAMATPT CAAVKSLASS+IG HN+CRT SLPLGL SAS SS+ F S  GSSLS  FNTAIAAVN      SDSA+K+ESNKYYFLV
Subjt:  MLGTVNLVMGSSSAAMATPTHCAAVKSLASSRIGHHNYCRTISLPLGLPSASISSSCFLSPPGSSLSSPFNTAIAAVN------SDSANKQESNKYYFLV

Query:  ANAKFMLDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF
        ANAKFMLDEEEHFKELLFERLRNY ERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF
Subjt:  ANAKFMLDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEF

Query:  EKPEKWVAPYSKYEYGWWEVFLPPVTKAEAKV
        EKPEKWVAPYSKYEYGWWE FLPPV KAEAKV
Subjt:  EKPEKWVAPYSKYEYGWWEVFLPPVTKAEAKV

A0A6J1JCP0 uncharacterized protein LOC1114845254.2e-10388.84Show/hide
Query:  MLGTVNLVMGSSSAAMATPTHCAAVKSLASSRIGHHNYCRTISLPLGLPSASISSSCFLSPPGSSLS-SPFNTAIAAVN------SDSANKQESNKYYFL
        MLGTVNLVMGSSSAAMATPT CAAVKSLASS+IG HN+CRT SLPLGL SAS SSS FLS  GSSLS   F TA+AAVN      SDSA+K+ESNKYYFL
Subjt:  MLGTVNLVMGSSSAAMATPTHCAAVKSLASSRIGHHNYCRTISLPLGLPSASISSSCFLSPPGSSLS-SPFNTAIAAVN------SDSANKQESNKYYFL

Query:  VANAKFMLDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLE
        VANAKFMLDEEEHFKELLFERLRNY ERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLE
Subjt:  VANAKFMLDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLE

Query:  FEKPEKWVAPYSKYEYGWWEVFLPPVTKAEAKV
        FEKPEKWVAPYSKYEYGWWE FLPPV KAEAKV
Subjt:  FEKPEKWVAPYSKYEYGWWEVFLPPVTKAEAKV

SwissProt top hitse value%identityAlignment
P51204 Uncharacterized protein ycf546.9e-1040.86Show/hide
Query:  YYFLVANAKFMLDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLD-----KFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYE
        YYF +A+  F+L EE   +E+  ER+  Y   NKE DFWL+  PKFL+     KF N+   +   A+A++STNS +I ++KLR+  V    +E
Subjt:  YYFLVANAKFMLDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLD-----KFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYE

P72777 Ycf54-like protein1.8e-1346.08Show/hide
Query:  YYFLVANAKFMLDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPA--VALVSTNSTWITFMKLRLDRVLAESYEA--NSLEEAL
        YY+ +A+ KF+L EEE F+E+L ER R+Y E+NKE DFW VI+P FL+       + + P   VA+VSTN ++I ++KLRL+ VL   +EA  +++ + L
Subjt:  YYFLVANAKFMLDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPA--VALVSTNSTWITFMKLRLDRVLAESYEA--NSLEEAL

Query:  AS
        AS
Subjt:  AS

Q1XDT3 Uncharacterized protein ycf543.8e-0836.96Show/hide
Query:  YYFLVANAKFMLDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKR--LQRPAVALVSTNSTWITFMKLRLDRVLAESYEAN
        YYF +A+  F+L  +E  +E+  ER+  Y   NK  DFWL+  P FL+K   I+ +  + + AVA++STN  +I ++KLR+  +    +E N
Subjt:  YYFLVANAKFMLDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKR--LQRPAVALVSTNSTWITFMKLRLDRVLAESYEAN

Arabidopsis top hitse value%identityAlignment
AT5G58250.1 unknown protein1.5e-6070.18Show/hide
Query:  SSCFLSPPGSSLSSPFNTAIAAVNSDSA-NKQESNKYYFLVANAKFMLDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVAL
        SS  LS P S  SS F TA  ++   S+ NK ES KY+FLVANAKFMLDEEEHF+E LFERLR + ER   QDFWLVIEPKFLD FP IT+RL+RPAVAL
Subjt:  SSCFLSPPGSSLSSPFNTAIAAVNSDSA-NKQESNKYYFLVANAKFMLDEEEHFKELLFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVAL

Query:  VSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKWVAPYSKYEYGWWEVFLPPVTKAEA
        VSTN TWITFMKLRLDRVL +S+EA SL+EALAS PT LEF+KP+ WVAPY KYE GWW+ FLP VT+  A
Subjt:  VSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKWVAPYSKYEYGWWEVFLPPVTKAEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGCACTGTGAATCTAGTTATGGGCTCATCTTCAGCCGCCATGGCTACTCCAACGCATTGTGCTGCCGTGAAATCTCTTGCGAGCTCCAGAATTGGTCACCACAA
TTACTGCCGAACGATTTCATTGCCTTTGGGATTGCCTTCTGCTTCTATTTCCAGCTCCTGCTTCTTGTCTCCACCAGGTTCTTCTCTCTCTTCGCCCTTCAACACAGCAA
TCGCCGCCGTTAACTCCGATTCGGCCAACAAGCAAGAATCGAACAAGTATTATTTTCTAGTTGCAAATGCGAAGTTCATGCTTGATGAGGAGGAGCATTTCAAAGAACTT
CTGTTCGAACGGCTTCGGAACTATGACGAGCGTAACAAGGAGCAGGATTTTTGGCTGGTCATCGAGCCTAAGTTCTTGGACAAGTTTCCTAATATCACAAAGAGATTGCA
GAGACCTGCCGTTGCTCTTGTTTCAACCAATAGTACCTGGATTACGTTCATGAAGCTGAGACTGGATCGAGTTTTAGCCGAAAGTTATGAAGCCAACAGCTTAGAAGAAG
CATTGGCTTCTACCCCAACCAACCTCGAGTTTGAGAAGCCTGAAAAGTGGGTGGCTCCCTATTCCAAGTATGAATATGGATGGTGGGAGGTTTTCTTGCCGCCAGTAACA
AAAGCAGAAGCAAAAGTATAA
mRNA sequenceShow/hide mRNA sequence
GAAGATTTTGCTGATTAATTTAGTTAGCCATTTCATCCATTTTTGCTCTGCTCAATTTCAACATTCACAAGAGAAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAG
AAGAAGTTGGCCAAATGTTAGGCACTGTGAATCTAGTTATGGGCTCATCTTCAGCCGCCATGGCTACTCCAACGCATTGTGCTGCCGTGAAATCTCTTGCGAGCTCCAGA
ATTGGTCACCACAATTACTGCCGAACGATTTCATTGCCTTTGGGATTGCCTTCTGCTTCTATTTCCAGCTCCTGCTTCTTGTCTCCACCAGGTTCTTCTCTCTCTTCGCC
CTTCAACACAGCAATCGCCGCCGTTAACTCCGATTCGGCCAACAAGCAAGAATCGAACAAGTATTATTTTCTAGTTGCAAATGCGAAGTTCATGCTTGATGAGGAGGAGC
ATTTCAAAGAACTTCTGTTCGAACGGCTTCGGAACTATGACGAGCGTAACAAGGAGCAGGATTTTTGGCTGGTCATCGAGCCTAAGTTCTTGGACAAGTTTCCTAATATC
ACAAAGAGATTGCAGAGACCTGCCGTTGCTCTTGTTTCAACCAATAGTACCTGGATTACGTTCATGAAGCTGAGACTGGATCGAGTTTTAGCCGAAAGTTATGAAGCCAA
CAGCTTAGAAGAAGCATTGGCTTCTACCCCAACCAACCTCGAGTTTGAGAAGCCTGAAAAGTGGGTGGCTCCCTATTCCAAGTATGAATATGGATGGTGGGAGGTTTTCT
TGCCGCCAGTAACAAAAGCAGAAGCAAAAGTATAAGCTGTATGTAGCTTTAATTTGTTTATTTACTAGCTTTTTTTACCCTCTCTGATCAAGTATGAATTTTGATGGTGT
GAGGATTTCGTAAAACAATTGTAGTTGAAGGCAGTTTTTTGGGTAATCTAGTTTGGTAAAAGTAAATCTTATTTCAAAGGCTCTAGATCCTGGCCACTTCTCTCTATATT
GTAAAGCTATCTTAAGGATAAATTTTATTTGTGTGTGTGTTTTTCATTTGAGTTGCATGCCCA
Protein sequenceShow/hide protein sequence
MLGTVNLVMGSSSAAMATPTHCAAVKSLASSRIGHHNYCRTISLPLGLPSASISSSCFLSPPGSSLSSPFNTAIAAVNSDSANKQESNKYYFLVANAKFMLDEEEHFKEL
LFERLRNYDERNKEQDFWLVIEPKFLDKFPNITKRLQRPAVALVSTNSTWITFMKLRLDRVLAESYEANSLEEALASTPTNLEFEKPEKWVAPYSKYEYGWWEVFLPPVT
KAEAKV