; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001636 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001636
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionFilamentous hemagglutinin transporter
Genome locationChr09:18936261..18937176
RNA-Seq ExpressionHG10001636
SyntenyHG10001636
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140015.1 uncharacterized protein LOC101202760 [Cucumis sativus]2.0e-12493.15Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSSLVRVLT YNK+DRHRT G+ES AEKLTPLITRDLL+GGYSKFTESQELDLDLHVPSGWE+RLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSPSYSSSSSSVAAEKEFREEDNLKY
        FPPSPN SKFQL+NHLVDETSLDLKLVSSLS SPS SSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPSPSYSSSSSS AAEKEFREE+NLK 
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSPSYSSSSSSVAAEKEFREEDNLKY

Query:  LSSPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
        LSSPIAAGCPGCLSYVLVMKNNPTCPRCSS+VPLPA KKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

XP_008456306.1 PREDICTED: uncharacterized protein LOC103496294 [Cucumis melo]2.9e-12391.94Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSSLVRVLT YNK+DRH T GNES AEKL PLITRDLL+GGYSKFTESQELDLDLHVPSGWE+RLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSPSYSSSSSSVAAEKEFREEDNLKY
        FPPSPN+SKFQL+NHLVDETSLDLKLVSSLS SPS SSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPSPSYSSSSSS AA+KEFREE+ LK 
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSPSYSSSSSSVAAEKEFREEDNLKY

Query:  LSSPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
         SSPIAAGCPGCLSYVLVMKNNPTCPRCSS+VPLPAAKKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

XP_022943495.1 uncharacterized protein LOC111448249 [Cucurbita moschata]4.9e-9981.4Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNN----NQTVPKL
        MAAEVSSLVRVLT YNK       GN+S  EKLT LITRDLLSGG     ESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNN    NQTVPKL
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNN----NQTVPKL

Query:  QDLNFPPS-PNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKS----SPSPSYSSSSSSVAAEKE
        QDLNFPPS PN+ KF+ S HL+DET+LDLKLVSS S SP   S R+NYQSVCTLDKVKSALERAERNPI+KRSSLWKS    SPSPSYSSSSSS AA KE
Subjt:  QDLNFPPS-PNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKS----SPSPSYSSSSSSVAAEKE

Query:  FREEDNLKYLSS-PIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
        F+EE NLK+LSS PI AGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
Subjt:  FREEDNLKYLSS-PIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

XP_023512147.1 uncharacterized protein LOC111776954 [Cucurbita pepo subsp. pepo]4.9e-9981.71Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNN---NQTVPKLQ
        MAAEVSSLVRVLT YNK       GN+S  EKLT LITRDLLSGG     ESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNN   NQTVPKLQ
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNN---NQTVPKLQ

Query:  DLNFPPS-PNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKS----SPSPSYSSSSSSVAAEKEF
        DLNFPPS PN+ KF  S HL+DET+LDLKLVSS S SP   S R+NYQSVCTLDKVKSALERAERNPI+KRSSLWKS    SPSPSYSSSSSS AA KEF
Subjt:  DLNFPPS-PNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKS----SPSPSYSSSSSSVAAEKEF

Query:  REEDNLKYLSS-PIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
        +EE NLK+LSS PI AGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
Subjt:  REEDNLKYLSS-PIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

XP_038901828.1 uncharacterized protein LOC120088523 [Benincasa hispida]2.8e-12694.76Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSSLVRVLTGYNKDDRHRTVGN+SAAEKLTPLITRDLLSGGYSK+TESQELDLDLHVPSGWE+RLDLKSGK FIQRCNVQDF NNNQTVPKLQDLN
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSPSYSSSSSSVAAEKEFREEDNLKY
        FPPSPNFSKFQ SNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPSPSYSSSSSS  AEKEFR+EDNLK 
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSPSYSSSSSSVAAEKEFREEDNLKY

Query:  LSSPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
        LSSPIAAGCPGCLSYVLVMKNNPTCPRC+SVVPLPA KKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

TrEMBL top hitse value%identityAlignment
A0A0A0KAN5 Uncharacterized protein9.6e-12593.15Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSSLVRVLT YNK+DRHRT G+ES AEKLTPLITRDLL+GGYSKFTESQELDLDLHVPSGWE+RLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSPSYSSSSSSVAAEKEFREEDNLKY
        FPPSPN SKFQL+NHLVDETSLDLKLVSSLS SPS SSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPSPSYSSSSSS AAEKEFREE+NLK 
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSPSYSSSSSSVAAEKEFREEDNLKY

Query:  LSSPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
        LSSPIAAGCPGCLSYVLVMKNNPTCPRCSS+VPLPA KKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

A0A1S3C2I3 uncharacterized protein LOC1034962941.4e-12391.94Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSSLVRVLT YNK+DRH T GNES AEKL PLITRDLL+GGYSKFTESQELDLDLHVPSGWE+RLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSPSYSSSSSSVAAEKEFREEDNLKY
        FPPSPN+SKFQL+NHLVDETSLDLKLVSSLS SPS SSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPSPSYSSSSSS AA+KEFREE+ LK 
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSPSYSSSSSSVAAEKEFREEDNLKY

Query:  LSSPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
         SSPIAAGCPGCLSYVLVMKNNPTCPRCSS+VPLPAAKKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

A0A5D3CB81 Putative YUP8H12R.23 protein1.4e-12391.94Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSSLVRVLT YNK+DRH T GNES AEKL PLITRDLL+GGYSKFTESQELDLDLHVPSGWE+RLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSPSYSSSSSSVAAEKEFREEDNLKY
        FPPSPN+SKFQL+NHLVDETSLDLKLVSSLS SPS SSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPSPSYSSSSSS AA+KEFREE+ LK 
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSPSYSSSSSSVAAEKEFREEDNLKY

Query:  LSSPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
         SSPIAAGCPGCLSYVLVMKNNPTCPRCSS+VPLPAAKKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

A0A6J1FXV9 uncharacterized protein LOC1114482492.4e-9981.4Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNN----NQTVPKL
        MAAEVSSLVRVLT YNK       GN+S  EKLT LITRDLLSGG     ESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNN    NQTVPKL
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNN----NQTVPKL

Query:  QDLNFPPS-PNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKS----SPSPSYSSSSSSVAAEKE
        QDLNFPPS PN+ KF+ S HL+DET+LDLKLVSS S SP   S R+NYQSVCTLDKVKSALERAERNPI+KRSSLWKS    SPSPSYSSSSSS AA KE
Subjt:  QDLNFPPS-PNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKS----SPSPSYSSSSSSVAAEKE

Query:  FREEDNLKYLSS-PIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
        F+EE NLK+LSS PI AGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
Subjt:  FREEDNLKYLSS-PIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

A0A6J1I433 uncharacterized protein LOC1114697114.1e-9978.21Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNN--NQTVPKLQD
        MAAEV+S VRVLTGYNKDD H TV NES  + LTPLITRDLL+GG SKFT+ QELDLDL +PSGWEKRLDLKSGKMFIQR NVQDFNN+  NQTV KLQD
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNN--NQTVPKLQD

Query:  LNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSPSYSSSSSSVAAEKEFREEDNL
        LNFPPS N+SKF+LSNHLV ETSL+LKL SS S  P   SPRSNYQSVCTLDKVKSALERA++NPI+KRSSLWKSSPSPSYSSSSSS    +EF+EEDN 
Subjt:  LNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSPSYSSSSSSVAAEKEFREEDNL

Query:  -KYLSS------PIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
         K LSS       IA GCPGCLSYVLVMKNNPTCPRC SVV LPA KKPR+DLNISI
Subjt:  -KYLSS------PIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G16500.1 unknown protein2.1e-4748.39Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLT-PLITRDLL------SGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFI-QRCNVQDFNNN---
        MAA+VSSLVR+L+ + KDD  RTV  +S   + T  L+TRDLL       GG     +S ELDLD+ VP+GWEKRLDLKSGK+++ Q+CN    +++   
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLT-PLITRDLL------SGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFI-QRCNVQDFNNN---

Query:  ---------NQTVPKLQDLNFPP-SPNFSKFQLSNHL--VDETSLDLKLV-SSLS---PSP----SPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSS
                 NQTVP+ QDLN PP S  F    L +     D+TSL+LKLV SS+S   P P    SP+   S   SVCTLDKVK ALERAE++  K++S 
Subjt:  ---------NQTVPKLQDLNFPP-SPNFSKFQLSNHL--VDETSLDLKLV-SSLS---PSP----SPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSS

Query:  LWKSSPSPSYSSSSSSVAAEKEFREEDNLKYLSSPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
                 Y  ++S+  A             +S +AAGCPGCLSYV V KNNP CPRC S VPLPA KKP+IDLNIS+
Subjt:  LWKSSPSPSYSSSSSSVAAEKEFREEDNLKYLSSPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

AT1G79160.1 unknown protein6.1e-5551.74Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTE-SQELDLDLHVPSGWEKRLDLKSGKMFIQRCN------VQDFNNNNQTV
        MAA+VSSLVR+L+GY KDDR   V + + A+    L+TRDLL  G     + S ELDLDL VP+G+EKRLDLKSGK+++QRCN      + + +  NQTV
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTE-SQELDLDLHVPSGWEKRLDLKSGKMFIQRCN------VQDFNNNNQTV

Query:  PKLQDLNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPR-SNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSPSYSSSSSSVAAEKEF
        P  QDLNFPP P  +   L N L D+T+ +LKL+    PS   S P  SN QSVCTLDKVKSALERAER+P     +++K   SP           +   
Subjt:  PKLQDLNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPR-SNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSPSYSSSSSSVAAEKEF

Query:  REEDNLKYLSSPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPA---AKKPRIDLNISI
         +    + ++SP+ AGCPGCLSYVLVM NNP CPRC ++VPLP     KKP+IDLNISI
Subjt:  REEDNLKYLSSPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPA---AKKPRIDLNISI

AT3G11600.1 unknown protein1.3e-0437.7Show/hide
Query:  SPSPSYSSSSSSVAAEKEFREEDNLKYLSSPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPL
        SP+ S ++S SS  + +  +EE+  + ++S +  GCP CL YV++  ++P CP+C S V L
Subjt:  SPSPSYSSSSSSVAAEKEFREEDNLKYLSSPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPL

AT5G06270.1 unknown protein2.9e-0433.33Show/hide
Query:  AERNPIKKRSSLWKSSPSPSYSSSSSSVAAEKEFREEDNLKYLSSP-----IAAGCPGCLSYVLVMKNNPTCPRCSSVVPL
        ++R  ++  S    +SP+   S  SS V++E   ++E +++Y +SP     +  GCP CL YV++ +++P CP+C S V L
Subjt:  AERNPIKKRSSLWKSSPSPSYSSSSSSVAAEKEFREEDNLKYLSSP-----IAAGCPGCLSYVLVMKNNPTCPRCSSVVPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCGAGGTGAGCTCGCTCGTGCGAGTTCTTACAGGCTATAACAAAGACGATCGTCATCGGACGGTCGGAAACGAATCCGCCGCTGAGAAATTAACGCCTCTGAT
AACTCGAGACTTACTCAGCGGCGGCTATTCCAAATTTACAGAGTCCCAAGAACTTGACCTCGATCTCCATGTCCCTTCCGGCTGGGAAAAACGACTCGATTTGAAGTCAG
GGAAAATGTTTATTCAAAGATGCAATGTTCAAGATTTCAACAATAACAATCAAACAGTGCCAAAGCTTCAAGATTTGAATTTTCCACCGTCGCCCAATTTCTCTAAATTT
CAATTGTCGAATCATTTGGTCGATGAAACCAGTTTGGATTTGAAATTGGTTTCATCGTTATCGCCGTCACCGTCGCCGTCGTCGCCGAGGAGTAATTATCAGAGCGTTTG
TACTTTGGATAAGGTCAAATCGGCGCTCGAACGGGCTGAGAGAAATCCCATTAAGAAACGATCGTCGTTGTGGAAATCGTCTCCATCGCCGTCGTATTCGTCGTCGTCGT
CGTCAGTGGCGGCGGAGAAAGAGTTTCGAGAAGAAGACAACTTGAAATATTTGTCATCGCCAATCGCTGCCGGCTGCCCTGGGTGTTTGTCGTATGTGTTGGTGATGAAG
AACAATCCGACGTGTCCACGGTGTAGCTCCGTCGTGCCGTTGCCGGCGGCGAAGAAACCTCGGATTGATCTCAACATTTCGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCCGAGGTGAGCTCGCTCGTGCGAGTTCTTACAGGCTATAACAAAGACGATCGTCATCGGACGGTCGGAAACGAATCCGCCGCTGAGAAATTAACGCCTCTGAT
AACTCGAGACTTACTCAGCGGCGGCTATTCCAAATTTACAGAGTCCCAAGAACTTGACCTCGATCTCCATGTCCCTTCCGGCTGGGAAAAACGACTCGATTTGAAGTCAG
GGAAAATGTTTATTCAAAGATGCAATGTTCAAGATTTCAACAATAACAATCAAACAGTGCCAAAGCTTCAAGATTTGAATTTTCCACCGTCGCCCAATTTCTCTAAATTT
CAATTGTCGAATCATTTGGTCGATGAAACCAGTTTGGATTTGAAATTGGTTTCATCGTTATCGCCGTCACCGTCGCCGTCGTCGCCGAGGAGTAATTATCAGAGCGTTTG
TACTTTGGATAAGGTCAAATCGGCGCTCGAACGGGCTGAGAGAAATCCCATTAAGAAACGATCGTCGTTGTGGAAATCGTCTCCATCGCCGTCGTATTCGTCGTCGTCGT
CGTCAGTGGCGGCGGAGAAAGAGTTTCGAGAAGAAGACAACTTGAAATATTTGTCATCGCCAATCGCTGCCGGCTGCCCTGGGTGTTTGTCGTATGTGTTGGTGATGAAG
AACAATCCGACGTGTCCACGGTGTAGCTCCGTCGTGCCGTTGCCGGCGGCGAAGAAACCTCGGATTGATCTCAACATTTCGATTTGA
Protein sequenceShow/hide protein sequence
MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWEKRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLNFPPSPNFSKF
QLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSPSYSSSSSSVAAEKEFREEDNLKYLSSPIAAGCPGCLSYVLVMK
NNPTCPRCSSVVPLPAAKKPRIDLNISI