; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC02G044170 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC02G044170
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionFilamentous hemagglutinin transporter
Genome locationCmU531Chr02:32195316..32196232
RNA-Seq ExpressionCmUC02G044170
SyntenyCmUC02G044170
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140015.1 uncharacterized protein LOC101202760 [Cucumis sativus]4.9e-12393.15Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSSLVRVLT YNK+DRHRT G+ES AEKLTPLITRDLL+GGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS
        FPPSPN SKFQL+NHLVDETSLDLKLVSSLS SPS SSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS SYSSSSSSAAAEKEFREE+NLK 
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS

Query:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
        LSSPIAAGCPGCLSYVLVM NNPTCPRCSS+VPLPA KKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

XP_008456306.1 PREDICTED: uncharacterized protein LOC103496294 [Cucumis melo]7.1e-12291.94Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSSLVRVLT YNK+DRH T GNES AEKL PLITRDLL+GGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS
        FPPSPN+SKFQL+NHLVDETSLDLKLVSSLS SPS SSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS SYSSSSSSAAA+KEFREE+ LK 
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS

Query:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
         SSPIAAGCPGCLSYVLVM NNPTCPRCSS+VPLPAAKKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

XP_022943495.1 uncharacterized protein LOC111448249 [Cucurbita moschata]1.3e-9680.16Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN----NQTVPKL
        MAAEVSSLVRVLT YNK       GN+S  EKLT LITRDLLSGG     ESQELDLDLHVPSGWE+RLDLKSGKMFIQRCNVQDFNNN    NQTVPKL
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN----NQTVPKL

Query:  QDLNFPPS-PNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYS---SSSSSAAAEKEF
        QDLNFPPS PN+ KF+ S HL+DET+LDLKLVSS S SP   S R+NYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS S S   SSSSS+AA KEF
Subjt:  QDLNFPPS-PNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYS---SSSSSAAAEKEF

Query:  REEDNLKSLSS-PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
        +EE NLK LSS PI AGCPGCLSYVLVM NNPTCPRCSSVVPLPAAKKPRIDLNISI
Subjt:  REEDNLKSLSS-PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

XP_022970865.1 uncharacterized protein LOC111469711 [Cucurbita maxima]2.1e-9777.82Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN--NQTVPKLQD
        MAAEV+S VRVLTGYNKDD H TV NES  + LTPLITRDLL+GG SKFT+ QELDLDL +PSGWE+RLDLKSGKMFIQR NVQDFNN+  NQTV KLQD
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN--NQTVPKLQD

Query:  LNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNL
        LNFPPS N+SKF+LSNHLV ETSL+LKL SS S  P   SPRSNYQSVCTLDKVKSALERA++NPI+KRSSLWKSSPS SYSSSSSSA   +EF+EEDN 
Subjt:  LNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNL

Query:  -KSLSS------PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
         KSLSS       IA GCPGCLSYVLVM NNPTCPRC SVV LPA KKPR+DLNISI
Subjt:  -KSLSS------PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

XP_038901828.1 uncharacterized protein LOC120088523 [Benincasa hispida]1.8e-12595.16Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSSLVRVLTGYNKDDRHRTVGN+SAAEKLTPLITRDLLSGGYSK+TESQELDLDLHVPSGWERRLDLKSGK FIQRCNVQDF NNNQTVPKLQDLN
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS
        FPPSPNFSKFQ SNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS SYSSSSSSA AEKEFR+EDNLKS
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS

Query:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
        LSSPIAAGCPGCLSYVLVM NNPTCPRC+SVVPLPA KKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

TrEMBL top hitse value%identityAlignment
A0A0A0KAN5 Uncharacterized protein2.4e-12393.15Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSSLVRVLT YNK+DRHRT G+ES AEKLTPLITRDLL+GGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS
        FPPSPN SKFQL+NHLVDETSLDLKLVSSLS SPS SSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS SYSSSSSSAAAEKEFREE+NLK 
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS

Query:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
        LSSPIAAGCPGCLSYVLVM NNPTCPRCSS+VPLPA KKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

A0A1S3C2I3 uncharacterized protein LOC1034962943.4e-12291.94Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSSLVRVLT YNK+DRH T GNES AEKL PLITRDLL+GGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS
        FPPSPN+SKFQL+NHLVDETSLDLKLVSSLS SPS SSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS SYSSSSSSAAA+KEFREE+ LK 
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS

Query:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
         SSPIAAGCPGCLSYVLVM NNPTCPRCSS+VPLPAAKKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

A0A5D3CB81 Putative YUP8H12R.23 protein3.4e-12291.94Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSSLVRVLT YNK+DRH T GNES AEKL PLITRDLL+GGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS
        FPPSPN+SKFQL+NHLVDETSLDLKLVSSLS SPS SSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS SYSSSSSSAAA+KEFREE+ LK 
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS

Query:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
         SSPIAAGCPGCLSYVLVM NNPTCPRCSS+VPLPAAKKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

A0A6J1FXV9 uncharacterized protein LOC1114482496.5e-9780.16Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN----NQTVPKL
        MAAEVSSLVRVLT YNK       GN+S  EKLT LITRDLLSGG     ESQELDLDLHVPSGWE+RLDLKSGKMFIQRCNVQDFNNN    NQTVPKL
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN----NQTVPKL

Query:  QDLNFPPS-PNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYS---SSSSSAAAEKEF
        QDLNFPPS PN+ KF+ S HL+DET+LDLKLVSS S SP   S R+NYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS S S   SSSSS+AA KEF
Subjt:  QDLNFPPS-PNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYS---SSSSSAAAEKEF

Query:  REEDNLKSLSS-PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
        +EE NLK LSS PI AGCPGCLSYVLVM NNPTCPRCSSVVPLPAAKKPRIDLNISI
Subjt:  REEDNLKSLSS-PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

A0A6J1I433 uncharacterized protein LOC1114697111.0e-9777.82Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN--NQTVPKLQD
        MAAEV+S VRVLTGYNKDD H TV NES  + LTPLITRDLL+GG SKFT+ QELDLDL +PSGWE+RLDLKSGKMFIQR NVQDFNN+  NQTV KLQD
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN--NQTVPKLQD

Query:  LNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNL
        LNFPPS N+SKF+LSNHLV ETSL+LKL SS S  P   SPRSNYQSVCTLDKVKSALERA++NPI+KRSSLWKSSPS SYSSSSSSA   +EF+EEDN 
Subjt:  LNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNL

Query:  -KSLSS------PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
         KSLSS       IA GCPGCLSYVLVM NNPTCPRC SVV LPA KKPR+DLNISI
Subjt:  -KSLSS------PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G16500.1 unknown protein6.7e-4647.67Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLT-PLITRDLL------SGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFI-QRCNVQDFNNN---
        MAA+VSSLVR+L+ + KDD  RTV  +S   + T  L+TRDLL       GG     +S ELDLD+ VP+GWE+RLDLKSGK+++ Q+CN    +++   
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLT-PLITRDLL------SGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFI-QRCNVQDFNNN---

Query:  ---------NQTVPKLQDLNFPP-SPNFSKFQLSNHL--VDETSLDLKLV-SSLS---PSP----SPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSS
                 NQTVP+ QDLN PP S  F    L +     D+TSL+LKLV SS+S   P P    SP+   S   SVCTLDKVK ALERAE++  K++S 
Subjt:  ---------NQTVPKLQDLNFPP-SPNFSKFQLSNHL--VDETSLDLKLV-SSLS---PSP----SPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSS

Query:  LWKSSPSTSYSSSSSSAAAEKEFREEDNLKSLSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
                 Y  ++S+  A             +S +AAGCPGCLSYV V  NNP CPRC S VPLPA KKP+IDLNIS+
Subjt:  LWKSSPSTSYSSSSSSAAAEKEFREEDNLKSLSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

AT1G79160.1 unknown protein2.3e-5453.26Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTE-SQELDLDLHVPSGWERRLDLKSGKMFIQRCN------VQDFNNNNQTV
        MAA+VSSLVR+L+GY KDDR   V + + A+    L+TRDLL  G     + S ELDLDL VP+G+E+RLDLKSGK+++QRCN      + + +  NQTV
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTE-SQELDLDLHVPSGWERRLDLKSGKMFIQRCN------VQDFNNNNQTV

Query:  PKLQDLNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPR-SNYQSVCTLDKVKSALERAERNP--IKKRSSLWKSSPSTSYSSSSSSAAAEK
        P  QDLNFPP P  +   L N L D+T+ +LKL+    PS   S P  SN QSVCTLDKVKSALERAER+P   KKR    +S   T Y    + A A  
Subjt:  PKLQDLNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPR-SNYQSVCTLDKVKSALERAERNP--IKKRSSLWKSSPSTSYSSSSSSAAAEK

Query:  EFREEDNLKSLSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPA---AKKPRIDLNISI
                    SP+ AGCPGCLSYVLVMMNNP CPRC ++VPLP     KKP+IDLNISI
Subjt:  EFREEDNLKSLSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPA---AKKPRIDLNISI

AT3G11600.1 unknown protein2.9e-0437.7Show/hide
Query:  SPSTSYSSSSSSAAAEKEFREEDNLKSLSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPL
        SP+ S ++S SS  + +  +EE+  ++++S +  GCP CL YV++  ++P CP+C S V L
Subjt:  SPSTSYSSSSSSAAAEKEFREEDNLKSLSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCGAAGTCAGCTCGCTCGTGCGAGTTCTTACAGGCTACAACAAAGACGACCGTCATCGGACGGTCGGAAACGAATCCGCCGCCGAGAAATTAACGCCTCTGAT
CACCCGAGACTTACTCAGCGGCGGCTATTCCAAGTTTACAGAGTCCCAAGAACTTGACCTCGATCTCCATGTCCCTTCCGGCTGGGAAAGAAGACTCGATTTGAAGTCAG
GGAAAATGTTTATACAAAGATGCAATGTTCAAGATTTCAACAATAACAATCAAACAGTGCCAAAGCTTCAAGATTTGAATTTTCCACCGTCGCCCAATTTCTCTAAATTT
CAATTGTCGAATCATTTGGTCGATGAAACCAGTTTGGATTTGAAATTGGTTTCGTCGTTATCGCCTTCGCCATCGCCATCATCGCCGAGGAGTAATTATCAGAGCGTCTG
TACTTTGGATAAGGTCAAATCGGCGCTCGAACGGGCTGAGAGAAATCCCATAAAAAAGCGGTCGTCGTTGTGGAAATCGTCTCCGTCGACGTCGTATTCGTCGTCGTCGT
CGTCAGCGGCGGCGGAGAAAGAGTTTCGAGAAGAAGACAACTTGAAATCTTTGTCGTCGCCGATCGCCGCCGGCTGCCCTGGGTGTTTGTCGTATGTGTTGGTGATGATG
AACAATCCGACGTGTCCACGGTGTAGCTCCGTCGTGCCGTTGCCGGCGGCTAAGAAACCTCGGATTGATCTCAATATTTCGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCCGAAGTCAGCTCGCTCGTGCGAGTTCTTACAGGCTACAACAAAGACGACCGTCATCGGACGGTCGGAAACGAATCCGCCGCCGAGAAATTAACGCCTCTGAT
CACCCGAGACTTACTCAGCGGCGGCTATTCCAAGTTTACAGAGTCCCAAGAACTTGACCTCGATCTCCATGTCCCTTCCGGCTGGGAAAGAAGACTCGATTTGAAGTCAG
GGAAAATGTTTATACAAAGATGCAATGTTCAAGATTTCAACAATAACAATCAAACAGTGCCAAAGCTTCAAGATTTGAATTTTCCACCGTCGCCCAATTTCTCTAAATTT
CAATTGTCGAATCATTTGGTCGATGAAACCAGTTTGGATTTGAAATTGGTTTCGTCGTTATCGCCTTCGCCATCGCCATCATCGCCGAGGAGTAATTATCAGAGCGTCTG
TACTTTGGATAAGGTCAAATCGGCGCTCGAACGGGCTGAGAGAAATCCCATAAAAAAGCGGTCGTCGTTGTGGAAATCGTCTCCGTCGACGTCGTATTCGTCGTCGTCGT
CGTCAGCGGCGGCGGAGAAAGAGTTTCGAGAAGAAGACAACTTGAAATCTTTGTCGTCGCCGATCGCCGCCGGCTGCCCTGGGTGTTTGTCGTATGTGTTGGTGATGATG
AACAATCCGACGTGTCCACGGTGTAGCTCCGTCGTGCCGTTGCCGGCGGCTAAGAAACCTCGGATTGATCTCAATATTTCGATTTGA
Protein sequenceShow/hide protein sequence
MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLNFPPSPNFSKF
QLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKSLSSPIAAGCPGCLSYVLVMM
NNPTCPRCSSVVPLPAAKKPRIDLNISI