; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G036000 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G036000
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionFilamentous hemagglutinin transporter
Genome locationCicolChr02:31846077..31847214
RNA-Seq ExpressionCcUC02G036000
SyntenyCcUC02G036000
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140015.1 uncharacterized protein LOC101202760 [Cucumis sativus]8.3e-12392.74Show/hide
Query:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSS+VRVLT YNK+DRHRT G+ES AEKLTPLITRDLL+GGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
Subjt:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS
        FPPSPN SKFQL+NHLVDETSLDLKLVSSLS SPS SSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS SYSSSSSSAAAEKEFREE+NLK 
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS

Query:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
        LSSPIAAGCPGCLSYVLVM NNPTCPRCSS+VPLPA KKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

XP_008456306.1 PREDICTED: uncharacterized protein LOC103496294 [Cucumis melo]1.2e-12191.53Show/hide
Query:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSS+VRVLT YNK+DRH T GNES AEKL PLITRDLL+GGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
Subjt:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS
        FPPSPN+SKFQL+NHLVDETSLDLKLVSSLS SPS SSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS SYSSSSSSAAA+KEFREE+ LK 
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS

Query:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
         SSPIAAGCPGCLSYVLVM NNPTCPRCSS+VPLPAAKKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

XP_022943495.1 uncharacterized protein LOC111448249 [Cucurbita moschata]2.3e-9679.77Show/hide
Query:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN----NQTVPKL
        MAAEVSS+VRVLT YNK       GN+S  EKLT LITRDLLSGG     ESQELDLDLHVPSGWE+RLDLKSGKMFIQRCNVQDFNNN    NQTVPKL
Subjt:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN----NQTVPKL

Query:  QDLNFPPS-PNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYS---SSSSSAAAEKEF
        QDLNFPPS PN+ KF+ S HL+DET+LDLKLVSS S SP   S R+NYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS S S   SSSSS+AA KEF
Subjt:  QDLNFPPS-PNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYS---SSSSSAAAEKEF

Query:  REEDNLKSLSS-PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
        +EE NLK LSS PI AGCPGCLSYVLVM NNPTCPRCSSVVPLPAAKKPRIDLNISI
Subjt:  REEDNLKSLSS-PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

XP_022970865.1 uncharacterized protein LOC111469711 [Cucurbita maxima]2.1e-9777.82Show/hide
Query:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN--NQTVPKLQD
        MAAEV+S VRVLTGYNKDD H TV NES  + LTPLITRDLL+GG SKFT+ QELDLDL +PSGWE+RLDLKSGKMFIQR NVQDFNN+  NQTV KLQD
Subjt:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN--NQTVPKLQD

Query:  LNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNL
        LNFPPS N+SKF+LSNHLV ETSL+LKL SS S  P   SPRSNYQSVCTLDKVKSALERA++NPI+KRSSLWKSSPS SYSSSSSSA   +EF+EEDN 
Subjt:  LNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNL

Query:  -KSLSS------PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
         KSLSS       IA GCPGCLSYVLVM NNPTCPRC SVV LPA KKPR+DLNISI
Subjt:  -KSLSS------PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

XP_038901828.1 uncharacterized protein LOC120088523 [Benincasa hispida]3.1e-12594.76Show/hide
Query:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSS+VRVLTGYNKDDRHRTVGN+SAAEKLTPLITRDLLSGGYSK+TESQELDLDLHVPSGWERRLDLKSGK FIQRCNVQDF NNNQTVPKLQDLN
Subjt:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS
        FPPSPNFSKFQ SNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS SYSSSSSSA AEKEFR+EDNLKS
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS

Query:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
        LSSPIAAGCPGCLSYVLVM NNPTCPRC+SVVPLPA KKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

TrEMBL top hitse value%identityAlignment
A0A0A0KAN5 Uncharacterized protein4.0e-12392.74Show/hide
Query:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSS+VRVLT YNK+DRHRT G+ES AEKLTPLITRDLL+GGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
Subjt:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS
        FPPSPN SKFQL+NHLVDETSLDLKLVSSLS SPS SSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS SYSSSSSSAAAEKEFREE+NLK 
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS

Query:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
        LSSPIAAGCPGCLSYVLVM NNPTCPRCSS+VPLPA KKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

A0A1S3C2I3 uncharacterized protein LOC1034962945.8e-12291.53Show/hide
Query:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSS+VRVLT YNK+DRH T GNES AEKL PLITRDLL+GGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
Subjt:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS
        FPPSPN+SKFQL+NHLVDETSLDLKLVSSLS SPS SSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS SYSSSSSSAAA+KEFREE+ LK 
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS

Query:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
         SSPIAAGCPGCLSYVLVM NNPTCPRCSS+VPLPAAKKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

A0A5D3CB81 Putative YUP8H12R.23 protein5.8e-12291.53Show/hide
Query:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSS+VRVLT YNK+DRH T GNES AEKL PLITRDLL+GGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
Subjt:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS
        FPPSPN+SKFQL+NHLVDETSLDLKLVSSLS SPS SSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS SYSSSSSSAAA+KEFREE+ LK 
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKS

Query:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
         SSPIAAGCPGCLSYVLVM NNPTCPRCSS+VPLPAAKKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

A0A6J1FXV9 uncharacterized protein LOC1114482491.1e-9679.77Show/hide
Query:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN----NQTVPKL
        MAAEVSS+VRVLT YNK       GN+S  EKLT LITRDLLSGG     ESQELDLDLHVPSGWE+RLDLKSGKMFIQRCNVQDFNNN    NQTVPKL
Subjt:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN----NQTVPKL

Query:  QDLNFPPS-PNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYS---SSSSSAAAEKEF
        QDLNFPPS PN+ KF+ S HL+DET+LDLKLVSS S SP   S R+NYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS S S   SSSSS+AA KEF
Subjt:  QDLNFPPS-PNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYS---SSSSSAAAEKEF

Query:  REEDNLKSLSS-PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
        +EE NLK LSS PI AGCPGCLSYVLVM NNPTCPRCSSVVPLPAAKKPRIDLNISI
Subjt:  REEDNLKSLSS-PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

A0A6J1I433 uncharacterized protein LOC1114697111.0e-9777.82Show/hide
Query:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN--NQTVPKLQD
        MAAEV+S VRVLTGYNKDD H TV NES  + LTPLITRDLL+GG SKFT+ QELDLDL +PSGWE+RLDLKSGKMFIQR NVQDFNN+  NQTV KLQD
Subjt:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN--NQTVPKLQD

Query:  LNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNL
        LNFPPS N+SKF+LSNHLV ETSL+LKL SS S  P   SPRSNYQSVCTLDKVKSALERA++NPI+KRSSLWKSSPS SYSSSSSSA   +EF+EEDN 
Subjt:  LNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNL

Query:  -KSLSS------PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
         KSLSS       IA GCPGCLSYVLVM NNPTCPRC SVV LPA KKPR+DLNISI
Subjt:  -KSLSS------PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G16500.1 unknown protein1.1e-4547.31Show/hide
Query:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLT-PLITRDLL------SGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFI-QRCNVQDFNNN---
        MAA+VSS+VR+L+ + KDD  RTV  +S   + T  L+TRDLL       GG     +S ELDLD+ VP+GWE+RLDLKSGK+++ Q+CN    +++   
Subjt:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLT-PLITRDLL------SGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFI-QRCNVQDFNNN---

Query:  ---------NQTVPKLQDLNFPP-SPNFSKFQLSNHL--VDETSLDLKLV-SSLS---PSP----SPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSS
                 NQTVP+ QDLN PP S  F    L +     D+TSL+LKLV SS+S   P P    SP+   S   SVCTLDKVK ALERAE++  K++S 
Subjt:  ---------NQTVPKLQDLNFPP-SPNFSKFQLSNHL--VDETSLDLKLV-SSLS---PSP----SPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSS

Query:  LWKSSPSTSYSSSSSSAAAEKEFREEDNLKSLSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
                 Y  ++S+  A             +S +AAGCPGCLSYV V  NNP CPRC S VPLPA KKP+IDLNIS+
Subjt:  LWKSSPSTSYSSSSSSAAAEKEFREEDNLKSLSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

AT1G79160.1 unknown protein5.1e-5452.87Show/hide
Query:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTE-SQELDLDLHVPSGWERRLDLKSGKMFIQRCN------VQDFNNNNQTV
        MAA+VSS+VR+L+GY KDDR   V + + A+    L+TRDLL  G     + S ELDLDL VP+G+E+RLDLKSGK+++QRCN      + + +  NQTV
Subjt:  MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTE-SQELDLDLHVPSGWERRLDLKSGKMFIQRCN------VQDFNNNNQTV

Query:  PKLQDLNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPR-SNYQSVCTLDKVKSALERAERNP--IKKRSSLWKSSPSTSYSSSSSSAAAEK
        P  QDLNFPP P  +   L N L D+T+ +LKL+    PS   S P  SN QSVCTLDKVKSALERAER+P   KKR    +S   T Y    + A A  
Subjt:  PKLQDLNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPR-SNYQSVCTLDKVKSALERAERNP--IKKRSSLWKSSPSTSYSSSSSSAAAEK

Query:  EFREEDNLKSLSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPA---AKKPRIDLNISI
                    SP+ AGCPGCLSYVLVMMNNP CPRC ++VPLP     KKP+IDLNISI
Subjt:  EFREEDNLKSLSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPA---AKKPRIDLNISI

AT3G11600.1 unknown protein2.9e-0437.7Show/hide
Query:  SPSTSYSSSSSSAAAEKEFREEDNLKSLSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPL
        SP+ S ++S SS  + +  +EE+  ++++S +  GCP CL YV++  ++P CP+C S V L
Subjt:  SPSTSYSSSSSSAAAEKEFREEDNLKSLSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCGAAGTCAGCTCGATCGTGCGAGTTCTTACAGGCTACAACAAAGACGACCGTCATCGGACGGTCGGAAACGAATCCGCCGCCGAGAAATTAACGCCTCTGAT
CACCCGAGACTTACTCAGCGGCGGCTATTCCAAGTTTACAGAGTCCCAAGAACTTGACCTCGATCTCCATGTCCCTTCCGGCTGGGAAAGAAGACTCGATTTGAAGTCAG
GGAAAATGTTTATACAAAGATGCAATGTTCAAGATTTCAACAATAACAATCAAACAGTGCCAAAGCTTCAAGATTTGAATTTTCCACCATCGCCCAATTTCTCTAAATTT
CAATTGTCGAATCATTTGGTCGATGAAACCAGTTTGGATTTGAAATTGGTTTCGTCGTTATCGCCTTCGCCATCGCCGTCGTCGCCGAGGAGTAATTATCAGAGCGTCTG
TACTTTGGATAAGGTCAAATCGGCGCTCGAACGGGCTGAGAGAAATCCCATAAAAAAACGGTCGTCGTTGTGGAAATCGTCTCCGTCGACGTCGTATTCGTCGTCGTCGT
CGTCAGCGGCGGCGGAGAAAGAGTTTCGAGAAGAAGACAACTTGAAATCCTTGTCGTCGCCGATCGCCGCCGGCTGCCCTGGGTGTTTGTCGTATGTGTTGGTGATGATG
AACAATCCGACGTGTCCACGGTGTAGCTCCGTCGTGCCGTTGCCGGCGGCGAAGAAACCTCGGATTGATCTCAACATTTCGATTTGA
mRNA sequenceShow/hide mRNA sequence
CATATATAGAGAGGTGTGTGCAGCCATTGTACAAGAAAACCGACACTCCCATTTCTCTCTCTCTCTACTTTCTCTCTCTTCCTCTCCGTAAACAAAAATGGCCGCCGAAG
TCAGCTCGATCGTGCGAGTTCTTACAGGCTACAACAAAGACGACCGTCATCGGACGGTCGGAAACGAATCCGCCGCCGAGAAATTAACGCCTCTGATCACCCGAGACTTA
CTCAGCGGCGGCTATTCCAAGTTTACAGAGTCCCAAGAACTTGACCTCGATCTCCATGTCCCTTCCGGCTGGGAAAGAAGACTCGATTTGAAGTCAGGGAAAATGTTTAT
ACAAAGATGCAATGTTCAAGATTTCAACAATAACAATCAAACAGTGCCAAAGCTTCAAGATTTGAATTTTCCACCATCGCCCAATTTCTCTAAATTTCAATTGTCGAATC
ATTTGGTCGATGAAACCAGTTTGGATTTGAAATTGGTTTCGTCGTTATCGCCTTCGCCATCGCCGTCGTCGCCGAGGAGTAATTATCAGAGCGTCTGTACTTTGGATAAG
GTCAAATCGGCGCTCGAACGGGCTGAGAGAAATCCCATAAAAAAACGGTCGTCGTTGTGGAAATCGTCTCCGTCGACGTCGTATTCGTCGTCGTCGTCGTCAGCGGCGGC
GGAGAAAGAGTTTCGAGAAGAAGACAACTTGAAATCCTTGTCGTCGCCGATCGCCGCCGGCTGCCCTGGGTGTTTGTCGTATGTGTTGGTGATGATGAACAATCCGACGT
GTCCACGGTGTAGCTCCGTCGTGCCGTTGCCGGCGGCGAAGAAACCTCGGATTGATCTCAACATTTCGATTTGATTTTCAAAGAGATGGAAAAGTATTGTTATTGTAGAT
GGAGGGACAGAAATTATCGTCTTCAAGACATAGATTTTTTTTTTTTTTTTTTTGTTAGCTGTACTTAGAAACTATATGGAAAAAAAAAATGCAACAGGGAAGGTGGTG
Protein sequenceShow/hide protein sequence
MAAEVSSIVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLNFPPSPNFSKF
QLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSAAAEKEFREEDNLKSLSSPIAAGCPGCLSYVLVMM
NNPTCPRCSSVVPLPAAKKPRIDLNISI