; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G044070 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G044070
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionFilamentous hemagglutinin transporter
Genome locationCiama_Chr02:31846200..31847277
RNA-Seq ExpressionCaUC02G044070
SyntenyCaUC02G044070
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140015.1 uncharacterized protein LOC101202760 [Cucumis sativus]6.4e-12392.74Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSSLVRVLT YNK+DRHRT G+ES AEKLTPLITRDLL+GGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSVAAEKEFREEDNLKS
        FPPSPN SKFQL+NHLVDETSLDLKLVSSLS SPS SSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS SYSSSSSS AAEKEFREE+NLK 
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSVAAEKEFREEDNLKS

Query:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
        LSSPIAAGCPGCLSYVLVM NNPTCPRCSS+VPLPA KKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

XP_008456306.1 PREDICTED: uncharacterized protein LOC103496294 [Cucumis melo]9.2e-12291.53Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSSLVRVLT YNK+DRH T GNES AEKL PLITRDLL+GGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSVAAEKEFREEDNLKS
        FPPSPN+SKFQL+NHLVDETSLDLKLVSSLS SPS SSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS SYSSSSSS AA+KEFREE+ LK 
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSVAAEKEFREEDNLKS

Query:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
         SSPIAAGCPGCLSYVLVM NNPTCPRCSS+VPLPAAKKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

XP_022943495.1 uncharacterized protein LOC111448249 [Cucurbita moschata]1.3e-9680.16Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN----NQTVPKL
        MAAEVSSLVRVLT YNK       GN+S  EKLT LITRDLLSGG     ESQELDLDLHVPSGWE+RLDLKSGKMFIQRCNVQDFNNN    NQTVPKL
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN----NQTVPKL

Query:  QDLNFPPS-PNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYS---SSSSSVAAEKEF
        QDLNFPPS PN+ KF+ S HL+DET+LDLKLVSS S SP   S R+NYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS S S   SSSSS AA KEF
Subjt:  QDLNFPPS-PNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYS---SSSSSVAAEKEF

Query:  REEDNLKSLSS-PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
        +EE NLK LSS PI AGCPGCLSYVLVM NNPTCPRCSSVVPLPAAKKPRIDLNISI
Subjt:  REEDNLKSLSS-PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

XP_022970865.1 uncharacterized protein LOC111469711 [Cucurbita maxima]3.5e-9777.43Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN--NQTVPKLQD
        MAAEV+S VRVLTGYNKDD H TV NES  + LTPLITRDLL+GG SKFT+ QELDLDL +PSGWE+RLDLKSGKMFIQR NVQDFNN+  NQTV KLQD
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN--NQTVPKLQD

Query:  LNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSVAAEKEFREEDNL
        LNFPPS N+SKF+LSNHLV ETSL+LKL SS S  P   SPRSNYQSVCTLDKVKSALERA++NPI+KRSSLWKSSPS SYSSSSSS    +EF+EEDN 
Subjt:  LNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSVAAEKEFREEDNL

Query:  -KSLSS------PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
         KSLSS       IA GCPGCLSYVLVM NNPTCPRC SVV LPA KKPR+DLNISI
Subjt:  -KSLSS------PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

XP_038901828.1 uncharacterized protein LOC120088523 [Benincasa hispida]2.3e-12594.76Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSSLVRVLTGYNKDDRHRTVGN+SAAEKLTPLITRDLLSGGYSK+TESQELDLDLHVPSGWERRLDLKSGK FIQRCNVQDF NNNQTVPKLQDLN
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSVAAEKEFREEDNLKS
        FPPSPNFSKFQ SNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS SYSSSSSS  AEKEFR+EDNLKS
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSVAAEKEFREEDNLKS

Query:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
        LSSPIAAGCPGCLSYVLVM NNPTCPRC+SVVPLPA KKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

TrEMBL top hitse value%identityAlignment
A0A0A0KAN5 Uncharacterized protein3.1e-12392.74Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSSLVRVLT YNK+DRHRT G+ES AEKLTPLITRDLL+GGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSVAAEKEFREEDNLKS
        FPPSPN SKFQL+NHLVDETSLDLKLVSSLS SPS SSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS SYSSSSSS AAEKEFREE+NLK 
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSVAAEKEFREEDNLKS

Query:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
        LSSPIAAGCPGCLSYVLVM NNPTCPRCSS+VPLPA KKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

A0A1S3C2I3 uncharacterized protein LOC1034962944.5e-12291.53Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSSLVRVLT YNK+DRH T GNES AEKL PLITRDLL+GGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSVAAEKEFREEDNLKS
        FPPSPN+SKFQL+NHLVDETSLDLKLVSSLS SPS SSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS SYSSSSSS AA+KEFREE+ LK 
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSVAAEKEFREEDNLKS

Query:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
         SSPIAAGCPGCLSYVLVM NNPTCPRCSS+VPLPAAKKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

A0A5D3CB81 Putative YUP8H12R.23 protein4.5e-12291.53Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
        MAAEVSSLVRVLT YNK+DRH T GNES AEKL PLITRDLL+GGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLN

Query:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSVAAEKEFREEDNLKS
        FPPSPN+SKFQL+NHLVDETSLDLKLVSSLS SPS SSPRSNYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS SYSSSSSS AA+KEFREE+ LK 
Subjt:  FPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSVAAEKEFREEDNLKS

Query:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
         SSPIAAGCPGCLSYVLVM NNPTCPRCSS+VPLPAAKKPRIDLNISI
Subjt:  LSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

A0A6J1FXV9 uncharacterized protein LOC1114482496.5e-9780.16Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN----NQTVPKL
        MAAEVSSLVRVLT YNK       GN+S  EKLT LITRDLLSGG     ESQELDLDLHVPSGWE+RLDLKSGKMFIQRCNVQDFNNN    NQTVPKL
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN----NQTVPKL

Query:  QDLNFPPS-PNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYS---SSSSSVAAEKEF
        QDLNFPPS PN+ KF+ S HL+DET+LDLKLVSS S SP   S R+NYQSVCTLDKVKSALERAERNPI+KRSSLWKSSPS S S   SSSSS AA KEF
Subjt:  QDLNFPPS-PNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYS---SSSSSVAAEKEF

Query:  REEDNLKSLSS-PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
        +EE NLK LSS PI AGCPGCLSYVLVM NNPTCPRCSSVVPLPAAKKPRIDLNISI
Subjt:  REEDNLKSLSS-PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

A0A6J1I433 uncharacterized protein LOC1114697111.7e-9777.43Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN--NQTVPKLQD
        MAAEV+S VRVLTGYNKDD H TV NES  + LTPLITRDLL+GG SKFT+ QELDLDL +PSGWE+RLDLKSGKMFIQR NVQDFNN+  NQTV KLQD
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNN--NQTVPKLQD

Query:  LNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSVAAEKEFREEDNL
        LNFPPS N+SKF+LSNHLV ETSL+LKL SS S  P   SPRSNYQSVCTLDKVKSALERA++NPI+KRSSLWKSSPS SYSSSSSS    +EF+EEDN 
Subjt:  LNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSVAAEKEFREEDNL

Query:  -KSLSS------PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
         KSLSS       IA GCPGCLSYVLVM NNPTCPRC SVV LPA KKPR+DLNISI
Subjt:  -KSLSS------PIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G16500.1 unknown protein3.9e-4647.67Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLT-PLITRDLL------SGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFI-QRCNVQDFNNN---
        MAA+VSSLVR+L+ + KDD  RTV  +S   + T  L+TRDLL       GG     +S ELDLD+ VP+GWE+RLDLKSGK+++ Q+CN    +++   
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLT-PLITRDLL------SGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFI-QRCNVQDFNNN---

Query:  ---------NQTVPKLQDLNFPP-SPNFSKFQLSNHL--VDETSLDLKLV-SSLS---PSP----SPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSS
                 NQTVP+ QDLN PP S  F    L +     D+TSL+LKLV SS+S   P P    SP+   S   SVCTLDKVK ALERAE++  K++S 
Subjt:  ---------NQTVPKLQDLNFPP-SPNFSKFQLSNHL--VDETSLDLKLV-SSLS---PSP----SPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSS

Query:  LWKSSPSTSYSSSSSSVAAEKEFREEDNLKSLSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI
                 Y  ++S+  A             +S +AAGCPGCLSYV V  NNP CPRC S VPLPA KKP+IDLNIS+
Subjt:  LWKSSPSTSYSSSSSSVAAEKEFREEDNLKSLSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI

AT1G79160.1 unknown protein4.7e-5553.26Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTE-SQELDLDLHVPSGWERRLDLKSGKMFIQRCN------VQDFNNNNQTV
        MAA+VSSLVR+L+GY KDDR   V + + A+    L+TRDLL  G     + S ELDLDL VP+G+E+RLDLKSGK+++QRCN      + + +  NQTV
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTE-SQELDLDLHVPSGWERRLDLKSGKMFIQRCN------VQDFNNNNQTV

Query:  PKLQDLNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPR-SNYQSVCTLDKVKSALERAERNP--IKKRSSLWKSSPSTSYSSSSSSVAAEK
        P  QDLNFPP P  +   L N L D+T+ +LKL+    PS   S P  SN QSVCTLDKVKSALERAER+P   KKR    +S   T Y           
Subjt:  PKLQDLNFPPSPNFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPR-SNYQSVCTLDKVKSALERAERNP--IKKRSSLWKSSPSTSYSSSSSSVAAEK

Query:  EFREEDNLKSLSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPA---AKKPRIDLNISI
         +R E    +++SP+ AGCPGCLSYVLVMMNNP CPRC ++VPLP     KKP+IDLNISI
Subjt:  EFREEDNLKSLSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPLPA---AKKPRIDLNISI

AT3G11600.1 unknown protein2.2e-0437.7Show/hide
Query:  SPSTSYSSSSSSVAAEKEFREEDNLKSLSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPL
        SP+ S ++S SS  + +  +EE+  ++++S +  GCP CL YV++  ++P CP+C S V L
Subjt:  SPSTSYSSSSSSVAAEKEFREEDNLKSLSSPIAAGCPGCLSYVLVMMNNPTCPRCSSVVPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCGAAGTCAGCTCGCTCGTGCGAGTTCTTACAGGCTACAACAAAGACGACCGTCATCGGACGGTCGGAAACGAATCCGCCGCCGAGAAATTAACGCCT
CTGATCACCCGAGACTTACTCAGCGGCGGCTATTCCAAGTTTACAGAGTCCCAAGAACTTGACCTCGATCTCCATGTCCCTTCCGGCTGGGAAAGAAGACTCGAT
TTGAAGTCAGGGAAAATGTTTATACAAAGATGCAATGTTCAAGATTTCAACAATAACAATCAAACAGTGCCAAAGCTTCAAGATTTGAATTTTCCACCGTCGCCC
AATTTCTCTAAATTTCAATTGTCGAATCATTTGGTCGATGAAACCAGTTTGGATTTGAAATTGGTTTCGTCGTTATCGCCTTCGCCATCGCCGTCGTCGCCGAGG
AGTAATTATCAGAGCGTCTGTACTTTGGATAAGGTCAAATCGGCGCTCGAACGGGCTGAGAGAAATCCCATAAAAAAACGGTCGTCGTTGTGGAAATCGTCTCCG
TCGACGTCGTATTCGTCGTCGTCGTCGTCAGTGGCGGCGGAGAAAGAGTTTCGAGAAGAAGACAACTTGAAATCTTTGTCGTCGCCGATCGCCGCCGGCTGCCCT
GGGTGTTTGTCGTATGTGTTGGTGATGATGAACAATCCGACGTGTCCACGGTGTAGCTCCGTCGTGCCGTTGCCGGCGGCTAAGAAACCTCGGATTGATCTCAAC
ATTTCGATTTGA
mRNA sequenceShow/hide mRNA sequence
CTCCCATTTCTCTCTCTCTACTTTCTCTCTCTTCCTCTCCGTAAACAAAAATGGCCGCCGAAGTCAGCTCGCTCGTGCGAGTTCTTACAGGCTACAACAAAGACG
ACCGTCATCGGACGGTCGGAAACGAATCCGCCGCCGAGAAATTAACGCCTCTGATCACCCGAGACTTACTCAGCGGCGGCTATTCCAAGTTTACAGAGTCCCAAG
AACTTGACCTCGATCTCCATGTCCCTTCCGGCTGGGAAAGAAGACTCGATTTGAAGTCAGGGAAAATGTTTATACAAAGATGCAATGTTCAAGATTTCAACAATA
ACAATCAAACAGTGCCAAAGCTTCAAGATTTGAATTTTCCACCGTCGCCCAATTTCTCTAAATTTCAATTGTCGAATCATTTGGTCGATGAAACCAGTTTGGATT
TGAAATTGGTTTCGTCGTTATCGCCTTCGCCATCGCCGTCGTCGCCGAGGAGTAATTATCAGAGCGTCTGTACTTTGGATAAGGTCAAATCGGCGCTCGAACGGG
CTGAGAGAAATCCCATAAAAAAACGGTCGTCGTTGTGGAAATCGTCTCCGTCGACGTCGTATTCGTCGTCGTCGTCGTCAGTGGCGGCGGAGAAAGAGTTTCGAG
AAGAAGACAACTTGAAATCTTTGTCGTCGCCGATCGCCGCCGGCTGCCCTGGGTGTTTGTCGTATGTGTTGGTGATGATGAACAATCCGACGTGTCCACGGTGTA
GCTCCGTCGTGCCGTTGCCGGCGGCTAAGAAACCTCGGATTGATCTCAACATTTCGATTTGATTTTCAAAGAGATGGAAAAGTATTGTTACTGTAGATGGAGGGA
CAGAAATTATCGTCTTCAAGACATAGATTTTATTTTATTTTTTTCATTTTATTTTTTTTGTTTTTTTTTTTGTTTTTTTTTTT
Protein sequenceShow/hide protein sequence
MAAEVSSLVRVLTGYNKDDRHRTVGNESAAEKLTPLITRDLLSGGYSKFTESQELDLDLHVPSGWERRLDLKSGKMFIQRCNVQDFNNNNQTVPKLQDLNFPPSP
NFSKFQLSNHLVDETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIKKRSSLWKSSPSTSYSSSSSSVAAEKEFREEDNLKSLSSPIAAGCP
GCLSYVLVMMNNPTCPRCSSVVPLPAAKKPRIDLNISI