; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg000823 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg000823
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionFilamentous hemagglutinin transporter
Genome locationscaffold8:43047444..43048505
RNA-Seq ExpressionSpg000823
SyntenySpg000823
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140015.1 uncharacterized protein LOC101202760 [Cucumis sativus]3.5e-10583.59Show/hide
Query:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGTHQTNQTVPKLQD
        MAAEVSSLVRVL +YNK+DRHRT G+ES  EKLTPLITRDLL GGYSKFTE QELDLDL VPSGWE+RLDLKSGKMF+QRCNVQDF  +  NQTVPKLQD
Subjt:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGTHQTNQTVPKLQD

Query:  LNFPPSPNFSKF----HLVDETTNLDLKLVSSSSPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDN
        LNFPPSPN SKF    HLVDE T+LDLKLVSS S SP S SPRSNYQSVCTLDKVKSALERAE+NPIRKRSSLWKSSPSPSYSSSSSSAAA +EF+EE+N
Subjt:  LNFPPSPNFSKF----HLVDETTNLDLKLVSSSSPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDN

Query:  LKSLSSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI
        LK LSS     PIAAGCPGCLSYVLVMKNNPTCPRCSS+VPLPA KKPRIDLN+SI
Subjt:  LKSLSSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI

XP_008456306.1 PREDICTED: uncharacterized protein LOC103496294 [Cucumis melo]3.0e-10482.81Show/hide
Query:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGTHQTNQTVPKLQD
        MAAEVSSLVRVL +YNK+DRH T GNES  EKL PLITRDLL GGYSKFTE QELDLDL VPSGWE+RLDLKSGKMF+QRCNVQDF  +  NQTVPKLQD
Subjt:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGTHQTNQTVPKLQD

Query:  LNFPPSPNFSKF----HLVDETTNLDLKLVSSSSPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDN
        LNFPPSPN+SKF    HLVDE T+LDLKLVSS S SP S SPRSNYQSVCTLDKVKSALERAE+NPIRKRSSLWKSSPSPSYSSSSSSAAA +EF+EE+ 
Subjt:  LNFPPSPNFSKF----HLVDETTNLDLKLVSSSSPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDN

Query:  LKSLSSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI
        LK  SS     PIAAGCPGCLSYVLVMKNNPTCPRCSS+VPLPAAKKPRIDLN+SI
Subjt:  LKSLSSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI

XP_022947770.1 uncharacterized protein LOC111451530 isoform X1 [Cucurbita moschata]4.5e-10081.08Show/hide
Query:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGTHQTNQTVPKLQD
        MAA+V+S VRVL  YNKDD HRTV N+SGP+ LTPLITRDLL GG SKFT+PQELDLDLQ+PSGWEK LDLKSGKMF+QR NVQDF  HQTNQTV KLQD
Subjt:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGTHQTNQTVPKLQD

Query:  LNFPPSPNFSKF----HLVDETTNLDLKLVSSSSPS--PSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEED
        LNFPPS N+SKF    HLV E T+LDLKL SSSS S  PSPSPRSNYQSVCTLDKVKSALERA+KNPIRKRSSLWK SPSPSYSSSSSS   AREFQEED
Subjt:  LNFPPSPNFSKF----HLVDETTNLDLKLVSSSSPS--PSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEED

Query:  NL-KSL-SSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI
        N  KSL SSSSA A IA GCPGCLSYVLVMKNNPTCPRC SVV LPA KKPR+DLN+SI
Subjt:  NL-KSL-SSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI

XP_022970865.1 uncharacterized protein LOC111469711 [Cucurbita maxima]6.3e-10282.17Show/hide
Query:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGTHQTNQTVPKLQD
        MAAEV+S VRVL  YNKDD H TV NESGP+ LTPLITRDLL GG SKFT+PQELDLDLQ+PSGWEKRLDLKSGKMF+QR NVQDF  HQTNQTV KLQD
Subjt:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGTHQTNQTVPKLQD

Query:  LNFPPSPNFSKF----HLVDETTNLDLKL-VSSSSPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDN
        LNFPPS N+SKF    HLV E T+L+LKL  SSSSP PSPSPRSNYQSVCTLDKVKSALERA+KNPIRKRSSLWKSSPSPSYSSSSSS   AREFQEEDN
Subjt:  LNFPPSPNFSKF----HLVDETTNLDLKL-VSSSSPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDN

Query:  L-KSL-SSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI
          KSL SSSSA A IA GCPGCLSYVLVMKNNPTCPRC SVV LPA KKPR+DLN+SI
Subjt:  L-KSL-SSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI

XP_038901828.1 uncharacterized protein LOC120088523 [Benincasa hispida]4.5e-10884.77Show/hide
Query:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGTHQTNQTVPKLQD
        MAAEVSSLVRVL  YNKDDRHRTVGN+S  EKLTPLITRDLL GGYSK+TE QELDLDL VPSGWE+RLDLKSGK F+QRCNVQDF     NQTVPKLQD
Subjt:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGTHQTNQTVPKLQD

Query:  LNFPPSPNFSKF----HLVDETTNLDLKLVSSSSPSPSP-SPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDN
        LNFPPSPNFSKF    HLVDE T+LDLKLVSS SPSPSP SPRSNYQSVCTLDKVKSALERAE+NPIRKRSSLWKSSPSPSYSSSSSSA A +EF++EDN
Subjt:  LNFPPSPNFSKF----HLVDETTNLDLKLVSSSSPSPSP-SPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDN

Query:  LKSLSSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI
        LKSLSS     PIAAGCPGCLSYVLVMKNNPTCPRC+SVVPLPA KKPRIDLN+SI
Subjt:  LKSLSSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI

TrEMBL top hitse value%identityAlignment
A0A0A0KAN5 Uncharacterized protein1.7e-10583.59Show/hide
Query:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGTHQTNQTVPKLQD
        MAAEVSSLVRVL +YNK+DRHRT G+ES  EKLTPLITRDLL GGYSKFTE QELDLDL VPSGWE+RLDLKSGKMF+QRCNVQDF  +  NQTVPKLQD
Subjt:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGTHQTNQTVPKLQD

Query:  LNFPPSPNFSKF----HLVDETTNLDLKLVSSSSPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDN
        LNFPPSPN SKF    HLVDE T+LDLKLVSS S SP S SPRSNYQSVCTLDKVKSALERAE+NPIRKRSSLWKSSPSPSYSSSSSSAAA +EF+EE+N
Subjt:  LNFPPSPNFSKF----HLVDETTNLDLKLVSSSSPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDN

Query:  LKSLSSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI
        LK LSS     PIAAGCPGCLSYVLVMKNNPTCPRCSS+VPLPA KKPRIDLN+SI
Subjt:  LKSLSSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI

A0A1S3C2I3 uncharacterized protein LOC1034962941.5e-10482.81Show/hide
Query:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGTHQTNQTVPKLQD
        MAAEVSSLVRVL +YNK+DRH T GNES  EKL PLITRDLL GGYSKFTE QELDLDL VPSGWE+RLDLKSGKMF+QRCNVQDF  +  NQTVPKLQD
Subjt:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGTHQTNQTVPKLQD

Query:  LNFPPSPNFSKF----HLVDETTNLDLKLVSSSSPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDN
        LNFPPSPN+SKF    HLVDE T+LDLKLVSS S SP S SPRSNYQSVCTLDKVKSALERAE+NPIRKRSSLWKSSPSPSYSSSSSSAAA +EF+EE+ 
Subjt:  LNFPPSPNFSKF----HLVDETTNLDLKLVSSSSPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDN

Query:  LKSLSSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI
        LK  SS     PIAAGCPGCLSYVLVMKNNPTCPRCSS+VPLPAAKKPRIDLN+SI
Subjt:  LKSLSSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI

A0A5D3CB81 Putative YUP8H12R.23 protein1.5e-10482.81Show/hide
Query:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGTHQTNQTVPKLQD
        MAAEVSSLVRVL +YNK+DRH T GNES  EKL PLITRDLL GGYSKFTE QELDLDL VPSGWE+RLDLKSGKMF+QRCNVQDF  +  NQTVPKLQD
Subjt:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGTHQTNQTVPKLQD

Query:  LNFPPSPNFSKF----HLVDETTNLDLKLVSSSSPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDN
        LNFPPSPN+SKF    HLVDE T+LDLKLVSS S SP S SPRSNYQSVCTLDKVKSALERAE+NPIRKRSSLWKSSPSPSYSSSSSSAAA +EF+EE+ 
Subjt:  LNFPPSPNFSKF----HLVDETTNLDLKLVSSSSPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDN

Query:  LKSLSSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI
        LK  SS     PIAAGCPGCLSYVLVMKNNPTCPRCSS+VPLPAAKKPRIDLN+SI
Subjt:  LKSLSSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI

A0A6J1G7U1 uncharacterized protein LOC111451530 isoform X12.2e-10081.08Show/hide
Query:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGTHQTNQTVPKLQD
        MAA+V+S VRVL  YNKDD HRTV N+SGP+ LTPLITRDLL GG SKFT+PQELDLDLQ+PSGWEK LDLKSGKMF+QR NVQDF  HQTNQTV KLQD
Subjt:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGTHQTNQTVPKLQD

Query:  LNFPPSPNFSKF----HLVDETTNLDLKLVSSSSPS--PSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEED
        LNFPPS N+SKF    HLV E T+LDLKL SSSS S  PSPSPRSNYQSVCTLDKVKSALERA+KNPIRKRSSLWK SPSPSYSSSSSS   AREFQEED
Subjt:  LNFPPSPNFSKF----HLVDETTNLDLKLVSSSSPS--PSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEED

Query:  NL-KSL-SSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI
        N  KSL SSSSA A IA GCPGCLSYVLVMKNNPTCPRC SVV LPA KKPR+DLN+SI
Subjt:  NL-KSL-SSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI

A0A6J1I433 uncharacterized protein LOC1114697113.0e-10282.17Show/hide
Query:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGTHQTNQTVPKLQD
        MAAEV+S VRVL  YNKDD H TV NESGP+ LTPLITRDLL GG SKFT+PQELDLDLQ+PSGWEKRLDLKSGKMF+QR NVQDF  HQTNQTV KLQD
Subjt:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGTHQTNQTVPKLQD

Query:  LNFPPSPNFSKF----HLVDETTNLDLKL-VSSSSPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDN
        LNFPPS N+SKF    HLV E T+L+LKL  SSSSP PSPSPRSNYQSVCTLDKVKSALERA+KNPIRKRSSLWKSSPSPSYSSSSSS   AREFQEEDN
Subjt:  LNFPPSPNFSKF----HLVDETTNLDLKL-VSSSSPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDN

Query:  L-KSL-SSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI
          KSL SSSSA A IA GCPGCLSYVLVMKNNPTCPRC SVV LPA KKPR+DLN+SI
Subjt:  L-KSL-SSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G16500.1 unknown protein7.7e-5046.64Show/hide
Query:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-------GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFV-QRCNVQDFGT-----
        MAA+VSSLVR+L  + KDDR   V + +GP     L+TRDLL       GG     +  ELDLD+QVP+GWEKRLDLKSGK+++ Q+CN     +     
Subjt:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLL-------GGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFV-QRCNVQDFGT-----

Query:  -----HQTNQTVPKLQDLNFPP-SPNFSKFHLV-----DETTNLDLKLVSSSSPSPSPSPRSNY---------QSVCTLDKVKSALERAEKNPIRKRSSL
              QTNQTVP+ QDLN PP S  F    L+     D+ T+L+LKLV SS   P P P S++          SVCTLDKVK ALERAEK+  +++S  
Subjt:  -----HQTNQTVPKLQDLNFPP-SPNFSKFHLV-----DETTNLDLKLVSSSSPSPSPSPRSNY---------QSVCTLDKVKSALERAEKNPIRKRSSL

Query:  WKSSPSPSYSSSSSSAAAAREFQEEDNLKSLSSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI
                Y  ++S+  AA +                  +AAGCPGCLSYV V KNNP CPRC S VPLPA KKP+IDLN+S+
Subjt:  WKSSPSPSYSSSSSSAAAAREFQEEDNLKSLSSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNMSI

AT1G79160.1 unknown protein2.4e-5149.81Show/hide
Query:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLLGG--YSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGT----HQTNQTV
        MAA+VSSLVR+L  Y KDDR   V + +G +    L+TRDLLG           ELDLDLQVP+G+EKRLDLKSGK+++QRCN     +     QTNQTV
Subjt:  MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLLGG--YSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGT----HQTNQTV

Query:  PKLQDLNFPPSP--NFSKFHLVDETTNLDLKLVSSSSPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEE
        P  QDLNFPP    N    +L D+TT  +LKL+ SS  S   +  SN QSVCTLDKVKSALERAE++P     +++K   SP                ++
Subjt:  PKLQDLNFPPSP--NFSKFHLVDETTNLDLKLVSSSSPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEE

Query:  DNLKSLSSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPA---AKKPRIDLNMSI
               + +  +P+ AGCPGCLSYVLVM NNP CPRC ++VPLP     KKP+IDLN+SI
Subjt:  DNLKSLSSSSAGAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPA---AKKPRIDLNMSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCGAAGTGAGCTCGCTCGTGCGAGTTCTCATGAGCTACAACAAAGACGACCGTCATCGGACGGTCGGAAACGAATCCGGCCCGGAGAAATTAACGCCTCTGAT
CACCCGAGACTTGCTCGGCGGCTACTCCAAATTCACAGAGCCCCAAGAATTGGACCTCGACCTCCAGGTTCCTTCCGGCTGGGAAAAAAGACTCGACTTGAAGTCGGGGA
AAATGTTCGTTCAAAGATGCAATGTTCAAGATTTCGGCACCCATCAAACGAATCAAACAGTGCCAAAGCTTCAAGATTTGAACTTTCCGCCGTCGCCCAATTTCTCGAAA
TTCCATTTGGTCGACGAGACGACGAATTTGGATTTGAAATTGGTGTCGTCGTCGTCGCCGTCGCCGTCGCCGTCGCCGAGGAGTAATTATCAGAGCGTTTGTACTTTGGA
TAAGGTCAAATCAGCGCTCGAACGGGCGGAGAAAAATCCCATCAGAAAACGCTCGTCGCTTTGGAAATCGTCGCCGTCGCCGTCGTATTCGTCGTCGTCGTCGTCAGCGG
CCGCCGCGAGAGAGTTTCAAGAAGAAGACAATTTGAAATCTCTGTCCTCGTCGTCGGCGGGGGCGCCGATCGCCGCTGGCTGTCCTGGGTGTCTGTCGTACGTGTTGGTG
ATGAAGAACAATCCGACGTGTCCACGGTGTAGCTCCGTCGTGCCGTTGCCGGCGGCGAAGAAACCTCGGATTGATCTGAACATGTCGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCCGAAGTGAGCTCGCTCGTGCGAGTTCTCATGAGCTACAACAAAGACGACCGTCATCGGACGGTCGGAAACGAATCCGGCCCGGAGAAATTAACGCCTCTGAT
CACCCGAGACTTGCTCGGCGGCTACTCCAAATTCACAGAGCCCCAAGAATTGGACCTCGACCTCCAGGTTCCTTCCGGCTGGGAAAAAAGACTCGACTTGAAGTCGGGGA
AAATGTTCGTTCAAAGATGCAATGTTCAAGATTTCGGCACCCATCAAACGAATCAAACAGTGCCAAAGCTTCAAGATTTGAACTTTCCGCCGTCGCCCAATTTCTCGAAA
TTCCATTTGGTCGACGAGACGACGAATTTGGATTTGAAATTGGTGTCGTCGTCGTCGCCGTCGCCGTCGCCGTCGCCGAGGAGTAATTATCAGAGCGTTTGTACTTTGGA
TAAGGTCAAATCAGCGCTCGAACGGGCGGAGAAAAATCCCATCAGAAAACGCTCGTCGCTTTGGAAATCGTCGCCGTCGCCGTCGTATTCGTCGTCGTCGTCGTCAGCGG
CCGCCGCGAGAGAGTTTCAAGAAGAAGACAATTTGAAATCTCTGTCCTCGTCGTCGGCGGGGGCGCCGATCGCCGCTGGCTGTCCTGGGTGTCTGTCGTACGTGTTGGTG
ATGAAGAACAATCCGACGTGTCCACGGTGTAGCTCCGTCGTGCCGTTGCCGGCGGCGAAGAAACCTCGGATTGATCTGAACATGTCGATTTGA
Protein sequenceShow/hide protein sequence
MAAEVSSLVRVLMSYNKDDRHRTVGNESGPEKLTPLITRDLLGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFVQRCNVQDFGTHQTNQTVPKLQDLNFPPSPNFSK
FHLVDETTNLDLKLVSSSSPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDNLKSLSSSSAGAPIAAGCPGCLSYVLV
MKNNPTCPRCSSVVPLPAAKKPRIDLNMSI