; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh16G010980 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh16G010980
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionBifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamily protein
Genome locationCmo_Chr16:7739863..7748019
RNA-Seq ExpressionCmoCh16G010980
SyntenyCmoCh16G010980
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR016140 - Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domain
IPR036312 - Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577605.1 hypothetical protein SDJN03_25179, partial [Cucurbita argyrosperma subsp. sororia]5.7e-15879.49Show/hide
Query:  MAGSLVSV----VAVVVVVVAVGFVEAQQPDCASKLASCAEFLKSNNPPATCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDL
        MAGSLVSV    VAVVVVVVAVGFVEAQQPDCASKLA+CAEFLKSNNPPATCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDL
Subjt:  MAGSLVSV----VAVVVVVVAVGFVEAQQPDCASKLASCAEFLKSNNPPATCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDL

Query:  SKCKSTTLVLRLRHTVLRLRRTVLHLYRQWVDLHFCCCSSSPSSFISISDIVHNQLNVFGYLASTNFVCSLGVKSKGKAAYLFDLSRRFDFTLGLTFSTS
        SKCK+                                 + +PS            L   G      F+    V        L++   RFDFTLGLTFSTS
Subjt:  SKCKSTTLVLRLRHTVLRLRRTVLHLYRQWVDLHFCCCSSSPSSFISISDIVHNQLNVFGYLASTNFVCSLGVKSKGKAAYLFDLSRRFDFTLGLTFSTS

Query:  ESAADGAVIVTGSLPRSIMGDFSIQISSNLVNMLIDDTEKPKRKPRRNRTKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELD
        ESAADGAVIVTGSLPRSIMGDFSIQISSNLVNMLIDDTEKPKRKPRRNRTKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELD
Subjt:  ESAADGAVIVTGSLPRSIMGDFSIQISSNLVNMLIDDTEKPKRKPRRNRTKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELD

Query:  AIRSILQESERVVEKLQKQEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSSSVEK
        AIRSILQESERVVEKLQKQEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSSSVEK
Subjt:  AIRSILQESERVVEKLQKQEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSSSVEK

XP_008448899.1 PREDICTED: uncharacterized protein LOC103490925 [Cucumis melo]3.1e-7985.47Show/hide
Query:  MGDFSIQISSNLVNMLIDDTEKPKRKPRRNR------TKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESERV
        MGDFSIQISSNLVNMLIDDT+KPKRKPRRNR      +K+PQVKVDQK+ SDDSG LKGSTSDGWPHQ PPLFLPIIPPVH AN ELDAIRS+LQESERV
Subjt:  MGDFSIQISSNLVNMLIDDTEKPKRKPRRNR------TKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESERV

Query:  VEKLQKQEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSS
        VEKLQKQEDNM++EVTQRAK+LHDKEFKLPYQKPMPC+AES+ACFQCYKDHPND LKCA LVK+FENCNRQARQK+SS+
Subjt:  VEKLQKQEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSS

XP_022923446.1 uncharacterized protein LOC111431139 [Cucurbita moschata]4.9e-93100Show/hide
Query:  MGDFSIQISSNLVNMLIDDTEKPKRKPRRNRTKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESERVVEKLQK
        MGDFSIQISSNLVNMLIDDTEKPKRKPRRNRTKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESERVVEKLQK
Subjt:  MGDFSIQISSNLVNMLIDDTEKPKRKPRRNRTKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESERVVEKLQK

Query:  QEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSSSVEK
        QEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSSSVEK
Subjt:  QEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSSSVEK

XP_022965185.1 uncharacterized protein LOC111465118 [Cucurbita maxima]7.5e-9497.3Show/hide
Query:  TGSLPRSIMGDFSIQISSNLVNMLIDDTEKPKRKPRRNRTKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESE
        +GSLPRSIMGDFSIQISSNLVNMLIDDTEKPKRKPRRNRTKEPQVKVDQKHVSDDSG+LKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESE
Subjt:  TGSLPRSIMGDFSIQISSNLVNMLIDDTEKPKRKPRRNRTKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESE

Query:  RVVEKLQKQEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSSSVEK
        RVV+KLQKQEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKL  SSVEK
Subjt:  RVVEKLQKQEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSSSVEK

XP_038876833.1 uncharacterized protein LOC120069208 [Benincasa hispida]9.5e-8185.79Show/hide
Query:  MGDFSIQISSNLVNMLIDDTEKPKRKPRRNR------TKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESERV
        MGDFSIQISSNLVNMLIDDTEKPKRKPRRNR      +K+PQ+KVDQKH SDDSGMLKGSTSDGWP+Q PPLFLPIIPPVHSANPELDAIRS+LQESERV
Subjt:  MGDFSIQISSNLVNMLIDDTEKPKRKPRRNR------TKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESERV

Query:  VEKLQKQEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSSSVEK
        VEKLQKQEDNM++EVTQRAK+LHDKEFKLPYQKPMPC+AES+ACFQCYKDHPN+ LKCA LVK+FENCNRQARQ++  SSVEK
Subjt:  VEKLQKQEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSSSVEK

TrEMBL top hitse value%identityAlignment
A0A0A0L5M8 Uncharacterized protein1.6e-7884.36Show/hide
Query:  MGDFSIQISSNLVNMLIDDTEKPKRKPRRNRT------KEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESERV
        MGDFSIQISSNLVNMLIDDTEKPKRKPRRN+       K+PQVKVDQKH SDDSG LKGSTSDGWPHQ  P+FLPIIPPVH AN ELDAIRS+LQ+SERV
Subjt:  MGDFSIQISSNLVNMLIDDTEKPKRKPRRNRT------KEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESERV

Query:  VEKLQKQEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSS
        V+KLQKQEDNM++EVTQRAK+LHDKEFKLPYQKPMPCVAES+ACFQCYKDHPND LKCA LVK+FENCNRQARQK+SS+
Subjt:  VEKLQKQEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSS

A0A1S3BKT1 uncharacterized protein LOC1034909251.5e-7985.47Show/hide
Query:  MGDFSIQISSNLVNMLIDDTEKPKRKPRRNR------TKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESERV
        MGDFSIQISSNLVNMLIDDT+KPKRKPRRNR      +K+PQVKVDQK+ SDDSG LKGSTSDGWPHQ PPLFLPIIPPVH AN ELDAIRS+LQESERV
Subjt:  MGDFSIQISSNLVNMLIDDTEKPKRKPRRNR------TKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESERV

Query:  VEKLQKQEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSS
        VEKLQKQEDNM++EVTQRAK+LHDKEFKLPYQKPMPC+AES+ACFQCYKDHPND LKCA LVK+FENCNRQARQK+SS+
Subjt:  VEKLQKQEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSS

A0A5D3D7J8 Uncharacterized protein1.5e-7985.47Show/hide
Query:  MGDFSIQISSNLVNMLIDDTEKPKRKPRRNR------TKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESERV
        MGDFSIQISSNLVNMLIDDT+KPKRKPRRNR      +K+PQVKVDQK+ SDDSG LKGSTSDGWPHQ PPLFLPIIPPVH AN ELDAIRS+LQESERV
Subjt:  MGDFSIQISSNLVNMLIDDTEKPKRKPRRNR------TKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESERV

Query:  VEKLQKQEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSS
        VEKLQKQEDNM++EVTQRAK+LHDKEFKLPYQKPMPC+AES+ACFQCYKDHPND LKCA LVK+FENCNRQARQK+SS+
Subjt:  VEKLQKQEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSS

A0A6J1E6U2 uncharacterized protein LOC1114311392.4e-93100Show/hide
Query:  MGDFSIQISSNLVNMLIDDTEKPKRKPRRNRTKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESERVVEKLQK
        MGDFSIQISSNLVNMLIDDTEKPKRKPRRNRTKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESERVVEKLQK
Subjt:  MGDFSIQISSNLVNMLIDDTEKPKRKPRRNRTKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESERVVEKLQK

Query:  QEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSSSVEK
        QEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSSSVEK
Subjt:  QEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSSSVEK

A0A6J1HN12 uncharacterized protein LOC1114651183.6e-9497.3Show/hide
Query:  TGSLPRSIMGDFSIQISSNLVNMLIDDTEKPKRKPRRNRTKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESE
        +GSLPRSIMGDFSIQISSNLVNMLIDDTEKPKRKPRRNRTKEPQVKVDQKHVSDDSG+LKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESE
Subjt:  TGSLPRSIMGDFSIQISSNLVNMLIDDTEKPKRKPRRNRTKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESE

Query:  RVVEKLQKQEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSSSVEK
        RVV+KLQKQEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKL  SSVEK
Subjt:  RVVEKLQKQEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSSSVEK

SwissProt top hitse value%identityAlignment
B3H587 Non-specific lipid transfer protein GPI-anchored 84.7e-0631.43Show/hide
Query:  LVSVVAVVVV-----VVAVGFVEAQQP-DCASKLASCAEFLK-SNNPPATCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDLS
        ++ VV  VV+     V A  F + QQ   C +KL  C  ++  S  PP  CCNP+K+     + CLC  +  PD    + +   +A+ +  +CG+  D S
Subjt:  LVSVVAVVVV-----VVAVGFVEAQQP-DCASKLASCAEFLK-SNNPPATCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDLS

Query:  KCKST
         C  T
Subjt:  KCKST

Q1G2Y5 Non-specific lipid transfer protein GPI-anchored 213.4e-0432.53Show/hide
Query:  PDCASKLASCAEFLKSNN--PPATCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDLSKCKSTTLVL
        P   S +  C  FL      P + CC  +K    T +DCLC + T+     SI +N + A+ L +ACG+P    +CK++   L
Subjt:  PDCASKLASCAEFLKSNN--PPATCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDLSKCKSTTLVL

Q1PFD8 Non-specific lipid transfer protein GPI-anchored 94.5e-0939Show/hide
Query:  VAVVVVVVAVGFVEA-------QQPDCASKLASCAEFLKSNN--PPATCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDLSKC
        +AVVV VV +  VEA       Q   C  KL  C  ++ S N  PP +CC P+KE V     CLC  + +P+   ++ +   +AL L KACGV  D+S C
Subjt:  VAVVVVVVAVGFVEA-------QQPDCASKLASCAEFLKSNN--PPATCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDLSKC

Q6NLF7 Non-specific lipid transfer protein GPI-anchored 72.0e-0938.54Show/hide
Query:  LVSVVAVVVVVVAVGFVEAQQPDCASKLASCAEFLKSNNPPA-TCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDLSKC
        + +    V+ ++ V  +EA Q +C SKL  C   L +   P   CC+ IKEAV  +L CLC +YTSP   +   V    AL L++ C V  DLS C
Subjt:  LVSVVAVVVVVVAVGFVEAQQPDCASKLASCAEFLKSNNPPA-TCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDLSKC

Q9LE56 Non-specific lipid transfer protein GPI-anchored 31.8e-1039.02Show/hide
Query:  EAQQPDCASKLASCAEFLK-SNNPPATCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDLSKCKST
        +A    C  KL  C  +L  +  PPATCC P+ E VA    CLC ++ + D   S+ +   +AL L KACG   D+S CK++
Subjt:  EAQQPDCASKLASCAEFLK-SNNPPATCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDLSKCKST

Arabidopsis top hitse value%identityAlignment
AT1G18280.1 Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamily protein1.3e-1139.02Show/hide
Query:  EAQQPDCASKLASCAEFLK-SNNPPATCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDLSKCKST
        +A    C  KL  C  +L  +  PPATCC P+ E VA    CLC ++ + D   S+ +   +AL L KACG   D+S CK++
Subjt:  EAQQPDCASKLASCAEFLK-SNNPPATCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDLSKCKST

AT1G62790.1 Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamily protein1.4e-1038.54Show/hide
Query:  LVSVVAVVVVVVAVGFVEAQQPDCASKLASCAEFLKSNNPPA-TCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDLSKC
        + +    V+ ++ V  +EA Q +C SKL  C   L +   P   CC+ IKEAV  +L CLC +YTSP   +   V    AL L++ C V  DLS C
Subjt:  LVSVVAVVVVVVAVGFVEAQQPDCASKLASCAEFLKSNNPPA-TCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDLSKC

AT1G62790.2 Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamily protein1.4e-1038.54Show/hide
Query:  LVSVVAVVVVVVAVGFVEAQQPDCASKLASCAEFLKSNNPPA-TCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDLSKC
        + +    V+ ++ V  +EA Q +C SKL  C   L +   P   CC+ IKEAV  +L CLC +YTSP   +   V    AL L++ C V  DLS C
Subjt:  LVSVVAVVVVVVAVGFVEAQQPDCASKLASCAEFLKSNNPPA-TCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDLSKC

AT1G73560.1 Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamily protein3.2e-1039Show/hide
Query:  VAVVVVVVAVGFVEA-------QQPDCASKLASCAEFLKSNN--PPATCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDLSKC
        +AVVV VV +  VEA       Q   C  KL  C  ++ S N  PP +CC P+KE V     CLC  + +P+   ++ +   +AL L KACGV  D+S C
Subjt:  VAVVVVVVAVGFVEA-------QQPDCASKLASCAEFLKSNN--PPATCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDLSKC

AT4G12340.1 copper ion binding1.9e-3445.61Show/hide
Query:  MGDFSIQISSNLVNMLIDDTEKPKRKPRRNRTKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESERVVEKLQK
        MGDFSIQISS L+N L +  ++PKR+ ++ +   P+V    K  ++     K +     P Q PP F P IPP  +A+ EL++I+S+++ESE+V+EKL+ 
Subjt:  MGDFSIQISSNLVNMLIDDTEKPKRKPRRNRTKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESERVVEKLQK

Query:  QEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLS
        QE N+V+EVT+RAK+L +KEFK+P  KPMPC ++ EA  +CYK++    LKC+  VKSF++C R++RQ+++
Subjt:  QEDNMVQEVTQRAKELHDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGGAAGTTTGGTTTCCGTCGTGGCGGTGGTGGTGGTGGTGGTGGCCGTCGGATTCGTGGAGGCGCAGCAGCCGGACTGTGCTTCTAAGCTAGCGAGTTGTGCGGA
GTTCCTGAAATCCAATAATCCGCCGGCGACGTGCTGCAATCCGATAAAGGAAGCGGTGGCGACGCAGCTCGATTGCCTCTGCAACCTCTACACCTCGCCGGATTTCTTTT
CGTCGATCGGCGTTAATGTCTCCGACGCTCTCCATCTCACTAAGGCCTGCGGCGTTCCTGTCGATCTCTCCAAGTGCAAAAGTACGACTCTGGTGCTCCGGCTCCGTCAC
ACGGTTCTCCGGCTCCGTCGCACGGTGCTCCATCTCTACCGGCAGTGGGTGGATTTACATTTCTGCTGTTGTTCTTCGTCTCCCTCGTCTTTTATTAGCATCTCAGATAT
CGTCCATAATCAATTGAACGTGTTCGGGTACCTAGCATCGACCAACTTTGTATGTAGTTTAGGGGTTAAAAGTAAGGGAAAAGCAGCTTATTTGTTTGATTTGAGTAGAA
GATTTGATTTTACGCTAGGGCTTACCTTTTCCACCTCTGAATCTGCTGCAGACGGAGCCGTGATCGTCACAGGTAGTCTGCCTAGGAGCATAATGGGTGATTTTTCCATT
CAGATTAGTTCAAACCTTGTTAATATGTTAATTGATGATACCGAAAAACCGAAAAGAAAACCGAGAAGAAACAGAACCAAGGAACCCCAAGTAAAGGTTGATCAGAAACA
TGTATCTGATGATTCTGGGATGCTCAAGGGAAGCACAAGTGATGGATGGCCTCACCAAGCTCCTCCACTCTTTCTTCCCATAATTCCGCCCGTGCATTCTGCCAATCCAG
AGCTAGACGCAATCCGATCCATCCTGCAAGAGAGCGAAAGGGTCGTCGAGAAGTTGCAGAAGCAGGAGGACAACATGGTGCAGGAAGTTACTCAAAGGGCAAAGGAACTC
CATGACAAGGAGTTCAAGCTCCCTTACCAAAAGCCCATGCCATGCGTGGCTGAGAGTGAAGCTTGCTTTCAATGCTACAAAGATCATCCCAATGACTCTTTGAAATGTGC
TCATCTCGTAAAAAGTTTCGAAAACTGTAACCGTCAAGCTCGGCAGAAATTGAGCTCGAGCTCGGTCGAGAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGGAAGTTTGGTTTCCGTCGTGGCGGTGGTGGTGGTGGTGGTGGCCGTCGGATTCGTGGAGGCGCAGCAGCCGGACTGTGCTTCTAAGCTAGCGAGTTGTGCGGA
GTTCCTGAAATCCAATAATCCGCCGGCGACGTGCTGCAATCCGATAAAGGAAGCGGTGGCGACGCAGCTCGATTGCCTCTGCAACCTCTACACCTCGCCGGATTTCTTTT
CGTCGATCGGCGTTAATGTCTCCGACGCTCTCCATCTCACTAAGGCCTGCGGCGTTCCTGTCGATCTCTCCAAGTGCAAAAGTACGACTCTGGTGCTCCGGCTCCGTCAC
ACGGTTCTCCGGCTCCGTCGCACGGTGCTCCATCTCTACCGGCAGTGGGTGGATTTACATTTCTGCTGTTGTTCTTCGTCTCCCTCGTCTTTTATTAGCATCTCAGATAT
CGTCCATAATCAATTGAACGTGTTCGGGTACCTAGCATCGACCAACTTTGTATGTAGTTTAGGGGTTAAAAGTAAGGGAAAAGCAGCTTATTTGTTTGATTTGAGTAGAA
GATTTGATTTTACGCTAGGGCTTACCTTTTCCACCTCTGAATCTGCTGCAGACGGAGCCGTGATCGTCACAGGTAGTCTGCCTAGGAGCATAATGGGTGATTTTTCCATT
CAGATTAGTTCAAACCTTGTTAATATGTTAATTGATGATACCGAAAAACCGAAAAGAAAACCGAGAAGAAACAGAACCAAGGAACCCCAAGTAAAGGTTGATCAGAAACA
TGTATCTGATGATTCTGGGATGCTCAAGGGAAGCACAAGTGATGGATGGCCTCACCAAGCTCCTCCACTCTTTCTTCCCATAATTCCGCCCGTGCATTCTGCCAATCCAG
AGCTAGACGCAATCCGATCCATCCTGCAAGAGAGCGAAAGGGTCGTCGAGAAGTTGCAGAAGCAGGAGGACAACATGGTGCAGGAAGTTACTCAAAGGGCAAAGGAACTC
CATGACAAGGAGTTCAAGCTCCCTTACCAAAAGCCCATGCCATGCGTGGCTGAGAGTGAAGCTTGCTTTCAATGCTACAAAGATCATCCCAATGACTCTTTGAAATGTGC
TCATCTCGTAAAAAGTTTCGAAAACTGTAACCGTCAAGCTCGGCAGAAATTGAGCTCGAGCTCGGTCGAGAAGTAGGATTGCCATCTGCAATGAATCTTCTCTTGTACTG
AGAATATTGTGATTCAAGTTCTCACTGAGAAAAGGAAAGAAAAAAACTGATTTAGCCAACAGAAATAACCCAAGACATGACTGTTGGGGGTTCCTTTGATTCAATTTTGT
GTGTTGTTTCTTGAATCGTATTGGCCAGCCTTTCTTTTAGGCTCTATGGGTGTACTTATATGTCATTCACTACCGTTCAAATCGGTTCGACTAGATGAGTGAACTCTTCC
TCGTATCCTTCATAAAGATGGTTCATGGGGTTTCTTTTGCTTTTGGCATAGTGCCCCTAACGAGAGGCTTGCCAACGTGCGTCTATTATTGAAATCACCCGACTAGAGAG
TATCCCATCTTCTGAAAAATCGTGTGATATGGCTTGAACTTGTTGTGTGCGGATTGAATTAGAGCTCATATCTACTTTGAACAAATATTATGTATGTGTAACAGATTTCG
CAACGAG
Protein sequenceShow/hide protein sequence
MAGSLVSVVAVVVVVVAVGFVEAQQPDCASKLASCAEFLKSNNPPATCCNPIKEAVATQLDCLCNLYTSPDFFSSIGVNVSDALHLTKACGVPVDLSKCKSTTLVLRLRH
TVLRLRRTVLHLYRQWVDLHFCCCSSSPSSFISISDIVHNQLNVFGYLASTNFVCSLGVKSKGKAAYLFDLSRRFDFTLGLTFSTSESAADGAVIVTGSLPRSIMGDFSI
QISSNLVNMLIDDTEKPKRKPRRNRTKEPQVKVDQKHVSDDSGMLKGSTSDGWPHQAPPLFLPIIPPVHSANPELDAIRSILQESERVVEKLQKQEDNMVQEVTQRAKEL
HDKEFKLPYQKPMPCVAESEACFQCYKDHPNDSLKCAHLVKSFENCNRQARQKLSSSSVEK