; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg21957 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg21957
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionDUF4408 domain-containing protein
Genome locationCarg_Chr11:3153977..3154663
RNA-Seq ExpressionCarg21957
SyntenyCarg21957
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant
IPR025520 - Domain of unknown function DUF4408


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588047.1 Pathogen-associated molecular patterns-induced protein A70, partial [Cucurbita argyrosperma subsp. sororia]1.6e-120100Show/hide
Query:  MWTSLATWVTPTSLFILINVVIATIVITSRRHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHEPETPQP
        MWTSLATWVTPTSLFILINVVIATIVITSRRHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHEPETPQP
Subjt:  MWTSLATWVTPTSLFILINVVIATIVITSRRHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHEPETPQP

Query:  VAEQSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAADDEEEVDVRADDF
        VAEQSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAADDEEEVDVRADDF
Subjt:  VAEQSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAADDEEEVDVRADDF

Query:  INKFKQQLKLQRLESLLRYRDMIKGKKH
        INKFKQQLKLQRLESLLRYRDMIKGKKH
Subjt:  INKFKQQLKLQRLESLLRYRDMIKGKKH

KAG6590003.1 Pathogen-associated molecular patterns-induced protein A70, partial [Cucurbita argyrosperma subsp. sororia]1.9e-7369.36Show/hide
Query:  MWTSLATWVTPTSLFILINVVIATIVITSR------RHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHE
        MWTS+ TW TP+ LFIL+N VIATI ITSR       HLH GP LL  PSFLDRVKSFN  PY+SDH+PNP P       P++L+RLKSITL RSDS  E
Subjt:  MWTSLATWVTPTSLFILINVVIATIVITSR------RHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHE

Query:  PETPQPVAEQSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAA--DDEEE
        PET  P AEQSPEKTHHDHS+SRSKS T+   P TS R +LQKSLSEKL W SF    T AQ ET E VSEIERRRPAT  AEIGEP+T E    DD++E
Subjt:  PETPQPVAEQSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAA--DDEEE

Query:  VDVRADDFINKFKQQLKLQRLESLLRYRDMIKGKK
        VDVRADDFINKFKQQLKLQRLESLLRYRDM+KG+K
Subjt:  VDVRADDFINKFKQQLKLQRLESLLRYRDMIKGKK

XP_022925526.1 uncharacterized protein LOC111432826 [Cucurbita moschata]4.8e-11798.25Show/hide
Query:  MWTSLATWVTPTSLFILINVVIATIVITSRRHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHEPETPQP
        MWTSLATWVTPTSLFILINVVIATIVITSRRHLHDGPALL HPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHE ETPQP
Subjt:  MWTSLATWVTPTSLFILINVVIATIVITSRRHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHEPETPQP

Query:  VAEQSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAADDEEEVDVRADDF
        VAEQSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQ+ETEESVSEIERRRPATTMAEIGEPKTPEAAD+EEEVDVRADDF
Subjt:  VAEQSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAADDEEEVDVRADDF

Query:  INKFKQQLKLQRLESLLRYRDMIKGKKH
        INKFKQQLKLQRLESLLRYRDMIKGKKH
Subjt:  INKFKQQLKLQRLESLLRYRDMIKGKKH

XP_023003301.1 uncharacterized protein LOC111496949 [Cucurbita maxima]6.7e-11194.3Show/hide
Query:  MWTSLATWVTPTSLFILINVVIATIVITSRRHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHEPETPQP
        MWTSLATWVTPTSLFILINVVIATIVITSRRHLHDGPALL HPS LDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSN EPETPQ 
Subjt:  MWTSLATWVTPTSLFILINVVIATIVITSRRHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHEPETPQP

Query:  VAEQSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAADDEEEVDVRADDF
         AEQS EKTHHDHS+SR KSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTR DTAAQDETEESVSEIERRRPATTMAEIGEPK  E AD+EEEVDVRADDF
Subjt:  VAEQSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAADDEEEVDVRADDF

Query:  INKFKQQLKLQRLESLLRYRDMIKGKKH
        INKFKQQLKLQRLESLLRYRDMIKGKKH
Subjt:  INKFKQQLKLQRLESLLRYRDMIKGKKH

XP_023530369.1 uncharacterized protein LOC111792966 [Cucurbita pepo subsp. pepo]6.9e-11696.93Show/hide
Query:  MWTSLATWVTPTSLFILINVVIATIVITSRRHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHEPETPQP
        MWT LATW+TPTSLFILINVVIATIVITSRRHLHDGPALL HPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHEPETPQP
Subjt:  MWTSLATWVTPTSLFILINVVIATIVITSRRHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHEPETPQP

Query:  VAEQSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAADDEEEVDVRADDF
        VAEQSPEKTHHDHSVSRSKS+T+PHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIE RRPATTMAEIGEPKTPEAAD+EEEVDVRADDF
Subjt:  VAEQSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAADDEEEVDVRADDF

Query:  INKFKQQLKLQRLESLLRYRDMIKGKKH
        INKFKQQLKLQRLESLLRYRDMIKGKKH
Subjt:  INKFKQQLKLQRLESLLRYRDMIKGKKH

TrEMBL top hitse value%identityAlignment
A0A5A7UQD5 DUF761 domain-containing protein/DUF4408 domain-containing protein9.6e-6365.4Show/hide
Query:  MWTSLATWVTPTSLFILINVVIATIVIT-------SRRHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNH
        M TSL TW+TPTSLFI IN+VIATI IT       SR HLH G  LL  PSFLDRVKSFN   + S+++PNPDP     R PS+LDRLKSI++ RSDS  
Subjt:  MWTSLATWVTPTSLFILINVVIATIVIT-------SRRHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNH

Query:  EPETPQPVAE---QSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAADDE
        +PE PQP AE   Q+PE+ H DHSVSRSKS T    P TS R +LQKSLSEKLSW S     T  Q ETEE V+EIERRRPAT  AE  EP+T E    E
Subjt:  EPETPQPVAE---QSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAADDE

Query:  EEVDVRADDFINKFKQQLKLQRLESLLRYRDMIKGKK
        EEVD RADDFINKFKQQLKLQRLESLLRYRDM+ GKK
Subjt:  EEVDVRADDFINKFKQQLKLQRLESLLRYRDMIKGKK

A0A6J1ECG2 uncharacterized protein LOC1114328262.3e-11798.25Show/hide
Query:  MWTSLATWVTPTSLFILINVVIATIVITSRRHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHEPETPQP
        MWTSLATWVTPTSLFILINVVIATIVITSRRHLHDGPALL HPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHE ETPQP
Subjt:  MWTSLATWVTPTSLFILINVVIATIVITSRRHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHEPETPQP

Query:  VAEQSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAADDEEEVDVRADDF
        VAEQSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQ+ETEESVSEIERRRPATTMAEIGEPKTPEAAD+EEEVDVRADDF
Subjt:  VAEQSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAADDEEEVDVRADDF

Query:  INKFKQQLKLQRLESLLRYRDMIKGKKH
        INKFKQQLKLQRLESLLRYRDMIKGKKH
Subjt:  INKFKQQLKLQRLESLLRYRDMIKGKKH

A0A6J1H824 uncharacterized protein LOC1114613354.3e-7167.66Show/hide
Query:  MWTSLATWVTPTSLFILINVVIATIVITSR------RHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHE
        MWTS+ TW TP+SLFIL+N VIATI ITSR       HLH GP LL  PSFL+RV+SFN  PY++DH+P+P P       PS+L+RLKSITL RSDS  E
Subjt:  MWTSLATWVTPTSLFILINVVIATIVITSR------RHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHE

Query:  PETPQPVAEQSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAA--DDEEE
        PE   P AEQSPEKTH DHS+SRSKS T+   P TS R +LQKSLSEKL W SF    T AQ ET E VSEIERRRPAT  AEIGEP+T E    DD++E
Subjt:  PETPQPVAEQSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAA--DDEEE

Query:  VDVRADDFINKFKQQLKLQRLESLLRYRDMIKGKK
        VDVRADDFINKFKQQLKLQRLESLLRYRDM+KG+K
Subjt:  VDVRADDFINKFKQQLKLQRLESLLRYRDMIKGKK

A0A6J1JED0 uncharacterized protein LOC1114850235.6e-7166.81Show/hide
Query:  MWTSLATWVTPTSLFILINVVIATIVITSR------RHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHE
        MWTS+ TW TP+SLFIL+N VIATI ITSR       HLH GP LL  PSFL+RV+SFN  PY+ DH+P+P P       PS+L+RLKSITL RSDS  E
Subjt:  MWTSLATWVTPTSLFILINVVIATIVITSR------RHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHE

Query:  PETPQPVAEQSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAA-----DD
        PET  P AEQ PEKTHHDHS+SRSKS T+   P TS R +LQKSLSEKL W SF    T AQ ET E VSEIERRRP+T  AEIGEP+T E       +D
Subjt:  PETPQPVAEQSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAA-----DD

Query:  EEEVDVRADDFINKFKQQLKLQRLESLLRYRDMIKGKK
        ++EVDVRADDFINKFKQQLKLQRLESLLRYRDM+KGKK
Subjt:  EEEVDVRADDFINKFKQQLKLQRLESLLRYRDMIKGKK

A0A6J1KNT3 uncharacterized protein LOC1114969493.3e-11194.3Show/hide
Query:  MWTSLATWVTPTSLFILINVVIATIVITSRRHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHEPETPQP
        MWTSLATWVTPTSLFILINVVIATIVITSRRHLHDGPALL HPS LDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSN EPETPQ 
Subjt:  MWTSLATWVTPTSLFILINVVIATIVITSRRHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHEPETPQP

Query:  VAEQSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAADDEEEVDVRADDF
         AEQS EKTHHDHS+SR KSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTR DTAAQDETEESVSEIERRRPATTMAEIGEPK  E AD+EEEVDVRADDF
Subjt:  VAEQSPEKTHHDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAADDEEEVDVRADDF

Query:  INKFKQQLKLQRLESLLRYRDMIKGKKH
        INKFKQQLKLQRLESLLRYRDMIKGKKH
Subjt:  INKFKQQLKLQRLESLLRYRDMIKGKKH

SwissProt top hitse value%identityAlignment
F4K956 Pathogen-associated molecular patterns-induced protein A703.1e-1837.45Show/hide
Query:  SRRHLHDGPA---LLCHPSFLDRVKSFNL--------YPYHSD--HHPNP-------------DP------------------------PTRLDRAPSVL
        S   LH  PA   L   PS LDRVKS N+         P  +D  HH  P             DP                        P  L RAPS+L
Subjt:  SRRHLHDGPA---LLCHPSFLDRVKSFNL--------YPYHSD--HHPNP-------------DP------------------------PTRLDRAPSVL

Query:  DRLKSITL---YRSDSN-HEPETPQPVAEQSPEKTHHDHSVSRSKSNT-EPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPAT
        +R+KSI L   YRSD +  + + P PV        H +H   RSKS + +P         K+ KS SEK   + F  A + A  E  E+V  +ERRRP T
Subjt:  DRLKSITL---YRSDSN-HEPETPQPVAEQSPEKTHHDHSVSRSKSNT-EPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPAT

Query:  TMAEIGEPKTPEAADDEEEVDVRADDFINKFKQQLKLQRLESLLRYRDMIK
        T  E    ++    D E+ VD +A DFINKFKQQLKLQRL+S+LRY++M+K
Subjt:  TMAEIGEPKTPEAADDEEEVDVRADDFINKFKQQLKLQRLESLLRYRDMIK

Arabidopsis top hitse value%identityAlignment
AT2G26110.1 Protein of unknown function (DUF761)4.6e-1730.32Show/hide
Query:  TSLATWVTPTSLFILINVVIATIVI----TSRRHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPD-PPTR---------------------LDRAPS
        T++ +W TPT LF+ +N++I TI I    +S+ +  +   +   PS + R+KS N   + S    + + PP+                      L R+PS
Subjt:  TSLATWVTPTSLFILINVVIATIVI----TSRRHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPD-PPTR---------------------LDRAPS

Query:  VLDRLKSITLYRSDS------------------------NHEPETPQPVAEQSPEKTHHD---HSVSRSKSNTEPHA--PTTSFRAKLQKSLSEKLSWTS
        VL R+KS  LY   S                          E +  Q   EQS E+ +     + V+R+KS+TEP A         K++KS S K  ++ 
Subjt:  VLDRLKSITLYRSDS------------------------NHEPETPQPVAEQSPEKTHHD---HSVSRSKSNTEPHA--PTTSFRAKLQKSLSEKLSWTS

Query:  FTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAADDEEEVDVRADDFINKFKQQLKLQRLESLLRYRDMIK
        F           +E    +E RRPAT    +  P+     + +EEVD +ADDFIN+FK QLKLQR++S+ +Y++M+K
Subjt:  FTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAADDEEEVDVRADDFINKFKQQLKLQRLESLLRYRDMIK

AT4G26130.1 unknown protein4.5e-2033.33Show/hide
Query:  TSLATWVTPTSLFILINVVIATIVIT------SRRH--LHDGPALLCH----------PSFLDRVKSFNLYPYH------------SDHHPNPDPPTRLD
        TSL  W+TPT+LF+L+N  IATI IT      SR+H    DG     H          PS +DRVKS N + Y+            SD +PNP PP+ L 
Subjt:  TSLATWVTPTSLFILINVVIATIVIT------SRRH--LHDGPALLCH----------PSFLDRVKSFNLYPYH------------SDHHPNPDPPTRLD

Query:  R-----------------------------------------------------APSVLDRLKSI---TLYRSDSNHEPETPQPVAEQSPEKTHHDHSVS
        R                                                     APS+L R+KSI   +LYRSD +  PE           +TH      
Subjt:  R-----------------------------------------------------APSVLDRLKSI---TLYRSDSNHEPETPQPVAEQSPEKTHHDHSVS

Query:  RSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAAD-DEEEVDVRADDFINKFKQQLKLQRLES
         +++ +E   P T  + K  K + +  S     R       E EE+V  +E+RRP T   E    +T    D  EE VD +A +FINKFKQQLKLQRL+S
Subjt:  RSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAAD-DEEEVDVRADDFINKFKQQLKLQRLES

Query:  LLRYRDMIK
         LRYR+M+K
Subjt:  LLRYRDMIK

AT5G56980.1 unknown protein2.2e-1937.45Show/hide
Query:  SRRHLHDGPA---LLCHPSFLDRVKSFNL--------YPYHSD--HHPNP-------------DP------------------------PTRLDRAPSVL
        S   LH  PA   L   PS LDRVKS N+         P  +D  HH  P             DP                        P  L RAPS+L
Subjt:  SRRHLHDGPA---LLCHPSFLDRVKSFNL--------YPYHSD--HHPNP-------------DP------------------------PTRLDRAPSVL

Query:  DRLKSITL---YRSDSN-HEPETPQPVAEQSPEKTHHDHSVSRSKSNT-EPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPAT
        +R+KSI L   YRSD +  + + P PV        H +H   RSKS + +P         K+ KS SEK   + F  A + A  E  E+V  +ERRRP T
Subjt:  DRLKSITL---YRSDSN-HEPETPQPVAEQSPEKTHHDHSVSRSKSNT-EPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPAT

Query:  TMAEIGEPKTPEAADDEEEVDVRADDFINKFKQQLKLQRLESLLRYRDMIK
        T  E    ++    D E+ VD +A DFINKFKQQLKLQRL+S+LRY++M+K
Subjt:  TMAEIGEPKTPEAADDEEEVDVRADDFINKFKQQLKLQRLESLLRYRDMIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGACTTCTTTAGCCACTTGGGTCACCCCCACTTCCCTCTTCATCCTCATCAACGTCGTCATCGCCACCATCGTCATCACTTCCCGCCGCCATCTCCACGACGGCCC
CGCCCTCCTATGCCACCCTTCTTTCCTAGACAGAGTCAAGTCCTTCAACCTCTACCCCTACCACTCCGACCACCACCCTAATCCAGACCCTCCAACCCGACTCGACCGAG
CTCCATCGGTGTTGGATCGCCTCAAATCCATCACCCTCTACAGATCCGATTCAAACCACGAACCGGAAACGCCACAGCCGGTAGCAGAACAGAGCCCGGAAAAGACCCAC
CACGACCATTCCGTCAGCCGGAGCAAATCCAACACCGAACCCCACGCTCCGACGACGAGCTTCCGGGCGAAATTGCAGAAATCGCTGAGCGAGAAGCTGTCATGGACGTC
GTTCACTAGGGCGGATACAGCGGCGCAGGACGAAACAGAGGAATCAGTAAGCGAAATTGAACGGCGTCGTCCGGCGACAACGATGGCGGAGATTGGAGAACCAAAAACGC
CGGAAGCTGCCGACGACGAGGAGGAGGTGGATGTGAGAGCTGACGATTTCATCAACAAATTCAAGCAGCAGCTGAAATTGCAGAGGCTGGAATCTCTGTTGCGTTACAGA
GACATGATTAAAGGCAAAAAACATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGGACTTCTTTAGCCACTTGGGTCACCCCCACTTCCCTCTTCATCCTCATCAACGTCGTCATCGCCACCATCGTCATCACTTCCCGCCGCCATCTCCACGACGGCCC
CGCCCTCCTATGCCACCCTTCTTTCCTAGACAGAGTCAAGTCCTTCAACCTCTACCCCTACCACTCCGACCACCACCCTAATCCAGACCCTCCAACCCGACTCGACCGAG
CTCCATCGGTGTTGGATCGCCTCAAATCCATCACCCTCTACAGATCCGATTCAAACCACGAACCGGAAACGCCACAGCCGGTAGCAGAACAGAGCCCGGAAAAGACCCAC
CACGACCATTCCGTCAGCCGGAGCAAATCCAACACCGAACCCCACGCTCCGACGACGAGCTTCCGGGCGAAATTGCAGAAATCGCTGAGCGAGAAGCTGTCATGGACGTC
GTTCACTAGGGCGGATACAGCGGCGCAGGACGAAACAGAGGAATCAGTAAGCGAAATTGAACGGCGTCGTCCGGCGACAACGATGGCGGAGATTGGAGAACCAAAAACGC
CGGAAGCTGCCGACGACGAGGAGGAGGTGGATGTGAGAGCTGACGATTTCATCAACAAATTCAAGCAGCAGCTGAAATTGCAGAGGCTGGAATCTCTGTTGCGTTACAGA
GACATGATTAAAGGCAAAAAACATTAA
Protein sequenceShow/hide protein sequence
MWTSLATWVTPTSLFILINVVIATIVITSRRHLHDGPALLCHPSFLDRVKSFNLYPYHSDHHPNPDPPTRLDRAPSVLDRLKSITLYRSDSNHEPETPQPVAEQSPEKTH
HDHSVSRSKSNTEPHAPTTSFRAKLQKSLSEKLSWTSFTRADTAAQDETEESVSEIERRRPATTMAEIGEPKTPEAADDEEEVDVRADDFINKFKQQLKLQRLESLLRYR
DMIKGKKH