; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg022130 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg022130
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF4408 domain-containing protein
Genome locationscaffold2:8544723..8545424
RNA-Seq ExpressionSpg022130
SyntenySpg022130
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant
IPR025520 - Domain of unknown function DUF4408


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588047.1 Pathogen-associated molecular patterns-induced protein A70, partial [Cucurbita argyrosperma subsp. sororia]2.1e-8374.26Show/hide
Query:  MWTSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSDHHPNPDPPTQLARAPSMLDRLKSITLDRSD
        MWTS   WVTPTSLFIL+N V+ATI ITS          R HLH GPALL  PSFLDRVKSFN   YHSDHHPNPDPPT+L RAPS+LDRLKSITL RSD
Subjt:  MWTSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSDHHPNPDPPTQLARAPSMLDRLKSITLDRSD

Query:  SIREPETPQPAAEQSPETTHHDHSVGRSKSDTRTLTPATSLRRRLQKSLSEKLPWAPF----TAAEDETEESASEVERRRPATTKAEIAELETAAAGDDD
        S  EPETPQP AEQSPE THHDHSV RSKS+T    P TS R +LQKSLSEKL W  F    TAA+DETEES SE+ERRRPATT AEI E +T  A DD+
Subjt:  SIREPETPQPAAEQSPETTHHDHSVGRSKSDTRTLTPATSLRRRLQKSLSEKLPWAPF----TAAEDETEESASEVERRRPATTKAEIAELETAAAGDDD

Query:  EEVDVRADDFINKFKQQLKLQRLESLLRYRDMTKGKK
        EEVDVRADDFINKFKQQLKLQRLESLLRYRDM KGKK
Subjt:  EEVDVRADDFINKFKQQLKLQRLESLLRYRDMTKGKK

KAG6590003.1 Pathogen-associated molecular patterns-induced protein A70, partial [Cucurbita argyrosperma subsp. sororia]1.3e-8575.32Show/hide
Query:  MWTSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSDHHPNPDPPTQLARAPSMLDRLKSITLDRSD
        MWTS T W TP+ LFILVNAV+ATIAITSRF ADKS    HHLHGGP LLR PSFLDRVKSFNFS Y+SDH+PNP P       P+ML+RLKSITL+RSD
Subjt:  MWTSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSDHHPNPDPPTQLARAPSMLDRLKSITLDRSD

Query:  SIREPETPQPAAEQSPETTHHDHSVGRSKSDTRTLTPATSLRRRLQKSLSEKLPWAPFTAAEDETEESASEVERRRPATTKAEIAELETAAAG--DDDEE
        S+REPET  P AEQSPE THHDHS+ RSKS+T+T TPATS+RRRLQKSLSEKLPWA FT A+ ET E  SE+ERRRPAT +AEI E ET   G  DDD+E
Subjt:  SIREPETPQPAAEQSPETTHHDHSVGRSKSDTRTLTPATSLRRRLQKSLSEKLPWAPFTAAEDETEESASEVERRRPATTKAEIAELETAAAG--DDDEE

Query:  VDVRADDFINKFKQQLKLQRLESLLRYRDMTKGKK
        VDVRADDFINKFKQQLKLQRLESLLRYRDM KG+K
Subjt:  VDVRADDFINKFKQQLKLQRLESLLRYRDMTKGKK

KAG7023665.1 hypothetical protein SDJN02_14691, partial [Cucurbita argyrosperma subsp. argyrosperma]5.9e-8675.74Show/hide
Query:  MWTSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSDHHPNPDPPTQLARAPSMLDRLKSITLDRSD
        MWTS T W TP+SLFILVNAV+ATIAITSRF ADKS    HHLHGGP LLRPPSFL+RV+SFNFS Y++DH+P+P P       PSML RLKSITL+RSD
Subjt:  MWTSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSDHHPNPDPPTQLARAPSMLDRLKSITLDRSD

Query:  SIREPETPQPAAEQSPETTHHDHSVGRSKSDTRTLTPATSLRRRLQKSLSEKLPWAPFTAAEDETEESASEVERRRPATTKAEIAELETAAAG--DDDEE
        S+REPET  PAAEQSPE THHDHS+ RSKS+T+T TPATSLRRRLQKSLSEKLPWA FT A+ ET E  SE+ERRRPAT +AEI E ET   G  DDD+E
Subjt:  SIREPETPQPAAEQSPETTHHDHSVGRSKSDTRTLTPATSLRRRLQKSLSEKLPWAPFTAAEDETEESASEVERRRPATTKAEIAELETAAAG--DDDEE

Query:  VDVRADDFINKFKQQLKLQRLESLLRYRDMTKGKK
        VDVRADDFINKFKQQLKLQRLESLLRYRDM KG+K
Subjt:  VDVRADDFINKFKQQLKLQRLESLLRYRDMTKGKK

XP_022960601.1 uncharacterized protein LOC111461335 [Cucurbita moschata]1.1e-8474.89Show/hide
Query:  MWTSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSDHHPNPDPPTQLARAPSMLDRLKSITLDRSD
        MWTS T W TP+SLFILVNAV+ATIAITSRF ADKS    HHLHGGP LLRPPSFL+RV+SFNFS Y++DH+P+P P       PSML+RLKSITL+RSD
Subjt:  MWTSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSDHHPNPDPPTQLARAPSMLDRLKSITLDRSD

Query:  SIREPETPQPAAEQSPETTHHDHSVGRSKSDTRTLTPATSLRRRLQKSLSEKLPWAPFTAAEDETEESASEVERRRPATTKAEIAELETAAAG--DDDEE
        S+REPE   PAAEQSPE TH DHS+ RSKS+T+T TPATSLRRRLQKSLSEKLPWA FT A+ ET E  SE+ERRRPAT +AEI E ET   G  DDD+E
Subjt:  SIREPETPQPAAEQSPETTHHDHSVGRSKSDTRTLTPATSLRRRLQKSLSEKLPWAPFTAAEDETEESASEVERRRPATTKAEIAELETAAAG--DDDEE

Query:  VDVRADDFINKFKQQLKLQRLESLLRYRDMTKGKK
        VDVRADDFINKFKQQLKLQRLESLLRYRDM KG+K
Subjt:  VDVRADDFINKFKQQLKLQRLESLLRYRDMTKGKK

XP_022987481.1 uncharacterized protein LOC111485023 [Cucurbita maxima]1.4e-8473.95Show/hide
Query:  MWTSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSDHHPNPDPPTQLARAPSMLDRLKSITLDRSD
        MWTS T W TP+SLFILVNAV+ATIAITSRF ADKS    HHLHGGP LLRPPSFL+RV+SFNFS Y+ DH+P+P P       PSML+RLKSITL+RSD
Subjt:  MWTSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSDHHPNPDPPTQLARAPSMLDRLKSITLDRSD

Query:  SIREPETPQPAAEQSPETTHHDHSVGRSKSDTRTLTPATSLRRRLQKSLSEKLPWAPFTAAEDETEESASEVERRRPATTKAEIAELETAAAG-----DD
        S+REPET  PAAEQ PE THHDHS+ RSKS+T+T TPATSLRRRLQKSLSEKLPWA FT A+ ET E  SE+ERRRP+T +AEI E ET   G     +D
Subjt:  SIREPETPQPAAEQSPETTHHDHSVGRSKSDTRTLTPATSLRRRLQKSLSEKLPWAPFTAAEDETEESASEVERRRPATTKAEIAELETAAAG-----DD

Query:  DEEVDVRADDFINKFKQQLKLQRLESLLRYRDMTKGKK
        D+EVDVRADDFINKFKQQLKLQRLESLLRYRDM KGKK
Subjt:  DEEVDVRADDFINKFKQQLKLQRLESLLRYRDMTKGKK

TrEMBL top hitse value%identityAlignment
A0A5A7UQD5 DUF761 domain-containing protein/DUF4408 domain-containing protein1.7e-7570.76Show/hide
Query:  MWTSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSDHHPNPDPPTQLARAPSMLDRLKSITLDRSD
        M TS T W+TPTSLFI +N V+ATIAITSRF ADKS   RHHLH G  LLRPPSFLDRVKSFNF+ + S+++PNPDP     R PSMLDRLKSI+++RSD
Subjt:  MWTSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSDHHPNPDPPTQLARAPSMLDRLKSITLDRSD

Query:  SIREPETPQPAAE---QSPETTHHDHSVGRSKSDTRTLTPATSLRRRLQKSLSEKLPWAPFTAAEDETEESASEVERRRPATTKAEIAELETAAAGDDDE
        SIR+PE PQPAAE   Q+PE  H DHSV RSKS+T TLTPATSLRRRLQKSLSEKL W   T  + ETEE  +E+ERRRPAT +AE  E ET   G  +E
Subjt:  SIREPETPQPAAE---QSPETTHHDHSVGRSKSDTRTLTPATSLRRRLQKSLSEKLPWAPFTAAEDETEESASEVERRRPATTKAEIAELETAAAGDDDE

Query:  EVDVRADDFINKFKQQLKLQRLESLLRYRDMTKGKK
        EVD RADDFINKFKQQLKLQRLESLLRYRDM  GKK
Subjt:  EVDVRADDFINKFKQQLKLQRLESLLRYRDMTKGKK

A0A6J1ECG2 uncharacterized protein LOC1114328261.5e-8273.42Show/hide
Query:  MWTSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSDHHPNPDPPTQLARAPSMLDRLKSITLDRSD
        MWTS   WVTPTSLFIL+N V+ATI ITS          R HLH GPALLR PSFLDRVKSFN   YHSDHHPNPDPPT+L RAPS+LDRLKSITL RSD
Subjt:  MWTSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSDHHPNPDPPTQLARAPSMLDRLKSITLDRSD

Query:  SIREPETPQPAAEQSPETTHHDHSVGRSKSDTRTLTPATSLRRRLQKSLSEKLPWAPF----TAAEDETEESASEVERRRPATTKAEIAELETAAAGDDD
        S  E ETPQP AEQSPE THHDHSV RSKS+T    P TS R +LQKSLSEKL W  F    TAA++ETEES SE+ERRRPATT AEI E +T  A D++
Subjt:  SIREPETPQPAAEQSPETTHHDHSVGRSKSDTRTLTPATSLRRRLQKSLSEKLPWAPF----TAAEDETEESASEVERRRPATTKAEIAELETAAAGDDD

Query:  EEVDVRADDFINKFKQQLKLQRLESLLRYRDMTKGKK
        EEVDVRADDFINKFKQQLKLQRLESLLRYRDM KGKK
Subjt:  EEVDVRADDFINKFKQQLKLQRLESLLRYRDMTKGKK

A0A6J1H824 uncharacterized protein LOC1114613355.4e-8574.89Show/hide
Query:  MWTSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSDHHPNPDPPTQLARAPSMLDRLKSITLDRSD
        MWTS T W TP+SLFILVNAV+ATIAITSRF ADKS    HHLHGGP LLRPPSFL+RV+SFNFS Y++DH+P+P P       PSML+RLKSITL+RSD
Subjt:  MWTSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSDHHPNPDPPTQLARAPSMLDRLKSITLDRSD

Query:  SIREPETPQPAAEQSPETTHHDHSVGRSKSDTRTLTPATSLRRRLQKSLSEKLPWAPFTAAEDETEESASEVERRRPATTKAEIAELETAAAG--DDDEE
        S+REPE   PAAEQSPE TH DHS+ RSKS+T+T TPATSLRRRLQKSLSEKLPWA FT A+ ET E  SE+ERRRPAT +AEI E ET   G  DDD+E
Subjt:  SIREPETPQPAAEQSPETTHHDHSVGRSKSDTRTLTPATSLRRRLQKSLSEKLPWAPFTAAEDETEESASEVERRRPATTKAEIAELETAAAG--DDDEE

Query:  VDVRADDFINKFKQQLKLQRLESLLRYRDMTKGKK
        VDVRADDFINKFKQQLKLQRLESLLRYRDM KG+K
Subjt:  VDVRADDFINKFKQQLKLQRLESLLRYRDMTKGKK

A0A6J1JED0 uncharacterized protein LOC1114850237.0e-8573.95Show/hide
Query:  MWTSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSDHHPNPDPPTQLARAPSMLDRLKSITLDRSD
        MWTS T W TP+SLFILVNAV+ATIAITSRF ADKS    HHLHGGP LLRPPSFL+RV+SFNFS Y+ DH+P+P P       PSML+RLKSITL+RSD
Subjt:  MWTSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSDHHPNPDPPTQLARAPSMLDRLKSITLDRSD

Query:  SIREPETPQPAAEQSPETTHHDHSVGRSKSDTRTLTPATSLRRRLQKSLSEKLPWAPFTAAEDETEESASEVERRRPATTKAEIAELETAAAG-----DD
        S+REPET  PAAEQ PE THHDHS+ RSKS+T+T TPATSLRRRLQKSLSEKLPWA FT A+ ET E  SE+ERRRP+T +AEI E ET   G     +D
Subjt:  SIREPETPQPAAEQSPETTHHDHSVGRSKSDTRTLTPATSLRRRLQKSLSEKLPWAPFTAAEDETEESASEVERRRPATTKAEIAELETAAAG-----DD

Query:  DEEVDVRADDFINKFKQQLKLQRLESLLRYRDMTKGKK
        D+EVDVRADDFINKFKQQLKLQRLESLLRYRDM KGKK
Subjt:  DEEVDVRADDFINKFKQQLKLQRLESLLRYRDMTKGKK

A0A6J1KNT3 uncharacterized protein LOC1114969498.8e-8071.73Show/hide
Query:  MWTSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSDHHPNPDPPTQLARAPSMLDRLKSITLDRSD
        MWTS   WVTPTSLFIL+N V+ATI ITS          R HLH GPALLR PS LDRVKSFN   YHSDHHPNPDPPT+L RAPS+LDRLKSITL RSD
Subjt:  MWTSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSDHHPNPDPPTQLARAPSMLDRLKSITLDRSD

Query:  SIREPETPQPAAEQSPETTHHDHSVGRSKSDTRTLTPATSLRRRLQKSLSEKLPWAPF----TAAEDETEESASEVERRRPATTKAEIAELETAAAGDDD
        S  EPETPQ AAEQS E THHDHS+ R KS+T    P TS R +LQKSLSEKL W  F    TAA+DETEES SE+ERRRPATT AEI E +     D++
Subjt:  SIREPETPQPAAEQSPETTHHDHSVGRSKSDTRTLTPATSLRRRLQKSLSEKLPWAPF----TAAEDETEESASEVERRRPATTKAEIAELETAAAGDDD

Query:  EEVDVRADDFINKFKQQLKLQRLESLLRYRDMTKGKK
        EEVDVRADDFINKFKQQLKLQRLESLLRYRDM KGKK
Subjt:  EEVDVRADDFINKFKQQLKLQRLESLLRYRDMTKGKK

SwissProt top hitse value%identityAlignment
F4K956 Pathogen-associated molecular patterns-induced protein A707.6e-2036.65Show/hide
Query:  SHAHRH-HLHGGPA---LLRPPSFLDRVKS-----FNFSSYHSD-----HHPNP-------------DP------------------------PTQLARA
        SH+H H  LH  PA   L R PS LDRVKS     F F  Y+ +     HH  P             DP                        P  L RA
Subjt:  SHAHRH-HLHGGPA---LLRPPSFLDRVKS-----FNFSSYHSD-----HHPNP-------------DP------------------------PTQLARA

Query:  PSMLDRLKSITLD---RSDSIREPETPQPAAEQSPE-TTHHDHSVGRSKSDT-RTLTPATSLRRRLQKSLSEKLPWAPFTAAEDETEESASEVERRRPAT
        PS+L+R+KSI L    RSD       P    +Q+P+   H +H   RSKS++ + +        ++ KS SEK  +  F  +  E  E+   +ERRRP T
Subjt:  PSMLDRLKSITLD---RSDSIREPETPQPAAEQSPE-TTHHDHSVGRSKSDT-RTLTPATSLRRRLQKSLSEKLPWAPFTAAEDETEESASEVERRRPAT

Query:  TKAEIAELETAAAGDDDEEVDVRADDFINKFKQQLKLQRLESLLRYRDMTK
        T+ E     + + GD ++ VD +A DFINKFKQQLKLQRL+S+LRY++M K
Subjt:  TKAEIAELETAAAGDDDEEVDVRADDFINKFKQQLKLQRLESLLRYRDMTK

Arabidopsis top hitse value%identityAlignment
AT2G26110.1 Protein of unknown function (DUF761)6.4e-2232.86Show/hide
Query:  TSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSD--------------------HHPNPDPPTQ--
        T+  +W TPT LF+ +N ++ TIAI+S F++  +  ++  +       R PS + R+KS NFSS+ S                     H P      Q  
Subjt:  TSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSD--------------------HHPNPDPPTQ--

Query:  LARAPSMLDRLKSITL----------------------DRSDSIREPET--PQPAAEQSPETTHHD---HSVGRSKSDTRTLTPATSLR-----RRLQKS
        L+R+PS+L R+KS  L                       + + ++E E    Q   EQS E  +     + V R+KSDT    PA  +R     ++++KS
Subjt:  LARAPSMLDRLKSITL----------------------DRSDSIREPET--PQPAAEQSPETTHHD---HSVGRSKSDTRTLTPATSLR-----RRLQKS

Query:  LSEKLPWAPFTAAEDETEESASEVERRRPATTKA-EIAELETAAAGDDDEEVDVRADDFINKFKQQLKLQRLESLLRYRDMTK
         S K P++ F       +E    VE RRPAT K   +  +E A     DEEVD +ADDFIN+FK QLKLQR++S+ +Y++M K
Subjt:  LSEKLPWAPFTAAEDETEESASEVERRRPATTKA-EIAELETAAAGDDDEEVDVRADDFINKFKQQLKLQRLESLLRYRDMTK

AT4G26130.1 unknown protein2.6e-2333.56Show/hide
Query:  TSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRH--------HLHGGPALLRPPSFLDRVKSFNFSSY----------HSDHHPNPDPPTQLARA
        TS T W+TPT+LF+L+N  +ATI IT+RF++     ++H        H        RPPS +DRVKS NF  Y          +S   PNP+PP      
Subjt:  TSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRH--------HLHGGPALLRPPSFLDRVKSFNFSSY----------HSDHHPNPDPPTQLARA

Query:  PSMLDRLKSIT-----------------------------------LDRSDSIREPETPQPAAEQ-------------SPETTHHDHSVGRSKSDTRTLT
        PS+L R+KSI                                    +   D   EP    P+  Q              P+ T    +  R+KS++    
Subjt:  PSMLDRLKSIT-----------------------------------LDRSDSIREPETPQPAAEQ-------------SPETTHHDHSVGRSKSDTRTLT

Query:  PATSLRRRLQKSLSEKLPWAPFTAAEDETEESASEVERRRPATTKAEIAELETAAAGDDDEE-VDVRADDFINKFKQQLKLQRLESLLRYRDMTK
        PAT  +++  K + +          E+ET E+   VE+RRP T + E     T + GD  EE VD +A +FINKFKQQLKLQRL+S LRYR+M K
Subjt:  PATSLRRRLQKSLSEKLPWAPFTAAEDETEESASEVERRRPATTKAEIAELETAAAGDDDEE-VDVRADDFINKFKQQLKLQRLESLLRYRDMTK

AT5G56980.1 unknown protein5.4e-2136.65Show/hide
Query:  SHAHRH-HLHGGPA---LLRPPSFLDRVKS-----FNFSSYHSD-----HHPNP-------------DP------------------------PTQLARA
        SH+H H  LH  PA   L R PS LDRVKS     F F  Y+ +     HH  P             DP                        P  L RA
Subjt:  SHAHRH-HLHGGPA---LLRPPSFLDRVKS-----FNFSSYHSD-----HHPNP-------------DP------------------------PTQLARA

Query:  PSMLDRLKSITLD---RSDSIREPETPQPAAEQSPE-TTHHDHSVGRSKSDT-RTLTPATSLRRRLQKSLSEKLPWAPFTAAEDETEESASEVERRRPAT
        PS+L+R+KSI L    RSD       P    +Q+P+   H +H   RSKS++ + +        ++ KS SEK  +  F  +  E  E+   +ERRRP T
Subjt:  PSMLDRLKSITLD---RSDSIREPETPQPAAEQSPE-TTHHDHSVGRSKSDT-RTLTPATSLRRRLQKSLSEKLPWAPFTAAEDETEESASEVERRRPAT

Query:  TKAEIAELETAAAGDDDEEVDVRADDFINKFKQQLKLQRLESLLRYRDMTK
        T+ E     + + GD ++ VD +A DFINKFKQQLKLQRL+S+LRY++M K
Subjt:  TKAEIAELETAAAGDDDEEVDVRADDFINKFKQQLKLQRLESLLRYRDMTK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGACTTCCTTCACCGCTTGGGTCACCCCCACCTCCCTCTTCATCCTCGTCAACGCCGTCGTCGCCACCATCGCCATCACCTCCCGCTTCGCCGCCGACAAATCCCA
CGCCCACCGCCACCATCTCCACGGCGGCCCCGCCCTCCTCCGCCCCCCTTCTTTCCTAGACAGAGTCAAGTCCTTCAACTTCTCCTCCTACCACTCCGACCACCACCCCA
ATCCGGACCCGCCGACCCAACTCGCCCGAGCTCCGTCGATGTTGGATCGGCTCAAATCCATCACCCTCGACAGATCCGATTCAATTCGAGAACCGGAAACACCACAGCCG
GCGGCAGAACAGAGTCCGGAGACGACCCACCACGACCATTCCGTCGGCCGGAGCAAGTCCGACACCAGAACCCTCACTCCGGCTACGAGCTTGCGGCGGCGATTGCAGAA
ATCGCTGAGCGAGAAGCTGCCGTGGGCGCCGTTCACGGCGGCGGAGGACGAAACAGAGGAATCAGCTAGCGAAGTCGAACGGCGTCGTCCGGCGACGACGAAAGCGGAGA
TTGCGGAACTGGAAACGGCGGCGGCCGGCGACGACGACGAGGAGGTTGACGTGAGAGCCGACGATTTCATTAACAAGTTCAAGCAGCAGCTGAAGTTGCAGAGGCTGGAA
TCTCTGTTGCGTTACAGAGACATGACTAAAGGCAAAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTGGACTTCCTTCACCGCTTGGGTCACCCCCACCTCCCTCTTCATCCTCGTCAACGCCGTCGTCGCCACCATCGCCATCACCTCCCGCTTCGCCGCCGACAAATCCCA
CGCCCACCGCCACCATCTCCACGGCGGCCCCGCCCTCCTCCGCCCCCCTTCTTTCCTAGACAGAGTCAAGTCCTTCAACTTCTCCTCCTACCACTCCGACCACCACCCCA
ATCCGGACCCGCCGACCCAACTCGCCCGAGCTCCGTCGATGTTGGATCGGCTCAAATCCATCACCCTCGACAGATCCGATTCAATTCGAGAACCGGAAACACCACAGCCG
GCGGCAGAACAGAGTCCGGAGACGACCCACCACGACCATTCCGTCGGCCGGAGCAAGTCCGACACCAGAACCCTCACTCCGGCTACGAGCTTGCGGCGGCGATTGCAGAA
ATCGCTGAGCGAGAAGCTGCCGTGGGCGCCGTTCACGGCGGCGGAGGACGAAACAGAGGAATCAGCTAGCGAAGTCGAACGGCGTCGTCCGGCGACGACGAAAGCGGAGA
TTGCGGAACTGGAAACGGCGGCGGCCGGCGACGACGACGAGGAGGTTGACGTGAGAGCCGACGATTTCATTAACAAGTTCAAGCAGCAGCTGAAGTTGCAGAGGCTGGAA
TCTCTGTTGCGTTACAGAGACATGACTAAAGGCAAAAAATAA
Protein sequenceShow/hide protein sequence
MWTSFTAWVTPTSLFILVNAVVATIAITSRFAADKSHAHRHHLHGGPALLRPPSFLDRVKSFNFSSYHSDHHPNPDPPTQLARAPSMLDRLKSITLDRSDSIREPETPQP
AAEQSPETTHHDHSVGRSKSDTRTLTPATSLRRRLQKSLSEKLPWAPFTAAEDETEESASEVERRRPATTKAEIAELETAAAGDDDEEVDVRADDFINKFKQQLKLQRLE
SLLRYRDMTKGKK