; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg22984 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg22984
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionextensin-like isoform X3
Genome locationCarg_Chr02:2004588..2005407
RNA-Seq ExpressionCarg22984
SyntenyCarg22984
Gene Ontology termsGO:0022900 - electron transport chain (biological process)
GO:0009055 - electron transfer activity (molecular function)
InterPro domainsIPR003245 - Phytocyanin domain
IPR008972 - Cupredoxin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605125.1 hypothetical protein SDJN03_02442, partial [Cucurbita argyrosperma subsp. sororia]1.0e-11797.82Show/hide
Query:  MSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQPPSRPRQPRTPPPSRPRQPRPPPPPRRRRRLPPSPPPPRRPRIPPSPSSPTPQNPRKIIV
        MSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQPPSRPRQPRTPPPSRPRQPR PPPP   RRLPPSPPPPRRPRIPPSPSSPTPQNPRKIIV
Subjt:  MSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQPPSRPRQPRTPPPSRPRQPRPPPPPRRRRRLPPSPPPPRRPRIPPSPSSPTPQNPRKIIV

Query:  GGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKLVANLTQGSGNGFNFVLKQQKAYYFACAEGNGFHCNLG
        GGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKLVANLTQGSGNGFNFVLKQQKAYYFACAEGNGFHCNLG
Subjt:  GGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKLVANLTQGSGNGFNFVLKQQKAYYFACAEGNGFHCNLG

Query:  SMKFSITPQSRRLPSPPPPSRPPRIPPPR
        SMKFSITPQSRRLPSPPPPSR PRIPPPR
Subjt:  SMKFSITPQSRRLPSPPPPSRPPRIPPPR

KAG7023921.1 hypothetical protein SDJN02_14949, partial [Cucurbita argyrosperma subsp. argyrosperma]3.4e-132100Show/hide
Query:  MASIFSNVIVLFVLACMSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQPPSRPRQPRTPPPSRPRQPRPPPPPRRRRRLPPSPPPPRRPRIP
        MASIFSNVIVLFVLACMSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQPPSRPRQPRTPPPSRPRQPRPPPPPRRRRRLPPSPPPPRRPRIP
Subjt:  MASIFSNVIVLFVLACMSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQPPSRPRQPRTPPPSRPRQPRPPPPPRRRRRLPPSPPPPRRPRIP

Query:  PSPSSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKLVANLTQGSGNGFNFVLKQQKA
        PSPSSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKLVANLTQGSGNGFNFVLKQQKA
Subjt:  PSPSSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKLVANLTQGSGNGFNFVLKQQKA

Query:  YYFACAEGNGFHCNLGSMKFSITPQSRRLPSPPPPSRPPRIPPPRH
        YYFACAEGNGFHCNLGSMKFSITPQSRRLPSPPPPSRPPRIPPPRH
Subjt:  YYFACAEGNGFHCNLGSMKFSITPQSRRLPSPPPPSRPPRIPPPRH

KAG7035126.1 hypothetical protein SDJN02_01921, partial [Cucurbita argyrosperma subsp. argyrosperma]3.4e-132100Show/hide
Query:  MASIFSNVIVLFVLACMSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQPPSRPRQPRTPPPSRPRQPRPPPPPRRRRRLPPSPPPPRRPRIP
        MASIFSNVIVLFVLACMSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQPPSRPRQPRTPPPSRPRQPRPPPPPRRRRRLPPSPPPPRRPRIP
Subjt:  MASIFSNVIVLFVLACMSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQPPSRPRQPRTPPPSRPRQPRPPPPPRRRRRLPPSPPPPRRPRIP

Query:  PSPSSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKLVANLTQGSGNGFNFVLKQQKA
        PSPSSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKLVANLTQGSGNGFNFVLKQQKA
Subjt:  PSPSSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKLVANLTQGSGNGFNFVLKQQKA

Query:  YYFACAEGNGFHCNLGSMKFSITPQSRRLPSPPPPSRPPRIPPPRH
        YYFACAEGNGFHCNLGSMKFSITPQSRRLPSPPPPSRPPRIPPPRH
Subjt:  YYFACAEGNGFHCNLGSMKFSITPQSRRLPSPPPPSRPPRIPPPRH

XP_022947108.1 extensin-like isoform X3 [Cucurbita moschata]4.3e-11184.64Show/hide
Query:  MASIFSNVIVLFVLACMSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQ----------PPSRPRQPRTPPPSRPRQPRPPP--------PPR
        MASIFSNVIVLFVLACMSTLSSADP WFRARNTTF FSSRHKRPRF PPPSRCLPQ          PPSRPRQPRTPPPSRP++PR PP        PP 
Subjt:  MASIFSNVIVLFVLACMSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQ----------PPSRPRQPRTPPPSRPRQPRPPP--------PPR

Query:  RRRRLPPS---PPPPRRPRIPPSPSSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKL
        RR R PP    PP P RPR PPS   PTPQNPRKIIVGGSKYWRLGFDYNDWI KNGPFY+NDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKL
Subjt:  RRRRLPPS---PPPPRRPRIPPSPSSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKL

Query:  VANLTQGSGNGFNFVLKQQKAYYFACAEGNGFHCNLGSMKFSITPQSRRLPSPPPPSRPPRIPPPRH
        VANLTQGSGNGFNFVLKQQKAYYFAC EGNGFHCNLGSMKFSITPQSR LPSPPPPSRPPRIPPPRH
Subjt:  VANLTQGSGNGFNFVLKQQKAYYFACAEGNGFHCNLGSMKFSITPQSRRLPSPPPPSRPPRIPPPRH

XP_022947109.1 extensin-like isoform X4 [Cucurbita moschata]4.3e-11184.64Show/hide
Query:  MASIFSNVIVLFVLACMSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQ----------PPSRPRQPRTPPPSRPRQPRPPP--------PPR
        MASIFSNVIVLFVLACMSTLSSADP WFRARNTTF FSSRHKRPRF PPPSRCLPQ          PPSRPRQPRTPPPSRP++PR PP        PP 
Subjt:  MASIFSNVIVLFVLACMSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQ----------PPSRPRQPRTPPPSRPRQPRPPP--------PPR

Query:  RRRRLPPS---PPPPRRPRIPPSPSSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKL
        RR R PP    PP P RPR PPS   PTPQNPRKIIVGGSKYWRLGFDYNDWI KNGPFY+NDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKL
Subjt:  RRRRLPPS---PPPPRRPRIPPSPSSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKL

Query:  VANLTQGSGNGFNFVLKQQKAYYFACAEGNGFHCNLGSMKFSITPQSRRLPSPPPPSRPPRIPPPRH
        VANLTQGSGNGFNFVLKQQKAYYFAC EGNGFHCNLGSMKFSITPQSR LPSPPPPSRPPRIPPPRH
Subjt:  VANLTQGSGNGFNFVLKQQKAYYFACAEGNGFHCNLGSMKFSITPQSRRLPSPPPPSRPPRIPPPRH

TrEMBL top hitse value%identityAlignment
A0A6J1G5G3 extensin-like isoform X22.1e-11184.64Show/hide
Query:  MASIFSNVIVLFVLACMSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQ----------PPSRPRQPRTPPPSRPRQPRPPP--------PPR
        MASIFSNVIVLFVLACMSTLSSADP WFRARNTTF FSSRHKRPRF PPPSRCLPQ          PPSRPRQPRTPPPSRP++PR PP        PP 
Subjt:  MASIFSNVIVLFVLACMSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQ----------PPSRPRQPRTPPPSRPRQPRPPP--------PPR

Query:  RRRRLPPS---PPPPRRPRIPPSPSSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKL
        RR R PP    PP P RPR PPS   PTPQNPRKIIVGGSKYWRLGFDYNDWI KNGPFY+NDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKL
Subjt:  RRRRLPPS---PPPPRRPRIPPSPSSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKL

Query:  VANLTQGSGNGFNFVLKQQKAYYFACAEGNGFHCNLGSMKFSITPQSRRLPSPPPPSRPPRIPPPRH
        VANLTQGSGNGFNFVLKQQKAYYFAC EGNGFHCNLGSMKFSITPQSR LPSPPPPSRPPRIPPPRH
Subjt:  VANLTQGSGNGFNFVLKQQKAYYFACAEGNGFHCNLGSMKFSITPQSRRLPSPPPPSRPPRIPPPRH

A0A6J1G5Q0 extensin-like isoform X32.1e-11184.64Show/hide
Query:  MASIFSNVIVLFVLACMSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQ----------PPSRPRQPRTPPPSRPRQPRPPP--------PPR
        MASIFSNVIVLFVLACMSTLSSADP WFRARNTTF FSSRHKRPRF PPPSRCLPQ          PPSRPRQPRTPPPSRP++PR PP        PP 
Subjt:  MASIFSNVIVLFVLACMSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQ----------PPSRPRQPRTPPPSRPRQPRPPP--------PPR

Query:  RRRRLPPS---PPPPRRPRIPPSPSSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKL
        RR R PP    PP P RPR PPS   PTPQNPRKIIVGGSKYWRLGFDYNDWI KNGPFY+NDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKL
Subjt:  RRRRLPPS---PPPPRRPRIPPSPSSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKL

Query:  VANLTQGSGNGFNFVLKQQKAYYFACAEGNGFHCNLGSMKFSITPQSRRLPSPPPPSRPPRIPPPRH
        VANLTQGSGNGFNFVLKQQKAYYFAC EGNGFHCNLGSMKFSITPQSR LPSPPPPSRPPRIPPPRH
Subjt:  VANLTQGSGNGFNFVLKQQKAYYFACAEGNGFHCNLGSMKFSITPQSRRLPSPPPPSRPPRIPPPRH

A0A6J1G5T9 extensin-like isoform X42.1e-11184.64Show/hide
Query:  MASIFSNVIVLFVLACMSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQ----------PPSRPRQPRTPPPSRPRQPRPPP--------PPR
        MASIFSNVIVLFVLACMSTLSSADP WFRARNTTF FSSRHKRPRF PPPSRCLPQ          PPSRPRQPRTPPPSRP++PR PP        PP 
Subjt:  MASIFSNVIVLFVLACMSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQ----------PPSRPRQPRTPPPSRPRQPRPPP--------PPR

Query:  RRRRLPPS---PPPPRRPRIPPSPSSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKL
        RR R PP    PP P RPR PPS   PTPQNPRKIIVGGSKYWRLGFDYNDWI KNGPFY+NDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKL
Subjt:  RRRRLPPS---PPPPRRPRIPPSPSSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKL

Query:  VANLTQGSGNGFNFVLKQQKAYYFACAEGNGFHCNLGSMKFSITPQSRRLPSPPPPSRPPRIPPPRH
        VANLTQGSGNGFNFVLKQQKAYYFAC EGNGFHCNLGSMKFSITPQSR LPSPPPPSRPPRIPPPRH
Subjt:  VANLTQGSGNGFNFVLKQQKAYYFACAEGNGFHCNLGSMKFSITPQSRRLPSPPPPSRPPRIPPPRH

A0A6J1G5W9 extensin-like isoform X12.1e-11184.64Show/hide
Query:  MASIFSNVIVLFVLACMSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQ----------PPSRPRQPRTPPPSRPRQPRPPP--------PPR
        MASIFSNVIVLFVLACMSTLSSADP WFRARNTTF FSSRHKRPRF PPPSRCLPQ          PPSRPRQPRTPPPSRP++PR PP        PP 
Subjt:  MASIFSNVIVLFVLACMSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQ----------PPSRPRQPRTPPPSRPRQPRPPP--------PPR

Query:  RRRRLPPS---PPPPRRPRIPPSPSSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKL
        RR R PP    PP P RPR PPS   PTPQNPRKIIVGGSKYWRLGFDYNDWI KNGPFY+NDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKL
Subjt:  RRRRLPPS---PPPPRRPRIPPSPSSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKL

Query:  VANLTQGSGNGFNFVLKQQKAYYFACAEGNGFHCNLGSMKFSITPQSRRLPSPPPPSRPPRIPPPRH
        VANLTQGSGNGFNFVLKQQKAYYFAC EGNGFHCNLGSMKFSITPQSR LPSPPPPSRPPRIPPPRH
Subjt:  VANLTQGSGNGFNFVLKQQKAYYFACAEGNGFHCNLGSMKFSITPQSRRLPSPPPPSRPPRIPPPRH

A0A6J1L0M6 extensin-like isoform X11.4e-10783.4Show/hide
Query:  MASIFSNVIVLFVLACMSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQ----------PPSRPRQPRTP---------PPSRPRQPRPPPPP
        MASIFSNVIVL+VLACMSTLSSADPGWFRARNTTF FSSRHKRP F PP SRCLPQ          PPS+PR+PRTP         PPSRPRQPR PPP 
Subjt:  MASIFSNVIVLFVLACMSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQ----------PPSRPRQPRTP---------PPSRPRQPRPPPPP

Query:  RRRRRLPPSPPPPRRPRIPPSPSSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKLVA
           RRLPPSPPP RRPR PP    PTPQNPRKIIVGGSKYWRLGFDYNDWI KNGPFYLNDILVFKYD PNSSTPPHNVYLLPNMQS NKCDFRRAKLVA
Subjt:  RRRRRLPPSPPPPRRPRIPPSPSSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKLVA

Query:  NLTQGSGNGFNFVLKQQKAYYFACAEGNGFHCNLGSMKFSITPQSRRLPSPPPPSRPPRIPPPRH
        NLTQGSGNGFNFVLKQQKAYYFAC EG GFHCN+GSMKFSITPQSRRLPSP PPSRPPRIPPPRH
Subjt:  NLTQGSGNGFNFVLKQQKAYYFACAEGNGFHCNLGSMKFSITPQSRRLPSPPPPSRPPRIPPPRH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15770.1 Cupredoxin superfamily protein3.4e-2646.55Show/hide
Query:  QNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKLVANLTQGSGNGFNFVLKQQKAYYFACAEG
        + P+KIIVGGS  W+ G DY DW  KN PFY+ND+LVFKYD   S+   +NVYL  +  S   CD + A+ + +  +GS   FNF LK+ + Y+FA  E 
Subjt:  QNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKLVANLTQGSGNGFNFVLKQQKAYYFACAEG

Query:  NGFHCNLGSMKFSITP
        +G +C   +MKF+I P
Subjt:  NGFHCNLGSMKFSITP

AT2G15780.1 Cupredoxin superfamily protein8.2e-3655.74Show/hide
Query:  TPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKLVANLTQGSGNGFNFVLKQQKAYYFACA
        T   PRKIIVGG K W  GF+Y DW  K  PF+LNDILVFKY+PP   T  H+VYLLPN  S  KCD ++ K++A+  QG+G GF FVLKQ K YY +C 
Subjt:  TPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKLVANLTQGSGNGFNFVLKQQKAYYFACA

Query:  EGNGFHCNLGSMKFSITPQSRR
        E +G HC+ G+MKF++ P   R
Subjt:  EGNGFHCNLGSMKFSITPQSRR

AT4G33930.1 Cupredoxin superfamily protein2.5e-1638.64Show/hide
Query:  SSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKY---DPPNSSTPPHN-----VYLLPNMQSLNKCDFRRAKLVANLTQGSGNGFNFVL
        ++P P   RKI V     W+ G+ Y +W  K+ PFY++D+LVFKY   D   S T   N     VYLLP+M+S  +C+  R K +      S  GF  +L
Subjt:  SSPTPQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKY---DPPNSSTPPHN-----VYLLPNMQSLNKCDFRRAKLVANLTQGSGNGFNFVL

Query:  KQQKAYYFACAEGNGFHCNLGSMKFSITPQSR
        ++ + YYFA  + N   CN  +MKFS+ P  R
Subjt:  KQQKAYYFACAEGNGFHCNLGSMKFSITPQSR

AT4G34300.1 Cupredoxin superfamily protein1.6e-1538.1Show/hide
Query:  PQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKY---DPPNSSTPPH------NVYLLPNMQSLNKCDFRRAKLVANLTQGSGNGFNFVLKQQ
        P   RKI V     W+ G+ Y +W  K+ PFY+ND+LVF Y   D   S T  H      +VYLLP+M+S  +C+  R K +      S  GF  +L++ 
Subjt:  PQNPRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKY---DPPNSSTPPH------NVYLLPNMQSLNKCDFRRAKLVANLTQGSGNGFNFVLKQQ

Query:  KAYYFACAEGNGFHCNLGSMKFSITP
          YYF   + N   CN  +MKFS+ P
Subjt:  KAYYFACAEGNGFHCNLGSMKFSITP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCATATTTTCTAACGTTATCGTTCTCTTCGTGTTAGCGTGCATGTCAACTCTAAGCTCGGCCGACCCGGGTTGGTTTAGGGCTCGTAATACCACTTTTCCTTT
CAGTTCTAGGCATAAACGTCCCCGGTTCTCGCCACCACCATCACGATGTTTGCCACAACCACCATCACGACCACGACAACCTCGAACACCACCACCATCACGACCACGAC
AACCTCGACCACCACCACCACCACGACGACGACGACGTTTGCCACCGTCCCCACCACCACCACGACGGCCTCGAATACCGCCATCACCATCATCGCCGACCCCACAAAAT
CCAAGAAAGATTATAGTGGGTGGTTCTAAGTATTGGCGTCTTGGGTTTGACTATAATGATTGGATACATAAGAACGGTCCTTTTTATCTAAACGATATTCTAGTGTTCAA
ATACGATCCTCCCAACAGCTCAACTCCTCCTCATAATGTTTATTTGCTACCAAACATGCAAAGCTTGAACAAGTGTGATTTTAGAAGGGCTAAATTGGTGGCAAATTTAA
CACAAGGAAGTGGAAATGGGTTTAATTTTGTTCTTAAACAACAAAAAGCTTACTACTTTGCTTGTGCTGAAGGCAATGGCTTCCATTGCAACCTTGGATCCATGAAGTTC
TCTATAACCCCACAATCAAGGCGTCTACCATCACCGCCACCACCATCACGACCGCCACGGATACCTCCACCACGACAT
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCATATTTTCTAACGTTATCGTTCTCTTCGTGTTAGCGTGCATGTCAACTCTAAGCTCGGCCGACCCGGGTTGGTTTAGGGCTCGTAATACCACTTTTCCTTT
CAGTTCTAGGCATAAACGTCCCCGGTTCTCGCCACCACCATCACGATGTTTGCCACAACCACCATCACGACCACGACAACCTCGAACACCACCACCATCACGACCACGAC
AACCTCGACCACCACCACCACCACGACGACGACGACGTTTGCCACCGTCCCCACCACCACCACGACGGCCTCGAATACCGCCATCACCATCATCGCCGACCCCACAAAAT
CCAAGAAAGATTATAGTGGGTGGTTCTAAGTATTGGCGTCTTGGGTTTGACTATAATGATTGGATACATAAGAACGGTCCTTTTTATCTAAACGATATTCTAGTGTTCAA
ATACGATCCTCCCAACAGCTCAACTCCTCCTCATAATGTTTATTTGCTACCAAACATGCAAAGCTTGAACAAGTGTGATTTTAGAAGGGCTAAATTGGTGGCAAATTTAA
CACAAGGAAGTGGAAATGGGTTTAATTTTGTTCTTAAACAACAAAAAGCTTACTACTTTGCTTGTGCTGAAGGCAATGGCTTCCATTGCAACCTTGGATCCATGAAGTTC
TCTATAACCCCACAATCAAGGCGTCTACCATCACCGCCACCACCATCACGACCGCCACGGATACCTCCACCACGACAT
Protein sequenceShow/hide protein sequence
MASIFSNVIVLFVLACMSTLSSADPGWFRARNTTFPFSSRHKRPRFSPPPSRCLPQPPSRPRQPRTPPPSRPRQPRPPPPPRRRRRLPPSPPPPRRPRIPPSPSSPTPQN
PRKIIVGGSKYWRLGFDYNDWIHKNGPFYLNDILVFKYDPPNSSTPPHNVYLLPNMQSLNKCDFRRAKLVANLTQGSGNGFNFVLKQQKAYYFACAEGNGFHCNLGSMKF
SITPQSRRLPSPPPPSRPPRIPPPRH