; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh04G009980 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh04G009980
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionEncodes a protein whose expression is responsive to nematode infection, putative
Genome locationCma_Chr04:5162523..5164562
RNA-Seq ExpressionCmaCh04G009980
SyntenyCmaCh04G009980
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022942788.1 uncharacterized protein At1g66480 isoform X1 [Cucurbita moschata]1.3e-9792.09Show/hide
Query:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES
        MGN FGLKKT KVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPK+PKEQAPRRVRS INMSAKDRLES
Subjt:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKRRR
        LMLSRRSASDLTIMKPKSVLAEEG  ++EGSEATRVKVRLPKAEVERVLKES+DEAEAAERIMGLYMAKA E+VCQN +KMEKEKDVIIKPREKM+K RR
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKRRR

Query:  VSFMATIEAMPQIEV
        VSFM TIEAMPQI V
Subjt:  VSFMATIEAMPQIEV

XP_022942789.1 uncharacterized protein At1g66480 isoform X2 [Cucurbita moschata]5.4e-9691.16Show/hide
Query:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES
        MGN FGLKKT KVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPK+PKEQAPRRVRS INMSAKDRLES
Subjt:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKRRR
        LMLSRRSASDLTIMKPKSVLAEEG  ++EGSEATRVKVRLPKAEVERVLKES+DEAEAAERIMGLYMAKA E+VCQN +KMEKEKDVIIKPREK    RR
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKRRR

Query:  VSFMATIEAMPQIEV
        VSFM TIEAMPQI V
Subjt:  VSFMATIEAMPQIEV

XP_022986400.1 uncharacterized protein At1g66480 isoform X1 [Cucurbita maxima]6.2e-108100Show/hide
Query:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES
        MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES
Subjt:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKRRR
        LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKRRR
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKRRR

Query:  VSFMATIEAMPQIEV
        VSFMATIEAMPQIEV
Subjt:  VSFMATIEAMPQIEV

XP_022986406.1 uncharacterized protein At1g66480 isoform X2 [Cucurbita maxima]8.4e-10598.6Show/hide
Query:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES
        MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES
Subjt:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKRRR
        LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPR   EKRRR
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKRRR

Query:  VSFMATIEAMPQIEV
        VSFMATIEAMPQIEV
Subjt:  VSFMATIEAMPQIEV

XP_023515798.1 uncharacterized protein At1g66480 [Cucurbita pepo subsp. pepo]1.2e-9591.16Show/hide
Query:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES
        MGNTFGLKKT KVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRS INMSAKDRLES
Subjt:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKRRR
        LMLSRRSASDLTIMKPKSVLAEEG  ++EGSEATRVKVRLPKAEVERVLKES+DEAEAAERIMGLYMAKA E+VCQN + MEKEKDV+IKPREK    RR
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKRRR

Query:  VSFMATIEAMPQIEV
        VSFM TIEAMPQI V
Subjt:  VSFMATIEAMPQIEV

TrEMBL top hitse value%identityAlignment
A0A1S3BZC7 uncharacterized protein At1g66480 isoform X17.7e-8080.18Show/hide
Query:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES
        MGNTFG+KKT KVM +SG+T+KL  PVQ  DVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLV+LPK+PKEQAPRRVRS INMSAKDRLES
Subjt:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGSGEKE--GSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKR
        LML+RRSASDLTIMKPKSVL EEG GE E  GS ATRVKVRLPKAEVER+LKE +DEAEAAERIMGLY  K  E+VC+N  K EKEK  IIKPR   EK+
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGSGEKE--GSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKR

Query:  RRVSFMATIEAMPQIEV
        RRVSFM T+EA  QI V
Subjt:  RRVSFMATIEAMPQIEV

A0A6J1FSC9 uncharacterized protein At1g66480 isoform X22.6e-9691.16Show/hide
Query:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES
        MGN FGLKKT KVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPK+PKEQAPRRVRS INMSAKDRLES
Subjt:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKRRR
        LMLSRRSASDLTIMKPKSVLAEEG  ++EGSEATRVKVRLPKAEVERVLKES+DEAEAAERIMGLYMAKA E+VCQN +KMEKEKDVIIKPREK    RR
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKRRR

Query:  VSFMATIEAMPQIEV
        VSFM TIEAMPQI V
Subjt:  VSFMATIEAMPQIEV

A0A6J1FVP5 uncharacterized protein At1g66480 isoform X16.3e-9892.09Show/hide
Query:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES
        MGN FGLKKT KVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPK+PKEQAPRRVRS INMSAKDRLES
Subjt:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKRRR
        LMLSRRSASDLTIMKPKSVLAEEG  ++EGSEATRVKVRLPKAEVERVLKES+DEAEAAERIMGLYMAKA E+VCQN +KMEKEKDVIIKPREKM+K RR
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKRRR

Query:  VSFMATIEAMPQIEV
        VSFM TIEAMPQI V
Subjt:  VSFMATIEAMPQIEV

A0A6J1JB09 uncharacterized protein At1g66480 isoform X24.1e-10598.6Show/hide
Query:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES
        MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES
Subjt:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKRRR
        LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPR   EKRRR
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKRRR

Query:  VSFMATIEAMPQIEV
        VSFMATIEAMPQIEV
Subjt:  VSFMATIEAMPQIEV

A0A6J1JDY8 uncharacterized protein At1g66480 isoform X13.0e-108100Show/hide
Query:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES
        MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES
Subjt:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKRRR
        LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKRRR
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKRRR

Query:  VSFMATIEAMPQIEV
        VSFMATIEAMPQIEV
Subjt:  VSFMATIEAMPQIEV

SwissProt top hitse value%identityAlignment
Q6NLC8 Uncharacterized protein At1g664802.8e-3446.45Show/hide
Query:  MGNTFGLK-KTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAP---------RRVRSEI
        MGN+  +K K AKVM + G+T ++  PV AR+V  DYPG+VLL+S+AVKH+GVR+KPLE +Q L  K+ YFLVELPK+P E            RRV S I
Subjt:  MGNTFGLK-KTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAP---------RRVRSEI

Query:  NMSAKDRLESLMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEA-AERIMGLYMAKATE
        ++ AK+RL+ LMLSRR+ SD+TI +       +G G + G   T V++RLP++++ ++++E+ ++A A AE+I+G+YM ++ E
Subjt:  NMSAKDRLESLMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEA-AERIMGLYMAKATE

Arabidopsis top hitse value%identityAlignment
AT1G66480.1 plastid movement impaired 22.0e-3546.45Show/hide
Query:  MGNTFGLK-KTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAP---------RRVRSEI
        MGN+  +K K AKVM + G+T ++  PV AR+V  DYPG+VLL+S+AVKH+GVR+KPLE +Q L  K+ YFLVELPK+P E            RRV S I
Subjt:  MGNTFGLK-KTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAP---------RRVRSEI

Query:  NMSAKDRLESLMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEA-AERIMGLYMAKATE
        ++ AK+RL+ LMLSRR+ SD+TI +       +G G + G   T V++RLP++++ ++++E+ ++A A AE+I+G+YM ++ E
Subjt:  NMSAKDRLESLMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEA-AERIMGLYMAKATE

AT1G71015.1 unknown protein1.6e-4053.94Show/hide
Query:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES
        MGN+ G KKTA +M ++G++ KL  PV+A  VVKD+PG VLLESEAVK  G+RAKPLE HQ L +KR+YF+VELP+  KE+ PRRVRS I MSAK+RLE+
Subjt:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGL
        L LSRRS+SDL++MK K+ + +      E  E + VK++LPK ++E++ KES   ++ + +I  L
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGL

AT2G01340.1 Encodes a protein whose expression is responsive to nematode infection.1.1e-4352.61Show/hide
Query:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES
        MGN+ G KKT KVM + G+T KL  PV A +V+KD+PG VLL+SE+VKHYG RAKPLE  Q+L  KRLYF+VE     KE  PRRVRS I++SAK+RLES
Subjt:  MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAK-ATENVCQNGQKMEKEKD-----VIIKPREK
        LML+RRS+SDL+I+KP      E   E+EG+   RVKVR+PKAE+E+++KE   EAEA ++I  L+MAK   E   QN ++ E              R  
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAK-ATENVCQNGQKMEKEKD-----VIIKPREK

Query:  MEKRRRVSFMA
          + +RVSFMA
Subjt:  MEKRRRVSFMA

AT5G37840.1 BEST Arabidopsis thaliana protein match is: plastid movement impaired 2 (TAIR:AT1G66480.1)3.6e-2946.78Show/hide
Query:  MGNTFGLKKT-AKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQ--APRRVRS-EINMSAKD
        MGNT  +++   KVM + GD  +L  PV A D  K+YPGFVLL+SE VK  GVRAKPLE +Q L     YFLV+LP V K      RRV S  I++ AK+
Subjt:  MGNTFGLKKT-AKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQ--APRRVRS-EINMSAKD

Query:  RLESLMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYM
        RLE LMLSRR+ SD+   +   V    G G + G   TRV++RLP++++ ++++ES D +E A +I+  YM
Subjt:  RLESLMLSRRSASDLTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAACACCTTTGGACTCAAGAAGACGGCTAAGGTGATGACGGTCTCCGGCGACACGATCAAACTAACCCCCCCGGTTCAAGCTCGGGATGTCGTCAAGGATTATCC
CGGCTTTGTTTTACTCGAATCCGAGGCTGTGAAACACTACGGAGTTCGAGCAAAGCCATTGGAACTCCACCAAAAGCTTAGCACGAAAAGACTCTATTTTCTTGTGGAGT
TGCCTAAAGTTCCAAAAGAACAGGCTCCACGGCGAGTACGGTCGGAGATCAACATGAGTGCGAAGGATAGGCTAGAGAGCTTGATGTTGTCACGACGGTCAGCATCGGAC
CTAACTATCATGAAACCGAAGAGCGTGTTGGCGGAGGAGGGCAGCGGGGAGAAAGAGGGATCGGAAGCGACACGAGTGAAGGTACGGCTGCCGAAGGCGGAGGTGGAAAG
GGTGTTGAAGGAGAGCAGAGATGAGGCAGAGGCAGCGGAGAGGATTATGGGGTTGTACATGGCCAAAGCCACAGAAAATGTTTGTCAAAATGGACAGAAGATGGAGAAGG
AGAAGGATGTCATCATCAAGCCACGTGAGAAAATGGAGAAGCGAAGACGTGTAAGTTTCATGGCGACAATAGAAGCAATGCCTCAAATTGAGGTGTTGGGATGGGTTTGT
GGGATGAGTTTCGACTCGTCCCCGATCCAACCCGACAGATCTCGTCTCCCTTACCCGAAACAAACTCAAATCTGCACTTTCACCTCTCCTCCAGCGATCTTCCATGACAG
CATTTTGGTTGTGGATTCGTGGTGCTAA
mRNA sequenceShow/hide mRNA sequence
TCTTCGACTACGCCCACCATCTTCCTCTTCCCTCTTTCCAACTCTTTCCCTTTTGGCTGCCATGGGAAACACCTTTGGACTCAAGAAGACGGCTAAGGTGATGACGGTCT
CCGGCGACACGATCAAACTAACCCCCCCGGTTCAAGCTCGGGATGTCGTCAAGGATTATCCCGGCTTTGTTTTACTCGAATCCGAGGCTGTGAAACACTACGGAGTTCGA
GCAAAGCCATTGGAACTCCACCAAAAGCTTAGCACGAAAAGACTCTATTTTCTTGTGGAGTTGCCTAAAGTTCCAAAAGAACAGGCTCCACGGCGAGTACGGTCGGAGAT
CAACATGAGTGCGAAGGATAGGCTAGAGAGCTTGATGTTGTCACGACGGTCAGCATCGGACCTAACTATCATGAAACCGAAGAGCGTGTTGGCGGAGGAGGGCAGCGGGG
AGAAAGAGGGATCGGAAGCGACACGAGTGAAGGTACGGCTGCCGAAGGCGGAGGTGGAAAGGGTGTTGAAGGAGAGCAGAGATGAGGCAGAGGCAGCGGAGAGGATTATG
GGGTTGTACATGGCCAAAGCCACAGAAAATGTTTGTCAAAATGGACAGAAGATGGAGAAGGAGAAGGATGTCATCATCAAGCCACGTGAGAAAATGGAGAAGCGAAGACG
TGTAAGTTTCATGGCGACAATAGAAGCAATGCCTCAAATTGAGGTGTTGGGATGGGTTTGTGGGATGAGTTTCGACTCGTCCCCGATCCAACCCGACAGATCTCGTCTCC
CTTACCCGAAACAAACTCAAATCTGCACTTTCACCTCTCCTCCAGCGATCTTCCATGACAGCATTTTGGTTGTGGATTCGTGGTGCTAA
Protein sequenceShow/hide protein sequence
MGNTFGLKKTAKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSEINMSAKDRLESLMLSRRSASD
LTIMKPKSVLAEEGSGEKEGSEATRVKVRLPKAEVERVLKESRDEAEAAERIMGLYMAKATENVCQNGQKMEKEKDVIIKPREKMEKRRRVSFMATIEAMPQIEVLGWVC
GMSFDSSPIQPDRSRLPYPKQTQICTFTSPPAIFHDSILVVDSWC