; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh09G005010 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh09G005010
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionProtein of unknown function (DUF1218)
Genome locationCma_Chr09:2264374..2269817
RNA-Seq ExpressionCmaCh09G005010
SyntenyCmaCh09G005010
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CBI20822.3 unnamed protein product, partial [Vitis vinifera]9.9e-12155.61Show/hide
Query:  MAVSVKIMSLTVATLGVISFIFGVVAENKKPASGTPIPGKGIVICKYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSAPRGAMFKSSSFSIFFNI
        MAV++K MS+ V  LGV SFIFG+VAENKKP SGTP  G G++ICKY +D T+ LG+LS  FL ASS AG  S+FYP++GKS P  A+F++++F+IFFNI
Subjt:  MAVSVKIMSLTVATLGVISFIFGVVAENKKPASGTPIPGKGIVICKYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSAPRGAMFKSSSFSIFFNI

Query:  ALFTTGLAITLLVWPTVTEQIHLTRNVHQNLETACPTAKTGLLGGGAFLSLDSSLFWLVALMLAENAREDYFDEIEEKGASVFAEIRSGSGSGNESDFPV
         L   G+A   ++WPT+TE  H   N+H NL+  CPTAKTGL+GGG  LS  SSLFWL++LML +N REDYF+E++ +   + +   S S          
Subjt:  ALFTTGLAITLLVWPTVTEQIHLTRNVHQNLETACPTAKTGLLGGGAFLSLDSSLFWLVALMLAENAREDYFDEIEEKGASVFAEIRSGSGSGNESDFPV

Query:  LLHIFLRRSILCLVKFI-LGSMAVSVKIMSLTVTTLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQ
                    + +F+  G MA ++K MS+ V TLG +SF+FGV+AENKKP SGT    +G+VIC+Y +DPTV LGYLSVAFL+AS++AGY SLFYPY+
Subjt:  LLHIFLRRSILCLVKFI-LGSMAVSVKIMSLTVTTLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQ

Query:  GKSVPRGAMFKSSTFSIFFNIALFTTGLAITLLLWPTVIEQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFWLVTLMLAGNAREDYFDEIEEN-
        G+S+P+ A+F+S++F +FFNIAL   GLA  +LLWPT+ EQ+HL RNVHHNL   CPTAKTGLLGGGAF++LD+SLFWLV+LMLA N REDYF+ +E++ 
Subjt:  GKSVPRGAMFKSSTFSIFFNIALFTTGLAITLLLWPTVIEQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFWLVTLMLAGNAREDYFDEIEEN-

Query:  -GDSAEALKN
         G+ +E L N
Subjt:  -GDSAEALKN

EXB60451.1 hypothetical protein L484_014904 [Morus notabilis]1.2e-12358.19Show/hide
Query:  MAVSVKIMSLTVATLGVISFIFGVVAENKKPASGTPIPGKGIVICKYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSAPRGAMFKSSSFSIFFNI
        MA +   M+L V  LG ISF+FG++AENKK  SGTPI GK +VIC YP++P VALGYLS AFL+AS++ GY SLFYPY+GKS P+  +FK++SF +FFNI
Subjt:  MAVSVKIMSLTVATLGVISFIFGVVAENKKPASGTPIPGKGIVICKYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSAPRGAMFKSSSFSIFFNI

Query:  ALFTTGLAITLLVWPTVTEQIHLTRNVHQNLETACPTAKTGLLGGGAFLSLDSSLFWLVALMLAENAREDYFDEIEE-----------------------
        ALFT GL   L +WPT  EQ H    V+ +  + CPTAKTGLLGG AF+SLDSSLFWLVALMLA N R+DYF + E                        
Subjt:  ALFTTGLAITLLVWPTVTEQIHLTRNVHQNLETACPTAKTGLLGGGAFLSLDSSLFWLVALMLAENAREDYFDEIEE-----------------------

Query:  KGASVFAEIRSGSGSGNESDFPVLLHIFLRRSILCLVKFILGSMAVSVKIMSLTVTTLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGY
        + A+ + E    S      D P  L +  +   LCL    L +MAV+VK M+L VT LGV+SF+FGV+AENKKPAS TP+ GK +VIC+Y +D TV  GY
Subjt:  KGASVFAEIRSGSGSGNESDFPVLLHIFLRRSILCLVKFILGSMAVSVKIMSLTVTTLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGY

Query:  LSVAFLLASSIAGYFSLFYPYQGKSVPRGAMFKSSTFSIFFNIALFTTGLAITLLLWPTVIEQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFW
        LSV FL ASS  G  SLFYPY+GK VP GA F+SS F +FFNIALFT+GLA  LLLWPT+ EQ H+ RNVH NL+  CPTAKTGLLGGGAF+SLD+SLFW
Subjt:  LSVAFLLASSIAGYFSLFYPYQGKSVPRGAMFKSSTFSIFFNIALFTTGLAITLLLWPTVIEQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFW

Query:  LVTLMLAGNAREDYFDEIEEN
        LV LMLA NAREDYF+E+E +
Subjt:  LVTLMLAGNAREDYFDEIEEN

KAG8375297.1 hypothetical protein BUALT_Bualt10G0085700 [Buddleja alternifolia]3.3e-10852.9Show/hide
Query:  MAVSVKIMSLTVATLGVISFIFGVVAENKKPASGTPIPGKGIVICKYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSAPRGAMFKSSSFSIFFNI
        MA++ K  ++ VA LG+ SF+FG++AE KKP +G  I GKG+VICKYP+DP++  GY+S A L  ++IAG+ SLFYPY+GKS P  A+F S     +  +
Subjt:  MAVSVKIMSLTVATLGVISFIFGVVAENKKPASGTPIPGKGIVICKYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSAPRGAMFKSSSFSIFFNI

Query:  ALFTTGLAITLLVWPTVTEQIHLTRNVHQNLETACPTAKTGLLGGGAFLSLDSSLFWLVALMLAENAREDYFDEIEEKGASVFAEIRSGSGSGNESDFPV
        +L T+  A   L+WPTV E +H TRNVH NLE  CPTAKTG+ GGGAF+SL+SSLFWL+  + A                                    
Subjt:  ALFTTGLAITLLVWPTVTEQIHLTRNVHQNLETACPTAKTGLLGGGAFLSLDSSLFWLVALMLAENAREDYFDEIEEKGASVFAEIRSGSGSGNESDFPV

Query:  LLHIFLRRSILCLVKFILGSMAVSVKIMSLTVTTLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQG
                           +M +++K MS+ V TLG++SF+FG++AENKKPASGT I GKG+VIC+YP+DPTVALGYLS AFL+ +++AG +SLFYPY+G
Subjt:  LLHIFLRRSILCLVKFILGSMAVSVKIMSLTVTTLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQG

Query:  KSVPRGAMFKSSTFSIFFNIALFTTGLAITLLLWPTVIEQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFWLVTLMLAGNAREDYFDEIEE
        KSVP GA+F+++ F +FFNIAL TTGLA ++LLWPTV EQ H   N+HHNLE  CPTAKTGLLGGGAFLSLDS LFWLV LMLA NAR+DYF++ ++
Subjt:  KSVPRGAMFKSSTFSIFFNIALFTTGLAITLLLWPTVIEQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFWLVTLMLAGNAREDYFDEIEE

XP_022937371.1 uncharacterized protein LOC111443679 [Cucurbita moschata]3.5e-9496.83Show/hide
Query:  MAVSVKIMSLTVTTLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSVPRGAMFKSSTFSIFFNI
        MAVSVKIMSLTV TLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSVPRGAMFKSS+FSIFFNI
Subjt:  MAVSVKIMSLTVTTLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSVPRGAMFKSSTFSIFFNI

Query:  ALFTTGLAITLLLWPTVIEQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFWLVTLMLAGNAREDYFDEIEENGDSAEALKNNA
        ALFTTGLAITLL+WPTV EQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFWLV LMLAGNAREDYFDEIEENGDS EALKNNA
Subjt:  ALFTTGLAITLLLWPTVIEQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFWLVTLMLAGNAREDYFDEIEENGDSAEALKNNA

XP_022976207.1 uncharacterized protein LOC111476666 [Cucurbita maxima]3.4e-97100Show/hide
Query:  MAVSVKIMSLTVTTLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSVPRGAMFKSSTFSIFFNI
        MAVSVKIMSLTVTTLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSVPRGAMFKSSTFSIFFNI
Subjt:  MAVSVKIMSLTVTTLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSVPRGAMFKSSTFSIFFNI

Query:  ALFTTGLAITLLLWPTVIEQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFWLVTLMLAGNAREDYFDEIEENGDSAEALKNNA
        ALFTTGLAITLLLWPTVIEQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFWLVTLMLAGNAREDYFDEIEENGDSAEALKNNA
Subjt:  ALFTTGLAITLLLWPTVIEQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFWLVTLMLAGNAREDYFDEIEENGDSAEALKNNA

TrEMBL top hitse value%identityAlignment
A0A6J1FA61 uncharacterized protein LOC1114436791.7e-9496.83Show/hide
Query:  MAVSVKIMSLTVTTLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSVPRGAMFKSSTFSIFFNI
        MAVSVKIMSLTV TLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSVPRGAMFKSS+FSIFFNI
Subjt:  MAVSVKIMSLTVTTLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSVPRGAMFKSSTFSIFFNI

Query:  ALFTTGLAITLLLWPTVIEQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFWLVTLMLAGNAREDYFDEIEENGDSAEALKNNA
        ALFTTGLAITLL+WPTV EQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFWLV LMLAGNAREDYFDEIEENGDS EALKNNA
Subjt:  ALFTTGLAITLLLWPTVIEQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFWLVTLMLAGNAREDYFDEIEENGDSAEALKNNA

A0A6J1IF65 uncharacterized protein LOC1114766661.7e-97100Show/hide
Query:  MAVSVKIMSLTVTTLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSVPRGAMFKSSTFSIFFNI
        MAVSVKIMSLTVTTLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSVPRGAMFKSSTFSIFFNI
Subjt:  MAVSVKIMSLTVTTLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSVPRGAMFKSSTFSIFFNI

Query:  ALFTTGLAITLLLWPTVIEQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFWLVTLMLAGNAREDYFDEIEENGDSAEALKNNA
        ALFTTGLAITLLLWPTVIEQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFWLVTLMLAGNAREDYFDEIEENGDSAEALKNNA
Subjt:  ALFTTGLAITLLLWPTVIEQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFWLVTLMLAGNAREDYFDEIEENGDSAEALKNNA

A0A6J1ILG3 uncharacterized protein LOC1114766676.1e-92100Show/hide
Query:  MAVSVKIMSLTVATLGVISFIFGVVAENKKPASGTPIPGKGIVICKYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSAPRGAMFKSSSFSIFFNI
        MAVSVKIMSLTVATLGVISFIFGVVAENKKPASGTPIPGKGIVICKYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSAPRGAMFKSSSFSIFFNI
Subjt:  MAVSVKIMSLTVATLGVISFIFGVVAENKKPASGTPIPGKGIVICKYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSAPRGAMFKSSSFSIFFNI

Query:  ALFTTGLAITLLVWPTVTEQIHLTRNVHQNLETACPTAKTGLLGGGAFLSLDSSLFWLVALMLAENAREDYFDEIEEKGAS
        ALFTTGLAITLLVWPTVTEQIHLTRNVHQNLETACPTAKTGLLGGGAFLSLDSSLFWLVALMLAENAREDYFDEIEEKGAS
Subjt:  ALFTTGLAITLLVWPTVTEQIHLTRNVHQNLETACPTAKTGLLGGGAFLSLDSSLFWLVALMLAENAREDYFDEIEEKGAS

F6H2X2 Uncharacterized protein3.1e-12055.01Show/hide
Query:  MAVSVKIMSLTVATLGVISFIFGVVAENKKPASGTPIPGKGIVICKYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSAPRGAMFKSSSFSIFFNI
        MAV++K MS+ V  LGV SFIFG+VAENKKP SGTP  G G++ICKY +D T+ LG+LS  FL ASS AG  S+FYP++GKS P  A+F++++F+IFFNI
Subjt:  MAVSVKIMSLTVATLGVISFIFGVVAENKKPASGTPIPGKGIVICKYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSAPRGAMFKSSSFSIFFNI

Query:  ALFTTGLAITLLVWPTVTEQIHLTRNVHQNLETACPTAKTGLLGGGAFLSLDSSLFWLVALMLAENAREDYFDEIEEKGASVFAEIRSGSGSGNESDFPV
         L   G+A   ++WPT+TE  H   N+H NL+  CPTAKTGL+GGG  LS  SSLFWL++LML +N REDYF+E++ +G                     
Subjt:  ALFTTGLAITLLVWPTVTEQIHLTRNVHQNLETACPTAKTGLLGGGAFLSLDSSLFWLVALMLAENAREDYFDEIEEKGASVFAEIRSGSGSGNESDFPV

Query:  LLHIFLRRSILCLVKFILGSMAVSVKIMSLTVTTLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQG
                            MA ++K MS+ V TLG +SF+FGV+AENKKP SGT    +G+VIC+Y +DPTV LGYLSVAFL+AS++AGY SLFYPY+G
Subjt:  LLHIFLRRSILCLVKFILGSMAVSVKIMSLTVTTLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQG

Query:  KSVPRGAMFKSSTFSIFFNIALFTTGLAITLLLWPTVIEQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFWLVTLMLAGNAREDYFDEIEEN--
        +S+P+ A+F+S++F +FFNIAL   GLA  +LLWPT+ EQ+HL RNVHHNL   CPTAKTGLLGGGAF++LD+SLFWLV+LMLA N REDYF+ +E++  
Subjt:  KSVPRGAMFKSSTFSIFFNIALFTTGLAITLLLWPTVIEQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFWLVTLMLAGNAREDYFDEIEEN--

Query:  GDSAEALKN
        G+ +E L N
Subjt:  GDSAEALKN

W9QZ35 Uncharacterized protein6.0e-12458.19Show/hide
Query:  MAVSVKIMSLTVATLGVISFIFGVVAENKKPASGTPIPGKGIVICKYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSAPRGAMFKSSSFSIFFNI
        MA +   M+L V  LG ISF+FG++AENKK  SGTPI GK +VIC YP++P VALGYLS AFL+AS++ GY SLFYPY+GKS P+  +FK++SF +FFNI
Subjt:  MAVSVKIMSLTVATLGVISFIFGVVAENKKPASGTPIPGKGIVICKYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSAPRGAMFKSSSFSIFFNI

Query:  ALFTTGLAITLLVWPTVTEQIHLTRNVHQNLETACPTAKTGLLGGGAFLSLDSSLFWLVALMLAENAREDYFDEIEE-----------------------
        ALFT GL   L +WPT  EQ H    V+ +  + CPTAKTGLLGG AF+SLDSSLFWLVALMLA N R+DYF + E                        
Subjt:  ALFTTGLAITLLVWPTVTEQIHLTRNVHQNLETACPTAKTGLLGGGAFLSLDSSLFWLVALMLAENAREDYFDEIEE-----------------------

Query:  KGASVFAEIRSGSGSGNESDFPVLLHIFLRRSILCLVKFILGSMAVSVKIMSLTVTTLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGY
        + A+ + E    S      D P  L +  +   LCL    L +MAV+VK M+L VT LGV+SF+FGV+AENKKPAS TP+ GK +VIC+Y +D TV  GY
Subjt:  KGASVFAEIRSGSGSGNESDFPVLLHIFLRRSILCLVKFILGSMAVSVKIMSLTVTTLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGY

Query:  LSVAFLLASSIAGYFSLFYPYQGKSVPRGAMFKSSTFSIFFNIALFTTGLAITLLLWPTVIEQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFW
        LSV FL ASS  G  SLFYPY+GK VP GA F+SS F +FFNIALFT+GLA  LLLWPT+ EQ H+ RNVH NL+  CPTAKTGLLGGGAF+SLD+SLFW
Subjt:  LSVAFLLASSIAGYFSLFYPYQGKSVPRGAMFKSSTFSIFFNIALFTTGLAITLLLWPTVIEQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFW

Query:  LVTLMLAGNAREDYFDEIEEN
        LV LMLA NAREDYF+E+E +
Subjt:  LVTLMLAGNAREDYFDEIEEN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G31130.1 Protein of unknown function (DUF1218)8.7e-7575.84Show/hide
Query:  MAVSVKIMSLTVATLGVISFIFGVVAENKKPASGTPIPGKGIVICKYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSAPRGAMFKSSSFSIFFNI
        MAVS+K MSL V+ LGV+SF+ GV+AENKKPASGTPI GKG+VICKYP+DPTVALGYLS AFLLA ++AGY SLF  Y+GKS P   +FKS+SFS+FFNI
Subjt:  MAVSVKIMSLTVATLGVISFIFGVVAENKKPASGTPIPGKGIVICKYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSAPRGAMFKSSSFSIFFNI

Query:  ALFTTGLAITLLVWPTVTEQIHLTRNVHQNLETACPTAKTGLLGGGAFLSLDSSLFWLVALMLAENAREDYFDEIEEK
        AL T+GLA++LL+WPT+TEQ+HLTRNVH+NLET+CPTAKTGLLGGGAF+SLDS LFWLVALMLA+NARED+FDE+E +
Subjt:  ALFTTGLAITLLVWPTVTEQIHLTRNVHQNLETACPTAKTGLLGGGAFLSLDSSLFWLVALMLAENAREDYFDEIEEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTGTCTGTTAAAATAATGTCTCTTACTGTCGCAACTCTAGGTGTGATATCCTTTATATTTGGAGTCGTAGCTGAGAACAAGAAGCCTGCATCCGGAACTCCCAT
CCCAGGCAAAGGCATTGTTATCTGTAAATATCCAGCGGACCCAACTGTGGCCCTGGGATATCTTTCAGTTGCGTTTCTTCTTGCTTCTTCAATTGCAGGATACTTCTCTT
TGTTCTATCCTTACCAAGGAAAATCCGCTCCTCGAGGTGCCATGTTTAAGAGCAGCAGTTTCTCTATTTTCTTCAACATTGCCTTGTTCACAACTGGATTGGCTATAACT
TTGCTCGTATGGCCTACAGTCACCGAGCAAATTCACTTGACTCGTAACGTTCATCAGAATCTCGAGACAGCGTGCCCAACCGCTAAGACGGGTCTTCTGGGTGGTGGTGC
ATTTCTATCACTTGATTCATCCCTCTTCTGGTTGGTTGCCCTGATGTTGGCTGAAAATGCTCGAGAGGATTACTTTGATGAAATAGAGGAAAAAGGAGCCAGTGTCTTCG
CCGAAATCCGGAGTGGGAGTGGGAGCGGGAACGAAAGCGATTTTCCTGTTCTCTTGCACATTTTTCTTCGTCGTTCAATTCTGTGTCTGGTAAAATTTATCTTGGGATCA
ATGGCCGTGTCTGTCAAGATAATGTCTCTTACTGTCACAACTCTAGGTGTGATATCCTTTATATTTGGAGTCGTAGCTGAGAACAAGAAGCCTGCATCCGGAACTCCCAT
CCCAGGCAAAGGCATTGTTATCTGTCAATATCCAGCGGACCCAACTGTGGCCCTGGGATATCTTTCCGTTGCGTTTCTTCTTGCTTCTTCAATTGCAGGATACTTCTCTT
TGTTCTATCCTTACCAAGGAAAATCCGTTCCTCGAGGTGCGATGTTTAAGAGCAGCACTTTCTCCATTTTCTTCAACATTGCCCTGTTCACAACTGGATTGGCTATAACT
TTGCTTCTATGGCCTACAGTCATCGAGCAAATTCACTTGACTCGCAACGTTCATCACAATCTCGAGGCAGCATGCCCGACCGCTAAGACTGGTCTTCTGGGTGGTGGTGC
CTTTCTATCCCTTGATTCATCCCTCTTCTGGTTGGTTACCCTGATGTTGGCTGGAAATGCTCGAGAGGACTACTTTGATGAAATTGAGGAAAACGGAGACAGTGCTGAGG
CTCTTAAGAACAACGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGTGTCTGTTAAAATAATGTCTCTTACTGTCGCAACTCTAGGTGTGATATCCTTTATATTTGGAGTCGTAGCTGAGAACAAGAAGCCTGCATCCGGAACTCCCAT
CCCAGGCAAAGGCATTGTTATCTGTAAATATCCAGCGGACCCAACTGTGGCCCTGGGATATCTTTCAGTTGCGTTTCTTCTTGCTTCTTCAATTGCAGGATACTTCTCTT
TGTTCTATCCTTACCAAGGAAAATCCGCTCCTCGAGGTGCCATGTTTAAGAGCAGCAGTTTCTCTATTTTCTTCAACATTGCCTTGTTCACAACTGGATTGGCTATAACT
TTGCTCGTATGGCCTACAGTCACCGAGCAAATTCACTTGACTCGTAACGTTCATCAGAATCTCGAGACAGCGTGCCCAACCGCTAAGACGGGTCTTCTGGGTGGTGGTGC
ATTTCTATCACTTGATTCATCCCTCTTCTGGTTGGTTGCCCTGATGTTGGCTGAAAATGCTCGAGAGGATTACTTTGATGAAATAGAGGAAAAAGGAGCCAGTGTCTTCG
CCGAAATCCGGAGTGGGAGTGGGAGCGGGAACGAAAGCGATTTTCCTGTTCTCTTGCACATTTTTCTTCGTCGTTCAATTCTGTGTCTGGTAAAATTTATCTTGGGATCA
ATGGCCGTGTCTGTCAAGATAATGTCTCTTACTGTCACAACTCTAGGTGTGATATCCTTTATATTTGGAGTCGTAGCTGAGAACAAGAAGCCTGCATCCGGAACTCCCAT
CCCAGGCAAAGGCATTGTTATCTGTCAATATCCAGCGGACCCAACTGTGGCCCTGGGATATCTTTCCGTTGCGTTTCTTCTTGCTTCTTCAATTGCAGGATACTTCTCTT
TGTTCTATCCTTACCAAGGAAAATCCGTTCCTCGAGGTGCGATGTTTAAGAGCAGCACTTTCTCCATTTTCTTCAACATTGCCCTGTTCACAACTGGATTGGCTATAACT
TTGCTTCTATGGCCTACAGTCATCGAGCAAATTCACTTGACTCGCAACGTTCATCACAATCTCGAGGCAGCATGCCCGACCGCTAAGACTGGTCTTCTGGGTGGTGGTGC
CTTTCTATCCCTTGATTCATCCCTCTTCTGGTTGGTTACCCTGATGTTGGCTGGAAATGCTCGAGAGGACTACTTTGATGAAATTGAGGAAAACGGAGACAGTGCTGAGG
CTCTTAAGAACAACGCATGAAATTTCTTTGTTCGGTGAGAATAATAATTCAGTTTCAGTTTACTGCACTGTAATTGACTATTAAGTAAGGACTTAATTGTGAATATGATA
TTTCAAGGAATGTTCATCAATGAAATTATATCATGATAGTATGAGTTTTTATTTTTCATTTTCAACCTCGAC
Protein sequenceShow/hide protein sequence
MAVSVKIMSLTVATLGVISFIFGVVAENKKPASGTPIPGKGIVICKYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSAPRGAMFKSSSFSIFFNIALFTTGLAIT
LLVWPTVTEQIHLTRNVHQNLETACPTAKTGLLGGGAFLSLDSSLFWLVALMLAENAREDYFDEIEEKGASVFAEIRSGSGSGNESDFPVLLHIFLRRSILCLVKFILGS
MAVSVKIMSLTVTTLGVISFIFGVVAENKKPASGTPIPGKGIVICQYPADPTVALGYLSVAFLLASSIAGYFSLFYPYQGKSVPRGAMFKSSTFSIFFNIALFTTGLAIT
LLLWPTVIEQIHLTRNVHHNLEAACPTAKTGLLGGGAFLSLDSSLFWLVTLMLAGNAREDYFDEIEENGDSAEALKNNA