; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G031070 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G031070
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptionlate embryogenesis abundant protein-like
Genome locationCiama_Chr02:5174276..5176918
RNA-Seq ExpressionCaUC02G031070
SyntenyCaUC02G031070
Gene Ontology termsGO:0009415 - response to water (biological process)
InterPro domainsIPR000167 - Dehydrin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572053.1 Embryogenic cell protein 40, partial [Cucurbita argyrosperma subsp. sororia]1.7e-4056.36Show/hide
Query:  MRMAKMADIRDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSR
        MRMAKMADI+DE GNPI+LTDEHGNPVVLTDE GNP+ L+GVATKVG TLGSL+FG G   EDG                      GGHG   DA+ SS 
Subjt:  MRMAKMADIRDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSR

Query:  GGYGDGEQSLPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTTTTTKIVSST
        GG GDGEQ L P  + EDGGS  HV   TSGSS             SEEEQ+E+   KKKKKKGLTQKIKEKL GGKH++EQ  AS PPTT      ++T
Subjt:  GGYGDGEQSLPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTTTTTKIVSST

Query:  TTATKTDFPTTTERTHQGES
        TTA     P TT + H GE+
Subjt:  TTATKTDFPTTTERTHQGES

KAG7011719.1 Embryogenic cell protein 40 [Cucurbita argyrosperma subsp. argyrosperma]5.7e-4155.91Show/hide
Query:  MRMAKMADIRDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSR
        MRMAKMADI+DE GNPI+LTDEHGNPVVLTDE GNP+ L+GVATKVG TLGSL+FG G   EDG                      GGHG   DA+ SS 
Subjt:  MRMAKMADIRDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSR

Query:  GGYGDGEQSLPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTTTTTKIVSST
        GG GDGEQ L P  + EDGGS  HV   TSGSS              EEEQ+E+K  KKKKKKGLTQKIKEKL GGKH++EQ  AS PP          T
Subjt:  GGYGDGEQSLPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTTTTTKIVSST

Query:  TTATKTDFPTTTERTHQGES
        T AT T  P TT + H GE+
Subjt:  TTATKTDFPTTTERTHQGES

XP_022953104.1 late embryogenesis abundant protein-like [Cucurbita moschata]4.2e-3653.33Show/hide
Query:  RDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSRGGYGDGEQS
        +DE GNPI+L DEHGNPVVLTDE GNP+ L+G+ATKVG TLGSL+FG G   EDG                      GGHG   DA+ SS GG GDGEQ 
Subjt:  RDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSRGGYGDGEQS

Query:  LPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTTTTTKIVSSTTTATKTDFP
        L P  + EDGGS  HV   TSGSS              EEEQ+E++ +KKKKKKGL QKIKEKL GGKH++EQ  AS PPTTT      +TTTA+    P
Subjt:  LPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTTTTTKIVSSTTTATKTDFP

Query:  TTTERTHQGE
         TT + H GE
Subjt:  TTTERTHQGE

XP_038887694.1 late embryogenesis abundant protein-like isoform X1 [Benincasa hispida]5.5e-5265.82Show/hide
Query:  MRMAKMADIRDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSR
        M+MAKMADIRDE GNPIRLTDE GNPV+LTDE GNPMWLTGVATKVG TLGSL+FG                        GGG  DGGHGC SDA+ SS 
Subjt:  MRMAKMADIRDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSR

Query:  GGYGDGEQSLPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTTTTTKI
        GGYGD EQ LPPH +DEDGGS+ HVR T+SGSS            LSEEEQNERK E KKKKKGLTQKIKEKLRGGKHK+EQ +ASP PTTT  ++
Subjt:  GGYGDGEQSLPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTTTTTKI

XP_038887695.1 embryogenic cell protein 40-like isoform X2 [Benincasa hispida]7.2e-3665.19Show/hide
Query:  MRMAKMADIRDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSR
        M+MAKMADIRDE GNPIRLTDE GNPV+LTDE GNPMWLTGVATKVG TLGSL+FG                        GGG  DGGHGC SDA+ SS 
Subjt:  MRMAKMADIRDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSR

Query:  GGYGDGEQSLPPHGKDEDGGSSAHVRFTTSGSSSV
        GGYGD EQ LPPH +DEDGGS+ HVR T+SGSS +
Subjt:  GGYGDGEQSLPPHGKDEDGGSSAHVRFTTSGSSSV

TrEMBL top hitse value%identityAlignment
A0A6J1C1Y9 late embryogenesis abundant protein-like1.1e-2644.4Show/hide
Query:  KMADIRDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSRGGYG
        KMADIRDE GNPI LTDE GNPVVLTDE GNPM LTGVATK+GPTLGSL+     SG D G                                   GG+G
Subjt:  KMADIRDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSRGGYG

Query:  DGEQSLPPHGKDEDGGSSAHVRFTTSGSSSV------------LIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTTT
        DGEQ L PH   +DGG    VR TTSGSSS               +  E Y    +E   +++ +K +KKKG TQKIKEKL G +HK+EQ H   P TTT
Subjt:  DGEQSLPPHGKDEDGGSSAHVRFTTSGSSSV------------LIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTTT

Query:  TTKIVSSTTTATKTDFPTTTERTHQGESDYYN
         T+      TA  T     TE   +G  +  N
Subjt:  TTKIVSSTTTATKTDFPTTTERTHQGESDYYN

A0A6J1GNP3 late embryogenesis abundant protein-like2.0e-3653.33Show/hide
Query:  RDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSRGGYGDGEQS
        +DE GNPI+L DEHGNPVVLTDE GNP+ L+G+ATKVG TLGSL+FG G   EDG                      GGHG   DA+ SS GG GDGEQ 
Subjt:  RDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSRGGYGDGEQS

Query:  LPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTTTTTKIVSSTTTATKTDFP
        L P  + EDGGS  HV   TSGSS              EEEQ+E++ +KKKKKKGL QKIKEKL GGKH++EQ  AS PPTTT      +TTTA+    P
Subjt:  LPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTTTTTKIVSSTTTATKTDFP

Query:  TTTERTHQGE
         TT + H GE
Subjt:  TTTERTHQGE

A0A6J1IJR5 late embryogenesis abundant protein-like3.2e-2949.77Show/hide
Query:  RDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSRGGYGDGEQS
        +DE GNPI+ TDEHGNPVVLTDE GNP+   GVATKVG TLGSL+FG G  GEDG                      GGHG  SDAK SS GG G  EQ 
Subjt:  RDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSRGGYGDGEQS

Query:  LPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTT-TTTKIVSSTTTATKTDF
        L P  ++EDGGS  HV   TSGSS             SEEEQ      +K+KKKGLTQKIKEKL GGKHK+EQ   S PPTT        +   A  T F
Subjt:  LPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTT-TTTKIVSSTTTATKTDF

Query:  PTTTERTHQGESDYY
        P+  +  +   SD++
Subjt:  PTTTERTHQGESDYY

M5VP39 Uncharacterized protein4.9e-1439.73Show/hide
Query:  MRMAKMADIRDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVAT----KVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAK
        ++  +MA IRDE GN ++LTDEHGNPV LTDE GNPM LTGVAT    + G   GS V  + GSG  GG+ + G  L       GG G   G G   D +
Subjt:  MRMAKMADIRDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVAT----KVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAK

Query:  VSSR-----GGYGDGEQSLPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKD---EQTHASPP
         S +     GG G+  Q  P      DGG +   R + S SSS            S E+  +     ++KKKGL +KIKEKL GGKHKD   +Q +    
Subjt:  VSSR-----GGYGDGEQSLPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKD---EQTHASPP

Query:  PTTTTTKIVSSTTTATKTD
          T T  + ++ TT   T+
Subjt:  PTTTTTKIVSSTTTATKTD

V9M5C3 Dehydrin protein4.4e-1540.96Show/hide
Query:  MADIRDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSRGGYGD
        MAD+RDE GNPI+LTDEHGNPV LTDE GNP+ +TGVAT   PTLG+L+       E    G +  + G+     G  G++  H                
Subjt:  MADIRDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSRGGYGD

Query:  GEQSLPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTTTT
         E+  P     E  GS    R +TS SSS            SE++        ++KKKGL +KIKEKL GGKHK+EQ H +   T TT
Subjt:  GEQSLPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTTTT

SwissProt top hitse value%identityAlignment
P21298 Late embryogenesis abundant protein8.3e-1135.18Show/hide
Query:  MADIRDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSRGGYGD
        MAD++DE GNPI LTD +GNPV L+DE GNPM +TGVA+       S+           G+    PT               G   G+ A  ++  G   
Subjt:  MADIRDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSRGGYGD

Query:  GEQSLPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTTTTTKIVSSTTTAT
         E +    G++  G    H+R + S SSS                 +E   +  ++KK +  KIK+KL GGKHKDEQT     PTT TT   ++TTT T
Subjt:  GEQSLPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTTTTTKIVSSTTTAT

Q07322 Embryogenic cell protein 406.4e-1131.6Show/hide
Query:  MADIRDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDG-----------------------------------------
        MAD+RDE GNPI+LTD+HGNPV LTDE GNP+ +TGVAT  G T G    G+GG+   G                                         
Subjt:  MADIRDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDG-----------------------------------------

Query:  ----------------GHGKVGPTLGSLMFGSGG---------GGEDGG-------HGCGSDAKVSSRGGYGDGEQSLPPHGKDEDGGSSAHVRFTTSGS
                        G G  G T G    G GG         GG  GG       HG G     ++ GG G G+  L        G  +     +   S
Subjt:  ----------------GHGKVGPTLGSLMFGSGG---------GGEDGG-------HGCGSDAKVSSRGGYGDGEQSLPPHGKDEDGGSSAHVRFTTSGS

Query:  SSVLIQCCEVYTKLSEE--------EQNERKSEKKKKKKGLTQKIKEKLRGGKH-KDEQTHASPPPTTT
        +       E  T L E+          +E   +  ++KKG T KIKEKL GGKH KDE T  +   TTT
Subjt:  SSVLIQCCEVYTKLSEE--------EQNERKSEKKKKKKGLTQKIKEKLRGGKH-KDEQTHASPPPTTT

Q96261 Probable dehydrin LEA1.7e-0832.86Show/hide
Query:  MADIRDELGNPIRLTDEHGNPVV-LTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSRGGYG
        MAD+RDE GNPI LTD  GNP+V LTDE GNPM+LTGV +       S    I         G+  P       G+       G   G+ A  +      
Subjt:  MADIRDELGNPIRLTDEHGNPVV-LTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSRGGYG

Query:  DGEQSLPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTTTTTKIVSSTTTAT
                 G+   G    H+R + S SSS                 +E   +  ++KK + +KIKEK   GKHKDEQT    P T TTT          
Subjt:  DGEQSLPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTTTTTKIVSSTTTAT

Query:  KTDFPTTTERTHQ
            P TT++ H+
Subjt:  KTDFPTTTERTHQ

Arabidopsis top hitse value%identityAlignment
AT2G21490.1 dehydrin LEA1.2e-0932.86Show/hide
Query:  MADIRDELGNPIRLTDEHGNPVV-LTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSRGGYG
        MAD+RDE GNPI LTD  GNP+V LTDE GNPM+LTGV +       S    I         G+  P       G+       G   G+ A  +      
Subjt:  MADIRDELGNPIRLTDEHGNPVV-LTDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSRGGYG

Query:  DGEQSLPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTTTTTKIVSSTTTAT
                 G+   G    H+R + S SSS                 +E   +  ++KK + +KIKEK   GKHKDEQT    P T TTT          
Subjt:  DGEQSLPPHGKDEDGGSSAHVRFTTSGSSSVLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTTTTTKIVSSTTTAT

Query:  KTDFPTTTERTHQ
            P TT++ H+
Subjt:  KTDFPTTTERTHQ

AT4G39130.1 Dehydrin family protein8.0e-0968.29Show/hide
Query:  MADIRDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKV
        MAD++DE GNPI LTD HG P  L DE GN M LTGVAT V
Subjt:  MADIRDELGNPIRLTDEHGNPVVLTDERGNPMWLTGVATKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTCTGATAGACTGCCTCTGATTTTTTACACAAGCAACACATCATATTCTCTGACCATGGGTGGCTATCGTTTTGCCGCTGCAACCAACAAGGAAACATCA
GGTACGATCAGTGGACTCCAATCTTTTCTCAATTACAGGCTTTTCTTGATTCTCATTGGACTGCTTAGACAGGTACTTAGCTTCTGCTTTCATGGTTATTTTTTA
TGTTTGGCTGTGGGAGAGATGAGAATGGCGAAAATGGCTGATATACGGGACGAGCTTGGCAACCCCATCCGACTCACTGACGAGCATGGCAACCCGGTTGTGCTG
ACTGATGAACGTGGCAACCCCATGTGGCTCACCGGCGTCGCAACAAAGGTCGGCCCGACACTCGGGTCACTGGTGTTTGGCATTGGTGGTAGTGGTGAGGATGGT
GGCCATGGCAAGGTTGGCCCGACGCTCGGGTCACTGATGTTTGGCAGTGGTGGTGGTGGTGAGGATGGTGGCCATGGCTGTGGTTCCGATGCTAAAGTTAGTTCA
AGAGGTGGCTATGGCGACGGCGAGCAGTCGCTGCCGCCGCATGGTAAGGATGAAGATGGTGGCTCTAGCGCCCATGTTCGCTTTACCACTTCAGGCTCTAGTTCG
GTGTTGATTCAATGTTGCGAAGTTTACACTAAACTTTCTGAGGAAGAACAAAATGAGAGGAAGAGTGAGAAGAAGAAGAAGAAGAAAGGACTGACTCAGAAAATA
AAGGAGAAACTAAGAGGAGGGAAACATAAAGATGAGCAGACTCATGCTTCTCCCCCGCCAACCACCACGACTACCAAAATCGTCTCTTCGACCACCACGGCTACA
AAAACCGACTTTCCGACCACCACGGAAAGAACTCATCAAGGAGAATCTGATTATTATAATGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATTCTGATAGACTGCCTCTGATTTTTTACACAAGCAACACATCATATTCTCTGACCATGGGTGGCTATCGTTTTGCCGCTGCAACCAACAAGGAAACATCA
GGTACGATCAGTGGACTCCAATCTTTTCTCAATTACAGGCTTTTCTTGATTCTCATTGGACTGCTTAGACAGGTACTTAGCTTCTGCTTTCATGGTTATTTTTTA
TGTTTGGCTGTGGGAGAGATGAGAATGGCGAAAATGGCTGATATACGGGACGAGCTTGGCAACCCCATCCGACTCACTGACGAGCATGGCAACCCGGTTGTGCTG
ACTGATGAACGTGGCAACCCCATGTGGCTCACCGGCGTCGCAACAAAGGTCGGCCCGACACTCGGGTCACTGGTGTTTGGCATTGGTGGTAGTGGTGAGGATGGT
GGCCATGGCAAGGTTGGCCCGACGCTCGGGTCACTGATGTTTGGCAGTGGTGGTGGTGGTGAGGATGGTGGCCATGGCTGTGGTTCCGATGCTAAAGTTAGTTCA
AGAGGTGGCTATGGCGACGGCGAGCAGTCGCTGCCGCCGCATGGTAAGGATGAAGATGGTGGCTCTAGCGCCCATGTTCGCTTTACCACTTCAGGCTCTAGTTCG
GTGTTGATTCAATGTTGCGAAGTTTACACTAAACTTTCTGAGGAAGAACAAAATGAGAGGAAGAGTGAGAAGAAGAAGAAGAAGAAAGGACTGACTCAGAAAATA
AAGGAGAAACTAAGAGGAGGGAAACATAAAGATGAGCAGACTCATGCTTCTCCCCCGCCAACCACCACGACTACCAAAATCGTCTCTTCGACCACCACGGCTACA
AAAACCGACTTTCCGACCACCACGGAAAGAACTCATCAAGGAGAATCTGATTATTATAATGTGTGA
Protein sequenceShow/hide protein sequence
MNSDRLPLIFYTSNTSYSLTMGGYRFAAATNKETSGTISGLQSFLNYRLFLILIGLLRQVLSFCFHGYFLCLAVGEMRMAKMADIRDELGNPIRLTDEHGNPVVL
TDERGNPMWLTGVATKVGPTLGSLVFGIGGSGEDGGHGKVGPTLGSLMFGSGGGGEDGGHGCGSDAKVSSRGGYGDGEQSLPPHGKDEDGGSSAHVRFTTSGSSS
VLIQCCEVYTKLSEEEQNERKSEKKKKKKGLTQKIKEKLRGGKHKDEQTHASPPPTTTTTKIVSSTTTATKTDFPTTTERTHQGESDYYNV