; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0027072 (gene) of Chayote v1 genome

Gene IDSed0027072
OrganismSechium edule (Chayote v1)
DescriptionUnknown protein
Genome locationLG01:67070984..67074102
RNA-Seq ExpressionSed0027072
SyntenySed0027072
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605603.1 hypothetical protein SDJN03_02920, partial [Cucurbita argyrosperma subsp. sororia]1.7e-9672.41Show/hide
Query:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPS--SRKAAVYGSRDRVRASEGVQFQ
        MCRSTDYR RL GDRLKIKAFFVRFSHL+SL  PP +SLTL YLPR+DET+LEI G +IRPDSPAFVTLHRVVSPS   RK  V+GSR+RVRASEGVQFQ
Subjt:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPS--SRKAAVYGSRDRVRASEGVQFQ

Query:  VYLREENVVQGIFRRSDEGEWRLECRCALESEIAGS-TSAVEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVDGDCDGCACCCGVDED--
        VYLREE VVQGIFRR D+GEWRLEC+CALES+IA S  +AVE+CVD+EGEVAMFEKV L+VRR++++RGFC +E IPE REVDGDCDGC CCC VD D  
Subjt:  VYLREENVVQGIFRRSDEGEWRLECRCALESEIAGS-TSAVEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVDGDCDGCACCCGVDED--

Query:  --------------GGETEVEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTLKRKPIF
                      GGET    EVEG  W +DLGIWAVCLGVG+LVSKAA+SKTL+RK IF
Subjt:  --------------GGETEVEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTLKRKPIF

XP_004143843.1 uncharacterized protein LOC101203014 [Cucumis sativus]2.6e-9774.52Show/hide
Query:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPSSR--KAAVYGSRDRVRASEGVQFQ
        MCRSTDYR   A DRLKIKAFFVRFSHL  LD PPPESLTL YLPR+DET+LEI G +IRPDSPAFVTLHRVVSPSSR  K   +GSR+RVRASEGVQFQ
Subjt:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPSSR--KAAVYGSRDRVRASEGVQFQ

Query:  VYLREENVVQGIFRRSDEGEWRLECRCALESEI--AGSTSAVEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVDGDCDGCACCCGVD---
        VYLREE VVQGIFR++D+G+WRLECRCALESEI  A + +A EVCVDVEG+ AM EKV LEV RR+++RGFC L  IPEGREVDG C+GC CCCG D   
Subjt:  VYLREENVVQGIFRRSDEGEWRLECRCALESEI--AGSTSAVEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVDGDCDGCACCCGVD---

Query:  ----------EDGGETEVEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTLKRKPIF
                  EDGG TEVEMEVEGVRWAVDLGIWAVCLGVGYLVS+ A+SKTL+RK IF
Subjt:  ----------EDGGETEVEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTLKRKPIF

XP_008437442.1 PREDICTED: uncharacterized protein LOC103482856 [Cucumis melo]8.2e-9975Show/hide
Query:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPSS---RKAAVYGSRDRVRASEGVQF
        MC+STDYR  LA DRLKIKAFFVRFSHL  LD PPPESLTL YLPR+DET+LEI G +IRPDSPAFVTLHRVVSPSS   +K  V+GSR+RVRASEGVQF
Subjt:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPSS---RKAAVYGSRDRVRASEGVQF

Query:  QVYLREENVVQGIFRRSDEGEWRLECRCALESEI--AGSTSAVEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVDGDCDGCACCCGVD--
        QVYLREE VVQGIFR++D+G+WRLECRCALES+I  A + +A EVCVDVEG+ AMFEKV LEV RR+++RGFC L  IPEGREVDG C+GC CCCG D  
Subjt:  QVYLREENVVQGIFRRSDEGEWRLECRCALESEI--AGSTSAVEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVDGDCDGCACCCGVD--

Query:  -----------EDGGETEVEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTLKRKPIF
                   EDGGETEVE+EVEGVRWAVDLGIWAVCLGVGYLVSKAA+SKTL+RK IF
Subjt:  -----------EDGGETEVEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTLKRKPIF

XP_023533025.1 uncharacterized protein LOC111795033 [Cucurbita pepo subsp. pepo]1.4e-9571.6Show/hide
Query:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPS--SRKAAVYGSRDRVRASEGVQFQ
        MCR+TDYR RL GD LKIKAFFVRFSHL+SL+ PP +SLTL YLPR+DET+LEI G +IRPDSPAFVTLHRV+SPS   RK  V+GSR+RVRASEGVQFQ
Subjt:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPS--SRKAAVYGSRDRVRASEGVQFQ

Query:  VYLREENVVQGIFRRSDEGEWRLECRCALESEIAGS-TSAVEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVDGDCDGCACCCGVDED--
        VYLREE VVQGIFRR D+GEWRLEC+CALES+IA S  +AVE+CVD+EGEVAMFEKV L+ RR++++RGFC +E IPE REVDGDCDGC CCC VD D  
Subjt:  VYLREENVVQGIFRRSDEGEWRLECRCALESEIAGS-TSAVEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVDGDCDGCACCCGVDED--

Query:  -------GGETE---VEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTLKRKPIF
               GGE +    E E+EG  W +DLGIWAVCLGVG+LVSKAA+SKTL+RK IF
Subjt:  -------GGETE---VEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTLKRKPIF

XP_038874855.1 uncharacterized protein LOC120067355 [Benincasa hispida]1.7e-9674.03Show/hide
Query:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPS--SRKAAVYGSRDRVRASEGVQFQ
        MCRSTDYR  L  DRL IKAFFVRFSHL+ L+ P P+SLTL YLPR+DET LEI G +IRPDSPAFVTLHRVVSPS   RK  V+GSR+RVRASEGVQFQ
Subjt:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPS--SRKAAVYGSRDRVRASEGVQFQ

Query:  VYLREENVVQGIFRRSDEGEWRLECRCALESEIAGSTSAVEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVD-GDCDGCACCCGVD----
        V+LREE VV GIFR+ D+GEWRLECRCALES+I G+ +A EVCVDVEG VAMF KV LEV +R+++RGFCGLE IPEGREVD GDC+GC CCCG D    
Subjt:  VYLREENVVQGIFRRSDEGEWRLECRCALESEIAGSTSAVEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVD-GDCDGCACCCGVD----

Query:  ---------EDGGETEVEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTLKRKPIF
                 EDGGE EVEMEVEGVRWAVDLGIWAVCLGVGYLVS+AA SKTL+RK IF
Subjt:  ---------EDGGETEVEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTLKRKPIF

TrEMBL top hitse value%identityAlignment
A0A0A0KQM2 Uncharacterized protein1.3e-9774.52Show/hide
Query:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPSSR--KAAVYGSRDRVRASEGVQFQ
        MCRSTDYR   A DRLKIKAFFVRFSHL  LD PPPESLTL YLPR+DET+LEI G +IRPDSPAFVTLHRVVSPSSR  K   +GSR+RVRASEGVQFQ
Subjt:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPSSR--KAAVYGSRDRVRASEGVQFQ

Query:  VYLREENVVQGIFRRSDEGEWRLECRCALESEI--AGSTSAVEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVDGDCDGCACCCGVD---
        VYLREE VVQGIFR++D+G+WRLECRCALESEI  A + +A EVCVDVEG+ AM EKV LEV RR+++RGFC L  IPEGREVDG C+GC CCCG D   
Subjt:  VYLREENVVQGIFRRSDEGEWRLECRCALESEI--AGSTSAVEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVDGDCDGCACCCGVD---

Query:  ----------EDGGETEVEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTLKRKPIF
                  EDGG TEVEMEVEGVRWAVDLGIWAVCLGVGYLVS+ A+SKTL+RK IF
Subjt:  ----------EDGGETEVEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTLKRKPIF

A0A1S3AUM0 uncharacterized protein LOC1034828564.0e-9975Show/hide
Query:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPSS---RKAAVYGSRDRVRASEGVQF
        MC+STDYR  LA DRLKIKAFFVRFSHL  LD PPPESLTL YLPR+DET+LEI G +IRPDSPAFVTLHRVVSPSS   +K  V+GSR+RVRASEGVQF
Subjt:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPSS---RKAAVYGSRDRVRASEGVQF

Query:  QVYLREENVVQGIFRRSDEGEWRLECRCALESEI--AGSTSAVEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVDGDCDGCACCCGVD--
        QVYLREE VVQGIFR++D+G+WRLECRCALES+I  A + +A EVCVDVEG+ AMFEKV LEV RR+++RGFC L  IPEGREVDG C+GC CCCG D  
Subjt:  QVYLREENVVQGIFRRSDEGEWRLECRCALESEI--AGSTSAVEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVDGDCDGCACCCGVD--

Query:  -----------EDGGETEVEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTLKRKPIF
                   EDGGETEVE+EVEGVRWAVDLGIWAVCLGVGYLVSKAA+SKTL+RK IF
Subjt:  -----------EDGGETEVEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTLKRKPIF

A0A5A7THB4 Uncharacterized protein4.0e-9975Show/hide
Query:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPSS---RKAAVYGSRDRVRASEGVQF
        MC+STDYR  LA DRLKIKAFFVRFSHL  LD PPPESLTL YLPR+DET+LEI G +IRPDSPAFVTLHRVVSPSS   +K  V+GSR+RVRASEGVQF
Subjt:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPSS---RKAAVYGSRDRVRASEGVQF

Query:  QVYLREENVVQGIFRRSDEGEWRLECRCALESEI--AGSTSAVEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVDGDCDGCACCCGVD--
        QVYLREE VVQGIFR++D+G+WRLECRCALES+I  A + +A EVCVDVEG+ AMFEKV LEV RR+++RGFC L  IPEGREVDG C+GC CCCG D  
Subjt:  QVYLREENVVQGIFRRSDEGEWRLECRCALESEI--AGSTSAVEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVDGDCDGCACCCGVD--

Query:  -----------EDGGETEVEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTLKRKPIF
                   EDGGETEVE+EVEGVRWAVDLGIWAVCLGVGYLVSKAA+SKTL+RK IF
Subjt:  -----------EDGGETEVEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTLKRKPIF

A0A6J1EQA2 uncharacterized protein LOC1114356449.1e-9673.52Show/hide
Query:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPSS--RKAAVYGSRDRVRASEGVQFQ
        MCRSTDYR+ L GDRLKIKAFFVRFSHL++LD PPPESLTL +LPR+DE  LEI G +IRPDSPAFV+LHRV+ PSS  RK  V+GSR+RVRASEG QF 
Subjt:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPSS--RKAAVYGSRDRVRASEGVQFQ

Query:  VYLREENVVQGIFRRSDEGEWRLECRCALESEIAGSTSA-VEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVDGDCDGCACCCGV-----
        V LRE+ VVQG+FR+SD+GEWR++C+CALES IAG   A  EVCVD+EGEVAMFEKV LEV RRR++RGFCGLE IPEGREVDGDCDGC CCCG      
Subjt:  VYLREENVVQGIFRRSDEGEWRLECRCALESEIAGSTSA-VEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVDGDCDGCACCCGV-----

Query:  ---DEDGGETEVEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTLKRKPIF
           DEDGGE  VEMEVEG +WAVDLGIWAVC GVG LV  AA+SKTL+RK IF
Subjt:  ---DEDGGETEVEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTLKRKPIF

A0A6J1K4D9 uncharacterized protein LOC1114915872.0e-9570.52Show/hide
Query:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPS--SRKAAVYGSRDRVRASEGVQFQ
        MCRSTDYR RL GDRLKIKAFFVRFSHL+SL+ P P+SLTL YLPR+DET+LEI G +IRPDSPAFVTLHRVVSPS   RK  V+GSR+RVRASEGVQFQ
Subjt:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPS--SRKAAVYGSRDRVRASEGVQFQ

Query:  VYLREENVVQGIFRRSDEGEWRLECRCALESEIAGS---TSAVEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVDGDCDGCACCCGV---
        VYLREE VVQGIFRR D+GEWRLEC+CALES+IA S    +AVE+CVD+EGEVAMFEKV L+V RR+++RGFC +E IPE REVDGDCDGC CCC V   
Subjt:  VYLREENVVQGIFRRSDEGEWRLECRCALESEIAGS---TSAVEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVDGDCDGCACCCGV---

Query:  ------------------DEDGGETEVEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTLKRKPIF
                          D+DGGET    EVEG  W +DLGIWAVCLGVG+LVSKAA+SKTL+RK IF
Subjt:  ------------------DEDGGETEVEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTLKRKPIF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G24110.1 unknown protein3.1e-5649.8Show/hide
Query:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPSSRKAAVYGSRDRVRASEGVQFQVY
        MCRS D+   +  + LK+KAFFVRF+ L +     P+SLTL Y PR++E   E+ G +IRPDSPAFVTLHRVV        +YGSR+RVR  EG++F+VY
Subjt:  MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPSSRKAAVYGSRDRVRASEGVQFQVY

Query:  LREENVVQGIFRRSDEGEWRLECRCALESEIAGSTSAVEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVDGDCDG--CACCCGVDEDGGE
        + EE V++GIFR+ +  +W+LEC C +E E      A EV V  EG VA         RRR+++ GF  LE IPE RE   D DG  C C C  + D GE
Subjt:  LREENVVQGIFRRSDEGEWRLECRCALESEIAGSTSAVEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVDGDCDG--CACCCGVDEDGGE

Query:  ---------TEVEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTL
                  E+E E EG+ WAVDLGIW +CLGVGYLVSKA+ +KTL
Subjt:  ---------TEVEMEVEGVRWAVDLGIWAVCLGVGYLVSKAAYSKTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCAGGTCTACCGATTACCGGAGCCGCCTCGCCGGCGATCGCCTCAAGATCAAGGCGTTCTTCGTTCGATTCTCTCATCTGGACTCACTCGATCCACCGCCGCCGGA
GTCGCTCACTCTCTGCTACCTCCCTCGCATGGACGAGACCGAGCTCGAGATCGGCGGCTTCGAAATCCGGCCGGACTCGCCGGCTTTCGTCACGCTGCACCGAGTCGTGT
CGCCGTCGTCGAGGAAGGCGGCGGTGTACGGGAGTCGAGATCGGGTACGAGCGAGCGAAGGCGTGCAGTTCCAGGTCTACTTGAGGGAGGAGAATGTGGTGCAGGGGATT
TTCAGGCGGAGCGATGAGGGAGAGTGGAGATTGGAGTGTAGGTGCGCGCTTGAATCTGAGATCGCTGGATCTACATCGGCGGTGGAGGTGTGTGTGGATGTGGAGGGGGA
GGTGGCGATGTTTGAGAAGGTGGCGCTGGAAGTGAGGAGGAGGAGGAGGAGGAGAGGTTTCTGTGGATTGGAGGGGATTCCGGAGGGGCGTGAAGTTGACGGCGACTGCG
ATGGATGTGCTTGTTGTTGTGGAGTGGATGAAGACGGCGGAGAAACGGAGGTGGAAATGGAGGTGGAGGGTGTTCGGTGGGCCGTGGATTTGGGAATTTGGGCCGTCTGT
TTGGGAGTTGGATACTTGGTTTCTAAAGCGGCCTATTCCAAAACATTGAAACGGAAACCAATTTTCTGA
mRNA sequenceShow/hide mRNA sequence
GTCGATAAAATAAAAAAAAGAATCAACGGCTCACAATAGAGTTACACACAAAATTGTGTAAACCTCATCACACAGGATTATTCACACCCATCTAACAAACAATGAAGCCA
CGTTTCACGCGTCAAAATACAAAAACGCAAATCCCACACCCCAAATTTGTCCGTTTGTCCACGTCATCATCGAACCCGTGTGCTCGGCACCCCTGGGTTGAACCCTCAGT
CCACGGAAATTTCGCCGCTTCCTCCCATTGTTTAAATATAATAATCTCTCAAACTCACAAATTCAAAATCAAAATTCAAACCGAAATCACCAATGTGCAGGTCTACCGAT
TACCGGAGCCGCCTCGCCGGCGATCGCCTCAAGATCAAGGCGTTCTTCGTTCGATTCTCTCATCTGGACTCACTCGATCCACCGCCGCCGGAGTCGCTCACTCTCTGCTA
CCTCCCTCGCATGGACGAGACCGAGCTCGAGATCGGCGGCTTCGAAATCCGGCCGGACTCGCCGGCTTTCGTCACGCTGCACCGAGTCGTGTCGCCGTCGTCGAGGAAGG
CGGCGGTGTACGGGAGTCGAGATCGGGTACGAGCGAGCGAAGGCGTGCAGTTCCAGGTCTACTTGAGGGAGGAGAATGTGGTGCAGGGGATTTTCAGGCGGAGCGATGAG
GGAGAGTGGAGATTGGAGTGTAGGTGCGCGCTTGAATCTGAGATCGCTGGATCTACATCGGCGGTGGAGGTGTGTGTGGATGTGGAGGGGGAGGTGGCGATGTTTGAGAA
GGTGGCGCTGGAAGTGAGGAGGAGGAGGAGGAGGAGAGGTTTCTGTGGATTGGAGGGGATTCCGGAGGGGCGTGAAGTTGACGGCGACTGCGATGGATGTGCTTGTTGTT
GTGGAGTGGATGAAGACGGCGGAGAAACGGAGGTGGAAATGGAGGTGGAGGGTGTTCGGTGGGCCGTGGATTTGGGAATTTGGGCCGTCTGTTTGGGAGTTGGATACTTG
GTTTCTAAAGCGGCCTATTCCAAAACATTGAAACGGAAACCAATTTTCTGAAGCCCTATAATTTTTATTTTTTTTTGGAATTAATTGTTTATCGAGAATTTTAAAAATAA
TTTAATACTCAATTAAATTAAGGAGAGATTTTGCATGTACAAAACAAGTGGCAAAATATTTATAGTACAAAATCTAAACTAAAAATTCAAATTGAATTATTGATAGAATC
ATTTTATCATGCGTTTTTCTATCGGTGATAAAATTCAACAATTGATATAACTTAATTTAAAATTCTATTAGTTCTAATTTAAAATTCTATTACTAATAGAAAATTCAAGT
GGTGATAGAAGAAGTCTTTTATTGATTAAATTTTATCACTAATAAAATTCAGTTAGTGATAGAATTCAATTTGAATTTTTTTACTACTATAAATGAATTAGTTTTGATTT
TATACCATAGATGAAAATATTTTGTCTTTTTTTTGTACGTATATTAAAAATTTTGTTGTAAATATAATATAAAATGTGTTTGGTAAATGCTCTTAAATTAATTATTTGGG
AGGGATTTCATGACAGATAACTTGCAAGGGGAGATGTTTTTTCGTTACTCTCTTCAAGTCACCATTGATGCTCAAGAGGAAGCCCAAGATTTCACTTGCTTCAAGTGGAT
TAAATTAAATTGATTTTTTTGAGAAAAATAAATAAAATATTCGTTATAAACTAAATTCATAATCTTTTGACATTAGAAAGGGGAAAAGATAAAAGAACGGGTCTAACTGA
CATTAAAGATTAGATGTTACAATTTTGAACATAAACAAAAATGAAGGAATTTTAATTTTTAACTATTCCATAAAATTAGAACACGGCTTATTAGAATATGATAAATCAAA
TGGTGTGAAGTTCCTCGGTATTCCATTTTCATTTACTACTGGAAAAGATTGGAATCTCAAGCTTGGGAAATTCAGCATTGTTGATGAAATATGACCCACATGTCCATTCC
TGCGGAGCTTTGCCAATAAAGAAGCGGAGCTTTGCCCTCACTACAGGGCACCCCTACTCGATAGACAACGAGTGATTTTACAATTAAATTTATTTGTTAATTTTGAACTT
CTCCAAGAAAGCATGTATACCTTAAAAACCATAAGCCTCTCTACCAATGAGGCTGCCTCTTGAAGATAGATTGGTGAATTTATTATTAACAATCTAACATATTTTTACAC
TTAATTAAATATTAGTTTTAGGGTAAACCACAATATATAGGGTAACTACACCCATCCTTAGGTGAGGCACCGGGAGACATCGAAGGAATGGAGGTACGGAAGGTTCTACA
AATTAATGTCACAGGTAAATTTTCAACCAATTACGCCATCTCTTGGGGACAACCACAATATATATGAAGTTAGGACTCCTTGCTAATTATTATTTTGGAATAGAAACAAT
TTCATTGATATAATAAATTTAGGATGTCATAAGGGCAGTTACAAAAGACCTCTTTAATTAGCTAGGAGGGTCATTACTAATTATGAACGAAAAAGATATAAAATTTTATT
CAACTTGCTGAATAAATCAACTATGAATAAAAACTTAAAGTTAAATGATAAAAAACATTGTTACTTTTCTTTATATCTGCCGTGTCTAAATCAATTTACGTGCACCTCAA
TTATTCACACGAGACAATCCTCGACCAATATACATTACTAATTACTAAAAGTGGACAGCTTCAAGAATGAATGTCAAAATCAGTATTAAGGCAGTGCCTTCGTTTAAAGG
CACATCAAGGTCGGCATTTCCAAGGTGCACGCACTAAAGGCATATAGCCACCATTTCCATTCCCCTTTCAATCTCTATTTCTTACTTCTTGTCATTTTCAGTTAAATAAG
AGAACGTTTGGGAGAACAGGCTTTTTTTAATAATTTTTTTAAGAACTATCTTGAATTTTACGAGTAGTGAAAGAGAGTTTTGAGAATCCAAAAGATGATGGATTCAAATA
ATTTCTTATGATTTTCTTGATATTCAGATGTTGTAAAGTCAAATAATATGTCTTATAAGAGTCGAGATGCTCATAATATGGTCTGGACACTCACAACTATCAAGCGCATC
ATATTTTTGGATATGTTTGGCAGCTCTCCTTCCTTCTCT
Protein sequenceShow/hide protein sequence
MCRSTDYRSRLAGDRLKIKAFFVRFSHLDSLDPPPPESLTLCYLPRMDETELEIGGFEIRPDSPAFVTLHRVVSPSSRKAAVYGSRDRVRASEGVQFQVYLREENVVQGI
FRRSDEGEWRLECRCALESEIAGSTSAVEVCVDVEGEVAMFEKVALEVRRRRRRRGFCGLEGIPEGREVDGDCDGCACCCGVDEDGGETEVEMEVEGVRWAVDLGIWAVC
LGVGYLVSKAAYSKTLKRKPIF