; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G007760 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G007760
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionsorbitol dehydrogenase
Genome locationCG_Chr05:8338595..8349464
RNA-Seq ExpressionClCG05G007760
SyntenyClCG05G007760
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR002328 - Alcohol dehydrogenase, zinc-type, conserved site
IPR002885 - Pentatricopeptide repeat
IPR011032 - GroES-like superfamily
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR013149 - Alcohol dehydrogenase, C-terminal
IPR013154 - Alcohol dehydrogenase, N-terminal
IPR036291 - NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7014792.1 Sorbitol dehydrogenase [Cucurbita argyrosperma subsp. argyrosperma]1.1e-16769.89Show/hide
Query:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG
        +TLRCAHFVVREPMVIGHECAGIIAEVGA+VKHLVPGDRVALEPGISCWRC LCK+GRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG
Subjt:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG

Query:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC
        AMCEPLSVGVHACRRAN+GPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDD+RLSVAKDLGADEV+KVS+DIQ VDQDVAQIQKAM+ EVDVS DC
Subjt:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC

Query:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIKTY
        AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSA GGNAIK  
Subjt:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIKTY

Query:  QDRVPQMGCKLNSPNTPILFKFATKRSKLGGKSINFCPTFCLNSQNQETPTIPTAAKIPKSDISTVHFKSLTACKLGISRYPDFQYNAEGGTGTGSAEIC
              +    N                L  K+  F P             +P +  +                   ISRY  FQY+ +G TGTGS    
Subjt:  QDRVPQMGCKLNSPNTPILFKFATKRSKLGGKSINFCPTFCLNSQNQETPTIPTAAKIPKSDISTVHFKSLTACKLGISRYPDFQYNAEGGTGTGSAEIC

Query:  GDSGSSHVSVSFDVNTLYIPPLTTQTTKFLGLPLPPFLKIDILPELFHGNINQES
                                   KF  L    F+K+D LPE  HGN+N++S
Subjt:  GDSGSSHVSVSFDVNTLYIPPLTTQTTKFLGLPLPPFLKIDILPELFHGNINQES

RXH94987.1 hypothetical protein DVH24_024671 [Malus domestica]1.8e-17547.56Show/hide
Query:  LSEWETADDLVKYIDEHGVVLDIYLGNTLIDMYGRRGMADFAGRVFYQMKEKNIVSWNAMIMGYAKAGNLVAAKKVFSDMPSRDVISWTTMITGYSQAKQ
        L EWE AD +V+YI+E+ V +D+YLGNT+IDMYGRRG+A+ A  VF QM+E+NIVSWNAMI GYAK GNLV A+K+F++MP R+V+SWT+MIT YSQA Q
Subjt:  LSEWETADDLVKYIDEHGVVLDIYLGNTLIDMYGRRGMADFAGRVFYQMKEKNIVSWNAMIMGYAKAGNLVAAKKVFSDMPSRDVISWTTMITGYSQAKQ

Query:  HAEAVKVFQEMMASMVKPDEITVASVLSACAQLGSLDVGETVHDYIRKHGIKSE----------------------------------------------
        H EAV +FQEMMA+ V+PDEIT+ASVLSACA++GSLDVGE VH+YI+KHG+K++                                              
Subjt:  HAEAVKVFQEMMASMVKPDEITVASVLSACAQLGSLDVGETVHDYIRKHGIKSE----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------TLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEV
                            T++CA F V+EPMVIGHECAGI+ E+G +VKHL  GDRVA+EPGISC RC  CK GRYNLCP+MKFFATPPVHGSLAN++
Subjt:  --------------------TLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEV

Query:  VHPADLCFKLPENVSLEEGAMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQ
        VHPADLCFKLPENVSLEEGAMCEPLSVGVHACRRAN+GPE+ VL++GAGPIGLV+++AARAFG+PRIVIVD+DD RL++AK LGA++ VKVS  ++D+D 
Subjt:  VHPADLCFKLPENVSLEEGAMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQ

Query:  DVAQIQKAMETEVDVSFDCAGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQK
        ++AQI+ AM+++VDVSFDC GFNKTMSTAL ATR GGKVCLVGMGH  MTVPLTPAAAREVDV+G+FRYKNTWPLCLEF+RSG+I+VKPLITHRFGF+QK
Subjt:  DVAQIQKAMETEVDVSFDCAGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQK

Query:  EVEDAFETSARGGNAIK
        EVE+AFETSARGGNAIK
Subjt:  EVEDAFETSARGGNAIK

XP_022922617.1 sorbitol dehydrogenase [Cucurbita moschata]6.6e-16595.97Show/hide
Query:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG
        +TLRCAHFVVREPMVIGHECAGIIAEVGA+VKHLVPGDRVALEPGISCWRC LCK+GRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG
Subjt:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG

Query:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC
        AMCEPLSVGVHACRRAN+GPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDD+RLSVAKDLGADEV+KVS+DIQDVDQDVAQIQKAM+ EVDVS DC
Subjt:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC

Query:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK
        AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSA GGNAIK
Subjt:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK

XP_023553050.1 sorbitol dehydrogenase [Cucurbita pepo subsp. pepo]5.0e-16596.31Show/hide
Query:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG
        +TLRCAHFVVREPMVIGHECAGIIAEVGA+VKHLVPGDRVALEPGISCWRC LCK+GRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG
Subjt:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG

Query:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC
        AMCEPLSVGVHACRRAN+GPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDD+RLSVAKDLGADEV+KVSVDIQDVDQDVAQIQKAM+ EVDVS DC
Subjt:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC

Query:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK
        AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSA GGNAIK
Subjt:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK

XP_038895302.1 sorbitol dehydrogenase [Benincasa hispida]3.8e-16595.97Show/hide
Query:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG
        +TLRCAHFVVREPM+IGHECAGIIAEVGADVKHLVPGD+VALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG
Subjt:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG

Query:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC
        AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAK+LGADEVVKVS+DIQDVD+DVAQIQKAM+TEVDVSFDC
Subjt:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC

Query:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK
        AGFNKTMSTALSATRAGGKVCLVGMGHN+MTVPL  AAAREVDV+GVFRYKNTWPLCLEFIRSGKINVKPLITHRFGF+QKEVEDAFETSARGGNAIK
Subjt:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK

TrEMBL top hitse value%identityAlignment
A0A498JJB3 PKS_ER domain-containing protein8.9e-17647.56Show/hide
Query:  LSEWETADDLVKYIDEHGVVLDIYLGNTLIDMYGRRGMADFAGRVFYQMKEKNIVSWNAMIMGYAKAGNLVAAKKVFSDMPSRDVISWTTMITGYSQAKQ
        L EWE AD +V+YI+E+ V +D+YLGNT+IDMYGRRG+A+ A  VF QM+E+NIVSWNAMI GYAK GNLV A+K+F++MP R+V+SWT+MIT YSQA Q
Subjt:  LSEWETADDLVKYIDEHGVVLDIYLGNTLIDMYGRRGMADFAGRVFYQMKEKNIVSWNAMIMGYAKAGNLVAAKKVFSDMPSRDVISWTTMITGYSQAKQ

Query:  HAEAVKVFQEMMASMVKPDEITVASVLSACAQLGSLDVGETVHDYIRKHGIKSE----------------------------------------------
        H EAV +FQEMMA+ V+PDEIT+ASVLSACA++GSLDVGE VH+YI+KHG+K++                                              
Subjt:  HAEAVKVFQEMMASMVKPDEITVASVLSACAQLGSLDVGETVHDYIRKHGIKSE----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------TLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEV
                            T++CA F V+EPMVIGHECAGI+ E+G +VKHL  GDRVA+EPGISC RC  CK GRYNLCP+MKFFATPPVHGSLAN++
Subjt:  --------------------TLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEV

Query:  VHPADLCFKLPENVSLEEGAMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQ
        VHPADLCFKLPENVSLEEGAMCEPLSVGVHACRRAN+GPE+ VL++GAGPIGLV+++AARAFG+PRIVIVD+DD RL++AK LGA++ VKVS  ++D+D 
Subjt:  VHPADLCFKLPENVSLEEGAMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQ

Query:  DVAQIQKAMETEVDVSFDCAGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQK
        ++AQI+ AM+++VDVSFDC GFNKTMSTAL ATR GGKVCLVGMGH  MTVPLTPAAAREVDV+G+FRYKNTWPLCLEF+RSG+I+VKPLITHRFGF+QK
Subjt:  DVAQIQKAMETEVDVSFDCAGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQK

Query:  EVEDAFETSARGGNAIK
        EVE+AFETSARGGNAIK
Subjt:  EVEDAFETSARGGNAIK

A0A5D3DF08 Sorbitol dehydrogenase-like3.8e-15890.27Show/hide
Query:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG
        + ++ AHFVV+EPMVIGHECAGI+AEVGADVKHLVPGDR+ALEPGISCWRCS CKEGRYNLCP+MKFFATPP+HGSLANEVVHPADLCFKLPENVSLEEG
Subjt:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG

Query:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC
        AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVT+MAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVS+D QDVD+DV +IQKAM+ EVDVSFDC
Subjt:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC

Query:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK
        AGF KTMSTAL A+R+GGKVCL+GMGHNEMTVPLTPAAAREVD+IGVFRYKNTWP+CLEFI SGKI+VKPLITHRFGFSQKEVE+AFETSARGGNAIK
Subjt:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK

A0A6J1CIE8 sorbitol dehydrogenase-like5.1e-16393.96Show/hide
Query:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG
        +TL+CAHFVV+EPMVIGHECAG+IAEVG +VKHLVPGDRVALEPGISCWRC+LCKEGRYNLCP+MKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG
Subjt:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG

Query:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC
        AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGAD+VVKVS+DIQDV+QDVA+IQKAM+TEVDVSFDC
Subjt:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC

Query:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK
        AGFNKTMSTALSATR GG+VCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWP+CLEFIRSGKINVK LITHRFGFSQKEVE+AFETSARGGNAIK
Subjt:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK

A0A6J1E3T2 sorbitol dehydrogenase3.2e-16595.97Show/hide
Query:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG
        +TLRCAHFVVREPMVIGHECAGIIAEVGA+VKHLVPGDRVALEPGISCWRC LCK+GRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG
Subjt:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG

Query:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC
        AMCEPLSVGVHACRRAN+GPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDD+RLSVAKDLGADEV+KVS+DIQDVDQDVAQIQKAM+ EVDVS DC
Subjt:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC

Query:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK
        AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSA GGNAIK
Subjt:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK

A0A6J1JAI4 sorbitol dehydrogenase4.6e-16495.64Show/hide
Query:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG
        +TLRCAHFVVREPMVIGHECAGIIAEVGA+VKHLV GDRVALEPGISCWRC LCK+GRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG
Subjt:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG

Query:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC
        AMCEPLSVGVHACRRAN+GPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDD+RLSVAKDLGADEV+KVS+DIQDVDQDVAQIQKAM+ EVDVS DC
Subjt:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC

Query:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK
        AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSA GGNAIK
Subjt:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK

SwissProt top hitse value%identityAlignment
P27867 Sorbitol dehydrogenase1.5e-7446.28Show/hide
Query:  GETVHDYIRKHGIKSETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPA
        G  VH +  +HG      R   FVV++PMV+GHE AG + +VG  VKHL PGDRVA+EPG+       CK GRYNL P + F ATPP  G+L     H A
Subjt:  GETVHDYIRKHGIKSETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPA

Query:  DLCFKLPENVSLEEGAMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVA-
        D C+KLP++V+ EEGA+ EPLSVG++ACRR ++     VLV GAGPIG+VT++ A+A GA ++V++D+   RL+ AK++GAD  ++V+   ++   D+A 
Subjt:  DLCFKLPENVSLEEGAMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVA-

Query:  QIQKAMETEVDVSFDCAGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVE
        +++  + ++ +V+ +C G   ++ T + AT +GG + +VGMG   + +PL  AA REVD+ GVFRY NTWP+ +  + S  +NVKPL+THRF   +K VE
Subjt:  QIQKAMETEVDVSFDCAGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVE

Query:  DAFETSARG
         AFET+ +G
Subjt:  DAFETSARG

Q00796 Sorbitol dehydrogenase4.2e-7447.42Show/hide
Query:  RCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEGAMC
        R  +F+V++PMV+GHE +G + +VG+ VKHL PGDRVA+EPG        CK GRYNL P + F ATPP  G+L     H A  C+KLP+NV+ EEGA+ 
Subjt:  RCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEGAMC

Query:  EPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVA-QIQKAMETEVDVSFDCAG
        EPLSVG+HACRR  +     VLV GAGPIG+VT++ A+A GA ++V+ D+   RLS AK++GAD V+++S   ++  Q++A +++  +  + +V+ +C G
Subjt:  EPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVA-QIQKAMETEVDVSFDCAG

Query:  FNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARG
           ++   + ATR+GG + LVG+G    TVPL  AA REVD+ GVFRY NTWP+ +  + S  +NVKPL+THRF    ++  +AFET  +G
Subjt:  FNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARG

Q1PSI9 L-idonate 5-dehydrogenase6.6e-14479.19Show/hide
Query:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG
        +T+RCA+F+V++PMVIGHECAGII EVG++VK+LV GDRVALEPGISC RCSLC+ G+YNLC EMKFF +PP +GSLAN+VVHP++LCFKLP+NVSLEEG
Subjt:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG

Query:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC
        AMCEPLSVG+HACRRAN+GPETNVL+MG+GPIGLVTM+AARAFGAPRIV+VDVDD RL++AKDLGAD++++VS +IQD+D++VA+IQ  M T VDVSFDC
Subjt:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC

Query:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK
         GFNKTMSTAL+ATRAGGKVCLVG+  +EMTVPLTPAAAREVD++G+FRY+NTWPLCLEF+RSGKI+VKPLITHRF FSQK+VE+AFETSARGGNAIK
Subjt:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK

Q58D31 Sorbitol dehydrogenase2.5e-7446.28Show/hide
Query:  GETVHDYIRKHGIKSETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPA
        G  VH +  +HG      R   FVV++PMV+GHE +G + +VG+ V+HL PGDRVA+EPG        CK GRYNL P + F ATPP  G+L     H A
Subjt:  GETVHDYIRKHGIKSETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPA

Query:  DLCFKLPENVSLEEGAMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVA-
        + C+KLP+NV+ EEGA+ EPLSVG+HACRRA +     VLV GAGPIGLV+++AA+A GA ++V+ D+   RLS AK++GAD ++++S    +  Q++A 
Subjt:  DLCFKLPENVSLEEGAMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVA-

Query:  QIQKAMETEVDVSFDCAGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVE
        +++  + ++ +V+ +C G   ++   + AT +GG + LVG+G    +VPL  AA REVD+ GVFRY NTWP+ +  + S  +NVKPL+THRF    ++  
Subjt:  QIQKAMETEVDVSFDCAGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVE

Query:  DAFETSARG
        +AFETS +G
Subjt:  DAFETSARG

Q9FJ95 Sorbitol dehydrogenase2.8e-15084.9Show/hide
Query:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG
        +T+RCA FVV+EPMVIGHECAGII EVG +VKHLV GDRVALEPGISCWRC+LC+EGRYNLCPEMKFFATPPVHGSLAN+VVHPADLCFKLPENVSLEEG
Subjt:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG

Query:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC
        AMCEPLSVGVHACRRA +GPETNVLVMGAGPIGLVTM+AARAF  PRIVIVDVD+ RL+VAK LGADE+V+V+ +++DV  +V QIQKAM + +DV+FDC
Subjt:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC

Query:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK
        AGFNKTMSTAL+ATR GGKVCLVGMGH  MTVPLTPAAAREVDV+GVFRYKNTWPLCLEF+ SGKI+VKPLITHRFGFSQKEVEDAFETSARG NAIK
Subjt:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK

Arabidopsis top hitse value%identityAlignment
AT2G22410.1 SLOW GROWTH 11.0e-3043.05Show/hide
Query:  LSEWETADDLVKYIDEHGVVLDIYLGNTLIDMYGRRGMADFAGRVFYQMKEKNIVSWNAMIMGYAKAGNLVAAKKVFSDMPSRDVISWTTMITGYSQAKQ
        L +     +  +Y+ E+G+ + I L N L+DM+ + G    A R+F  ++++ IVSW  MI GYA+ G L  ++K+F DM  +DV+ W  MI G  QAK+
Subjt:  LSEWETADDLVKYIDEHGVVLDIYLGNTLIDMYGRRGMADFAGRVFYQMKEKNIVSWNAMIMGYAKAGNLVAAKKVFSDMPSRDVISWTTMITGYSQAKQ

Query:  HAEAVKVFQEMMASMVKPDEITVASVLSACAQLGSLDVGETVHDYIRKHGI
          +A+ +FQEM  S  KPDEIT+   LSAC+QLG+LDVG  +H YI K+ +
Subjt:  HAEAVKVFQEMMASMVKPDEITVASVLSACAQLGSLDVGETVHDYIRKHGI

AT4G18840.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.5e-2949.26Show/hide
Query:  VLDIYLGNTLIDMYGRRGMADFAGRVFYQMKEKNIVSWNAMIMGYAKAGNLVAAKKVFSDMPSRDVISWTTMITGYSQAKQHAEAVKVFQEMM-ASMVKP
        V D    N+L+  Y  +G+ D A  +F +M+E+N+ SWN MI GYA AG +  AK+VF  MP RDV+SW  M+T Y+    + E ++VF +M+  S  KP
Subjt:  VLDIYLGNTLIDMYGRRGMADFAGRVFYQMKEKNIVSWNAMIMGYAKAGNLVAAKKVFSDMPSRDVISWTTMITGYSQAKQHAEAVKVFQEMM-ASMVKP

Query:  DEITVASVLSACAQLGSLDVGETVHDYIRKHGIKSE
        D  T+ SVLSACA LGSL  GE VH YI KHGI+ E
Subjt:  DEITVASVLSACAQLGSLDVGETVHDYIRKHGIKSE

AT5G51970.1 GroES-like zinc-binding alcohol dehydrogenase family protein2.0e-15184.9Show/hide
Query:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG
        +T+RCA FVV+EPMVIGHECAGII EVG +VKHLV GDRVALEPGISCWRC+LC+EGRYNLCPEMKFFATPPVHGSLAN+VVHPADLCFKLPENVSLEEG
Subjt:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG

Query:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC
        AMCEPLSVGVHACRRA +GPETNVLVMGAGPIGLVTM+AARAF  PRIVIVDVD+ RL+VAK LGADE+V+V+ +++DV  +V QIQKAM + +DV+FDC
Subjt:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC

Query:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK
        AGFNKTMSTAL+ATR GGKVCLVGMGH  MTVPLTPAAAREVDV+GVFRYKNTWPLCLEF+ SGKI+VKPLITHRFGFSQKEVEDAFETSARG NAIK
Subjt:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK

AT5G51970.2 GroES-like zinc-binding alcohol dehydrogenase family protein2.0e-15184.9Show/hide
Query:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG
        +T+RCA FVV+EPMVIGHECAGII EVG +VKHLV GDRVALEPGISCWRC+LC+EGRYNLCPEMKFFATPPVHGSLAN+VVHPADLCFKLPENVSLEEG
Subjt:  ETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPEMKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEG

Query:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC
        AMCEPLSVGVHACRRA +GPETNVLVMGAGPIGLVTM+AARAF  PRIVIVDVD+ RL+VAK LGADE+V+V+ +++DV  +V QIQKAM + +DV+FDC
Subjt:  AMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSVDIQDVDQDVAQIQKAMETEVDVSFDC

Query:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK
        AGFNKTMSTAL+ATR GGKVCLVGMGH  MTVPLTPAAAREVDV+GVFRYKNTWPLCLEF+ SGKI+VKPLITHRFGFSQKEVEDAFETSARG NAIK
Subjt:  AGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVEDAFETSARGGNAIK

AT5G62140.1 unknown protein2.1e-6054.17Show/hide
Query:  KLNSPNTPILFKFATKRSKLGGKSINFCPTFCLNSQNQETPTIPTAAKIPKSDISTVHFKSLTACKLGISRYPDFQYNAEGGTGTGSAEICGD---SGSS
        K+NS  T I + + + R  +  KS N       NS N +    P   ++P + + +V FK++   KLGISRYPDF+Y+  GG+G G+A+   D   + +S
Subjt:  KLNSPNTPILFKFATKRSKLGGKSINFCPTFCLNSQNQETPTIPTAAKIPKSDISTVHFKSLTACKLGISRYPDFQYNAEGGTGTGSAEICGD---SGSS

Query:  HVSVSFDVNTLYIPPLTTQTTKFLGLPLPPFLKIDILPELFHGNINQESGKVELEFEAQFMFSI-GSLYKAPPLLVKTVLSSEESRGSIRSGKGERLDDK
         +SV F+V TLYIP LT+QTTKFLG PLPPFLKIDI PE+F G INQESGKVELEF A+F F+  G +Y+AP L+V+TVL++EES G  + GKGERLD++
Subjt:  HVSVSFDVNTLYIPPLTTQTTKFLGLPLPPFLKIDILPELFHGNINQESGKVELEFEAQFMFSI-GSLYKAPPLLVKTVLSSEESRGSIRSGKGERLDDK

Query:  GKCRLVGVATVDPIDDLLLNSFLSLPTECIANLNAIITFS
        GKCRLVGVA V+ IDDL +N+FLSLP EC+A+L AII+ S
Subjt:  GKCRLVGVATVDPIDDLLLNSFLSLPTECIANLNAIITFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTGTTTAAGTGAATGGGAAACTGCAGACGATTTGGTGAAGTACATTGATGAACACGGTGTTGTGCTTGATATCTACTTAGGAAACACTTTGATAGATATGTATGG
ACGCCGTGGTATGGCAGATTTTGCAGGTAGAGTGTTTTATCAGATGAAAGAGAAAAATATAGTATCATGGAATGCAATGATCATGGGGTATGCAAAAGCAGGGAATTTAG
TTGCCGCAAAGAAGGTTTTTAGTGATATGCCTTCAAGGGATGTGATCTCATGGACCACCATGATCACAGGCTATTCTCAAGCCAAGCAACATGCCGAGGCAGTGAAGGTT
TTTCAAGAAATGATGGCGTCTATGGTGAAACCAGATGAAATAACTGTGGCTTCTGTGCTTTCTGCTTGTGCCCAATTGGGCTCACTTGATGTGGGAGAGACAGTTCATGA
CTACATACGCAAGCATGGCATCAAATCAGAGACACTGAGATGTGCACATTTTGTGGTTAGAGAGCCAATGGTGATTGGGCATGAATGTGCTGGGATTATCGCAGAAGTTG
GGGCTGATGTTAAGCATTTGGTGCCGGGGGATCGGGTTGCACTGGAGCCTGGAATTAGTTGTTGGAGATGTAGTCTCTGCAAAGAAGGCCGCTACAATCTGTGCCCAGAG
ATGAAGTTCTTTGCCACTCCCCCTGTTCATGGTTCTCTTGCAAATGAGGTGGTTCATCCAGCAGACCTGTGTTTTAAATTGCCAGAAAATGTCAGCTTAGAGGAAGGAGC
CATGTGTGAGCCCTTAAGTGTTGGTGTTCATGCTTGTCGACGTGCCAACATTGGTCCCGAAACAAATGTTTTGGTCATGGGTGCTGGACCAATTGGGCTTGTCACTATGA
TGGCTGCACGTGCATTTGGTGCACCACGGATTGTCATTGTCGATGTGGATGACTATCGATTGTCTGTTGCAAAGGATCTTGGAGCAGATGAAGTTGTTAAAGTTTCAGTT
GACATTCAGGATGTAGATCAAGATGTTGCTCAGATACAAAAAGCCATGGAAACTGAGGTAGATGTGAGTTTCGACTGTGCTGGCTTCAACAAGACAATGTCAACAGCCTT
AAGCGCCACCCGAGCTGGTGGCAAAGTTTGTCTCGTTGGAATGGGTCACAATGAGATGACTGTTCCACTAACTCCAGCTGCAGCAAGGGAAGTCGATGTCATTGGCGTGT
TTCGGTACAAAAACACGTGGCCTCTGTGCTTGGAGTTTATAAGAAGTGGTAAGATCAATGTGAAGCCGCTTATAACACACAGATTTGGTTTCTCACAGAAGGAGGTAGAG
GACGCCTTTGAAACCAGTGCTCGTGGTGGTAATGCTATTAAGACCTACCAAGACAGAGTTCCCCAGATGGGGTGTAAACTAAACTCCCCCAATACCCCAATTCTCTTCAA
ATTTGCCACAAAAAGATCCAAACTTGGAGGTAAAAGCATCAATTTTTGCCCCACTTTCTGTCTCAACAGTCAAAACCAAGAAACACCCACCATTCCCACCGCCGCAAAGA
TCCCAAAATCTGATATTTCCACCGTTCATTTCAAGTCTCTCACGGCCTGCAAGCTTGGTATATCCAGATACCCTGATTTCCAATATAATGCTGAAGGAGGAACAGGAACT
GGGTCTGCCGAGATCTGCGGTGACAGCGGCAGCAGCCATGTTTCAGTTTCTTTCGATGTCAACACCCTCTATATCCCACCATTGACAACTCAAACCACCAAGTTTCTAGG
TCTGCCATTGCCACCGTTTTTGAAGATTGATATTCTTCCTGAATTATTCCATGGGAACATCAATCAAGAGTCGGGCAAGGTTGAGCTTGAATTCGAGGCACAGTTCATGT
TCTCAATTGGGAGTTTGTATAAAGCTCCTCCATTGCTAGTAAAAACAGTGCTGAGTTCTGAAGAATCAAGAGGAAGCATAAGAAGTGGAAAAGGAGAAAGATTAGATGAT
AAAGGAAAGTGCAGATTGGTGGGAGTGGCTACTGTTGACCCCATTGATGATTTGCTTCTCAATTCCTTCCTTTCTCTCCCCACTGAATGTATTGCAAACCTCAATGCTAT
AATCACATTCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACTGTTTAAGTGAATGGGAAACTGCAGACGATTTGGTGAAGTACATTGATGAACACGGTGTTGTGCTTGATATCTACTTAGGAAACACTTTGATAGATATGTATGG
ACGCCGTGGTATGGCAGATTTTGCAGGTAGAGTGTTTTATCAGATGAAAGAGAAAAATATAGTATCATGGAATGCAATGATCATGGGGTATGCAAAAGCAGGGAATTTAG
TTGCCGCAAAGAAGGTTTTTAGTGATATGCCTTCAAGGGATGTGATCTCATGGACCACCATGATCACAGGCTATTCTCAAGCCAAGCAACATGCCGAGGCAGTGAAGGTT
TTTCAAGAAATGATGGCGTCTATGGTGAAACCAGATGAAATAACTGTGGCTTCTGTGCTTTCTGCTTGTGCCCAATTGGGCTCACTTGATGTGGGAGAGACAGTTCATGA
CTACATACGCAAGCATGGCATCAAATCAGAGACACTGAGATGTGCACATTTTGTGGTTAGAGAGCCAATGGTGATTGGGCATGAATGTGCTGGGATTATCGCAGAAGTTG
GGGCTGATGTTAAGCATTTGGTGCCGGGGGATCGGGTTGCACTGGAGCCTGGAATTAGTTGTTGGAGATGTAGTCTCTGCAAAGAAGGCCGCTACAATCTGTGCCCAGAG
ATGAAGTTCTTTGCCACTCCCCCTGTTCATGGTTCTCTTGCAAATGAGGTGGTTCATCCAGCAGACCTGTGTTTTAAATTGCCAGAAAATGTCAGCTTAGAGGAAGGAGC
CATGTGTGAGCCCTTAAGTGTTGGTGTTCATGCTTGTCGACGTGCCAACATTGGTCCCGAAACAAATGTTTTGGTCATGGGTGCTGGACCAATTGGGCTTGTCACTATGA
TGGCTGCACGTGCATTTGGTGCACCACGGATTGTCATTGTCGATGTGGATGACTATCGATTGTCTGTTGCAAAGGATCTTGGAGCAGATGAAGTTGTTAAAGTTTCAGTT
GACATTCAGGATGTAGATCAAGATGTTGCTCAGATACAAAAAGCCATGGAAACTGAGGTAGATGTGAGTTTCGACTGTGCTGGCTTCAACAAGACAATGTCAACAGCCTT
AAGCGCCACCCGAGCTGGTGGCAAAGTTTGTCTCGTTGGAATGGGTCACAATGAGATGACTGTTCCACTAACTCCAGCTGCAGCAAGGGAAGTCGATGTCATTGGCGTGT
TTCGGTACAAAAACACGTGGCCTCTGTGCTTGGAGTTTATAAGAAGTGGTAAGATCAATGTGAAGCCGCTTATAACACACAGATTTGGTTTCTCACAGAAGGAGGTAGAG
GACGCCTTTGAAACCAGTGCTCGTGGTGGTAATGCTATTAAGACCTACCAAGACAGAGTTCCCCAGATGGGGTGTAAACTAAACTCCCCCAATACCCCAATTCTCTTCAA
ATTTGCCACAAAAAGATCCAAACTTGGAGGTAAAAGCATCAATTTTTGCCCCACTTTCTGTCTCAACAGTCAAAACCAAGAAACACCCACCATTCCCACCGCCGCAAAGA
TCCCAAAATCTGATATTTCCACCGTTCATTTCAAGTCTCTCACGGCCTGCAAGCTTGGTATATCCAGATACCCTGATTTCCAATATAATGCTGAAGGAGGAACAGGAACT
GGGTCTGCCGAGATCTGCGGTGACAGCGGCAGCAGCCATGTTTCAGTTTCTTTCGATGTCAACACCCTCTATATCCCACCATTGACAACTCAAACCACCAAGTTTCTAGG
TCTGCCATTGCCACCGTTTTTGAAGATTGATATTCTTCCTGAATTATTCCATGGGAACATCAATCAAGAGTCGGGCAAGGTTGAGCTTGAATTCGAGGCACAGTTCATGT
TCTCAATTGGGAGTTTGTATAAAGCTCCTCCATTGCTAGTAAAAACAGTGCTGAGTTCTGAAGAATCAAGAGGAAGCATAAGAAGTGGAAAAGGAGAAAGATTAGATGAT
AAAGGAAAGTGCAGATTGGTGGGAGTGGCTACTGTTGACCCCATTGATGATTTGCTTCTCAATTCCTTCCTTTCTCTCCCCACTGAATGTATTGCAAACCTCAATGCTAT
AATCACATTCTCTTAGAGAGACAGAAAAAAAAATATGAATATAATGATGTAATTGTTTGGGACAAAACCAGACACATACATATTTGAAAACAAATAATTTAATGTATAGG
T
Protein sequenceShow/hide protein sequence
MDCLSEWETADDLVKYIDEHGVVLDIYLGNTLIDMYGRRGMADFAGRVFYQMKEKNIVSWNAMIMGYAKAGNLVAAKKVFSDMPSRDVISWTTMITGYSQAKQHAEAVKV
FQEMMASMVKPDEITVASVLSACAQLGSLDVGETVHDYIRKHGIKSETLRCAHFVVREPMVIGHECAGIIAEVGADVKHLVPGDRVALEPGISCWRCSLCKEGRYNLCPE
MKFFATPPVHGSLANEVVHPADLCFKLPENVSLEEGAMCEPLSVGVHACRRANIGPETNVLVMGAGPIGLVTMMAARAFGAPRIVIVDVDDYRLSVAKDLGADEVVKVSV
DIQDVDQDVAQIQKAMETEVDVSFDCAGFNKTMSTALSATRAGGKVCLVGMGHNEMTVPLTPAAAREVDVIGVFRYKNTWPLCLEFIRSGKINVKPLITHRFGFSQKEVE
DAFETSARGGNAIKTYQDRVPQMGCKLNSPNTPILFKFATKRSKLGGKSINFCPTFCLNSQNQETPTIPTAAKIPKSDISTVHFKSLTACKLGISRYPDFQYNAEGGTGT
GSAEICGDSGSSHVSVSFDVNTLYIPPLTTQTTKFLGLPLPPFLKIDILPELFHGNINQESGKVELEFEAQFMFSIGSLYKAPPLLVKTVLSSEESRGSIRSGKGERLDD
KGKCRLVGVATVDPIDDLLLNSFLSLPTECIANLNAIITFS