; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc12g0330541 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc12g0330541
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionU-box domain-containing protein 7
Genome locationCMiso1.1chr12:21513706..21521783
RNA-Seq ExpressionCmc12g0330541
SyntenyCmc12g0330541
Gene Ontology termsNA
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149064.2 uncharacterized protein LOC101207857 [Cucumis sativus]1.2e-17768.83Show/hide
Query:  MPPPSLS-FYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEIEHSIV---DDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERL
        MP PSLS +YYSY+HELRFLSLIR FLR  SKSSRKR RFPSHPSSD LFPEIE S +    D D+L  +SSS LQRTVK LHFGDGDEKERAAKEIERL
Subjt:  MPPPSLS-FYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEIEHSIV---DDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERL

Query:  IKKESGNSKGLLRFQQVVSELRLDMASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIY
        IKKESGNSK     ++V+ +L +                                                                             
Subjt:  IKKESGNSKGLLRFQQVVSELRLDMASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIY

Query:  QAVDEFIFERISTATSAKAAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKKDSSSHEFPELLLS
                                                   IPALVAMADSDHFAVKALIQLANHTFLNKTLMLE GILTKLP+KDSS+HEFPELLLS
Subjt:  QAVDEFIFERISTATSAKAAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKKDSSSHEFPELLLS

Query:  LSCLANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKF
        LSCLANTQLFLASTEPIISYLL ILN+ ESNS+SKT CLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGK ALETNSKF
Subjt:  LSCLANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKF

Query:  SEILIEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGFSEK
        SEILIEILTWEEKPKCQELSAYIIM++AHQSW QRE+L K SIIVPALLGLALLGSPLAQNRALKLLQWLKDERRA VTAHSGPQVGDGIVEVGSGFSEK
Subjt:  SEILIEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGFSEK

Query:  EIEKGKRVMRSLVKQSLYKNMEIITRRANGGECSSSSSSIRRTLVSSISSKSLPF
        EIEKGKRVMRSLVKQSLYKNMEIITRRANGGEC  SSSSIRRTLVSSISSKSLPF
Subjt:  EIEKGKRVMRSLVKQSLYKNMEIITRRANGGECSSSSSSIRRTLVSSISSKSLPF

XP_008443102.1 PREDICTED: uncharacterized protein LOC103486796 [Cucumis melo]5.6e-20475.86Show/hide
Query:  MPPPSLSFYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEIEHSIVDDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKKE
        MPPPSLSFYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEIEHSIVDDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKKE
Subjt:  MPPPSLSFYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEIEHSIVDDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKKE

Query:  SGNSKGLLRFQQVVSELRLDMASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAVD
        SGNSK        V +L +D+                                                                               
Subjt:  SGNSKGLLRFQQVVSELRLDMASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAVD

Query:  EFIFERISTATSAKAAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKKDSSSHEFPELLLSLSCL
                                               IPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKKDSSSHEFPELLLSLSCL
Subjt:  EFIFERISTATSAKAAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKKDSSSHEFPELLLSLSCL

Query:  ANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKFSEIL
        ANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKFSEIL
Subjt:  ANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKFSEIL

Query:  IEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGFSEKEIEK
        IEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGFSEKEIEK
Subjt:  IEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGFSEKEIEK

Query:  GKRVMRSLVKQSLYKNMEIITRRANGGECSSSSSSIRRTLVSSISSKSLPF
        GKRVMRSLVKQSLYKNMEIITRRANGGEC  SSSSIRRTLVSSISSKSLPF
Subjt:  GKRVMRSLVKQSLYKNMEIITRRANGGECSSSSSSIRRTLVSSISSKSLPF

XP_022927021.1 uncharacterized protein LOC111433976 [Cucurbita moschata]1.7e-12855.81Show/hide
Query:  PPSLSFYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEI---EHSIVDDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKK
        PPSL F YSY+ ++RFL+ +R+FLR  SKSSRKR R PS P S+I  PEI   E S +   D     +SSVLQRTVKSLHFGDG+EK+RAAKEIERLIK+
Subjt:  PPSLSFYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEI---EHSIVDDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKK

Query:  ESGNSKGLLRFQQVVSELRLDMASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAV
         +            V +L +D+                                                                              
Subjt:  ESGNSKGLLRFQQVVSELRLDMASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAV

Query:  DEFIFERISTATSAKAAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKK------DSSSHEFPEL
                                                IPALVAMADSD  AV+ALI+LAN T LNKT+M+E GIL+KLPK       DSSS EF EL
Subjt:  DEFIFERISTATSAKAAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKK------DSSSHEFPEL

Query:  LLSLSCLANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETN
        LLSLSCLANTQLFLASTEP++SYLL ILNN +S+ ++K  CLAT+FNIST+LEN ETLISN V+PTLL+FS +KE SEKALPTLANLAVTSKGKQALE+N
Subjt:  LLSLSCLANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETN

Query:  SKFSEILIEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGF
        S+F+EIL+EILTWEEKPKCQELSA IIMI+ HQSWAQRERL + S I PALLGLALLGS LAQ RALKLLQW KDER A V  HSGPQ G GIV VGSG 
Subjt:  SKFSEILIEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGF

Query:  SEKEIEKGKRVMRSLVKQSLYKNMEIITRRAN-GGECSSSSSSIRRTLVSSISSKSLPF
        SE+E+EKGKR+MRSLVKQSLYKNMEIITRRAN  GEC   S +IRRTLVSSISSKS PF
Subjt:  SEKEIEKGKRVMRSLVKQSLYKNMEIITRRAN-GGECSSSSSSIRRTLVSSISSKSLPF

XP_023519234.1 uncharacterized protein LOC111782668 [Cucurbita pepo subsp. pepo]4.3e-12755.64Show/hide
Query:  PPSLSFYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEI---EHSIVDDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKK
        PPSL F YSY+ ++RFL+ +R+FLR  SKSSRKR R PS P S+I  PEI   E S +   D     +SSVLQRTVKSLHFGDG+EK+RAAKEIERLIK+
Subjt:  PPSLSFYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEI---EHSIVDDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKK

Query:  ESGNSKGLLRFQQVVSELRLDMASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAV
         +            V +L +D+                                                                              
Subjt:  ESGNSKGLLRFQQVVSELRLDMASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAV

Query:  DEFIFERISTATSAKAAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKK------DSSSHEFPEL
                                                IPALVAMADSD  AV+ALI+LAN T LNKT+M+E GIL+KLPK       DSSS EF EL
Subjt:  DEFIFERISTATSAKAAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKK------DSSSHEFPEL

Query:  LLSLSCLANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETN
        LLSLSCLANTQLFLASTEP+ISYLL +LNN +S+ K+K  CL T+FNIST+L+N ETLISN V+PTLL+FS ++EFSEKALPTLANLAVTSKGKQALE+N
Subjt:  LLSLSCLANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETN

Query:  SKFSEILIEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGF
        S    ILIEILTWEEKPKCQELSA IIMI+ HQSWAQRERL + S I PALLGLALLGS LAQ RALKLLQW KDER A V  HSGPQ G GIV VGSG 
Subjt:  SKFSEILIEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGF

Query:  SEKEIEKGKRVMRSLVKQSLYKNMEIITRRAN-GGECSSSSSSIRRTLVSSISSKSLPF
        SEKE+EKGKR+MRSLVKQSLYKNMEIITRRAN  GEC   S +IRRTLVSSISSKS PF
Subjt:  SEKEIEKGKRVMRSLVKQSLYKNMEIITRRAN-GGECSSSSSSIRRTLVSSISSKSLPF

XP_038894080.1 U-box domain-containing protein 6-like [Benincasa hispida]2.5e-14359.96Show/hide
Query:  PSLSFYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFP---EIEHSIVDDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKKE
        PSLS  YSY+ +LRF+S +RRFL   S+SSRKR R PS P SDI  P   EIE SI+   DQ      +VLQRTVKSLHFGDGDEKERAAKEIER I KE
Subjt:  PSLSFYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFP---EIEHSIVDDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKKE

Query:  SGNSKGLLRFQQVVSELRLDMASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAVD
        S   + L                                                   IV+ G                                     
Subjt:  SGNSKGLLRFQQVVSELRLDMASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAVD

Query:  EFIFERISTATSAKAAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLP------KKDSSSHEFPELL
                                               IPALVAMADSD  AV+ALIQLANHT+LNKTLM+E GILTKLP      K DSSSHEFPELL
Subjt:  EFIFERISTATSAKAAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLP------KKDSSSHEFPELL

Query:  LSLSCLANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNS
        LSLSCLANTQLFLASTEP+ISYLL ILNN ESN K+K  CLAT+FNIST+LEN ETLISN V+PTLL+FS IKEFSEKALPTLANLAVTSKGKQALE+NS
Subjt:  LSLSCLANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNS

Query:  KFSEILIEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGFS
         F EILIEILTWEEKP CQELS YIIMI+AHQSWAQRERL K+S+IVPALLGLALLGSPLAQ RALKLLQW K+ER+A V  HSGPQ+  GIVEVGSGFS
Subjt:  KFSEILIEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGFS

Query:  EKEIEKGKRVMRSLVKQSLYKNMEIITRRANGGECSSSSSSIRRTLVSSISSKSLPF
        EKEIEKGKR+MRSLVKQSLYKNMEIITRRANGGEC   S  IRRTLV S SSKSLPF
Subjt:  EKEIEKGKRVMRSLVKQSLYKNMEIITRRANGGECSSSSSSIRRTLVSSISSKSLPF

TrEMBL top hitse value%identityAlignment
A0A0A0LUN8 Uncharacterized protein5.7e-17868.83Show/hide
Query:  MPPPSLS-FYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEIEHSIV---DDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERL
        MP PSLS +YYSY+HELRFLSLIR FLR  SKSSRKR RFPSHPSSD LFPEIE S +    D D+L  +SSS LQRTVK LHFGDGDEKERAAKEIERL
Subjt:  MPPPSLS-FYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEIEHSIV---DDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERL

Query:  IKKESGNSKGLLRFQQVVSELRLDMASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIY
        IKKESGNSK     ++V+ +L +                                                                             
Subjt:  IKKESGNSKGLLRFQQVVSELRLDMASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIY

Query:  QAVDEFIFERISTATSAKAAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKKDSSSHEFPELLLS
                                                   IPALVAMADSDHFAVKALIQLANHTFLNKTLMLE GILTKLP+KDSS+HEFPELLLS
Subjt:  QAVDEFIFERISTATSAKAAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKKDSSSHEFPELLLS

Query:  LSCLANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKF
        LSCLANTQLFLASTEPIISYLL ILN+ ESNS+SKT CLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGK ALETNSKF
Subjt:  LSCLANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKF

Query:  SEILIEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGFSEK
        SEILIEILTWEEKPKCQELSAYIIM++AHQSW QRE+L K SIIVPALLGLALLGSPLAQNRALKLLQWLKDERRA VTAHSGPQVGDGIVEVGSGFSEK
Subjt:  SEILIEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGFSEK

Query:  EIEKGKRVMRSLVKQSLYKNMEIITRRANGGECSSSSSSIRRTLVSSISSKSLPF
        EIEKGKRVMRSLVKQSLYKNMEIITRRANGGEC  SSSSIRRTLVSSISSKSLPF
Subjt:  EIEKGKRVMRSLVKQSLYKNMEIITRRANGGECSSSSSSIRRTLVSSISSKSLPF

A0A1S3B802 uncharacterized protein LOC1034867962.7e-20475.86Show/hide
Query:  MPPPSLSFYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEIEHSIVDDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKKE
        MPPPSLSFYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEIEHSIVDDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKKE
Subjt:  MPPPSLSFYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEIEHSIVDDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKKE

Query:  SGNSKGLLRFQQVVSELRLDMASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAVD
        SGNSK        V +L +D+                                                                               
Subjt:  SGNSKGLLRFQQVVSELRLDMASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAVD

Query:  EFIFERISTATSAKAAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKKDSSSHEFPELLLSLSCL
                                               IPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKKDSSSHEFPELLLSLSCL
Subjt:  EFIFERISTATSAKAAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKKDSSSHEFPELLLSLSCL

Query:  ANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKFSEIL
        ANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKFSEIL
Subjt:  ANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKFSEIL

Query:  IEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGFSEKEIEK
        IEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGFSEKEIEK
Subjt:  IEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGFSEKEIEK

Query:  GKRVMRSLVKQSLYKNMEIITRRANGGECSSSSSSIRRTLVSSISSKSLPF
        GKRVMRSLVKQSLYKNMEIITRRANGGEC  SSSSIRRTLVSSISSKSLPF
Subjt:  GKRVMRSLVKQSLYKNMEIITRRANGGECSSSSSSIRRTLVSSISSKSLPF

A0A5A7UPJ5 U-box domain-containing protein 72.7e-20475.86Show/hide
Query:  MPPPSLSFYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEIEHSIVDDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKKE
        MPPPSLSFYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEIEHSIVDDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKKE
Subjt:  MPPPSLSFYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEIEHSIVDDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKKE

Query:  SGNSKGLLRFQQVVSELRLDMASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAVD
        SGNSK        V +L +D+                                                                               
Subjt:  SGNSKGLLRFQQVVSELRLDMASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAVD

Query:  EFIFERISTATSAKAAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKKDSSSHEFPELLLSLSCL
                                               IPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKKDSSSHEFPELLLSLSCL
Subjt:  EFIFERISTATSAKAAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKKDSSSHEFPELLLSLSCL

Query:  ANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKFSEIL
        ANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKFSEIL
Subjt:  ANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKFSEIL

Query:  IEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGFSEKEIEK
        IEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGFSEKEIEK
Subjt:  IEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGFSEKEIEK

Query:  GKRVMRSLVKQSLYKNMEIITRRANGGECSSSSSSIRRTLVSSISSKSLPF
        GKRVMRSLVKQSLYKNMEIITRRANGGEC  SSSSIRRTLVSSISSKSLPF
Subjt:  GKRVMRSLVKQSLYKNMEIITRRANGGECSSSSSSIRRTLVSSISSKSLPF

A0A6J1EGI7 uncharacterized protein LOC1114339768.4e-12955.81Show/hide
Query:  PPSLSFYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEI---EHSIVDDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKK
        PPSL F YSY+ ++RFL+ +R+FLR  SKSSRKR R PS P S+I  PEI   E S +   D     +SSVLQRTVKSLHFGDG+EK+RAAKEIERLIK+
Subjt:  PPSLSFYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEI---EHSIVDDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKK

Query:  ESGNSKGLLRFQQVVSELRLDMASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAV
         +            V +L +D+                                                                              
Subjt:  ESGNSKGLLRFQQVVSELRLDMASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAV

Query:  DEFIFERISTATSAKAAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKK------DSSSHEFPEL
                                                IPALVAMADSD  AV+ALI+LAN T LNKT+M+E GIL+KLPK       DSSS EF EL
Subjt:  DEFIFERISTATSAKAAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKK------DSSSHEFPEL

Query:  LLSLSCLANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETN
        LLSLSCLANTQLFLASTEP++SYLL ILNN +S+ ++K  CLAT+FNIST+LEN ETLISN V+PTLL+FS +KE SEKALPTLANLAVTSKGKQALE+N
Subjt:  LLSLSCLANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETN

Query:  SKFSEILIEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGF
        S+F+EIL+EILTWEEKPKCQELSA IIMI+ HQSWAQRERL + S I PALLGLALLGS LAQ RALKLLQW KDER A V  HSGPQ G GIV VGSG 
Subjt:  SKFSEILIEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGF

Query:  SEKEIEKGKRVMRSLVKQSLYKNMEIITRRAN-GGECSSSSSSIRRTLVSSISSKSLPF
        SE+E+EKGKR+MRSLVKQSLYKNMEIITRRAN  GEC   S +IRRTLVSSISSKS PF
Subjt:  SEKEIEKGKRVMRSLVKQSLYKNMEIITRRAN-GGECSSSSSSIRRTLVSSISSKSLPF

A0A6J1KNH4 uncharacterized protein LOC1114957752.5e-12554.74Show/hide
Query:  PPSLSFYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEI---EHSIVDDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKK
        PPSL F YSY+ ++RFL+ +R+FLR  SKSSRKR R PS P SDI  PEI   E + +   DQ    +SSVLQRTVKSLHFGDG+EK+RAAKEIERLIK+
Subjt:  PPSLSFYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEI---EHSIVDDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKK

Query:  ESGNSKGLLRFQQVVSELRLDMASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAV
         +            V +L +D+                                                                              
Subjt:  ESGNSKGLLRFQQVVSELRLDMASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAV

Query:  DEFIFERISTATSAKAAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKK------DSSSHEFPEL
                                                IPALVAMADSD  AV+ALI+LAN T LNK +M+E GIL+KLPK       DSSS EF EL
Subjt:  DEFIFERISTATSAKAAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKK------DSSSHEFPEL

Query:  LLSLSCLANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETN
        L SLSCLANTQLFLASTEP+ISYLL ILN+ +S+ +++  CLAT+FNIST+LEN ETLISN V+PTLL+FS ++EFSEKALPTLANLAVTSK KQALE+N
Subjt:  LLSLSCLANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETN

Query:  SKFSEILIEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGF
        S F+EIL+EILTWEEKPKCQELSA IIMI+ HQSWAQRERL + S I PALLGLALLGS LAQ RALKLLQW KDER A V  HSGPQ   GIV VGSG 
Subjt:  SKFSEILIEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGF

Query:  SEKEIEKGKRVMRSLVKQSLYKNMEIITRRAN-GGECSSSSSSIRRTLVSSISSKSLPF
        S+KE+EKGKR+MRSLVKQSLYKNMEIITRRAN  GEC   S ++RR LVSSISSKS PF
Subjt:  SEKEIEKGKRVMRSLVKQSLYKNMEIITRRAN-GGECSSSSSSIRRTLVSSISSKSLPF

SwissProt top hitse value%identityAlignment
Q9CAG5 U-box domain-containing protein 73.7e-0926.91Show/hide
Query:  ALIQLANHTFLNKTLMLEGGILTKLPKKDSSSHEF---PELLLSLSCLANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLI
        AL  LA +   NK LML  G++  L K  SS+        L L+LSCL   +  + S++ +    L+ L   E  ++ K   L  ++N+ST   N   L+
Subjt:  ALIQLANHTFLNKTLMLEGGILTKLPKKDSSSHEF---PELLLSLSCLANTQLFLASTEPIISYLLIILNNPESNSKSKTCCLATIFNISTILENTETLI

Query:  SNSVIPT---LLKFSIIKEFSEKALPTLANLAVTSKGK-QALETNSKFSEILIEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLA
        S+++I +   LL  +    + EK+L  L NLA + +GK +A+ +    S +   +   +   + Q +S  +I+    +S  Q   +     ++P+L+ ++
Subjt:  SNSVIPT---LLKFSIIKEFSEKALPTLANLAVTSKGK-QALETNSKFSEILIEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASIIVPALLGLA

Query:  LLGSPLAQNRALKLLQWLKDERR
        + G+P  + ++ KLL   ++ER+
Subjt:  LLGSPLAQNRALKLLQWLKDERR

Arabidopsis top hitse value%identityAlignment
AT1G48720.1 unknown protein6.5e-1745.74Show/hide
Query:  MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAVDEFIFERISTATSAK
        MASN   +  Q+P  +  N++ WS++MK + G+ ++W+IVE+G+ E EN+  L+  Q   LR++RK+DKKAL  IYQ +DE  FE++  ATSAK
Subjt:  MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAVDEFIFERISTATSAK

AT2G25130.1 ARM repeat superfamily protein3.8e-2527.44Show/hide
Query:  IVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAVDEFIFERISTATSAK---AAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDH----
        I E    + E  S+L N  ++E     KK ++ L  + + V +   E  + A +A+   AA   +R   + + ++         IP LV+M D +     
Subjt:  IVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAVDEFIFERISTATSAK---AAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDH----

Query:  ---FAVKALIQLANHTFLNKTLMLEGGILTKLPKKDSSSHEFPELL--------LSLSCLANTQLFLASTEPIISYLLIILNNPE-SNSKSKTCCLATIF
            ++ AL+ L     +NK  +++ G++ K+ K   SS    + +        L LS L + +  + S+  II  +  + N  E S+S+++   L  ++
Subjt:  ---FAVKALIQLANHTFLNKTLMLEGGILTKLPKKDSSSHEFPELL--------LSLSCLANTQLFLASTEPIISYLLIILNNPE-SNSKSKTCCLATIF

Query:  NISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKFSEILIEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASI
        N+S   +N   ++   +IP LL      E SE+ L  L N+    +G++A+    +   IL+++L W +  KCQE + YI+M++AH+ +  R  + +A  
Subjt:  NISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKFSEILIEILTWEEKPKCQELSAYIIMIIAHQSWAQRERLTKASI

Query:  IVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGFSEK----EIEKGKRVMRSLVKQSLYKNMEIITRRAN
        I  +LL L L+GSPLAQ RA ++L+ L+           G QV   I    S   E+     +   ++ ++ LV+QSL  NM+ I +RAN
Subjt:  IVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGFSEK----EIEKGKRVMRSLVKQSLYKNMEIITRRAN

AT2G27430.1 ARM repeat superfamily protein5.1e-7050.15Show/hide
Query:  IPALVAMADSD-----HFAVKALIQLANHTFLNKTLMLEGGILTKLPKK-----DSSSHEFPELLLSLSCLANTQLFLASTEPIISYLLIILNNPESNSK
        I  LV+M  SD       AV ALIQL++ T+ NK LM+   I +KLPK       S+ H F ELLLSLS L NTQL +AS++ I+ +L+  +N+  ++ K
Subjt:  IPALVAMADSD-----HFAVKALIQLANHTFLNKTLMLEGGILTKLPKK-----DSSSHEFPELLLSLSCLANTQLFLASTEPIISYLLIILNNPESNSK

Query:  SKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKFSEILIEILTWEEKPKCQELSAYIIMIIAHQSWA
        +K  CLATI N+  +LEN   L+ N  + TLL     K+ SEKAL +L  L VT  GK+A+E     S+ LIEILTWE+ PKCQE +AYI+M++AHQSW+
Subjt:  SKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKFSEILIEILTWEEKPKCQELSAYIIMIIAHQSWA

Query:  QRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGFSEKEIEKGKRVMRSLVKQSLYKNMEIITRRANGGEC
        QRE++ KA  IVP LL ++LLGSPL Q RA+KLLQW KDER   +  HSGPQ G     +GS  S +  E+G+++M++LVKQSLYKNME+ITRR   G  
Subjt:  QRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGFSEKEIEKGKRVMRSLVKQSLYKNMEIITRRANGGEC

Query:  SSSSSSIR-RTLVSSISSKSLPF
           S S R ++L+ S SSKSL +
Subjt:  SSSSSSIR-RTLVSSISSKSLPF

AT2G27430.1 ARM repeat superfamily protein1.4e-0331.25Show/hide
Query:  ELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEIEHSIVDDDDQLA-----ASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKKESGNSK
        +L F + IR  L  KSK+S ++    + P     + + E      ++ ++      +   VLQ+TVK +HFG  +EKE+AA EIE+L +++    K
Subjt:  ELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEIEHSIVDDDDQLA-----ASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKKESGNSK

AT4G31890.1 ARM repeat superfamily protein9.4e-2429.34Show/hide
Query:  IPALVAMADSDHF------AVKALIQLANHTFLNKTLMLEGGILTKLPK----KDSSSHEFPELL----LSLSCLANTQLFLASTEPIISYLLIILNNPE
        IP LV+M D          ++ AL+ L      NK  +++ G + K+ K     ++   E  E +    L LS L + +  + S+  II  +  + N  E
Subjt:  IPALVAMADSDHF------AVKALIQLANHTFLNKTLMLEGGILTKLPK----KDSSSHEFPELL----LSLSCLANTQLFLASTEPIISYLLIILNNPE

Query:  -SNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKFSEILIEILTWEEKPKCQELSAYIIMIIA
         S+S+++   L  ++N+S    N   ++   +I  LL      E SE+ L  L+NL    +G++A+        +L+++L W + P CQE + YI+M++A
Subjt:  -SNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKFSEILIEILTWEEKPKCQELSAYIIMIIA

Query:  HQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDER-------RASVTAHSGPQVGDGIVEVGSGFSEKEIEKGKRVMRSLVKQSLYKNM
        H+ +  R+ + +A  I  ALL L LLGS LAQ RA ++L+ L+ ++         S  A S P  G     +    ++  + + ++ ++ LV+QSL  NM
Subjt:  HQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDER-------RASVTAHSGPQVGDGIVEVGSGFSEKEIEKGKRVMRSLVKQSLYKNM

Query:  EIITRRANGGECSSSSSSIRRTLVSSISSKSLPF
        + I +RAN  +    S   +   +SS +SKSLPF
Subjt:  EIITRRANGGECSSSSSSIRRTLVSSISSKSLPF

AT4G31890.2 ARM repeat superfamily protein9.4e-2429.34Show/hide
Query:  IPALVAMADSDHF------AVKALIQLANHTFLNKTLMLEGGILTKLPK----KDSSSHEFPELL----LSLSCLANTQLFLASTEPIISYLLIILNNPE
        IP LV+M D          ++ AL+ L      NK  +++ G + K+ K     ++   E  E +    L LS L + +  + S+  II  +  + N  E
Subjt:  IPALVAMADSDHF------AVKALIQLANHTFLNKTLMLEGGILTKLPK----KDSSSHEFPELL----LSLSCLANTQLFLASTEPIISYLLIILNNPE

Query:  -SNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKFSEILIEILTWEEKPKCQELSAYIIMIIA
         S+S+++   L  ++N+S    N   ++   +I  LL      E SE+ L  L+NL    +G++A+        +L+++L W + P CQE + YI+M++A
Subjt:  -SNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKFSEILIEILTWEEKPKCQELSAYIIMIIA

Query:  HQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDER-------RASVTAHSGPQVGDGIVEVGSGFSEKEIEKGKRVMRSLVKQSLYKNM
        H+ +  R+ + +A  I  ALL L LLGS LAQ RA ++L+ L+ ++         S  A S P  G     +    ++  + + ++ ++ LV+QSL  NM
Subjt:  HQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDER-------RASVTAHSGPQVGDGIVEVGSGFSEKEIEKGKRVMRSLVKQSLYKNM

Query:  EIITRRANGGECSSSSSSIRRTLVSSISSKSLPF
        + I +RAN  +    S   +   +SS +SKSLPF
Subjt:  EIITRRANGGECSSSSSSIRRTLVSSISSKSLPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCCTCCTTCTCTCTCCTTTTATTACTCCTACATGCACGAATTACGCTTCCTCAGTCTCATCCGCCGATTCCTTCGCTCCAAATCCAAATCATCTCGTAAG
CGATTACGTTTCCCGTCTCATCCATCATCGGATATTCTATTTCCCGAAATTGAACACAGTATTGTTGATGACGATGATCAGCTGGCCGCCTCCTCCTCCTCGGTG
TTGCAGAGGACAGTGAAGAGCCTTCACTTCGGCGACGGAGATGAGAAAGAGAGAGCCGCGAAGGAAATTGAGAGGTTGATTAAAAAAGAGAGCGGTAACTCGAAG
GGTCTATTACGTTTCCAACAAGTGGTATCAGAGCTCCGGTTAGATATGGCTTCGAATGGTAACATGTTGCAACCCCAACTTCCAAGGTTCAGCGGAAAGAATTTT
AATCAATGGAGTATTCAAATGAAAGTGTTATATGGCTCTCAAGAATTGTGGGATATTGTTGAAAGAGGATACACTGAAGTTGAGAATCAGAGTGAGCTCACAAAT
CAACAACTTGTTGAGTTAAGAGAAAATCGTAAGAAAGACAAAAAGGCTTTATTCTTCATTTATCAAGCTGTTGATGAATTTATTTTCGAGAGAATTTCAACAGCT
ACTTCTGCAAAGGCGGCTTGGGATATTCTAAGATCTACCTATCAAGGAGAAGATAAGAGTGTTCCAGGGTCTATTACGTTTCCAACAATTCCTGCTTTGGTCGCC
ATGGCGGACTCCGATCACTTCGCGGTGAAGGCCTTGATTCAACTTGCTAATCATACTTTCTTGAACAAAACACTAATGTTGGAGGGAGGGATTTTAACAAAGCTA
CCAAAGAAGGACTCATCCAGCCATGAGTTTCCAGAGCTTTTACTTTCACTTTCTTGTCTAGCAAACACCCAATTATTTCTTGCTTCAACAGAACCAATAATTTCA
TATCTCTTAATCATACTCAACAATCCAGAATCTAACTCCAAATCCAAAACATGCTGTTTAGCAACTATATTCAACATTTCCACCATCCTAGAAAATACAGAAACC
TTAATCTCCAATAGTGTAATTCCAACACTACTCAAATTCTCCATCATCAAAGAATTCTCAGAGAAAGCCCTACCAACATTAGCAAACTTGGCAGTGACTTCAAAA
GGAAAACAAGCTCTAGAAACCAACTCAAAATTCTCCGAGATTTTGATAGAGATTTTGACATGGGAAGAGAAACCCAAATGCCAAGAACTCTCAGCTTATATCATC
ATGATTATAGCACATCAAAGCTGGGCTCAAAGAGAGAGATTGACTAAGGCCAGCATTATTGTCCCTGCACTACTCGGATTGGCTCTGTTAGGAAGTCCATTAGCT
CAAAACAGAGCATTGAAACTGCTGCAATGGTTAAAAGATGAGCGACGAGCGAGCGTGACGGCGCATTCTGGACCTCAGGTGGGTGATGGGATAGTTGAAGTAGGC
TCAGGATTCAGTGAGAAGGAGATTGAGAAAGGGAAGAGGGTGATGAGAAGCTTGGTGAAGCAGAGTTTGTATAAGAATATGGAGATAATAACTAGAAGAGCTAAT
GGTGGGGAATGTTCAAGTTCAAGTTCAAGTATTAGGAGGACTTTGGTTTCCAGTATCAGTTCTAAGAGTTTGCCTTTTTGA
mRNA sequenceShow/hide mRNA sequence
GTGAAGAACAAATGGAAATCAAACTGACAGCTCTCTCTATCTGCTATTCACTCTCCTCTCACCATTCTTCCTTCTCCCTTCAAAACACTTTCTCTTCCAAATTCA
GATTCCCAAAACCAATTTCCCTCACTCTCATCTGATCCTTTTCAATCATGCCTCCTCCTTCTCTCTCCTTTTATTACTCCTACATGCACGAATTACGCTTCCTCA
GTCTCATCCGCCGATTCCTTCGCTCCAAATCCAAATCATCTCGTAAGCGATTACGTTTCCCGTCTCATCCATCATCGGATATTCTATTTCCCGAAATTGAACACA
GTATTGTTGATGACGATGATCAGCTGGCCGCCTCCTCCTCCTCGGTGTTGCAGAGGACAGTGAAGAGCCTTCACTTCGGCGACGGAGATGAGAAAGAGAGAGCCG
CGAAGGAAATTGAGAGGTTGATTAAAAAAGAGAGCGGTAACTCGAAGGGTCTATTACGTTTCCAACAAGTGGTATCAGAGCTCCGGTTAGATATGGCTTCGAATG
GTAACATGTTGCAACCCCAACTTCCAAGGTTCAGCGGAAAGAATTTTAATCAATGGAGTATTCAAATGAAAGTGTTATATGGCTCTCAAGAATTGTGGGATATTG
TTGAAAGAGGATACACTGAAGTTGAGAATCAGAGTGAGCTCACAAATCAACAACTTGTTGAGTTAAGAGAAAATCGTAAGAAAGACAAAAAGGCTTTATTCTTCA
TTTATCAAGCTGTTGATGAATTTATTTTCGAGAGAATTTCAACAGCTACTTCTGCAAAGGCGGCTTGGGATATTCTAAGATCTACCTATCAAGGAGAAGATAAGA
GTGTTCCAGGGTCTATTACGTTTCCAACAATTCCTGCTTTGGTCGCCATGGCGGACTCCGATCACTTCGCGGTGAAGGCCTTGATTCAACTTGCTAATCATACTT
TCTTGAACAAAACACTAATGTTGGAGGGAGGGATTTTAACAAAGCTACCAAAGAAGGACTCATCCAGCCATGAGTTTCCAGAGCTTTTACTTTCACTTTCTTGTC
TAGCAAACACCCAATTATTTCTTGCTTCAACAGAACCAATAATTTCATATCTCTTAATCATACTCAACAATCCAGAATCTAACTCCAAATCCAAAACATGCTGTT
TAGCAACTATATTCAACATTTCCACCATCCTAGAAAATACAGAAACCTTAATCTCCAATAGTGTAATTCCAACACTACTCAAATTCTCCATCATCAAAGAATTCT
CAGAGAAAGCCCTACCAACATTAGCAAACTTGGCAGTGACTTCAAAAGGAAAACAAGCTCTAGAAACCAACTCAAAATTCTCCGAGATTTTGATAGAGATTTTGA
CATGGGAAGAGAAACCCAAATGCCAAGAACTCTCAGCTTATATCATCATGATTATAGCACATCAAAGCTGGGCTCAAAGAGAGAGATTGACTAAGGCCAGCATTA
TTGTCCCTGCACTACTCGGATTGGCTCTGTTAGGAAGTCCATTAGCTCAAAACAGAGCATTGAAACTGCTGCAATGGTTAAAAGATGAGCGACGAGCGAGCGTGA
CGGCGCATTCTGGACCTCAGGTGGGTGATGGGATAGTTGAAGTAGGCTCAGGATTCAGTGAGAAGGAGATTGAGAAAGGGAAGAGGGTGATGAGAAGCTTGGTGA
AGCAGAGTTTGTATAAGAATATGGAGATAATAACTAGAAGAGCTAATGGTGGGGAATGTTCAAGTTCAAGTTCAAGTATTAGGAGGACTTTGGTTTCCAGTATCA
GTTCTAAGAGTTTGCCTTTTTGAGAGATTTTAATCCAAATCACATTTATGAACTTTCATCATCTGCCTTTTTGTATTGTATTGTTTTGTTATCTTATGCAGATTA
ACCAATGCCAAATGAAATCTAGAGAATGGATTTGGTGTGGATGAATACATAATATTATCTCTAATATCATATCAAACGGATTATCCAAAGTATTAGCAACATTTA
CACTTTGCTGTGATACTAATTTTTACGGTCATACTTCTCCAAACCACAATATCATATTTTTTTTCTCTTTGATACCATATTATATCT
Protein sequenceShow/hide protein sequence
MPPPSLSFYYSYMHELRFLSLIRRFLRSKSKSSRKRLRFPSHPSSDILFPEIEHSIVDDDDQLAASSSSVLQRTVKSLHFGDGDEKERAAKEIERLIKKESGNSK
GLLRFQQVVSELRLDMASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAVDEFIFERISTA
TSAKAAWDILRSTYQGEDKSVPGSITFPTIPALVAMADSDHFAVKALIQLANHTFLNKTLMLEGGILTKLPKKDSSSHEFPELLLSLSCLANTQLFLASTEPIIS
YLLIILNNPESNSKSKTCCLATIFNISTILENTETLISNSVIPTLLKFSIIKEFSEKALPTLANLAVTSKGKQALETNSKFSEILIEILTWEEKPKCQELSAYII
MIIAHQSWAQRERLTKASIIVPALLGLALLGSPLAQNRALKLLQWLKDERRASVTAHSGPQVGDGIVEVGSGFSEKEIEKGKRVMRSLVKQSLYKNMEIITRRAN
GGECSSSSSSIRRTLVSSISSKSLPF