; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037502 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037502
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionSANT domain-containing protein
Genome locationscaffold11:30935024..30958619
RNA-Seq ExpressionSpg037502
SyntenySpg037502
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR022228 - Protein of unknown function DUF3755


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152146.1 uncharacterized protein LOC101222201 isoform X2 [Cucumis sativus]2.3e-0894.44Show/hide
Query:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS
        LNEMPEVMKQMPPLPVKVNEELANTILPPT+HS+QS
Subjt:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS

XP_004152146.1 uncharacterized protein LOC101222201 isoform X2 [Cucumis sativus]2.5e-7654.68Show/hide
Query:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGI+ DWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

Query:  -------------------------ERVSDPSMKSAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEVGLGHVS
                                 ERVSD SMKSAQVA RPNVPPYGMPMIPMDNDDGVSYK                                     
Subjt:  -------------------------ERVSDPSMKSAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEVGLGHVS

Query:  HQASLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNAHAMNQI
                                                                                         IGGTTGELLEQNAHAMNQI
Subjt:  HQASLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNAHAMNQI

Query:  SSNLASFQIQDNISLFCQTRDNILKIMNEVS
        SSNLASFQIQDNISLFCQTRDNILKIMN+++
Subjt:  SSNLASFQIQDNISLFCQTRDNILKIMNEVS

XP_011653023.1 uncharacterized protein LOC101222201 isoform X1 [Cucumis sativus]2.3e-0894.44Show/hide
Query:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS
        LNEMPEVMKQMPPLPVKVNEELANTILPPT+HS+QS
Subjt:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS

XP_011653023.1 uncharacterized protein LOC101222201 isoform X1 [Cucumis sativus]2.8e-7554.27Show/hide
Query:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAAD+SSSALAMKHNPGI+ DWTSDEQ+TLEEGLKKYA ESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

Query:  ----------------------ERVSDPSMKSAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEVGLGHVSHQA
                              ERVSDPSMKSAQVATRPNV PYGMPMIPMDNDDGVSYK                                        
Subjt:  ----------------------ERVSDPSMKSAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEVGLGHVSHQA

Query:  SLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNAHAMNQISSN
                                                                                      IGGTTGELLEQNAHAMNQISSN
Subjt:  SLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNAHAMNQISSN

Query:  LASFQIQDNISLFCQTRDNILKIMNEVS
        LASFQIQDNISLFCQTRDNILKIM++++
Subjt:  LASFQIQDNISLFCQTRDNILKIMNEVS

XP_016901549.1 PREDICTED: uncharacterized protein LOC103494613 isoform X2 [Cucumis melo]1.7e-8059.15Show/hide
Query:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGI+ DWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

Query:  ERVSDPSMKSAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEVGLGHVSHQASLKDAVLGLPPVFSKRKAHLKA
        ERVSD SMKSAQVA RPNVPPYGMPMIPMDNDDGVSYK                                                              
Subjt:  ERVSDPSMKSAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEVGLGHVSHQASLKDAVLGLPPVFSKRKAHLKA

Query:  TSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNAHAMNQISSNLASFQIQDNISLFCQTRDNILK
                                                                IGGTTGELLEQNAHAMNQISSNLASFQIQDNISLFCQTRDNILK
Subjt:  TSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNAHAMNQISSNLASFQIQDNISLFCQTRDNILK

Query:  IMNEVS
        IMN+++
Subjt:  IMNEVS

XP_016901549.1 PREDICTED: uncharacterized protein LOC103494613 isoform X2 [Cucumis melo]2.3e-0894.44Show/hide
Query:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS
        LNEMPEVMKQMPPLPVKVNEELANTILPPT+HS+QS
Subjt:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS

XP_016901549.1 PREDICTED: uncharacterized protein LOC103494613 isoform X2 [Cucumis melo]1.1e-7655.18Show/hide
Query:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGI+ DWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

Query:  ----------------------ERVSDPSMKSAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEVGLGHVSHQA
                              ERVSD SMKSAQVA RPNVPPYGMPMIPMDNDDGVSYK                                        
Subjt:  ----------------------ERVSDPSMKSAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEVGLGHVSHQA

Query:  SLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNAHAMNQISSN
                                                                                      IGGTTGELLEQNAHAMNQISSN
Subjt:  SLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNAHAMNQISSN

Query:  LASFQIQDNISLFCQTRDNILKIMNEVS
        LASFQIQDNISLFCQTRDNILKIMN+++
Subjt:  LASFQIQDNISLFCQTRDNILKIMNEVS

XP_022145845.1 uncharacterized protein LOC111015203 [Momordica charantia]3.0e-0894.44Show/hide
Query:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS
        LNEMPEVMKQMPPLPVKVN+ELANTILPPT HSMQS
Subjt:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS

XP_022932738.1 uncharacterized protein LOC111439197 [Cucurbita moschata]3.0e-0894.44Show/hide
Query:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS
        LNEMPEVMKQMPPLPVKVN+ELANTILPPT HSMQS
Subjt:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS

XP_022932738.1 uncharacterized protein LOC111439197 [Cucurbita moschata]3.7e-7554.27Show/hide
Query:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNHQEAGQPSSSFDGGNPSNGNSTPV  ADNSSSALAMKHNPGI+ DWTSDEQVTLEEGLKKYA ESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

Query:  ----------------------ERVSDPSMKSAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEVGLGHVSHQA
                              ERVSDPSMKSAQVA RPNVPPYGMPMIPMDNDDGVSYK                                        
Subjt:  ----------------------ERVSDPSMKSAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEVGLGHVSHQA

Query:  SLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNAHAMNQISSN
                                                                                      IGGTTGELLEQNAHAMNQISSN
Subjt:  SLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNAHAMNQISSN

Query:  LASFQIQDNISLFCQTRDNILKIMNEVS
        LASFQIQ+NISLFCQTRDNILKIMN+++
Subjt:  LASFQIQDNISLFCQTRDNILKIMNEVS

TrEMBL top hitse value%identityAlignment
A0A0A0KX84 SANT domain-containing protein1.1e-0894.44Show/hide
Query:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS
        LNEMPEVMKQMPPLPVKVNEELANTILPPT+HS+QS
Subjt:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS

A0A0A0KX84 SANT domain-containing protein5.5e-7755.18Show/hide
Query:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGI+ DWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

Query:  ----------------------ERVSDPSMKSAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEVGLGHVSHQA
                              ERVSD SMKSAQVA RPNVPPYGMPMIPMDNDDGVSYK                                        
Subjt:  ----------------------ERVSDPSMKSAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEVGLGHVSHQA

Query:  SLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNAHAMNQISSN
                                                                                      IGGTTGELLEQNAHAMNQISSN
Subjt:  SLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNAHAMNQISSN

Query:  LASFQIQDNISLFCQTRDNILKIMNEVS
        LASFQIQDNISLFCQTRDNILKIMN+++
Subjt:  LASFQIQDNISLFCQTRDNILKIMNEVS

A0A1S3BYL7 uncharacterized protein LOC103494613 isoform X11.1e-0894.44Show/hide
Query:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS
        LNEMPEVMKQMPPLPVKVNEELANTILPPT+HS+QS
Subjt:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS

A0A1S3BYL7 uncharacterized protein LOC103494613 isoform X11.4e-7554.27Show/hide
Query:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAAD+SSSALAMKHNPGI+ DWTSDEQ+TLEEGLKKYA ESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

Query:  ----------------------ERVSDPSMKSAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEVGLGHVSHQA
                              ERVSDPSMKSAQVATRPNV PYGMPMIPMDNDDGVSYK                                        
Subjt:  ----------------------ERVSDPSMKSAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEVGLGHVSHQA

Query:  SLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNAHAMNQISSN
                                                                                      IGGTTGELLEQNAHAMNQISSN
Subjt:  SLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNAHAMNQISSN

Query:  LASFQIQDNISLFCQTRDNILKIMNEVS
        LASFQIQDNISLFCQTRDNILKIM++++
Subjt:  LASFQIQDNISLFCQTRDNILKIMNEVS

A0A1S4E017 uncharacterized protein LOC103494613 isoform X28.3e-8159.15Show/hide
Query:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGI+ DWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

Query:  ERVSDPSMKSAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEVGLGHVSHQASLKDAVLGLPPVFSKRKAHLKA
        ERVSD SMKSAQVA RPNVPPYGMPMIPMDNDDGVSYK                                                              
Subjt:  ERVSDPSMKSAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEVGLGHVSHQASLKDAVLGLPPVFSKRKAHLKA

Query:  TSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNAHAMNQISSNLASFQIQDNISLFCQTRDNILK
                                                                IGGTTGELLEQNAHAMNQISSNLASFQIQDNISLFCQTRDNILK
Subjt:  TSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNAHAMNQISSNLASFQIQDNISLFCQTRDNILK

Query:  IMNEVS
        IMN+++
Subjt:  IMNEVS

A0A1S4E017 uncharacterized protein LOC103494613 isoform X21.1e-0894.44Show/hide
Query:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS
        LNEMPEVMKQMPPLPVKVNEELANTILPPT+HS+QS
Subjt:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS

A0A1S4E017 uncharacterized protein LOC103494613 isoform X25.5e-7755.18Show/hide
Query:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGI+ DWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

Query:  ----------------------ERVSDPSMKSAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEVGLGHVSHQA
                              ERVSD SMKSAQVA RPNVPPYGMPMIPMDNDDGVSYK                                        
Subjt:  ----------------------ERVSDPSMKSAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEVGLGHVSHQA

Query:  SLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNAHAMNQISSN
                                                                                      IGGTTGELLEQNAHAMNQISSN
Subjt:  SLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNAHAMNQISSN

Query:  LASFQIQDNISLFCQTRDNILKIMNEVS
        LASFQIQDNISLFCQTRDNILKIMN+++
Subjt:  LASFQIQDNISLFCQTRDNILKIMNEVS

A0A6J1CVN7 uncharacterized protein LOC1110152031.4e-0894.44Show/hide
Query:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS
        LNEMPEVMKQMPPLPVKVN+ELANTILPPT HSMQS
Subjt:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS

A0A6J1F2K8 uncharacterized protein LOC1114391971.4e-0894.44Show/hide
Query:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS
        LNEMPEVMKQMPPLPVKVN+ELANTILPPT HSMQS
Subjt:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS

A0A6J1F2K8 uncharacterized protein LOC1114391971.8e-7554.27Show/hide
Query:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNHQEAGQPSSSFDGGNPSNGNSTPV  ADNSSSALAMKHNPGI+ DWTSDEQVTLEEGLKKYA ESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

Query:  ----------------------ERVSDPSMKSAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEVGLGHVSHQA
                              ERVSDPSMKSAQVA RPNVPPYGMPMIPMDNDDGVSYK                                        
Subjt:  ----------------------ERVSDPSMKSAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEVGLGHVSHQA

Query:  SLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNAHAMNQISSN
                                                                                      IGGTTGELLEQNAHAMNQISSN
Subjt:  SLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNAHAMNQISSN

Query:  LASFQIQDNISLFCQTRDNILKIMNEVS
        LASFQIQ+NISLFCQTRDNILKIMN+++
Subjt:  LASFQIQDNISLFCQTRDNILKIMNEVS

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657503.5e-1232.54Show/hide
Query:  SFFWEGNESGKINHLVRWRVVSRAQSDGGLGIGALKHRNVALVTKWGWRFIHEPDSFWRKII-ASIHGSTTYNWHTAGRKGCCLRSPWVSISNVWKQVDS
        +F W      K  HLV+W  V   + +GGLG+ A K  N AL++K GWR + E +S W  ++    H     +      KG    S W SI+   + V S
Subjt:  SFFWEGNESGKINHLVRWRVVSRAQSDGGLGIGALKHRNVALVTKWGWRFIHEPDSFWRKII-ASIHGSTTYNWHTAGRKGCCLRSPWVSISNVWKQVDS

Query:  FALFKL-GNGRRLSFWQDSWLNDLPL
          +  + G+G+++ FW D W++  PL
Subjt:  FALFKL-GNGRRLSFWQDSWLNDLPL

P93295 Uncharacterized mitochondrial protein AtMg003101.6e-0426.61Show/hide
Query:  FFWEGNESGKINHLVRWRVVSRA-QSDGGLGIGALKHRNVALVTKWGWRFIHEPDSFWRKIIASIHGSTTYNWHTAGRKGCCLRSP---WVSISNVWKQV
        F+W   E+ +    V W+ + ++ + DGGLG   L   N AL+ K  +R IH+P +   +++ S      Y  H++  +      P   W SI +  + +
Subjt:  FFWEGNESGKINHLVRWRVVSRA-QSDGGLGIGALKHRNVALVTKWGWRFIHEPDSFWRKIIASIHGSTTYNWHTAGRKGCCLRSP---WVSISNVWKQV

Query:  DSFALFKLGNGRRLSFWQDSWLND
            L  +G+G     W D W+ D
Subjt:  DSFALFKLGNGRRLSFWQDSWLND

Arabidopsis top hitse value%identityAlignment
AT1G10820.2 Protein of unknown function (DUF3755)2.1e-1248.81Show/hide
Query:  GGNPSNGNSTPVPAADNSSS-ALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNER
        G N  N +S   PA D S S A  +K    + MDW+ +EQ  LE GL K   E  + +Y KIA  LP+KTVRDVALRCRWM  +
Subjt:  GGNPSNGNSTPVPAADNSSS-ALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNER

AT1G10820.2 Protein of unknown function (DUF3755)5.6e-0553.49Show/hide
Query:  ELLEQNAHAMNQISSNLASFQIQDNISLFCQTRDNILKIMNEV
        +LL+QNA A +QIS NL++ ++QDNISLF Q R+NI  I+ ++
Subjt:  ELLEQNAHAMNQISSNLASFQIQDNISLFCQTRDNILKIMNEV

AT3G07565.1 Protein of unknown function (DUF3755)7.7e-3129.38Show/hide
Query:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD
        ANPSGN+QE    +      + +  N   V           AADNS +  A++HNPGI+ DWT +EQ  LE+ L KYA E SV RYAKIAM++ +KTVRD
Subjt:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD

Query:  VALRCRWM---------------------NERVSDPSMK-SAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEV
        VALRCRWM                      E+ +D S K S+ +   PN P Y  PM+P+D DDG+SYK                               
Subjt:  VALRCRWM---------------------NERVSDPSMK-SAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEV

Query:  GLGHVSHQASLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNA
                                                                                               IGG +G+LLEQNA
Subjt:  GLGHVSHQASLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNA

Query:  HAMNQISSNLASFQIQDNISLFCQTRDNILKIMNEVS
           NQ+S+N ++FQ+ +N+++ C+ RDNIL I+N+++
Subjt:  HAMNQISSNLASFQIQDNISLFCQTRDNILKIMNEVS

AT3G07565.1 Protein of unknown function (DUF3755)3.2e-0875Show/hide
Query:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS
        LN+MPEVMKQMPPLPVK+NEELAN+ILP  +H  +S
Subjt:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS

AT3G07565.2 Protein of unknown function (DUF3755)8.6e-3042.71Show/hide
Query:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD
        ANPSGN+QE    +      + +  N   V           AADNS +  A++HNPGI+ DWT +EQ  LE+ L KYA E SV RYAKIAM++ +KTVRD
Subjt:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD

Query:  VALRCRWM---------------------NERVSDPSMK-SAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKAL
        VALRCRWM                      E+ +D S K S+ +   PN P Y  PM+P+D DDG+SYKG    L       F LK KS  L
Subjt:  VALRCRWM---------------------NERVSDPSMK-SAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKAL

AT3G07565.3 Protein of unknown function (DUF3755)1.0e-3029.29Show/hide
Query:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD
        ANPSGN+QE    +      + +  N   V           AADNS +  A++HNPGI+ DWT +EQ  LE+ L KYA E SV RYAKIAM++ +KTVRD
Subjt:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD

Query:  VALRCRWM----------------------NERVSDPSMK-SAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGE
        VALRCRWM                       E+ +D S K S+ +   PN P Y  PM+P+D DDG+SYK                              
Subjt:  VALRCRWM----------------------NERVSDPSMK-SAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGE

Query:  VGLGHVSHQASLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQN
                                                                                                IGG +G+LLEQN
Subjt:  VGLGHVSHQASLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQN

Query:  AHAMNQISSNLASFQIQDNISLFCQTRDNILKIMNEVS
        A   NQ+S+N ++FQ+ +N+++ C+ RDNIL I+N+++
Subjt:  AHAMNQISSNLASFQIQDNISLFCQTRDNILKIMNEVS

AT3G07565.3 Protein of unknown function (DUF3755)3.2e-0875Show/hide
Query:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS
        LN+MPEVMKQMPPLPVK+NEELAN+ILP  +H  +S
Subjt:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS

AT3G07565.4 Protein of unknown function (DUF3755)1.6e-2828.53Show/hide
Query:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD
        ANPSGN+QE    +      + +  N   V           AADNS +  A++HNPGI+ DWT +EQ  LE+ L KYA E SV RYAKIAM++ +KTVRD
Subjt:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PAADNSSSALAMKHNPGIAMDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD

Query:  VALRCRWM----------------------NERVSDPSMK-SAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGE
        VALRCRWM                       E+ +D S K S+ +   PN P Y  PM+P+D DDG+SYK                              
Subjt:  VALRCRWM----------------------NERVSDPSMK-SAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGE

Query:  VGLGHVSHQASLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQN
                                                                                                IGG +G+LLEQN
Subjt:  VGLGHVSHQASLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQN

Query:  AHAMNQISSNLASFQI---------QDNISLFCQTRDNILKIMNEVS
        A   NQ+S+N ++FQ+          +N+++ C+ RDNIL I+N+++
Subjt:  AHAMNQISSNLASFQI---------QDNISLFCQTRDNILKIMNEVS

AT3G07565.4 Protein of unknown function (DUF3755)3.2e-0875Show/hide
Query:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS
        LN+MPEVMKQMPPLPVK+NEELAN+ILP  +H  +S
Subjt:  LNEMPEVMKQMPPLPVKVNEELANTILPPTTHSMQS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGGACATCGGTCTCGTTTCTCTCTCTTTTCTGCGAATTCGTATAGTTTATTTTGTCTACCCGAGCTCCCTTCAGAGGTGGGTCTTGATTGGATTTGGATTTGGGA
TTGGAAAACGTTTGAATTGTTGCGAGTTTTGATGGCTAACCCATCTGGGAACCATCAAGAAGCTGGCCAACCATCGTCTTCCTTCGATGGAGGGAACCCCAGCAACGGTA
ATTCGACCCCTGTGCCTGCAGCGGATAATTCCAGCTCGGCTCTTGCGATGAAGCACAACCCAGGTATCGCAATGGATTGGACATCTGATGAGCAGGTCACACTGGAAGAA
GGGCTTAAGAAATATGCCGCAGAGTCTAGTGTTATTCGTTATGCAAAGATTGCAATGCAGCTACCAAATAAGACTGTACGAGATGTTGCCTTGCGTTGCAGATGGATGAA
TGAAAGAGTATCTGACCCTTCAATGAAGTCAGCACAGGTTGCAACTAGGCCTAATGTGCCTCCGTATGGGATGCCTATGATTCCCATGGACAACGATGATGGTGTCTCAT
ATAAAGGTTTTGGTTCTACACTCTTCCTTCACACCACTGCGTGTTTCGGGCTAAAAAAGAAGTCAAAGGCTTTGGCTTCGGCTTCGGATGGAGAAGTTGGATTAGGGCAT
GTATCTCATCAAGCAAGCCTAAAGGATGCTGTTTTAGGATTACCACCAGTGTTTTCAAAGCGCAAGGCGCACCTCAAGGCGACAAGCCCCTTGATCGCCTCAAGGCGAGA
GGCGACAAAAAGGCGCAATTGGGCGAGCGCCTGTTTGAAGACGCTCGCCCCAACTACCTACGAGGCGCAGTACATGCTGGGCGCCTCGCCTCTCGCCATAGGCGCTCGCC
CGACGTCGCCTCAATTATATGAAGGAACTATTGGTGGTACAACTGGAGAGCTTCTTGAACAGAACGCACATGCCATGAATCAAATTTCCTCTAATCTTGCATCTTTTCAG
ATACAAGATAATATCAGTCTCTTCTGCCAAACTCGGGATAACATCCTCAAAATAATGAACGAGGTGAGTTTTGAAGTAGAAGAAGAAGGTCATTGGAATGAAATAAGGCC
TATGGCTCTTTGGCGAAAGGTGATGGATGGAAAATATTTGGTTCATGAGAACATGTGGCTTTCTGCCAACATGAGTCTAGCTCAATTGGTCAAAGCATCTAATACCATCC
TGAAGGGAGATGGAAGACAGGGCCTCGCTTGTGAACATTTCAATCATGAGATTCCGATCTCAGTGATCAAAGGGTGTGGAGGATTTGTAAGAATGGGCTCTTCATGGTTA
GCCACAGTAATTCAAAGAGTGGAGATGGACACTCTCTTTGGGGAGTCAGATCGCCATGGTTGTTCGTCGGAAGCAGCTGTGCGTGATCCACACGCACCATTGGATATTTC
TGCAGTCGCTGCCGCACGCCAACGCGTGAGGGCATGTGGGAGGGTCTTTTTCCTTCCATCAGTTGGGTTAGGTGCATTTATGGCTGGAATTTTCTTCGGTTGTCGTGATT
TGCTGTATCTTGTGCATCAAGAGTATTTCAGACTTGTTTCTATGTCCATCCATGGTGCATCTAAGAAACCATTTGTTAATCCTACTATTGAATCTTTCTTTTGGGAAGGA
AATGAAAGTGGAAAGATAAATCATTTAGTGAGATGGCGTGTGGTTTCAAGAGCCCAATCTGATGGAGGCCTCGGTATTGGGGCTTTGAAGCATAGAAATGTGGCGCTCGT
TACCAAGTGGGGTTGGAGATTTATTCATGAACCAGATTCTTTTTGGCGCAAAATAATTGCTAGTATTCATGGCTCCACAACCTATAATTGGCATACAGCTGGTAGAAAAG
GATGTTGTCTTCGTAGTCCATGGGTGAGTATATCTAATGTATGGAAGCAGGTGGACTCTTTTGCCCTCTTTAAATTGGGCAATGGTCGAAGATTGTCATTTTGGCAAGAT
TCCTGGCTTAATGACTTGCCTCTAAAGCTCCGTTTTCCTCTTCTATTTCGTATTTCTTCAAACCCTGATGGTTCGGTTTTTGACCATTGGGATGCCTATACTTCGTCATG
GAGCATATTTTTTTGTTGTTCGTTAAAGGAAGTGGAAATTTTTGAGTTGCCGCAGACTTTGGGTCTTATTTCTTTGGTTCAGATAGGTTCTTCAGAAGACTCTAGAAGAT
GGTCCCTTGATTCCTCGGGTGTTTTCTCAGTCAAGTCTTTATCACGTTACTTGGCTTCAGCTTCTCCTATGGATAATGAGTGGGTTTTCTCCAATGACTTCAAAGGCAAC
GTTCTTCAATTACTTATTGGTCCATATTCAGCTTCAAAGTTCAAAGTTTTGTGGATCAATGCAGTCAAGCTATTCGTTCTGAATTGTGGTTCAAAAGAAACCAAAGGATC
TTTCATAATAAACATCCCCCTTGGTTTGATCGGTTTGATTCAGCTCGCCTTAAGGCTTCATCCTGGTGCTCTCTTTCCAAATCTTTTGTTGATTTCCCTTTACAGGATAT
ATTTCTTAACTGGAATGCTTTTATTTCTCCGATCACAGGGCGAACGCCTGCCAATAGTTGTGAAGACCTCTACTGCTAATTGCTGTGATTTATCTTTCTGCTTGAATGAA
ATGCCCGAAGTAATGAAGCAGATGCCGCCTCTTCCGGTGAAGGTGAACGAAGAGTTAGCGAACACAATCCTTCCGCCGACCACTCACTCTATGCAATCATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGGACATCGGTCTCGTTTCTCTCTCTTTTCTGCGAATTCGTATAGTTTATTTTGTCTACCCGAGCTCCCTTCAGAGGTGGGTCTTGATTGGATTTGGATTTGGGA
TTGGAAAACGTTTGAATTGTTGCGAGTTTTGATGGCTAACCCATCTGGGAACCATCAAGAAGCTGGCCAACCATCGTCTTCCTTCGATGGAGGGAACCCCAGCAACGGTA
ATTCGACCCCTGTGCCTGCAGCGGATAATTCCAGCTCGGCTCTTGCGATGAAGCACAACCCAGGTATCGCAATGGATTGGACATCTGATGAGCAGGTCACACTGGAAGAA
GGGCTTAAGAAATATGCCGCAGAGTCTAGTGTTATTCGTTATGCAAAGATTGCAATGCAGCTACCAAATAAGACTGTACGAGATGTTGCCTTGCGTTGCAGATGGATGAA
TGAAAGAGTATCTGACCCTTCAATGAAGTCAGCACAGGTTGCAACTAGGCCTAATGTGCCTCCGTATGGGATGCCTATGATTCCCATGGACAACGATGATGGTGTCTCAT
ATAAAGGTTTTGGTTCTACACTCTTCCTTCACACCACTGCGTGTTTCGGGCTAAAAAAGAAGTCAAAGGCTTTGGCTTCGGCTTCGGATGGAGAAGTTGGATTAGGGCAT
GTATCTCATCAAGCAAGCCTAAAGGATGCTGTTTTAGGATTACCACCAGTGTTTTCAAAGCGCAAGGCGCACCTCAAGGCGACAAGCCCCTTGATCGCCTCAAGGCGAGA
GGCGACAAAAAGGCGCAATTGGGCGAGCGCCTGTTTGAAGACGCTCGCCCCAACTACCTACGAGGCGCAGTACATGCTGGGCGCCTCGCCTCTCGCCATAGGCGCTCGCC
CGACGTCGCCTCAATTATATGAAGGAACTATTGGTGGTACAACTGGAGAGCTTCTTGAACAGAACGCACATGCCATGAATCAAATTTCCTCTAATCTTGCATCTTTTCAG
ATACAAGATAATATCAGTCTCTTCTGCCAAACTCGGGATAACATCCTCAAAATAATGAACGAGGTGAGTTTTGAAGTAGAAGAAGAAGGTCATTGGAATGAAATAAGGCC
TATGGCTCTTTGGCGAAAGGTGATGGATGGAAAATATTTGGTTCATGAGAACATGTGGCTTTCTGCCAACATGAGTCTAGCTCAATTGGTCAAAGCATCTAATACCATCC
TGAAGGGAGATGGAAGACAGGGCCTCGCTTGTGAACATTTCAATCATGAGATTCCGATCTCAGTGATCAAAGGGTGTGGAGGATTTGTAAGAATGGGCTCTTCATGGTTA
GCCACAGTAATTCAAAGAGTGGAGATGGACACTCTCTTTGGGGAGTCAGATCGCCATGGTTGTTCGTCGGAAGCAGCTGTGCGTGATCCACACGCACCATTGGATATTTC
TGCAGTCGCTGCCGCACGCCAACGCGTGAGGGCATGTGGGAGGGTCTTTTTCCTTCCATCAGTTGGGTTAGGTGCATTTATGGCTGGAATTTTCTTCGGTTGTCGTGATT
TGCTGTATCTTGTGCATCAAGAGTATTTCAGACTTGTTTCTATGTCCATCCATGGTGCATCTAAGAAACCATTTGTTAATCCTACTATTGAATCTTTCTTTTGGGAAGGA
AATGAAAGTGGAAAGATAAATCATTTAGTGAGATGGCGTGTGGTTTCAAGAGCCCAATCTGATGGAGGCCTCGGTATTGGGGCTTTGAAGCATAGAAATGTGGCGCTCGT
TACCAAGTGGGGTTGGAGATTTATTCATGAACCAGATTCTTTTTGGCGCAAAATAATTGCTAGTATTCATGGCTCCACAACCTATAATTGGCATACAGCTGGTAGAAAAG
GATGTTGTCTTCGTAGTCCATGGGTGAGTATATCTAATGTATGGAAGCAGGTGGACTCTTTTGCCCTCTTTAAATTGGGCAATGGTCGAAGATTGTCATTTTGGCAAGAT
TCCTGGCTTAATGACTTGCCTCTAAAGCTCCGTTTTCCTCTTCTATTTCGTATTTCTTCAAACCCTGATGGTTCGGTTTTTGACCATTGGGATGCCTATACTTCGTCATG
GAGCATATTTTTTTGTTGTTCGTTAAAGGAAGTGGAAATTTTTGAGTTGCCGCAGACTTTGGGTCTTATTTCTTTGGTTCAGATAGGTTCTTCAGAAGACTCTAGAAGAT
GGTCCCTTGATTCCTCGGGTGTTTTCTCAGTCAAGTCTTTATCACGTTACTTGGCTTCAGCTTCTCCTATGGATAATGAGTGGGTTTTCTCCAATGACTTCAAAGGCAAC
GTTCTTCAATTACTTATTGGTCCATATTCAGCTTCAAAGTTCAAAGTTTTGTGGATCAATGCAGTCAAGCTATTCGTTCTGAATTGTGGTTCAAAAGAAACCAAAGGATC
TTTCATAATAAACATCCCCCTTGGTTTGATCGGTTTGATTCAGCTCGCCTTAAGGCTTCATCCTGGTGCTCTCTTTCCAAATCTTTTGTTGATTTCCCTTTACAGGATAT
ATTTCTTAACTGGAATGCTTTTATTTCTCCGATCACAGGGCGAACGCCTGCCAATAGTTGTGAAGACCTCTACTGCTAATTGCTGTGATTTATCTTTCTGCTTGAATGAA
ATGCCCGAAGTAATGAAGCAGATGCCGCCTCTTCCGGTGAAGGTGAACGAAGAGTTAGCGAACACAATCCTTCCGCCGACCACTCACTCTATGCAATCATGA
Protein sequenceShow/hide protein sequence
MQGHRSRFSLFSANSYSLFCLPELPSEVGLDWIWIWDWKTFELLRVLMANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGIAMDWTSDEQVTLEE
GLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNERVSDPSMKSAQVATRPNVPPYGMPMIPMDNDDGVSYKGFGSTLFLHTTACFGLKKKSKALASASDGEVGLGH
VSHQASLKDAVLGLPPVFSKRKAHLKATSPLIASRREATKRRNWASACLKTLAPTTYEAQYMLGASPLAIGARPTSPQLYEGTIGGTTGELLEQNAHAMNQISSNLASFQ
IQDNISLFCQTRDNILKIMNEVSFEVEEEGHWNEIRPMALWRKVMDGKYLVHENMWLSANMSLAQLVKASNTILKGDGRQGLACEHFNHEIPISVIKGCGGFVRMGSSWL
ATVIQRVEMDTLFGESDRHGCSSEAAVRDPHAPLDISAVAAARQRVRACGRVFFLPSVGLGAFMAGIFFGCRDLLYLVHQEYFRLVSMSIHGASKKPFVNPTIESFFWEG
NESGKINHLVRWRVVSRAQSDGGLGIGALKHRNVALVTKWGWRFIHEPDSFWRKIIASIHGSTTYNWHTAGRKGCCLRSPWVSISNVWKQVDSFALFKLGNGRRLSFWQD
SWLNDLPLKLRFPLLFRISSNPDGSVFDHWDAYTSSWSIFFCCSLKEVEIFELPQTLGLISLVQIGSSEDSRRWSLDSSGVFSVKSLSRYLASASPMDNEWVFSNDFKGN
VLQLLIGPYSASKFKVLWINAVKLFVLNCGSKETKGSFIINIPLGLIGLIQLALRLHPGALFPNLLLISLYRIYFLTGMLLFLRSQGERLPIVVKTSTANCCDLSFCLNE
MPEVMKQMPPLPVKVNEELANTILPPTTHSMQS