; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC09G169080 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC09G169080
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptionprotein SENSITIVE TO UV 2 isoform X1
Genome locationCiama_Chr09:12015400..12023358
RNA-Seq ExpressionCaUC09G169080
SyntenyCaUC09G169080
Gene Ontology termsGO:0006974 - cellular response to DNA damage stimulus (biological process)
InterPro domainsIPR044952 - Protein SENSITIVE TO UV 2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038602.1 uncharacterized protein E6C27_scaffold92G001320 [Cucumis melo var. makuwa]0.0e+0084.02Show/hide
Query:  MRAEDEGFED--WDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTYSPPPPPEPEPQ----HLVEVFHDRPISYSPPRELSQRATGLRSHAIRFPN
        M   DEGFED  WDADFLDQLIQVEELAISSTANN    PIPI SS S   PPPPPEPEP+    HLVE  + RPISYSPPRELSQ ATGLRSHAIRFPN
Subjt:  MRAEDEGFED--WDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTYSPPPPPEPEPQ----HLVEVFHDRPISYSPPRELSQRATGLRSHAIRFPN

Query:  GLGECGPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAGKDGGRTGMKSED
        GL ECGPSSS+LAPCLHR DAAKELEICDLKRELGRVSKQLK+LEQECVELRK R+KKEEQL+VVSSNKDEQYIG R+SES DLRVAGKDG RTGMKSED
Subjt:  GLGECGPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAGKDGGRTGMKSED

Query:  IAGDLGGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHKFSVDSLAGDNSS
        IAGDLGGPH VTSRSK NEQGEKAHSSVGERANDDLPAFDKLSKKL+VFWVPESDSK+GQ++VSELLLSCETD HVLFHSIGT+LS KF V +LAGDNSS
Subjt:  IAGDLGGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHKFSVDSLAGDNSS

Query:  DVALKQPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATNCNLLRETVMIGG
        DVALKQPL L QC EAKKVSNLYTTLTKVSNGIVKMEALF+PLLDLCNLDNVA+VHRSLHILHMFLK+LLWLERKSER              RETV+IGG
Subjt:  DVALKQPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATNCNLLRETVMIGG

Query:  LGSRNNVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEAVSVMNLILMRNN
        LGSRNNVVDSHGSQSAE +EF+L NMDE+S+GS   A   LPGAELLCKNRNL KNINLVPQ+NWV+FFE M QVAKTHSAKCVRIEA+SVMNLILMRNN
Subjt:  LGSRNNVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEAVSVMNLILMRNN

Query:  TYLEKEKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLTCFGNGIEELKLR
         YLEKEKFGQ  LFDSVVEFIRKESG+AIQ HAVRLLFLILNCPTFF AFCSGCKEAE A+  +ENVR AGGFQKF+ ILHGLADC TCFGNGIEELKLR
Subjt:  TYLEKEKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLTCFGNGIEELKLR

Query:  RNTVLLLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEPVENLEERASLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLTID
        +NTVLLLAFL+SSG AGFEILISNKLYT SNFLTLILQ VASE+ QEKTV EPVENLEERA LLREVLILLNRLASHSLYS T+LRVLTNSRDMA+LTID
Subjt:  RNTVLLLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEPVENLEERASLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLTID

Query:  VTNKLCRKNKRNCQFDSKKRKMRESEVMELAQ
        VTNKLCRKN RN Q DSKKRKMRESEV+ELAQ
Subjt:  VTNKLCRKNKRNCQFDSKKRKMRESEVMELAQ

KAG7011360.1 hypothetical protein SDJN02_26265, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0082.21Show/hide
Query:  MRAEDEGFEDWDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTY-SPPPPPEPEPQHLVEVFHDRPISYSPPRELSQRATGLRSHAIRFPNGLGEC
        MR+EDEGFEDWDADFLDQLIQVEELAISSTANN    P PI  S STY  PPPPPEPEPQHLVEV HDRPISYSPPRELSQRA GLRSH+IR P GLGEC
Subjt:  MRAEDEGFEDWDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTY-SPPPPPEPEPQHLVEVFHDRPISYSPPRELSQRATGLRSHAIRFPNGLGEC

Query:  GPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAGKDGGRTGMKSEDIAGDL
        GPSSSALAPCL  PDAAKELEI  LKRELGRVSKQLK+LEQECVELRKKRDKKEEQL VV SNKD+QYI     E  DLRVAGKDGG  G+KSED   + 
Subjt:  GPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAGKDGGRTGMKSEDIAGDL

Query:  GGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHKFSVDSLAGDNSSDVALK
        GGPHTVTSRSK NEQGEK H+SVGERANDD PAFDKLSKKL+VFWVPE D KMGQ++VSELLLSCE DFHVLF  IGTELS KFSV+SLAG NSSDVALK
Subjt:  GGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHKFSVDSLAGDNSSDVALK

Query:  QPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATNCNLLRETVMIGGLGSRN
         PLQ+L   E+ KVSNLYTTLTKVSNGIVKMEALFTPL+DLCNLDNVAIVHRSLHILHMFLK+L+WLERKSER              R+TVMIGGLG RN
Subjt:  QPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATNCNLLRETVMIGGLGSRN

Query:  NVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEAVSVMNLILMRNNTYLEK
        NVVDS+GS SAE EEFSL+NMDETS G C+PAGM  PGAELL KNRNLNKNINLVP++NWV+FFE+MH+VAKTHSA+C R+EAVSVMNLILMRNNTYLE+
Subjt:  NVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEAVSVMNLILMRNNTYLEK

Query:  EKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLTCFGNGIEELKLRRNTVL
        EKFGQ  LFDSVVEFI KESG+AIQ HAVRLLFLILNCPTFF+AFCSGCKEAEAADAA+ENVRCAGGFQKFR ILHGLADCLTC GNGI ELKLRRNTVL
Subjt:  EKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLTCFGNGIEELKLRRNTVL

Query:  LLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEPVENLEERASLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLTIDVTNKL
        LLAFLSSSG AGFEIL+SN L+ DSNFLTLILQ V SE+ QEK V E VE +EERA LLREVLILLNRLASHS+YSATVLRVLT+SRDMASLTIDVTNKL
Subjt:  LLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEPVENLEERASLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLTIDVTNKL

Query:  CRKNKRNCQFDSKKRKMRESEVMELAQVFRKRLLTYLGNSIL
         RKN RNCQFDSKKRKMRESEV++LAQVFRKR+LTYLGNSI+
Subjt:  CRKNKRNCQFDSKKRKMRESEVMELAQVFRKRLLTYLGNSIL

XP_004136401.1 protein SENSITIVE TO UV 2 [Cucumis sativus]0.0e+0083.71Show/hide
Query:  MRAEDEGFED--WDADFLDQLIQVEELAISSTANNPIPIP------IPIPSSFSTYSPPPPPEPEPQHLVEVFHDRPISYSPPRELSQRATGLRSHAIRF
        MR EDEGFED  WDADFLDQLIQVEELAISSTANNP+PIP      +P+P       P P PEPEPQHLVE  H RPISYSPPRELSQ ATGLRSH IRF
Subjt:  MRAEDEGFED--WDADFLDQLIQVEELAISSTANNPIPIP------IPIPSSFSTYSPPPPPEPEPQHLVEVFHDRPISYSPPRELSQRATGLRSHAIRF

Query:  PNGLGECGPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAGKDGGRTGMKS
        PNGL ECGPSSSALAPCLHR DAAKELEICDLKRELGRVSKQLKDLEQECVELRK R+KKEE+L+VVSSNKDEQYIG R+SES DLRVAGKDG RTGMKS
Subjt:  PNGLGECGPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAGKDGGRTGMKS

Query:  EDIAGDLGGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHKFSVDSLAGDN
        EDI+GDLGGPH VTSRSK NEQGEKAHSSVGER NDDLP FDKLSKKL+VFW+PESD K+GQ++VSELLLSCETD HVLFHSIG ELS KFS  +LAGDN
Subjt:  EDIAGDLGGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHKFSVDSLAGDN

Query:  SSDVALKQPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATNCNLLRETVMI
        SSDVALK PLQLLQC EAKKVSNLYTTLTKVSNGIVKMEALF+PLLDLCNLDNVA+VHRSLHILHMFLK+LLWLERKSER              RETVMI
Subjt:  SSDVALKQPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATNCNLLRETVMI

Query:  GGLGSRNNVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEAVSVMNLILMR
        GGLGSRNNVVDSHGSQSAE +EFSL NMDE+S+GS A A   LPGAELLCKNRNL KNINLVPQ+NWV+FFE MHQVAKTHSAKCVRIEA+SVMNLILMR
Subjt:  GGLGSRNNVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEAVSVMNLILMR

Query:  NNTYLEKEKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLTCFGNGIEELK
        NNTYLEKEKFGQ  LFDSVVEFIRKESG+AIQ HAVRLLFLILNCPTFF+AFCSGCKEAEA +  +ENVR AGGFQKF+ ILHGLADCLTC GNGIEELK
Subjt:  NNTYLEKEKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLTCFGNGIEELK

Query:  LRRNTVLLLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEPVENLEERASLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLT
        LR+NTVLLLAFL+SSG AGFEIL SNKLYT SNFLTLILQ VASE+ QEKTVPEP+ENLEERA LLREVLIL NRLASHSLYS T+LRV+TNSRDMA+LT
Subjt:  LRRNTVLLLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEPVENLEERASLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLT

Query:  IDVTNKLCRKNKRNCQFDSKKRKMRESEVMELAQVFRKRLLTYLGNSIL
        IDVTNKLCRKN RN Q DSKKRKMRESEV++LAQVFRKRLL+YLGNSIL
Subjt:  IDVTNKLCRKNKRNCQFDSKKRKMRESEVMELAQVFRKRLLTYLGNSIL

XP_008466031.1 PREDICTED: uncharacterized protein LOC103503583 [Cucumis melo]0.0e+0084.2Show/hide
Query:  MRAEDEGFED--WDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTYSPPPPPEPEPQ----HLVEVFHDRPISYSPPRELSQRATGLRSHAIRFPN
        M   DEGFED  WDADFLDQLIQVEELAISSTANN    PIPI SS S   PPPPPEPEP+    HLVE  + RPISYSPPRELSQ ATGLRSHAIRFPN
Subjt:  MRAEDEGFED--WDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTYSPPPPPEPEPQ----HLVEVFHDRPISYSPPRELSQRATGLRSHAIRFPN

Query:  GLGECGPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAGKDGGRTGMKSED
        GL ECGPSSS+LAPCLHR DAAKELEICDLKRELGRVSKQLK+LEQECVELRK R+KKEEQL+VVSSNKDEQYIG R+SES DLRVAGKDG RTGMKSED
Subjt:  GLGECGPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAGKDGGRTGMKSED

Query:  IAGDLGGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHKFSVDSLAGDNSS
        IAGDLGGPH VTSRSK NEQGEKAHSSVGERANDDLPAFDKLSKKL+VFWVPESDSK+GQ++VSELLLSCETD HVLFHSIGT+LS KF V +LAGDNSS
Subjt:  IAGDLGGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHKFSVDSLAGDNSS

Query:  DVALKQPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATNCNLLRETVMIGG
        DVALKQPL L QC EAKKVSNLYTTLTKVSNGIVKMEALF+PLLDLCNLDNVA+VHRSLHILHMFLK+LLWLERKSER              RETV+IGG
Subjt:  DVALKQPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATNCNLLRETVMIGG

Query:  LGSRNNVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEAVSVMNLILMRNN
        LGSRNNVVDSHGSQSAE +EF+L NMDE+S+GS   A   LPGAELLCKNRNL KNINLVPQ+NWV+FFE M QVAKTHSAKCVRIEA+SVMNLILMRNN
Subjt:  LGSRNNVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEAVSVMNLILMRNN

Query:  TYLEKEKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLTCFGNGIEELKLR
         YLEKEKFGQ  LFDSVVEFIRKESG+AIQ HAVRLLFLILNCPTFF AFCSGCKEAE A+  +ENVR AGGFQKF+ ILHGLADC TCFGNGIEELKLR
Subjt:  TYLEKEKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLTCFGNGIEELKLR

Query:  RNTVLLLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEPVENLEERASLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLTID
        +NTVLLLAFL+SSG AGFEILISNKLYT SNFLTLILQ VASE+ QEKTV EPVENLEERA LLREVLILLNRLASHSLYS T+LRVLTNSRDMA+LTID
Subjt:  RNTVLLLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEPVENLEERASLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLTID

Query:  VTNKLCRKNKRNCQFDSKKRKMRESEVMELAQVFRKRLLTYLGNSIL
        VTNKLCRKN RN Q DSKKRKMRESEV+ELAQVFRKRLL+YLGNSIL
Subjt:  VTNKLCRKNKRNCQFDSKKRKMRESEVMELAQVFRKRLLTYLGNSIL

XP_038888976.1 protein SENSITIVE TO UV 2 [Benincasa hispida]0.0e+0086.98Show/hide
Query:  MRAEDEGFEDWDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTYSPPPP----PEPEPQHLVEVFHDRPISYSPPRELSQRATGLRSHAIRFPNGL
        M  EDEGFEDWDADFLDQLIQVEELAISSTANN IPIPI IPSS STY P PP    PEP+PQHLVEVFHDRPISYSPPRELSQRATGLRSHAIR PNG 
Subjt:  MRAEDEGFEDWDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTYSPPPP----PEPEPQHLVEVFHDRPISYSPPRELSQRATGLRSHAIRFPNGL

Query:  GECGPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAGKDGGRTGMKSEDIA
        GE GPSSSALAPCLHRPDAAKELEI DLKRELGRVSKQLKDLEQECVELRKKRDK EEQLKVVSSNKDEQYIGR VSES DLRVAGKDGGRTGMKSEDIA
Subjt:  GECGPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAGKDGGRTGMKSEDIA

Query:  GDLGGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHKFSVDSLAGDNSSDV
        GDLGGPHTVTSR K NEQ  KAHSSVGERANDDLPAFDKLSKKL+VFWVPESDSK+GQN+VSELLLSCETDF VLFHSI TELS KFSVD L GDNSSD+
Subjt:  GDLGGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHKFSVDSLAGDNSSDV

Query:  ALKQPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATNCNLLRETVMIGGLG
             +Q L+C EAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLK+LLWLERKSER              RETVMIGGLG
Subjt:  ALKQPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATNCNLLRETVMIGGLG

Query:  SRNNVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEAVSVMNLILMRNNTY
        SRNN VDSHGSQSAE EEF+L NMD+TS+GSCAPAG  LPGA LLCKNRNLNKNINLVPQINWVAFFEVMHQVAK HSAKCVRIEAVSVMNLILMRNNTY
Subjt:  SRNNVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEAVSVMNLILMRNNTY

Query:  LEKEKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLTCFGNGIEELKLRRN
        LEKEKFGQ  LFDSVVEFIRKESG+AIQ H VRLLFLILNCPTFF+ FCSGCKEAEA DAA+ENVRCAGGFQKFR ILH LADCLTC GNGIEELKLRRN
Subjt:  LEKEKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLTCFGNGIEELKLRRN

Query:  TVLLLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEPVENLEERASLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLTIDVT
        T+LLLAFL+SSG  GFEILISNKLYT+SNFL LILQ  ASE+ QEKTVPEPVENLEERA LLREVLILLNRLASHSLYS TVLRVLTNSRDMASL IDVT
Subjt:  TVLLLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEPVENLEERASLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLTIDVT

Query:  NKLCRKNKRNCQFDSKKRKMRESEVMELAQVFRKRLLTYLGNSIL
        NKLCRKN RN QFDSKKRKMRE+EV+ELAQVFRKRLL+YLGN+IL
Subjt:  NKLCRKNKRNCQFDSKKRKMRESEVMELAQVFRKRLLTYLGNSIL

TrEMBL top hitse value%identityAlignment
A0A1S3CQL6 uncharacterized protein LOC1035035830.0e+0084.2Show/hide
Query:  MRAEDEGFED--WDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTYSPPPPPEPEPQ----HLVEVFHDRPISYSPPRELSQRATGLRSHAIRFPN
        M   DEGFED  WDADFLDQLIQVEELAISSTANN    PIPI SS S   PPPPPEPEP+    HLVE  + RPISYSPPRELSQ ATGLRSHAIRFPN
Subjt:  MRAEDEGFED--WDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTYSPPPPPEPEPQ----HLVEVFHDRPISYSPPRELSQRATGLRSHAIRFPN

Query:  GLGECGPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAGKDGGRTGMKSED
        GL ECGPSSS+LAPCLHR DAAKELEICDLKRELGRVSKQLK+LEQECVELRK R+KKEEQL+VVSSNKDEQYIG R+SES DLRVAGKDG RTGMKSED
Subjt:  GLGECGPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAGKDGGRTGMKSED

Query:  IAGDLGGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHKFSVDSLAGDNSS
        IAGDLGGPH VTSRSK NEQGEKAHSSVGERANDDLPAFDKLSKKL+VFWVPESDSK+GQ++VSELLLSCETD HVLFHSIGT+LS KF V +LAGDNSS
Subjt:  IAGDLGGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHKFSVDSLAGDNSS

Query:  DVALKQPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATNCNLLRETVMIGG
        DVALKQPL L QC EAKKVSNLYTTLTKVSNGIVKMEALF+PLLDLCNLDNVA+VHRSLHILHMFLK+LLWLERKSER              RETV+IGG
Subjt:  DVALKQPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATNCNLLRETVMIGG

Query:  LGSRNNVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEAVSVMNLILMRNN
        LGSRNNVVDSHGSQSAE +EF+L NMDE+S+GS   A   LPGAELLCKNRNL KNINLVPQ+NWV+FFE M QVAKTHSAKCVRIEA+SVMNLILMRNN
Subjt:  LGSRNNVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEAVSVMNLILMRNN

Query:  TYLEKEKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLTCFGNGIEELKLR
         YLEKEKFGQ  LFDSVVEFIRKESG+AIQ HAVRLLFLILNCPTFF AFCSGCKEAE A+  +ENVR AGGFQKF+ ILHGLADC TCFGNGIEELKLR
Subjt:  TYLEKEKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLTCFGNGIEELKLR

Query:  RNTVLLLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEPVENLEERASLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLTID
        +NTVLLLAFL+SSG AGFEILISNKLYT SNFLTLILQ VASE+ QEKTV EPVENLEERA LLREVLILLNRLASHSLYS T+LRVLTNSRDMA+LTID
Subjt:  RNTVLLLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEPVENLEERASLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLTID

Query:  VTNKLCRKNKRNCQFDSKKRKMRESEVMELAQVFRKRLLTYLGNSIL
        VTNKLCRKN RN Q DSKKRKMRESEV+ELAQVFRKRLL+YLGNSIL
Subjt:  VTNKLCRKNKRNCQFDSKKRKMRESEVMELAQVFRKRLLTYLGNSIL

A0A5A7T562 Uncharacterized protein0.0e+0084.02Show/hide
Query:  MRAEDEGFED--WDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTYSPPPPPEPEPQ----HLVEVFHDRPISYSPPRELSQRATGLRSHAIRFPN
        M   DEGFED  WDADFLDQLIQVEELAISSTANN    PIPI SS S   PPPPPEPEP+    HLVE  + RPISYSPPRELSQ ATGLRSHAIRFPN
Subjt:  MRAEDEGFED--WDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTYSPPPPPEPEPQ----HLVEVFHDRPISYSPPRELSQRATGLRSHAIRFPN

Query:  GLGECGPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAGKDGGRTGMKSED
        GL ECGPSSS+LAPCLHR DAAKELEICDLKRELGRVSKQLK+LEQECVELRK R+KKEEQL+VVSSNKDEQYIG R+SES DLRVAGKDG RTGMKSED
Subjt:  GLGECGPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAGKDGGRTGMKSED

Query:  IAGDLGGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHKFSVDSLAGDNSS
        IAGDLGGPH VTSRSK NEQGEKAHSSVGERANDDLPAFDKLSKKL+VFWVPESDSK+GQ++VSELLLSCETD HVLFHSIGT+LS KF V +LAGDNSS
Subjt:  IAGDLGGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHKFSVDSLAGDNSS

Query:  DVALKQPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATNCNLLRETVMIGG
        DVALKQPL L QC EAKKVSNLYTTLTKVSNGIVKMEALF+PLLDLCNLDNVA+VHRSLHILHMFLK+LLWLERKSER              RETV+IGG
Subjt:  DVALKQPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATNCNLLRETVMIGG

Query:  LGSRNNVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEAVSVMNLILMRNN
        LGSRNNVVDSHGSQSAE +EF+L NMDE+S+GS   A   LPGAELLCKNRNL KNINLVPQ+NWV+FFE M QVAKTHSAKCVRIEA+SVMNLILMRNN
Subjt:  LGSRNNVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEAVSVMNLILMRNN

Query:  TYLEKEKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLTCFGNGIEELKLR
         YLEKEKFGQ  LFDSVVEFIRKESG+AIQ HAVRLLFLILNCPTFF AFCSGCKEAE A+  +ENVR AGGFQKF+ ILHGLADC TCFGNGIEELKLR
Subjt:  TYLEKEKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLTCFGNGIEELKLR

Query:  RNTVLLLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEPVENLEERASLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLTID
        +NTVLLLAFL+SSG AGFEILISNKLYT SNFLTLILQ VASE+ QEKTV EPVENLEERA LLREVLILLNRLASHSLYS T+LRVLTNSRDMA+LTID
Subjt:  RNTVLLLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEPVENLEERASLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLTID

Query:  VTNKLCRKNKRNCQFDSKKRKMRESEVMELAQ
        VTNKLCRKN RN Q DSKKRKMRESEV+ELAQ
Subjt:  VTNKLCRKNKRNCQFDSKKRKMRESEVMELAQ

A0A5D3E6M8 Uncharacterized protein0.0e+0081.35Show/hide
Query:  MRAEDEGFED--WDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTYSPPPPPEPEPQ----HLVEVFHDRPISYSPPRELSQRATGLRSHAIRFPN
        M   DEGFED  WDADFLDQLIQVEELAISSTANN    PIPI SS S   PPPPPEPEP+    HLVE  + RPISYSPPRELSQ ATGLRSHAIRFPN
Subjt:  MRAEDEGFED--WDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTYSPPPPPEPEPQ----HLVEVFHDRPISYSPPRELSQRATGLRSHAIRFPN

Query:  GLGECGPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAGKDGGRTGMKSED
        GL ECGPSSS+LAPCLHR DAAKELEICDLKRELGRVSKQLK+LEQECVELRK R+KKEEQL+VVSSNKDEQYIG R+SES DLRVAGKDG RTGMKSED
Subjt:  GLGECGPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAGKDGGRTGMKSED

Query:  IAGDLGGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHKFSVDSLAGDNSS
        IAGDLGGPH VTSRSK NEQGEKAHSSVGERANDDLPAFDKLSKKL+VFWVPESDSK+GQ++VSELLLSCETD HVLFHSIGT+LS KF V +LAGDNSS
Subjt:  IAGDLGGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHKFSVDSLAGDNSS

Query:  DVALKQPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATNCNLLRETVMIGG
        DVALKQPL L QC EAKKVSNLYTTLTKVSNGIVKMEALF+PLLDLCNLDNVA+VHRSLHILHMFLK+LLWLERKSER              RETV+IGG
Subjt:  DVALKQPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATNCNLLRETVMIGG

Query:  LGSRNNVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEAVSVMNLILMRNN
        LGSRNNVVDSHGSQSAE +EF+L NMDE+S+GS   A   LPGAELLCKNRNL KNINLVPQ+NWV+FFE M QVAKTHSAKCVRIEA+SVMNLILMRNN
Subjt:  LGSRNNVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEAVSVMNLILMRNN

Query:  TYLEKEKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLTCFGNGIE-----
         YLEKEKFGQ  LFDSVVEFIRKESG+AIQ HAVRLLFLILNCPTFF AFCSGCKEAE A+  +ENVR AGGFQKF+ ILHGLADC TCFGNGIE     
Subjt:  TYLEKEKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLTCFGNGIE-----

Query:  -------------------ELKLRRNTVLLLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEPVENLEERASLLREVLILLNRLAS
                           ELKLR+NTVLLLAFL+SSG AGFEILISNKLYT SNFLTLILQ VASE+ QEKTV EPVENLEERA LLREVLILLNRLAS
Subjt:  -------------------ELKLRRNTVLLLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEPVENLEERASLLREVLILLNRLAS

Query:  HSLYSATVLRVLTNSRDMASLTIDVTNKLCRKNKRNCQFDSKKRKMRESEVMELAQ
        HSLYS T+LRVLTNSRDMA+LTIDVTNKLCRKN RN Q DSKKRKMRESEV+ELAQ
Subjt:  HSLYSATVLRVLTNSRDMASLTIDVTNKLCRKNKRNCQFDSKKRKMRESEVMELAQ

A0A6J1HLQ0 uncharacterized protein LOC111464104 isoform X10.0e+0081.54Show/hide
Query:  MRAEDEGFEDWDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTY-SPPPPPEPEPQHLVEVFHDRPISYSPPRELSQRATGLRSHAIRFPNGLGEC
        MR+EDEGFEDWDADFLDQLIQVEELAISSTANN    P PI  S STY  PPPPPEPEPQHLVEV HDRPISYSPPRELSQRA GLRSH+IR P GLGEC
Subjt:  MRAEDEGFEDWDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTY-SPPPPPEPEPQHLVEVFHDRPISYSPPRELSQRATGLRSHAIRFPNGLGEC

Query:  GPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAGKDGGRTGMKSEDIAGDL
        GPSSSALAPCL  PDAAKELEI  LKRELGRVSKQLK+LEQECVELRKKRDKKEEQL VV SNKD+QYI     E  DLRVA KDGG  G+K+ED   + 
Subjt:  GPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAGKDGGRTGMKSEDIAGDL

Query:  GGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHKFSVDSLAGDNSSDVALK
        GGPHTVTSRSK NEQGEK H+SVGERANDD PAFDKLSKKL+VFWVPE D KMGQ++VSELLLSCE DFHVLF  IGTELS KFSV+SLAG NSSDVALK
Subjt:  GGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHKFSVDSLAGDNSSDVALK

Query:  QPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATNCNLLRETVMIGGLGSRN
         PLQ+L   E+ KVSNLYTTLTKVSNGIVKMEALFTPL+DLCNLDNVAIVHRSLHILHMFLK+L+WLERKSER              R+TVMIGGLG RN
Subjt:  QPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATNCNLLRETVMIGGLGSRN

Query:  NVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEAVSVMNLILMRNNTYLEK
        +VVDS+GS SAE EEFSL+NMDETS G C+PAGM  PGAELL KNRNLNKNINLVP++NWV+FFE+MH+VAK HSA+C R+EAVSVMNLILMRNNTYLE+
Subjt:  NVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEAVSVMNLILMRNNTYLEK

Query:  EKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLTCFGNGIEELKLRRNTVL
        EKFGQ  LFDSVVEFIRKESG+AIQ HAVRLLFLILNCPTFF+AFCSGCKEAEAADAA+ENVRCAGGFQKF  ILHGLADCLTC GNGI ELKLRR+TVL
Subjt:  EKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLTCFGNGIEELKLRRNTVL

Query:  LLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEPVENLEERASLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLTIDVTNKL
        LLAFLSSSG AGFEIL+SN L+ DSNFLTLILQ V SE+ QEK V E VE +EERA LLREVLILLNRLASHS+YSATVLRVLT+SRDMASLTIDVTNKL
Subjt:  LLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEPVENLEERASLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLTIDVTNKL

Query:  CRKNKRNCQFDSKKRKMRESEVMELAQVFRKRLLTYLGNSIL
         RKN RNCQFDSKKRKMRESEV++LAQVFRKR+LTYLGNSI+
Subjt:  CRKNKRNCQFDSKKRKMRESEVMELAQVFRKRLLTYLGNSIL

A0A6J1HUD9 uncharacterized protein LOC111466806 isoform X10.0e+0081.23Show/hide
Query:  MRAEDEGFEDWDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTYSPPPP-----PEPEPQHLVEVFHDRPISYSPPRELSQRATGLRSHAIRFPNG
        MR+EDEGFEDWDADFLDQLIQVEELAISSTANN    P PI  S STY PPPP     PEPEPQHLVEV HDR ISYSPPRELSQRA G RSHAIR   G
Subjt:  MRAEDEGFEDWDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTYSPPPP-----PEPEPQHLVEVFHDRPISYSPPRELSQRATGLRSHAIRFPNG

Query:  LGECGPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAGKDGGRTGMKSEDI
        LGECGPSSSA APCL  PDAAKELEI +LKRELGRVSKQLK+LEQEC+ELRKKRDKKEEQL VV SNKD+QYI     E  +LRVAGKDGG  G+KSEDI
Subjt:  LGECGPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAGKDGGRTGMKSEDI

Query:  AGDLGGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHKFSVDSLAGDNSSD
        + DLGGPHTVTSRSK NEQGEK+H+SVGERA+D+ PAFDKLSKKL+VFWVPE DSKMGQ++VSELLLSCE DFHVL+  IGTELS KFSV+SLAG NSSD
Subjt:  AGDLGGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHKFSVDSLAGDNSSD

Query:  VALKQPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATNCNLLRETVMIGGL
        VALK PLQ L  LE+ KVSNLYTTL KVSNGIVKMEALFTPL+DLCNLDNVAIVHRSLHILHMFLK+L+WLERKSER              R+TVMIGGL
Subjt:  VALKQPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATNCNLLRETVMIGGL

Query:  GSRNNVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEAVSVMNLILMRNNT
        G RNNVVDS+GS SAE EEFSL+NMDETS G C+PAGM  PGAELL KNRNLNKNINLVP++NWV+FFE+MH+VAKTHSA+C R+EAVSVMNLILMRNNT
Subjt:  GSRNNVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEAVSVMNLILMRNNT

Query:  YLEKEKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLTCFGNGIEELKLRR
        YLE+EKFGQ  LFDSVVEFIRKESG+AIQ HAVRLLFLILNCPTFF+AFCSGCKEAEAADAA+EN RCAGGFQKFR ILHGL DCLTC GNGI+ELKLRR
Subjt:  YLEKEKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLTCFGNGIEELKLRR

Query:  NTVLLLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEPVENLEERASLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLTIDV
        NTVLLLAFLSSSG AGFEIL+SN L+ DSNFLTLILQAV SE+ QEK V E VE LEERA LLREVLILLNRLASHS+YSATVLRVLT+SRDMASLTIDV
Subjt:  NTVLLLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEPVENLEERASLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLTIDV

Query:  TNKLCRKNKRNCQFDSKKRKMRESEVMELAQVFRKRLLTYLGNSIL
        TNKL RKN RNCQFD KKRKMRESEV++LAQVFRKR+LTYLGNSI+
Subjt:  TNKLCRKNKRNCQFDSKKRKMRESEVMELAQVFRKRLLTYLGNSIL

SwissProt top hitse value%identityAlignment
C8KI33 Protein SENSITIVE TO UV 22.1e-7630.79Show/hide
Query:  MRAEDEGFEDWDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTYSPPPPPEPE-------PQHL----------VEVFHDRPISYSPPRELSQR-A
        M   DE F D   +FL  +  +E     +    P+P     P    T+ P PPP  +       P  L          ++V  D  +SYSPPRELSQR  
Subjt:  MRAEDEGFEDWDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTYSPPPPPEPE-------PQHL----------VEVFHDRPISYSPPRELSQR-A

Query:  TGLRSHAIRFPNGLGECGPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAG
        +G     + + N          +      R D+ K+LEI  LK+EL RVSKQL D+EQEC +L+K + K+ E   + + +   Q      S+ IDL    
Subjt:  TGLRSHAIRFPNGLGECGPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAG

Query:  KDGGRTGMKSEDIAGDLGGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHK
                   D+A          SR   ++  +++  + G +A  D+     LSKKL   W   +     +N++SELLL+C TD  +LF  +      +
Subjt:  KDGGRTGMKSEDIAGDLGGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHK

Query:  FSVDSLAGDNSSDVALKQPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATN
          ++      SSD       Q  + LE++KV  LY+ +TK+S G V ++ L  PLLDLC  +   +VHRSL +LH+ L+ +   E++ E  + A      
Subjt:  FSVDSLAGDNSSDVALKQPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATN

Query:  CNLLRETVMIGGLGSRNNVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEA
                                                                                   NW + F++M+Q+A   + + V+ EA
Subjt:  CNLLRETVMIGGLGSRNNVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEA

Query:  VSVMNLILMRNNTYLEKEKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLT
        +S+MN+I+M  + Y  +E F    +F+S+   +RKE G  ++  A+ L +L+LNCP  +  F S  +E  ++D  +++       + F  I  GLADCLT
Subjt:  VSVMNLILMRNNTYLEKEKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLT

Query:  CFGNGIEELKLRRNTVLLLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEP-VENLEERASLLREVLILLNRLASHSLYSATVLRV
              E+L+L RN +++LA  +SSG +G+E+L S+KL  DS+FL LIL  + +EI  E T   P  E  + R  L+RE+LILLNRL S    SAT+L+ 
Subjt:  CFGNGIEELKLRRNTVLLLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEP-VENLEERASLLREVLILLNRLASHSLYSATVLRV

Query:  LTNSRDMASLTIDVTNKLCRKNKRNCQFDSKKRKMRESEVMELAQVFRKRLLTYLGNSIL
        LT SRDMASLT+D   +L RK     + +S   +MR +E+M+LA++F+KR+  +LG++ +
Subjt:  LTNSRDMASLTIDVTNKLCRKNKRNCQFDSKKRKMRESEVMELAQVFRKRLLTYLGNSIL

Arabidopsis top hitse value%identityAlignment
AT5G45610.1 protein dimerizations1.5e-7730.79Show/hide
Query:  MRAEDEGFEDWDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTYSPPPPPEPE-------PQHL----------VEVFHDRPISYSPPRELSQR-A
        M   DE F D   +FL  +  +E     +    P+P     P    T+ P PPP  +       P  L          ++V  D  +SYSPPRELSQR  
Subjt:  MRAEDEGFEDWDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTYSPPPPPEPE-------PQHL----------VEVFHDRPISYSPPRELSQR-A

Query:  TGLRSHAIRFPNGLGECGPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAG
        +G     + + N          +      R D+ K+LEI  LK+EL RVSKQL D+EQEC +L+K + K+ E   + + +   Q      S+ IDL    
Subjt:  TGLRSHAIRFPNGLGECGPSSSALAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAG

Query:  KDGGRTGMKSEDIAGDLGGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHK
                   D+A          SR   ++  +++  + G +A  D+     LSKKL   W   +     +N++SELLL+C TD  +LF  +      +
Subjt:  KDGGRTGMKSEDIAGDLGGPHTVTSRSKGNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHK

Query:  FSVDSLAGDNSSDVALKQPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATN
          ++      SSD       Q  + LE++KV  LY+ +TK+S G V ++ L  PLLDLC  +   +VHRSL +LH+ L+ +   E++ E  + A      
Subjt:  FSVDSLAGDNSSDVALKQPLQLLQCLEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATN

Query:  CNLLRETVMIGGLGSRNNVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEA
                                                                                   NW + F++M+Q+A   + + V+ EA
Subjt:  CNLLRETVMIGGLGSRNNVVDSHGSQSAEVEEFSLVNMDETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEA

Query:  VSVMNLILMRNNTYLEKEKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLT
        +S+MN+I+M  + Y  +E F    +F+S+   +RKE G  ++  A+ L +L+LNCP  +  F S  +E  ++D  +++       + F  I  GLADCLT
Subjt:  VSVMNLILMRNNTYLEKEKFGQTRLFDSVVEFIRKESGTAIQMHAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLT

Query:  CFGNGIEELKLRRNTVLLLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEP-VENLEERASLLREVLILLNRLASHSLYSATVLRV
              E+L+L RN +++LA  +SSG +G+E+L S+KL  DS+FL LIL  + +EI  E T   P  E  + R  L+RE+LILLNRL S    SAT+L+ 
Subjt:  CFGNGIEELKLRRNTVLLLAFLSSSGVAGFEILISNKLYTDSNFLTLILQAVASEIGQEKTVPEP-VENLEERASLLREVLILLNRLASHSLYSATVLRV

Query:  LTNSRDMASLTIDVTNKLCRKNKRNCQFDSKKRKMRESEVMELAQVFRKRLLTYLGNSIL
        LT SRDMASLT+D   +L RK     + +S   +MR +E+M+LA++F+KR+  +LG++ +
Subjt:  LTNSRDMASLTIDVTNKLCRKNKRNCQFDSKKRKMRESEVMELAQVFRKRLLTYLGNSIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAGCCGAAGACGAAGGGTTCGAAGATTGGGACGCTGATTTCTTGGACCAACTCATCCAAGTTGAAGAGCTTGCTATCTCCTCCACCGCCAACAATCCCATT
CCCATTCCGATTCCGATTCCGAGTTCTTTTTCTACCTACTCCCCCCCGCCGCCGCCGGAACCGGAGCCGCAGCATTTGGTGGAAGTGTTTCATGACCGTCCCATT
TCTTATTCGCCTCCTCGAGAACTCTCTCAGAGAGCCACTGGTTTACGTTCTCACGCCATTCGCTTTCCCAATGGCTTGGGCGAATGCGGTCCTTCTTCCTCGGCG
CTTGCTCCGTGCTTGCATCGCCCCGATGCTGCAAAAGAACTTGAAATTTGTGATTTGAAGAGGGAGCTAGGGCGCGTCTCAAAGCAACTTAAGGACTTGGAACAA
GAGTGTGTTGAACTCAGGAAGAAAAGAGACAAAAAGGAGGAGCAGCTTAAAGTTGTCTCTTCCAATAAGGATGAGCAATATATTGGCCGTCGTGTTTCAGAAAGT
ATAGATTTGAGAGTGGCAGGGAAAGATGGTGGACGTACTGGCATGAAGAGTGAAGATATAGCTGGTGACCTTGGTGGCCCTCACACTGTTACCTCTAGGAGTAAA
GGCAATGAACAGGGAGAAAAAGCCCATAGTTCTGTTGGAGAGAGAGCCAACGATGATCTACCTGCTTTTGACAAGCTCTCAAAGAAGTTGAAAGTCTTCTGGGTC
CCTGAAAGTGACTCTAAGATGGGACAAAATATGGTTTCAGAATTGCTTTTATCATGTGAAACAGATTTTCATGTGCTTTTTCACAGCATTGGGACGGAGTTATCC
CATAAATTTTCTGTGGATTCCCTGGCTGGTGATAACTCTTCTGATGTAGCTTTAAAGCAACCTTTGCAGCTTCTTCAGTGTCTGGAAGCTAAAAAAGTATCTAAC
CTCTACACCACCTTGACTAAGGTAAGTAATGGAATAGTCAAGATGGAGGCTTTGTTTACTCCGTTGCTTGATCTCTGTAATCTTGACAATGTTGCGATAGTTCAT
AGATCTCTGCATATATTGCATATGTTTCTGAAACAGTTGTTGTGGTTGGAGAGGAAATCAGAAAGAAGGTATGGTGCACTCTACTTTGCAACAAATTGCAATCTG
CTCAGGGAAACAGTCATGATTGGGGGACTTGGCTCTAGGAACAATGTTGTGGATTCTCATGGATCACAGAGTGCAGAAGTTGAAGAATTTTCTTTGGTGAACATG
GATGAGACATCAAATGGCAGCTGTGCCCCAGCTGGCATGAGCCTCCCTGGTGCTGAATTGCTGTGCAAGAACAGAAACTTGAATAAGAATATAAATTTAGTTCCT
CAAATAAACTGGGTAGCTTTCTTTGAGGTGATGCATCAGGTTGCTAAGACACATAGTGCAAAATGTGTGAGAATTGAGGCAGTTTCAGTCATGAATTTGATTCTG
ATGAGAAATAATACTTATTTGGAGAAGGAGAAATTTGGGCAGACACGTCTATTTGATAGTGTAGTGGAGTTTATCAGAAAGGAATCCGGTACAGCCATACAAATG
CATGCCGTGCGTCTGCTATTTTTGATACTAAACTGTCCCACATTTTTTATCGCGTTCTGTTCTGGTTGCAAAGAAGCAGAAGCTGCTGATGCTGCAGATGAAAAT
GTGAGATGTGCAGGAGGTTTCCAGAAATTCAGAATCATCCTTCATGGTTTGGCAGATTGTCTTACATGTTTCGGAAATGGTATTGAGGAGTTGAAACTTCGAAGA
AACACCGTTCTTTTGCTCGCCTTTCTATCATCGTCCGGGGTAGCTGGCTTTGAGATTCTCATAAGCAACAAGCTGTATACAGACTCAAATTTCCTCACGTTAATT
CTTCAAGCTGTGGCTTCAGAAATCGGACAAGAAAAAACAGTTCCAGAGCCAGTGGAAAATCTTGAGGAAAGGGCGTCGCTGTTGCGGGAAGTGCTTATTCTTCTT
AATAGACTTGCATCTCATTCATTATACTCGGCGACGGTCTTGCGAGTGTTAACGAACAGCAGAGATATGGCCAGCCTCACCATTGATGTAACCAACAAGTTGTGC
AGAAAAAACAAGAGAAATTGCCAGTTTGACAGCAAGAAAAGAAAGATGAGGGAATCTGAAGTTATGGAGTTAGCCCAGGTATTTAGGAAAAGACTTCTTACATAT
TTGGGAAATAGCATACTA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAAAAGAAAAAAAGAAAAGAGAAAAATAATGAATCGTCCATAAACCTCGAGAATTCACACAAGGCGGGAGGCCATTATATTAAAAGAACTTCACTTTCC
TTAGAGCATTGGAAGGCACCACCAAACTCTGACAATGCGAGCCGAAGACGAAGGGTTCGAAGATTGGGACGCTGATTTCTTGGACCAACTCATCCAAGTTGAAGA
GCTTGCTATCTCCTCCACCGCCAACAATCCCATTCCCATTCCGATTCCGATTCCGAGTTCTTTTTCTACCTACTCCCCCCCGCCGCCGCCGGAACCGGAGCCGCA
GCATTTGGTGGAAGTGTTTCATGACCGTCCCATTTCTTATTCGCCTCCTCGAGAACTCTCTCAGAGAGCCACTGGTTTACGTTCTCACGCCATTCGCTTTCCCAA
TGGCTTGGGCGAATGCGGTCCTTCTTCCTCGGCGCTTGCTCCGTGCTTGCATCGCCCCGATGCTGCAAAAGAACTTGAAATTTGTGATTTGAAGAGGGAGCTAGG
GCGCGTCTCAAAGCAACTTAAGGACTTGGAACAAGAGTGTGTTGAACTCAGGAAGAAAAGAGACAAAAAGGAGGAGCAGCTTAAAGTTGTCTCTTCCAATAAGGA
TGAGCAATATATTGGCCGTCGTGTTTCAGAAAGTATAGATTTGAGAGTGGCAGGGAAAGATGGTGGACGTACTGGCATGAAGAGTGAAGATATAGCTGGTGACCT
TGGTGGCCCTCACACTGTTACCTCTAGGAGTAAAGGCAATGAACAGGGAGAAAAAGCCCATAGTTCTGTTGGAGAGAGAGCCAACGATGATCTACCTGCTTTTGA
CAAGCTCTCAAAGAAGTTGAAAGTCTTCTGGGTCCCTGAAAGTGACTCTAAGATGGGACAAAATATGGTTTCAGAATTGCTTTTATCATGTGAAACAGATTTTCA
TGTGCTTTTTCACAGCATTGGGACGGAGTTATCCCATAAATTTTCTGTGGATTCCCTGGCTGGTGATAACTCTTCTGATGTAGCTTTAAAGCAACCTTTGCAGCT
TCTTCAGTGTCTGGAAGCTAAAAAAGTATCTAACCTCTACACCACCTTGACTAAGGTAAGTAATGGAATAGTCAAGATGGAGGCTTTGTTTACTCCGTTGCTTGA
TCTCTGTAATCTTGACAATGTTGCGATAGTTCATAGATCTCTGCATATATTGCATATGTTTCTGAAACAGTTGTTGTGGTTGGAGAGGAAATCAGAAAGAAGGTA
TGGTGCACTCTACTTTGCAACAAATTGCAATCTGCTCAGGGAAACAGTCATGATTGGGGGACTTGGCTCTAGGAACAATGTTGTGGATTCTCATGGATCACAGAG
TGCAGAAGTTGAAGAATTTTCTTTGGTGAACATGGATGAGACATCAAATGGCAGCTGTGCCCCAGCTGGCATGAGCCTCCCTGGTGCTGAATTGCTGTGCAAGAA
CAGAAACTTGAATAAGAATATAAATTTAGTTCCTCAAATAAACTGGGTAGCTTTCTTTGAGGTGATGCATCAGGTTGCTAAGACACATAGTGCAAAATGTGTGAG
AATTGAGGCAGTTTCAGTCATGAATTTGATTCTGATGAGAAATAATACTTATTTGGAGAAGGAGAAATTTGGGCAGACACGTCTATTTGATAGTGTAGTGGAGTT
TATCAGAAAGGAATCCGGTACAGCCATACAAATGCATGCCGTGCGTCTGCTATTTTTGATACTAAACTGTCCCACATTTTTTATCGCGTTCTGTTCTGGTTGCAA
AGAAGCAGAAGCTGCTGATGCTGCAGATGAAAATGTGAGATGTGCAGGAGGTTTCCAGAAATTCAGAATCATCCTTCATGGTTTGGCAGATTGTCTTACATGTTT
CGGAAATGGTATTGAGGAGTTGAAACTTCGAAGAAACACCGTTCTTTTGCTCGCCTTTCTATCATCGTCCGGGGTAGCTGGCTTTGAGATTCTCATAAGCAACAA
GCTGTATACAGACTCAAATTTCCTCACGTTAATTCTTCAAGCTGTGGCTTCAGAAATCGGACAAGAAAAAACAGTTCCAGAGCCAGTGGAAAATCTTGAGGAAAG
GGCGTCGCTGTTGCGGGAAGTGCTTATTCTTCTTAATAGACTTGCATCTCATTCATTATACTCGGCGACGGTCTTGCGAGTGTTAACGAACAGCAGAGATATGGC
CAGCCTCACCATTGATGTAACCAACAAGTTGTGCAGAAAAAACAAGAGAAATTGCCAGTTTGACAGCAAGAAAAGAAAGATGAGGGAATCTGAAGTTATGGAGTT
AGCCCAGGTATTTAGGAAAAGACTTCTTACATATTTGGGAAATAGCATACTA
Protein sequenceShow/hide protein sequence
MRAEDEGFEDWDADFLDQLIQVEELAISSTANNPIPIPIPIPSSFSTYSPPPPPEPEPQHLVEVFHDRPISYSPPRELSQRATGLRSHAIRFPNGLGECGPSSSA
LAPCLHRPDAAKELEICDLKRELGRVSKQLKDLEQECVELRKKRDKKEEQLKVVSSNKDEQYIGRRVSESIDLRVAGKDGGRTGMKSEDIAGDLGGPHTVTSRSK
GNEQGEKAHSSVGERANDDLPAFDKLSKKLKVFWVPESDSKMGQNMVSELLLSCETDFHVLFHSIGTELSHKFSVDSLAGDNSSDVALKQPLQLLQCLEAKKVSN
LYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHILHMFLKQLLWLERKSERRYGALYFATNCNLLRETVMIGGLGSRNNVVDSHGSQSAEVEEFSLVNM
DETSNGSCAPAGMSLPGAELLCKNRNLNKNINLVPQINWVAFFEVMHQVAKTHSAKCVRIEAVSVMNLILMRNNTYLEKEKFGQTRLFDSVVEFIRKESGTAIQM
HAVRLLFLILNCPTFFIAFCSGCKEAEAADAADENVRCAGGFQKFRIILHGLADCLTCFGNGIEELKLRRNTVLLLAFLSSSGVAGFEILISNKLYTDSNFLTLI
LQAVASEIGQEKTVPEPVENLEERASLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLTIDVTNKLCRKNKRNCQFDSKKRKMRESEVMELAQVFRKRLLTY
LGNSIL