; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg038841 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg038841
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNephrocystin-3 isoform X1
Genome locationscaffold12:1723324..1725771
RNA-Seq ExpressionSpg038841
SyntenySpg038841
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605791.1 Nephrocystin-3, partial [Cucurbita argyrosperma subsp. sororia]1.5e-19178.33Show/hide
Query:  VMAAAEAKPPRKHSENHFPAQDDGYRDSKLLSTPSFSSNSECDVASDHRAHRLYKPTKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAVGWAKALQTE
        VMAAAEAKPPRKHS NHFP Q DG R SKLL+TP FSS SECD AS+HR    Y PTKAAH PT  +TV I+L +DEQ N QHH  KDFAVGWAKALQ E
Subjt:  VMAAAEAKPPRKHSENHFPAQDDGYRDSKLLSTPSFSSNSECDVASDHRAHRLYKPTKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAVGWAKALQTE

Query:  VDNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWTGTEKTKPWWQ
        VD SS+ IANKTTVSG+EH+NIEE +YRVCKNYV TEV+SNNKSVNTTKK D MDEFHYIED FTDLHNFDSQ+SKQGEKV  DL+SHWTGTEKTKPWWQ
Subjt:  VDNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWTGTEKTKPWWQ

Query:  SASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHHQDQDHF-RTG
        SASKDELAS VA+KSL NLENCDLPQPRTKHQRKDQSTCLECF+QDCFLTSSFTE   S L EYNRGMHP V MGER+S+V  VG   HH    HF RT 
Subjt:  SASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHHQDQDHF-RTG

Query:  NEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSDVLPWVPCKDR
        NEE+N  S  NLN  KAQLLEALC+SQTRAREAEKAAQEADTEKKHIVSLFLRQA+QLFAYKQWFQL+QL+NICLQLR+K+ P+TGL SDVLPWVPCKDR
Subjt:  NEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSDVLPWVPCKDR

Query:  QFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTT
        QF +PR++RKKRGR  R+FTM+EIA  +GLGLAGA LLLGWTT
Subjt:  QFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTT

KAG7035756.1 hypothetical protein SDJN02_02554 [Cucurbita argyrosperma subsp. argyrosperma]5.6e-19478.91Show/hide
Query:  MAAAEAKPPRKHSENHFPAQDDGYRDSKLLSTPSFSSNSECDVASDHRAHRLYKPTKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAVGWAKALQTEV
        MAAAEAKPPRKHS NHFP Q DG R SKLL+TP FSS SECD AS+HR    Y PTKAAH PT  +TV I+L +DEQ N QHH  KDFAVGWAKALQ EV
Subjt:  MAAAEAKPPRKHSENHFPAQDDGYRDSKLLSTPSFSSNSECDVASDHRAHRLYKPTKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAVGWAKALQTEV

Query:  DNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWTGTEKTKPWWQS
        D SS+ IANKTTVSG+EH+NIEE +YRVCKNYV TEV+SNNKSVNTTKK D MDEFHYIED FTDLHNFDSQ+SKQGEKV  DL+SHWTGTEKTKPWWQS
Subjt:  DNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWTGTEKTKPWWQS

Query:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHHQDQDHFRTGNE
        ASKDELAS VA+KSL NLENCDLPQPRTKHQRKDQSTCLECF+QDCFLTSSFTEM  S L EYNRGMHP V MGER+S+V  VG   HH    HFRT NE
Subjt:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHHQDQDHFRTGNE

Query:  EDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSDVLPWVPCKDRQF
        E+N  S  NLN  KAQLLEALC+SQTRAREAEKAAQEADTEKKHIVSLFLRQA+QLFAYKQWFQL+QL+NICLQLR+K+ P+TGLFSDVLPWVPCKDRQF
Subjt:  EDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSDVLPWVPCKDRQF

Query:  NRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTT
         +PR++RKKRGR  R+FTM+EIA  +GLGLAGA LLLGWTT
Subjt:  NRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTT

XP_022958712.1 uncharacterized protein LOC111459854 [Cucurbita moschata]4.9e-19077.93Show/hide
Query:  MAAAEAKPPRKHSENHFPAQDDGYRDSKLLSTPSFSSNSECDVASDHRAHRLYKPTKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAVGWAKALQTEV
        MAAAEAKPPRKHS NHF  Q DG R SKLL+TP FSS SECD AS+HR    Y PTKAAH PT  +TV I+L +DEQ N QHH  KDFAVGWAKALQ EV
Subjt:  MAAAEAKPPRKHSENHFPAQDDGYRDSKLLSTPSFSSNSECDVASDHRAHRLYKPTKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAVGWAKALQTEV

Query:  DNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWTGTEKTKPWWQS
        D SS+ IANKTTVSG+EH+NIEE +YRVCKNYV TEV+SNNKSVNTTKK D MDEFHYIED FTDLHNFDSQ+SKQGEKV  DL+SHWTGTEKTKPWWQS
Subjt:  DNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWTGTEKTKPWWQS

Query:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHHQDQDHF---RT
        ASKDELAS VA+KSL NLENCDLPQPRTKHQRKDQSTCLECF+QD FLTSSFTE  FS L EYNRGMHP V MGER+S+V  VG   HH    HF   RT
Subjt:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHHQDQDHF---RT

Query:  GNEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSDVLPWVPCKD
         NEE+N  S  NLN  KAQLLEALC+SQTRAREAEKAAQEADTEKKHIVSLFLRQA+QLFAYKQWFQL+QL+NICLQLR+K+ P+TGLFSDVLPWVPCKD
Subjt:  GNEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSDVLPWVPCKD

Query:  RQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTT
        RQF +PR++RKKRGR  R+FTM+EIA  +GLGLAGA LLLGWTT
Subjt:  RQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTT

XP_023532371.1 uncharacterized protein LOC111794566 [Cucurbita pepo subsp. pepo]2.2e-19077.93Show/hide
Query:  MAAAEAKPPRKHSENHFPAQDDGYRDSKLLSTPSFSSNSECDVASDHRAHRLYKPTKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAVGWAKALQTEV
        MAAAEAKP RKHS NHFP Q DGYR SKLL+TP FSS SE D AS+HR H  Y PTKAAH PT  RTV I+L +DEQ N QHH  KDFAVGWAKALQ EV
Subjt:  MAAAEAKPPRKHSENHFPAQDDGYRDSKLLSTPSFSSNSECDVASDHRAHRLYKPTKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAVGWAKALQTEV

Query:  DNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWTGTEKTKPWWQS
        D SS+ IANKTTVSG+EH+NIEE +YRVCKNYV TEV+SNNKSVNTTKK + MDEFHYIED  TDLHNFDSQ+SKQGEKV  DL+SHWTGTEKTKPWWQS
Subjt:  DNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWTGTEKTKPWWQS

Query:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHHQDQDHF---RT
        ASKDELAS VA+KSL NLENCDLPQPRTKHQRKDQSTCLECF+QDCFLTSSFTE  FS L EYNRGMHP V MG+R+S+V  VG   HH    HF   RT
Subjt:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHHQDQDHF---RT

Query:  GNEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSDVLPWVPCKD
         NEE+N  S  NLN  KAQLLEALC+SQTRAREAEKAAQEADTEKKHIVSLFLRQA+QLFAYKQWFQL+QL+NICLQLR+K+ P+TGLFSDVLPWVPCKD
Subjt:  GNEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSDVLPWVPCKD

Query:  RQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTT
        RQF +PR++RKKRGR  R+FTM+EIA  +GLGLAGA LLLGWTT
Subjt:  RQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTT

XP_038875629.1 uncharacterized protein LOC120068032 isoform X2 [Benincasa hispida]1.1e-18978.67Show/hide
Query:  LDVGGVMAAAEAKPPRKHSENHFPAQDDGYRDSKLLSTPSFSSNSECDVASDHRAHRLYKPTKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAVGWAK
        LDV G MAAAEAKPPRKHSEN+FPA+DD YRDSKLLSTPSFSSNSE   ASDHRAH    PTKAAH P S +TVD EL SD+  +FQHH  K FAVGWAK
Subjt:  LDVGGVMAAAEAKPPRKHSENHFPAQDDGYRDSKLLSTPSFSSNSECDVASDHRAHRLYKPTKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAVGWAK

Query:  ALQTEVDNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWTGTEKT
         +Q  +DNSSV IA+KTTVS EEHQNIEEL +RVCK Y T    SNN+SVN TKKTDCMDEFHYIEDHFTDLHNFDS ISKQGEKV FDL+SHWT  EKT
Subjt:  ALQTEVDNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWTGTEKT

Query:  KPWWQSASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHHQDQDH
        KPWW+SASKDELASLVA+KSLENLENCDLPQPRTKHQ KD+STC ECF QDCFLTS FTEM FSSL  YNRGMHP   M ERQ VVGSVG+ L H  QDH
Subjt:  KPWWQSASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHHQDQDH

Query:  F---RTGNEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSDVLP
        F   RTGNEE +SSS  NLN+SKAQLLEALC+SQTRAREAEKAAQEADTEKKHIVSLFLRQA+QLFAYKQWFQLLQLQN+CLQLRNKD      FSDVLP
Subjt:  F---RTGNEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSDVLP

Query:  WVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTT
        WVPCKDRQFN+PRNRRKKR RD  +FTM++IA  VGLGLAGA+LL+GWTT
Subjt:  WVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTT

TrEMBL top hitse value%identityAlignment
A0A1S3AUK4 uncharacterized protein LOC1034829711.3e-17574.1Show/hide
Query:  MAAAEAKPPRKHSENHFPAQDDGYRDSKLLSTPSFSSNSECDVASDHRAHRLYKPTKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAVGWAKALQTEV
        MA AEA  PRKH ENHF A+DDGYRDSKL STP FS NSE + ASD R H    P+K AH PTS  +VD EL S +Q +FQH   + FAVGWAK LQ  V
Subjt:  MAAAEAKPPRKHSENHFPAQDDGYRDSKLLSTPSFSSNSECDVASDHRAHRLYKPTKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAVGWAKALQTEV

Query:  DNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWTGTEKTKPWWQS
        DNSSV IA+K TVS E HQNIEEL ++VCK+YVTTE VSNN+SVN TKKTDC+DEFHYIEDHFTDLHN D+QIS +GEKV  DL+SHW G EKTKPWW+S
Subjt:  DNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWTGTEKTKPWWQS

Query:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHHQDQDHF---RT
        ASKDELASLVA+KSLEN+ENCDLPQPRTKHQ K++STC ECFDQDCFL S FTEM FSSL   NR + P   MGERQ +VG++G+SL H  QDHF   RT
Subjt:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHHQDQDHF---RT

Query:  GNEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSDVLPWVPCKD
         NEE+NSS   NLN+SKAQLLEALC+SQTRAREAEKAAQEADTEKKHIVSLFLRQA+QLFAY+QW QLLQLQNICLQLRNKDQPITGLFSD LPW PCKD
Subjt:  GNEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSDVLPWVPCKD

Query:  RQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTT
         QFN+PRNRRKKR +D  +FT   IA  VGL LAGASLLLGWTT
Subjt:  RQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTT

A0A5A7TGM0 Uncharacterized protein4.4e-17674.32Show/hide
Query:  MAAAEAKPPRKHSENHFPAQDDGYRDSKLLSTPSFSSNSECDVASDHRAHRLYKPTKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAVGWAKALQTEV
        MA AEA  PRKH ENHF A+DDGYRDSKL STP FS NSE + ASD R H    P+K AH PTS  TVD EL S +Q +FQH   + FAVGWAK LQ  V
Subjt:  MAAAEAKPPRKHSENHFPAQDDGYRDSKLLSTPSFSSNSECDVASDHRAHRLYKPTKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAVGWAKALQTEV

Query:  DNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWTGTEKTKPWWQS
        DNSSV IA+K TVS E HQNIEEL ++VCK+YVTTE VSNN+SVN TKKTDC+DEFHYIEDHFTDLHN D+QIS +GEKV  DL+SHW G EKTKPWW+S
Subjt:  DNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWTGTEKTKPWWQS

Query:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHHQDQDHF---RT
        ASKDELASLVA+KSLEN+ENCDLPQPRTKHQ K++STC ECFDQDCFL S FTEM FSSL   NR + P   MGERQ +VG++G+SL H  QDHF   RT
Subjt:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHHQDQDHF---RT

Query:  GNEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSDVLPWVPCKD
         NEE+NSS   NLN+SKAQLLEALC+SQTRAREAEKAAQEADTEKKHIVSLFLRQA+QLFAY+QW QLLQLQNICLQLRNKDQPITGLFSD LPW PCKD
Subjt:  GNEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSDVLPWVPCKD

Query:  RQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTT
         QFN+PRNRRKKR +D  +FT   IA  VGL LAGASLLLGWTT
Subjt:  RQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTT

A0A6J1DUQ0 uncharacterized protein LOC1110246286.7e-18576.58Show/hide
Query:  MAAAEAKPPRKHSENHFPAQDDGYRDSKLLSTPSFSSNSECDVASDHRAHRLYKPTKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAVGWAKALQTEV
        MAAAEAKPP+KH ENHFPAQDDGY  S+LLST   SSNSEC+ + D RA   Y PTK+AH PTSN TVDI+L  D Q +F+HH  KDF + W K LQ EV
Subjt:  MAAAEAKPPRKHSENHFPAQDDGYRDSKLLSTPSFSSNSECDVASDHRAHRLYKPTKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAVGWAKALQTEV

Query:  DNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWTGTEKTKPWWQS
        D+ S+  A+KTTVSGEEHQNIEEL Y+V + Y  TE VSNNKSVNT KKTDCMDEF YIEDHFTDLH FDS   KQGE V FDL+SHWTGTEKTKPWW+S
Subjt:  DNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWTGTEKTKPWWQS

Query:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHHQDQDHF---RT
        ASKDELASLVA+KSLE+LENCDLPQPRTKH RKDQS   ECFDQDCFLTSSFTEM FSSL  Y+RGMH  V MGERQS VGSVG+SLH   QDHF   R+
Subjt:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHHQDQDHF---RT

Query:  GNEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSDVLPWVPCKD
        GNEE+NSSS  N++ SKAQLLEALC+SQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKD+PI+G+FSDVLPWVPCKD
Subjt:  GNEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSDVLPWVPCKD

Query:  RQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTT
        RQFN+ RNRRKKRGR R   TM+++AV VGLGL GA LLLGWT+
Subjt:  RQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTT

A0A6J1H2L0 uncharacterized protein LOC1114598542.4e-19077.93Show/hide
Query:  MAAAEAKPPRKHSENHFPAQDDGYRDSKLLSTPSFSSNSECDVASDHRAHRLYKPTKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAVGWAKALQTEV
        MAAAEAKPPRKHS NHF  Q DG R SKLL+TP FSS SECD AS+HR    Y PTKAAH PT  +TV I+L +DEQ N QHH  KDFAVGWAKALQ EV
Subjt:  MAAAEAKPPRKHSENHFPAQDDGYRDSKLLSTPSFSSNSECDVASDHRAHRLYKPTKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAVGWAKALQTEV

Query:  DNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWTGTEKTKPWWQS
        D SS+ IANKTTVSG+EH+NIEE +YRVCKNYV TEV+SNNKSVNTTKK D MDEFHYIED FTDLHNFDSQ+SKQGEKV  DL+SHWTGTEKTKPWWQS
Subjt:  DNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWTGTEKTKPWWQS

Query:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHHQDQDHF---RT
        ASKDELAS VA+KSL NLENCDLPQPRTKHQRKDQSTCLECF+QD FLTSSFTE  FS L EYNRGMHP V MGER+S+V  VG   HH    HF   RT
Subjt:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHHQDQDHF---RT

Query:  GNEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSDVLPWVPCKD
         NEE+N  S  NLN  KAQLLEALC+SQTRAREAEKAAQEADTEKKHIVSLFLRQA+QLFAYKQWFQL+QL+NICLQLR+K+ P+TGLFSDVLPWVPCKD
Subjt:  GNEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSDVLPWVPCKD

Query:  RQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTT
        RQF +PR++RKKRGR  R+FTM+EIA  +GLGLAGA LLLGWTT
Subjt:  RQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTT

A0A6J1KA69 uncharacterized protein LOC1114914981.5e-18977.25Show/hide
Query:  MAAAEAKPPRKHSENHFPAQDDGYRDSKLLSTPSFSSNSECDVASDHRAHRLYKPTKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAVGWAKALQTEV
        MAAAEAKPPRKHS NHFP Q DGYR SKLL+TP  SS SECD AS+HR H  Y PTKA H PT  RTV I+L +DEQ N QHH  KDFAVGWAKALQ EV
Subjt:  MAAAEAKPPRKHSENHFPAQDDGYRDSKLLSTPSFSSNSECDVASDHRAHRLYKPTKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAVGWAKALQTEV

Query:  DNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWTGTEKTKPWWQS
        D SS+ IANKTTVSG+EH+NIEE +Y VCKNYV TEV+SNNKSVNTTKK D MDEFHYIED FTDLHNFDSQ+SKQGEKV   L+S WTGTEKTKPWWQS
Subjt:  DNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWTGTEKTKPWWQS

Query:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHHQDQDHF---RT
        ASKDELAS VA+KSL NLENCDLPQPRT+HQRKDQSTCLECF+QDCFLTSSFTE  FS L EYNRGMHP V MGER+S+V  VG   HH   +HF   RT
Subjt:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHHQDQDHF---RT

Query:  GNEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSDVLPWVPCKD
         NEE+N  S  NLN  KAQLLEALC+SQTRAREAEKAAQEADTEKKHIVSLF RQA+QLFAYKQWFQL+QL+NICLQLR+K+ P+TGLFSDVLPWVPCKD
Subjt:  GNEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSDVLPWVPCKD

Query:  RQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTT
        RQF +PR++RKK+GR  R+FTM++IA  +GLGLAGASLLLGWTT
Subjt:  RQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G01240.1 unknown protein3.1e-2535Show/hide
Query:  EKTKPWWQSAS-KDELASLVAQKSLE-NLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHH
        + T PWW+S + KDELA +VA KS++ N++NCDLP P+  H+                       +H SS GE  +G    V    +Q V          
Subjt:  EKTKPWWQSAS-KDELASLVAQKSLE-NLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHH

Query:  QDQDHFR-----TGNEEDNSSSFLNL----NASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQ
          +D F       G+ E  ++S ++     + SK QLLEAL +SQTRAREAE+AA+EA  EK  ++++ L+QASQ+ AYKQW +LL+++ + LQ++ +++
Subjt:  QDQDHFR-----TGNEEDNSSSFLNL----NASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQ

Query:  PITGLFSDVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWT
               + +  +  K R   + R  +KK+G   R    + +A  +G  L GA LLLGWT
Subjt:  PITGLFSDVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWT

AT1G01240.2 unknown protein3.1e-2535Show/hide
Query:  EKTKPWWQSAS-KDELASLVAQKSLE-NLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHH
        + T PWW+S + KDELA +VA KS++ N++NCDLP P+  H+                       +H SS GE  +G    V    +Q V          
Subjt:  EKTKPWWQSAS-KDELASLVAQKSLE-NLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHH

Query:  QDQDHFR-----TGNEEDNSSSFLNL----NASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQ
          +D F       G+ E  ++S ++     + SK QLLEAL +SQTRAREAE+AA+EA  EK  ++++ L+QASQ+ AYKQW +LL+++ + LQ++ +++
Subjt:  QDQDHFR-----TGNEEDNSSSFLNL----NASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQ

Query:  PITGLFSDVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWT
               + +  +  K R   + R  +KK+G   R    + +A  +G  L GA LLLGWT
Subjt:  PITGLFSDVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWT

AT1G01240.3 unknown protein3.1e-2535Show/hide
Query:  EKTKPWWQSAS-KDELASLVAQKSLE-NLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHH
        + T PWW+S + KDELA +VA KS++ N++NCDLP P+  H+                       +H SS GE  +G    V    +Q V          
Subjt:  EKTKPWWQSAS-KDELASLVAQKSLE-NLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHH

Query:  QDQDHFR-----TGNEEDNSSSFLNL----NASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQ
          +D F       G+ E  ++S ++     + SK QLLEAL +SQTRAREAE+AA+EA  EK  ++++ L+QASQ+ AYKQW +LL+++ + LQ++ +++
Subjt:  QDQDHFR-----TGNEEDNSSSFLNL----NASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQ

Query:  PITGLFSDVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWT
               + +  +  K R   + R  +KK+G   R    + +A  +G  L GA LLLGWT
Subjt:  PITGLFSDVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWT

AT2G46550.1 unknown protein3.1e-3329.2Show/hide
Query:  MAAAEAKPPRKHSENHFPAQDDGYRDSKLL-----STPSFSSNSECDVASDHRAHRLYKPTKAA-----HNPTSNRTVDIELGSDEQSNFQHHKDKDFAV
        MAAAEA+   + + N    Q+D  R  KL      S+ + SS  + +V+       +   T+++       P +    D+   +    +  HH+   F V
Subjt:  MAAAEAKPPRKHSENHFPAQDDGYRDSKLL-----STPSFSSNSECDVASDHRAHRLYKPTKAA-----HNPTSNRTVDIELGSDEQSNFQHHKDKDFAV

Query:  GWAKALQTEVDNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWT-
           + L+ EV+N  V  + K +  G   +  +  N    + ++  E++   +S  +T                     +D    K+  ++ FD  S W  
Subjt:  GWAKALQTEVDNSSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWT-

Query:  -GTEKTKPWWQSASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLH
          +EK  PWW++  KDELASLVAQ+SL+ +ENCDLP P  +  ++        FD D           +S  G+  +G               S G+S  
Subjt:  -GTEKTKPWWQSASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLH

Query:  HQDQDHFRTGNEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSD
        ++ +      +E D          SK++LLEAL  SQTRAREAE  A+EA  EK+H+V + L+QA++LF YKQW QLLQL+ + LQ++NK+        D
Subjt:  HQDQDHFRTGNEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSD

Query:  VLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWT
            +PC      R   R+++  R +     + + + +G+ L GA LLLGWT
Subjt:  VLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWT

AT2G46550.2 unknown protein9.1e-3335.77Show/hide
Query:  FDSQISKQGEKVPFDLDSHWT--GTEKTKPWWQSASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRG
        +D    K+  ++ FD  S W    +EK  PWW++  KDELASLVAQ+SL+ +ENCDLP P  +  ++        FD D           +S  G+  +G
Subjt:  FDSQISKQGEKVPFDLDSHWT--GTEKTKPWWQSASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRG

Query:  MHPLVDMGERQSVVGSVGNSLHHQDQDHFRTGNEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLL
                       S G+S  ++ +      +E D          SK++LLEAL  SQTRAREAE  A+EA  EK+H+V + L+QA++LF YKQW QLL
Subjt:  MHPLVDMGERQSVVGSVGNSLHHQDQDHFRTGNEEDNSSSFLNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLL

Query:  QLQNICLQLRNKDQPITGLFSDVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWT
        QL+ + LQ++NK+        D    +PC      R   R+++  R +     + + + +G+ L GA LLLGWT
Subjt:  QLQNICLQLRNKDQPITGLFSDVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCTGGATGTTGGAGGAGTAATGGCTGCAGCAGAAGCAAAGCCTCCACGAAAGCATTCAGAAAACCACTTCCCTGCCCAAGATGATGGCTACAGAGATTCAAAATT
ATTGTCAACTCCATCTTTTTCCTCAAATTCAGAATGTGATGTTGCATCTGATCATAGAGCTCATAGGCTTTATAAACCAACCAAAGCTGCTCATAACCCAACATCTAATC
GAACAGTTGACATTGAGTTGGGGTCAGATGAGCAATCTAATTTTCAGCACCATAAAGATAAAGATTTTGCAGTAGGATGGGCAAAAGCTTTGCAGACTGAAGTGGATAAT
TCAAGTGTATGGATTGCAAATAAAACTACTGTATCTGGTGAGGAACATCAAAACATAGAAGAACTTAATTATCGAGTTTGCAAGAATTATGTTACCACTGAGGTTGTGAG
TAATAATAAATCAGTAAACACGACTAAGAAAACAGACTGCATGGATGAGTTCCACTACATAGAGGATCATTTTACAGACTTGCACAATTTTGACAGTCAGATCTCCAAGC
AAGGAGAGAAGGTCCCTTTTGATTTGGATTCACATTGGACAGGAACTGAGAAGACTAAACCATGGTGGCAATCTGCTAGTAAAGACGAGTTGGCTTCCTTGGTTGCGCAG
AAGTCTCTTGAAAACTTAGAAAATTGCGACCTTCCTCAACCACGAACCAAACACCAGAGAAAGGACCAATCTACCTGTCTTGAATGTTTTGATCAAGATTGCTTTCTCAC
TTCGTCATTTACTGAGATGCACTTCTCCAGTTTGGGTGAATATAACAGGGGAATGCACCCTTTGGTTGACATGGGTGAGAGACAGTCCGTTGTTGGTAGTGTAGGTAACT
CACTGCATCATCAAGATCAAGATCATTTCAGAACTGGCAATGAAGAAGACAACTCAAGTAGTTTTTTGAATCTGAACGCTAGCAAAGCCCAATTGTTGGAAGCACTATGC
TATTCACAAACTCGAGCAAGGGAGGCCGAGAAAGCAGCACAAGAAGCAGATACAGAGAAGAAGCACATTGTCTCACTCTTTCTCAGACAAGCCAGCCAGCTTTTTGCTTA
TAAACAGTGGTTCCAGTTGCTGCAGTTACAGAACATTTGCCTTCAACTTAGGAACAAAGATCAACCAATAACTGGTCTGTTCTCGGATGTCTTGCCTTGGGTCCCCTGTA
AAGATAGGCAGTTCAATCGGCCTAGAAACAGAAGGAAGAAACGGGGCCGAGACCGTCGTGAGTTTACAATGTTCGAGATTGCTGTCACTGTGGGATTGGGTCTTGCTGGT
GCCAGTTTGCTCCTCGGATGGACAACA
mRNA sequenceShow/hide mRNA sequence
ATGAGCCTGGATGTTGGAGGAGTAATGGCTGCAGCAGAAGCAAAGCCTCCACGAAAGCATTCAGAAAACCACTTCCCTGCCCAAGATGATGGCTACAGAGATTCAAAATT
ATTGTCAACTCCATCTTTTTCCTCAAATTCAGAATGTGATGTTGCATCTGATCATAGAGCTCATAGGCTTTATAAACCAACCAAAGCTGCTCATAACCCAACATCTAATC
GAACAGTTGACATTGAGTTGGGGTCAGATGAGCAATCTAATTTTCAGCACCATAAAGATAAAGATTTTGCAGTAGGATGGGCAAAAGCTTTGCAGACTGAAGTGGATAAT
TCAAGTGTATGGATTGCAAATAAAACTACTGTATCTGGTGAGGAACATCAAAACATAGAAGAACTTAATTATCGAGTTTGCAAGAATTATGTTACCACTGAGGTTGTGAG
TAATAATAAATCAGTAAACACGACTAAGAAAACAGACTGCATGGATGAGTTCCACTACATAGAGGATCATTTTACAGACTTGCACAATTTTGACAGTCAGATCTCCAAGC
AAGGAGAGAAGGTCCCTTTTGATTTGGATTCACATTGGACAGGAACTGAGAAGACTAAACCATGGTGGCAATCTGCTAGTAAAGACGAGTTGGCTTCCTTGGTTGCGCAG
AAGTCTCTTGAAAACTTAGAAAATTGCGACCTTCCTCAACCACGAACCAAACACCAGAGAAAGGACCAATCTACCTGTCTTGAATGTTTTGATCAAGATTGCTTTCTCAC
TTCGTCATTTACTGAGATGCACTTCTCCAGTTTGGGTGAATATAACAGGGGAATGCACCCTTTGGTTGACATGGGTGAGAGACAGTCCGTTGTTGGTAGTGTAGGTAACT
CACTGCATCATCAAGATCAAGATCATTTCAGAACTGGCAATGAAGAAGACAACTCAAGTAGTTTTTTGAATCTGAACGCTAGCAAAGCCCAATTGTTGGAAGCACTATGC
TATTCACAAACTCGAGCAAGGGAGGCCGAGAAAGCAGCACAAGAAGCAGATACAGAGAAGAAGCACATTGTCTCACTCTTTCTCAGACAAGCCAGCCAGCTTTTTGCTTA
TAAACAGTGGTTCCAGTTGCTGCAGTTACAGAACATTTGCCTTCAACTTAGGAACAAAGATCAACCAATAACTGGTCTGTTCTCGGATGTCTTGCCTTGGGTCCCCTGTA
AAGATAGGCAGTTCAATCGGCCTAGAAACAGAAGGAAGAAACGGGGCCGAGACCGTCGTGAGTTTACAATGTTCGAGATTGCTGTCACTGTGGGATTGGGTCTTGCTGGT
GCCAGTTTGCTCCTCGGATGGACAACA
Protein sequenceShow/hide protein sequence
MSLDVGGVMAAAEAKPPRKHSENHFPAQDDGYRDSKLLSTPSFSSNSECDVASDHRAHRLYKPTKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAVGWAKALQTEVDN
SSVWIANKTTVSGEEHQNIEELNYRVCKNYVTTEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPFDLDSHWTGTEKTKPWWQSASKDELASLVAQ
KSLENLENCDLPQPRTKHQRKDQSTCLECFDQDCFLTSSFTEMHFSSLGEYNRGMHPLVDMGERQSVVGSVGNSLHHQDQDHFRTGNEEDNSSSFLNLNASKAQLLEALC
YSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNICLQLRNKDQPITGLFSDVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAG
ASLLLGWTT