; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030801 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030801
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionNephrocystin-3 isoform X1
Genome locationchr11:1500984..1504171
RNA-Seq ExpressionLag0030801
SyntenyLag0030801
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605791.1 Nephrocystin-3, partial [Cucurbita argyrosperma subsp. sororia]3.2e-19277.01Show/hide
Query:  VLDVGGVMAAAEAKPPRKRSEDHFPTQDDGYRDSKLLSTPFFSSNSECDVASDHRAHRLYKPSKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAEGWA
        VLDV  VMAAAEAKPPRK S +HFP Q DG R SKLL+TP FSS SECD AS+HR    Y P+KAAH PT  +TV I+L +DEQ N QHH  KDFA GWA
Subjt:  VLDVGGVMAAAEAKPPRKRSEDHFPTQDDGYRDSKLLSTPFFSSNSECDVASDHRAHRLYKPSKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAEGWA

Query:  KALQTEVDNSSIWIANKTTVSGEEHQNIEELNYRVCKNYVATEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPCDLDSHWTGTEK
        KALQ EVD SSI IANKTTVSG+EH+NIEE +YRVCKNYVATEV+SNNKSVNTTKK D MDEFHYIED FTDLHNFDSQ+SKQGEKV  DL+SHWTGTEK
Subjt:  KALQTEVDNSSIWIANKTTVSGEEHQNIEELNYRVCKNYVATEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPCDLDSHWTGTEK

Query:  TKPWWQSASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGERQSVVGSVGNSLHNQ
        TKPWWQSASKDELAS VA+KSL NLENCDLPQPRTKHQRKDQSTCLECF+Q+CFLTSSFTE   S LDEYNRG   MHP VGMGER+S+V  VG   H+ 
Subjt:  TKPWWQSASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGERQSVVGSVGNSLHNQ

Query:  DQDHFSISRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRNKDQPITGLFS
           H   SRT NEE+N  S +NLN  KAQLLEALC+SQTRAREAEKAAQEADTEKKHIVSLFLRQA+QLFAYKQWFQL+QL+NI LQLR+K+ P+TGL S
Subjt:  DQDHFSISRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRNKDQPITGLFS

Query:  DVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVPIF
        DVLPWVPCKDRQF +PR++RKKRGR  R+FTM+EIA  +GLGLAGA LLLGWTTGWLVPIF
Subjt:  DVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVPIF

KAG7035756.1 hypothetical protein SDJN02_02554 [Cucurbita argyrosperma subsp. argyrosperma]3.2e-19277.53Show/hide
Query:  MAAAEAKPPRKRSEDHFPTQDDGYRDSKLLSTPFFSSNSECDVASDHRAHRLYKPSKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAEGWAKALQTEV
        MAAAEAKPPRK S +HFP Q DG R SKLL+TP FSS SECD AS+HR    Y P+KAAH PT  +TV I+L +DEQ N QHH  KDFA GWAKALQ EV
Subjt:  MAAAEAKPPRKRSEDHFPTQDDGYRDSKLLSTPFFSSNSECDVASDHRAHRLYKPSKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAEGWAKALQTEV

Query:  DNSSIWIANKTTVSGEEHQNIEELNYRVCKNYVATEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPCDLDSHWTGTEKTKPWWQS
        D SSI IANKTTVSG+EH+NIEE +YRVCKNYVATEV+SNNKSVNTTKK D MDEFHYIED FTDLHNFDSQ+SKQGEKV  DL+SHWTGTEKTKPWWQS
Subjt:  DNSSIWIANKTTVSGEEHQNIEELNYRVCKNYVATEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPCDLDSHWTGTEKTKPWWQS

Query:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGERQSVVGSVGNSLHNQDQDHFSI
        ASKDELAS VA+KSL NLENCDLPQPRTKHQRKDQSTCLECF+Q+CFLTSSFTEM  S LDEYNRG   MHP VGMGER+S+V  VG   H     H S 
Subjt:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGERQSVVGSVGNSLHNQDQDHFSI

Query:  SRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVP
         RT NEE+N  S +NLN  KAQLLEALC+SQTRAREAEKAAQEADTEKKHIVSLFLRQA+QLFAYKQWFQL+QL+NI LQLR+K+ P+TGLFSDVLPWVP
Subjt:  SRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVP

Query:  CKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVPIF
        CKDRQF +PR++RKKRGR  R+FTM+EIA  +GLGLAGA LLLGWTTGWLVPIF
Subjt:  CKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVPIF

XP_022958712.1 uncharacterized protein LOC111459854 [Cucurbita moschata]1.9e-19277.75Show/hide
Query:  MAAAEAKPPRKRSEDHFPTQDDGYRDSKLLSTPFFSSNSECDVASDHRAHRLYKPSKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAEGWAKALQTEV
        MAAAEAKPPRK S +HF  Q DG R SKLL+TP FSS SECD AS+HR    Y P+KAAH PT  +TV I+L +DEQ N QHH  KDFA GWAKALQ EV
Subjt:  MAAAEAKPPRKRSEDHFPTQDDGYRDSKLLSTPFFSSNSECDVASDHRAHRLYKPSKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAEGWAKALQTEV

Query:  DNSSIWIANKTTVSGEEHQNIEELNYRVCKNYVATEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPCDLDSHWTGTEKTKPWWQS
        D SSI IANKTTVSG+EH+NIEE +YRVCKNYVATEV+SNNKSVNTTKK D MDEFHYIED FTDLHNFDSQ+SKQGEKV  DL+SHWTGTEKTKPWWQS
Subjt:  DNSSIWIANKTTVSGEEHQNIEELNYRVCKNYVATEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPCDLDSHWTGTEKTKPWWQS

Query:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGERQSVVGSVGNSLHNQDQDHFSI
        ASKDELAS VA+KSL NLENCDLPQPRTKHQRKDQSTCLECF+Q+ FLTSSFTE  FS LDEYNRG   MHP VGMGER+S+V  VG   H+    HFSI
Subjt:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGERQSVVGSVGNSLHNQDQDHFSI

Query:  SRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVP
        SRT NEE+N  S +NLN  KAQLLEALC+SQTRAREAEKAAQEADTEKKHIVSLFLRQA+QLFAYKQWFQL+QL+NI LQLR+K+ P+TGLFSDVLPWVP
Subjt:  SRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVP

Query:  CKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVPIF
        CKDRQF +PR++RKKRGR  R+FTM+EIA  +GLGLAGA LLLGWTTGWLVPIF
Subjt:  CKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVPIF

XP_022996203.1 uncharacterized protein LOC111491498 [Cucurbita maxima]1.2e-19177.09Show/hide
Query:  MAAAEAKPPRKRSEDHFPTQDDGYRDSKLLSTPFFSSNSECDVASDHRAHRLYKPSKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAEGWAKALQTEV
        MAAAEAKPPRK S +HFP Q DGYR SKLL+TP  SS SECD AS+HR H  Y P+KA H PT  RTV I+L +DEQ N QHH  KDFA GWAKALQ EV
Subjt:  MAAAEAKPPRKRSEDHFPTQDDGYRDSKLLSTPFFSSNSECDVASDHRAHRLYKPSKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAEGWAKALQTEV

Query:  DNSSIWIANKTTVSGEEHQNIEELNYRVCKNYVATEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPCDLDSHWTGTEKTKPWWQS
        D SSI IANKTTVSG+EH+NIEE +Y VCKNYVATEV+SNNKSVNTTKK D MDEFHYIED FTDLHNFDSQ+SKQGEKV   L+S WTGTEKTKPWWQS
Subjt:  DNSSIWIANKTTVSGEEHQNIEELNYRVCKNYVATEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPCDLDSHWTGTEKTKPWWQS

Query:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGERQSVVGSVGNSLHNQDQDHFSI
        ASKDELAS VA+KSL NLENCDLPQPRT+HQRKDQSTCLECF+Q+CFLTSSFTE  FS LDEYNRG   MHP VGMGER+S+V  VG   H+   +HFSI
Subjt:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGERQSVVGSVGNSLHNQDQDHFSI

Query:  SRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVP
        SRT NEE+N  S +NLN  KAQLLEALC+SQTRAREAEKAAQEADTEKKHIVSLF RQA+QLFAYKQWFQL+QL+NI LQLR+K+ P+TGLFSDVLPWVP
Subjt:  SRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVP

Query:  CKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVPIF
        CKDRQF +PR++RKK+GR  R+FTM++IA  +GLGLAGASLLLGWTTGWLVPIF
Subjt:  CKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVPIF

XP_023532371.1 uncharacterized protein LOC111794566 [Cucurbita pepo subsp. pepo]8.4e-19377.75Show/hide
Query:  MAAAEAKPPRKRSEDHFPTQDDGYRDSKLLSTPFFSSNSECDVASDHRAHRLYKPSKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAEGWAKALQTEV
        MAAAEAKP RK S +HFP Q DGYR SKLL+TP FSS SE D AS+HR H  Y P+KAAH PT  RTV I+L +DEQ N QHH  KDFA GWAKALQ EV
Subjt:  MAAAEAKPPRKRSEDHFPTQDDGYRDSKLLSTPFFSSNSECDVASDHRAHRLYKPSKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAEGWAKALQTEV

Query:  DNSSIWIANKTTVSGEEHQNIEELNYRVCKNYVATEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPCDLDSHWTGTEKTKPWWQS
        D SSI IANKTTVSG+EH+NIEE +YRVCKNYVATEV+SNNKSVNTTKK + MDEFHYIED  TDLHNFDSQ+SKQGEKV  DL+SHWTGTEKTKPWWQS
Subjt:  DNSSIWIANKTTVSGEEHQNIEELNYRVCKNYVATEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPCDLDSHWTGTEKTKPWWQS

Query:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGERQSVVGSVGNSLHNQDQDHFSI
        ASKDELAS VA+KSL NLENCDLPQPRTKHQRKDQSTCLECF+Q+CFLTSSFTE  FS LDEYNRG   MHP VGMG+R+S+V  VG   H+    HFSI
Subjt:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGERQSVVGSVGNSLHNQDQDHFSI

Query:  SRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVP
        SRT NEE+N  S +NLN  KAQLLEALC+SQTRAREAEKAAQEADTEKKHIVSLFLRQA+QLFAYKQWFQL+QL+NI LQLR+K+ P+TGLFSDVLPWVP
Subjt:  SRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVP

Query:  CKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVPIF
        CKDRQF +PR++RKKRGR  R+FTM+EIA  +GLGLAGA LLLGWTTGWLVPIF
Subjt:  CKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVPIF

TrEMBL top hitse value%identityAlignment
A0A1S3AUK4 uncharacterized protein LOC1034829713.1e-17773.13Show/hide
Query:  MAAAEAKPPRKRSEDHFPTQDDGYRDSKLLSTPFFSSNSECDVASDHRAHRLYKPSKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAEGWAKALQTEV
        MA AEA  PRK  E+HF  +DDGYRDSKL STP FS NSE + ASD R H    PSK AH PTS  +VD EL S +Q +FQH   + FA GWAK LQ  V
Subjt:  MAAAEAKPPRKRSEDHFPTQDDGYRDSKLLSTPFFSSNSECDVASDHRAHRLYKPSKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAEGWAKALQTEV

Query:  DNSSIWIANKTTVSGEEHQNIEELNYRVCKNYVATEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPCDLDSHWTGTEKTKPWWQS
        DNSS+ IA+K TVS E HQNIEEL ++VCK+YV TE VSNN+SVN TKKTDC+DEFHYIEDHFTDLHN D+QIS +GEKV  DL+SHW G EKTKPWW+S
Subjt:  DNSSIWIANKTTVSGEEHQNIEELNYRVCKNYVATEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPCDLDSHWTGTEKTKPWWQS

Query:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGERQSVVGSVGNSLHNQDQDHFSI
        ASKDELASLVA+KSLEN+ENCDLPQPRTKHQ K++STC ECFDQ+CFL S FTEM FSSLD  NR    + P  GMGERQ +VG++G+SL +  QDHFSI
Subjt:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGERQSVVGSVGNSLHNQDQDHFSI

Query:  SRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVP
        SRT NEE+NSS  +NLN+SKAQLLEALC+SQTRAREAEKAAQEADTEKKHIVSLFLRQA+QLFAY+QW QLLQLQNI LQLRNKDQPITGLFSD LPW P
Subjt:  SRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVP

Query:  CKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVPIF
        CKD QFN+PRNRRKKR +D  +FT   IA  VGL LAGASLLLGWTTGWLVP+F
Subjt:  CKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVPIF

A0A5A7TGM0 Uncharacterized protein4.2e-17473.05Show/hide
Query:  MAAAEAKPPRKRSEDHFPTQDDGYRDSKLLSTPFFSSNSECDVASDHRAHRLYKPSKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAEGWAKALQTEV
        MA AEA  PRK  E+HF  +DDGYRDSKL STP FS NSE + ASD R H    PSK AH PTS  TVD EL S +Q +FQH   + FA GWAK LQ  V
Subjt:  MAAAEAKPPRKRSEDHFPTQDDGYRDSKLLSTPFFSSNSECDVASDHRAHRLYKPSKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAEGWAKALQTEV

Query:  DNSSIWIANKTTVSGEEHQNIEELNYRVCKNYVATEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPCDLDSHWTGTEKTKPWWQS
        DNSS+ IA+K TVS E HQNIEEL ++VCK+YV TE VSNN+SVN TKKTDC+DEFHYIEDHFTDLHN D+QIS +GEKV  DL+SHW G EKTKPWW+S
Subjt:  DNSSIWIANKTTVSGEEHQNIEELNYRVCKNYVATEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPCDLDSHWTGTEKTKPWWQS

Query:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGERQSVVGSVGNSLHNQDQDHFSI
        ASKDELASLVA+KSLEN+ENCDLPQPRTKHQ K++STC ECFDQ+CFL S FTEM FSSLD  NR    + P  GMGERQ +VG++G+SL +  QDHFSI
Subjt:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGERQSVVGSVGNSLHNQDQDHFSI

Query:  SRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVP
        SRT NEE+NSS  +NLN+SKAQLLEALC+SQTRAREAEKAAQEADTEKKHIVSLFLRQA+QLFAY+QW QLLQLQNI LQLRNKDQPITGLFSD LPW P
Subjt:  SRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVP

Query:  CKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGW
        CKD QFN+PRNRRKKR +D  +FT   IA  VGL LAGASLLLGWTTG+
Subjt:  CKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGW

A0A6J1DUQ0 uncharacterized protein LOC1110246284.1e-18575.77Show/hide
Query:  MAAAEAKPPRKRSEDHFPTQDDGYRDSKLLSTPFFSSNSECDVASDHRAHRLYKPSKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAEGWAKALQTEV
        MAAAEAKPP+K  E+HFP QDDGY  S+LLST   SSNSEC+ + D RA   Y P+K+AH PTSN TVDI+L  D Q +F+HH  KDF   W K LQ EV
Subjt:  MAAAEAKPPRKRSEDHFPTQDDGYRDSKLLSTPFFSSNSECDVASDHRAHRLYKPSKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAEGWAKALQTEV

Query:  DNSSIWIANKTTVSGEEHQNIEELNYRVCKNYVATEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPCDLDSHWTGTEKTKPWWQS
        D+ SI  A+KTTVSGEEHQNIEEL Y+V + Y ATE VSNNKSVNT KKTDCMDEF YIEDHFTDLH FDS   KQGE V  DL+SHWTGTEKTKPWW+S
Subjt:  DNSSIWIANKTTVSGEEHQNIEELNYRVCKNYVATEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPCDLDSHWTGTEKTKPWWQS

Query:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGERQSVVGSVGNSLHNQDQDHFSI
        ASKDELASLVA+KSLE+LENCDLPQPRTKH RKDQS   ECFDQ+CFLTSSFTEM FSSLD Y+RGMH+    V MGERQS VGSVG+SLH   QDHFS 
Subjt:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGERQSVVGSVGNSLHNQDQDHFSI

Query:  SRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVP
        SR+GNEE+NSSS  N++ SKAQLLEALC+SQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNI LQLRNKD+PI+G+FSDVLPWVP
Subjt:  SRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVP

Query:  CKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVPIF
        CKDRQFN+ RNRRKKRGR R   TM+++AV VGLGL GA LLLGWT+GWLV IF
Subjt:  CKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVPIF

A0A6J1H2L0 uncharacterized protein LOC1114598549.1e-19377.75Show/hide
Query:  MAAAEAKPPRKRSEDHFPTQDDGYRDSKLLSTPFFSSNSECDVASDHRAHRLYKPSKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAEGWAKALQTEV
        MAAAEAKPPRK S +HF  Q DG R SKLL+TP FSS SECD AS+HR    Y P+KAAH PT  +TV I+L +DEQ N QHH  KDFA GWAKALQ EV
Subjt:  MAAAEAKPPRKRSEDHFPTQDDGYRDSKLLSTPFFSSNSECDVASDHRAHRLYKPSKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAEGWAKALQTEV

Query:  DNSSIWIANKTTVSGEEHQNIEELNYRVCKNYVATEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPCDLDSHWTGTEKTKPWWQS
        D SSI IANKTTVSG+EH+NIEE +YRVCKNYVATEV+SNNKSVNTTKK D MDEFHYIED FTDLHNFDSQ+SKQGEKV  DL+SHWTGTEKTKPWWQS
Subjt:  DNSSIWIANKTTVSGEEHQNIEELNYRVCKNYVATEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPCDLDSHWTGTEKTKPWWQS

Query:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGERQSVVGSVGNSLHNQDQDHFSI
        ASKDELAS VA+KSL NLENCDLPQPRTKHQRKDQSTCLECF+Q+ FLTSSFTE  FS LDEYNRG   MHP VGMGER+S+V  VG   H+    HFSI
Subjt:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGERQSVVGSVGNSLHNQDQDHFSI

Query:  SRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVP
        SRT NEE+N  S +NLN  KAQLLEALC+SQTRAREAEKAAQEADTEKKHIVSLFLRQA+QLFAYKQWFQL+QL+NI LQLR+K+ P+TGLFSDVLPWVP
Subjt:  SRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVP

Query:  CKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVPIF
        CKDRQF +PR++RKKRGR  R+FTM+EIA  +GLGLAGA LLLGWTTGWLVPIF
Subjt:  CKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVPIF

A0A6J1KA69 uncharacterized protein LOC1114914985.9e-19277.09Show/hide
Query:  MAAAEAKPPRKRSEDHFPTQDDGYRDSKLLSTPFFSSNSECDVASDHRAHRLYKPSKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAEGWAKALQTEV
        MAAAEAKPPRK S +HFP Q DGYR SKLL+TP  SS SECD AS+HR H  Y P+KA H PT  RTV I+L +DEQ N QHH  KDFA GWAKALQ EV
Subjt:  MAAAEAKPPRKRSEDHFPTQDDGYRDSKLLSTPFFSSNSECDVASDHRAHRLYKPSKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAEGWAKALQTEV

Query:  DNSSIWIANKTTVSGEEHQNIEELNYRVCKNYVATEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPCDLDSHWTGTEKTKPWWQS
        D SSI IANKTTVSG+EH+NIEE +Y VCKNYVATEV+SNNKSVNTTKK D MDEFHYIED FTDLHNFDSQ+SKQGEKV   L+S WTGTEKTKPWWQS
Subjt:  DNSSIWIANKTTVSGEEHQNIEELNYRVCKNYVATEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPCDLDSHWTGTEKTKPWWQS

Query:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGERQSVVGSVGNSLHNQDQDHFSI
        ASKDELAS VA+KSL NLENCDLPQPRT+HQRKDQSTCLECF+Q+CFLTSSFTE  FS LDEYNRG   MHP VGMGER+S+V  VG   H+   +HFSI
Subjt:  ASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGERQSVVGSVGNSLHNQDQDHFSI

Query:  SRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVP
        SRT NEE+N  S +NLN  KAQLLEALC+SQTRAREAEKAAQEADTEKKHIVSLF RQA+QLFAYKQWFQL+QL+NI LQLR+K+ P+TGLFSDVLPWVP
Subjt:  SRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVP

Query:  CKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVPIF
        CKDRQF +PR++RKK+GR  R+FTM++IA  +GLGLAGASLLLGWTTGWLVPIF
Subjt:  CKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVPIF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G01240.1 unknown protein1.4e-2835.69Show/hide
Query:  EKTKPWWQSAS-KDELASLVAQKSLE-NLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGER--QSVVGSVG
        + T PWW+S + KDELA +VA KS++ N++NCDLP P+  H+                       +H SS                 GE+  ++ V S  
Subjt:  EKTKPWWQSAS-KDELASLVAQKSLE-NLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGER--QSVVGSVG

Query:  NSLHNQDQDHFSISRTGNEEDNS----SSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRN
             +D+   S+S  G+ E  +    SS  + + SK QLLEAL +SQTRAREAE+AA+EA  EK  ++++ L+QASQ+ AYKQW +LL+++ +YLQ++ 
Subjt:  NSLHNQDQDHFSISRTGNEEDNS----SSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRN

Query:  KDQPITGLFSDVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVP
        +++       + +  +  K R   + R  +KK+G   R    + +A  +G  L GA LLLGWT GWL+P
Subjt:  KDQPITGLFSDVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVP

AT1G01240.2 unknown protein1.4e-2835.69Show/hide
Query:  EKTKPWWQSAS-KDELASLVAQKSLE-NLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGER--QSVVGSVG
        + T PWW+S + KDELA +VA KS++ N++NCDLP P+  H+                       +H SS                 GE+  ++ V S  
Subjt:  EKTKPWWQSAS-KDELASLVAQKSLE-NLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGER--QSVVGSVG

Query:  NSLHNQDQDHFSISRTGNEEDNS----SSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRN
             +D+   S+S  G+ E  +    SS  + + SK QLLEAL +SQTRAREAE+AA+EA  EK  ++++ L+QASQ+ AYKQW +LL+++ +YLQ++ 
Subjt:  NSLHNQDQDHFSISRTGNEEDNS----SSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRN

Query:  KDQPITGLFSDVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVP
        +++       + +  +  K R   + R  +KK+G   R    + +A  +G  L GA LLLGWT GWL+P
Subjt:  KDQPITGLFSDVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVP

AT1G01240.3 unknown protein1.4e-2835.69Show/hide
Query:  EKTKPWWQSAS-KDELASLVAQKSLE-NLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGER--QSVVGSVG
        + T PWW+S + KDELA +VA KS++ N++NCDLP P+  H+                       +H SS                 GE+  ++ V S  
Subjt:  EKTKPWWQSAS-KDELASLVAQKSLE-NLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGER--QSVVGSVG

Query:  NSLHNQDQDHFSISRTGNEEDNS----SSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRN
             +D+   S+S  G+ E  +    SS  + + SK QLLEAL +SQTRAREAE+AA+EA  EK  ++++ L+QASQ+ AYKQW +LL+++ +YLQ++ 
Subjt:  NSLHNQDQDHFSISRTGNEEDNS----SSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRN

Query:  KDQPITGLFSDVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVP
        +++       + +  +  K R   + R  +KK+G   R    + +A  +G  L GA LLLGWT GWL+P
Subjt:  KDQPITGLFSDVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWLVP

AT2G46550.1 unknown protein5.0e-3435.79Show/hide
Query:  FDSQISKQGEKVPCDLDSHWT--GTEKTKPWWQSASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRG
        +D    K+  ++  D  S W    +EK  PWW++  KDELASLVAQ+SL+ +ENCDLP P  +  ++        FD +              L +Y+  
Subjt:  FDSQISKQGEKVPCDLDSHWT--GTEKTKPWWQSASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRG

Query:  MHQMHPLVGMGERQSVVG-SVGNSLHNQDQDHFSISRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAY
                     Q++ G S G+S  N+ +              +SS ++L  SK++LLEAL  SQTRAREAE  A+EA  EK+H+V + L+QA++LF Y
Subjt:  MHQMHPLVGMGERQSVVG-SVGNSLHNQDQDHFSISRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAY

Query:  KQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWL
        KQW QLLQL+ +YLQ++NK+        D    +PC      R   R+++  R +     + + + +G+ L GA LLLGWT GW+
Subjt:  KQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWL

AT2G46550.2 unknown protein5.0e-3435.79Show/hide
Query:  FDSQISKQGEKVPCDLDSHWT--GTEKTKPWWQSASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRG
        +D    K+  ++  D  S W    +EK  PWW++  KDELASLVAQ+SL+ +ENCDLP P  +  ++        FD +              L +Y+  
Subjt:  FDSQISKQGEKVPCDLDSHWT--GTEKTKPWWQSASKDELASLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRG

Query:  MHQMHPLVGMGERQSVVG-SVGNSLHNQDQDHFSISRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAY
                     Q++ G S G+S  N+ +              +SS ++L  SK++LLEAL  SQTRAREAE  A+EA  EK+H+V + L+QA++LF Y
Subjt:  MHQMHPLVGMGERQSVVG-SVGNSLHNQDQDHFSISRTGNEEDNSSSFTNLNASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAY

Query:  KQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWL
        KQW QLLQL+ +YLQ++NK+        D    +PC      R   R+++  R +     + + + +G+ L GA LLLGWT GW+
Subjt:  KQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFEIAVTVGLGLAGASLLLGWTTGWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAATTCCCTTTGTGTCCTGGATGTTGGAGGAGTAATGGCTGCAGCAGAAGCAAAGCCTCCACGAAAGCGTTCAGAAGACCACTTCCCTACCCAAGATGATGGCTA
CAGAGATTCAAAATTATTGTCAACTCCATTTTTTTCCTCAAATTCAGAATGTGATGTTGCATCTGATCATAGAGCTCATAGGCTTTATAAACCAAGCAAAGCTGCTCATA
ACCCAACATCTAATCGAACAGTTGACATTGAGTTGGGGTCAGATGAGCAGTCTAATTTTCAGCACCATAAAGATAAAGATTTTGCAGAAGGATGGGCAAAAGCTTTGCAG
ACTGAAGTGGATAATTCAAGTATATGGATTGCAAATAAAACTACTGTATCTGGTGAGGAACATCAAAACATAGAAGAACTTAATTATCGAGTTTGCAAGAATTATGTTGC
CACTGAGGTTGTGAGTAATAATAAATCAGTAAACACAACTAAGAAAACAGACTGCATGGATGAGTTCCACTACATAGAGGATCATTTTACAGACTTGCACAATTTTGACA
GTCAGATCTCCAAGCAAGGAGAGAAGGTTCCTTGTGATTTGGATTCACATTGGACAGGAACTGAGAAGACTAAACCATGGTGGCAATCTGCTAGTAAAGACGAGTTGGCT
TCCTTGGTTGCGCAGAAGTCTCTTGAAAACTTAGAAAATTGCGACCTTCCTCAACCACGAACAAAACACCAGAGAAAGGACCAATCTACCTGTCTTGAATGTTTTGATCA
AAATTGCTTTCTCACTTCATCATTTACTGAGATGCACTTCTCCAGTTTGGATGAATATAACAGGGGAATGCACCAAATGCACCCTTTGGTTGGCATGGGTGAGAGACAGT
CCGTTGTTGGTAGTGTAGGTAACTCACTGCATAATCAAGATCAAGATCATTTCAGCATTAGCAGAACTGGCAATGAAGAAGACAACTCAAGTAGTTTTACGAATCTGAAC
GCTAGCAAAGCCCAATTGTTGGAAGCACTATGCTATTCACAAACTCGAGCAAGGGAAGCCGAGAAAGCAGCACAAGAAGCAGATACAGAGAAGAAGCACATTGTCTCACT
CTTTCTCAGACAAGCCAGCCAGCTTTTTGCTTATAAACAGTGGTTCCAGTTGCTGCAGTTACAGAACATTTACCTTCAACTTAGGAACAAAGATCAACCAATAACTGGTC
TGTTCTCGGATGTCTTGCCTTGGGTCCCCTGTAAAGATAGGCAGTTCAATCGGCCTAGAAACAGAAGGAAGAAACGGGGCCGAGACCGTCGTGAGTTTACAATGTTCGAG
ATTGCTGTCACTGTGGGATTGGGTCTTGCTGGTGCCAGTTTGCTCCTCGGATGGACAACAGGTTGGTTGGTTCCCATTTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGAATTCCCTTTGTGTCCTGGATGTTGGAGGAGTAATGGCTGCAGCAGAAGCAAAGCCTCCACGAAAGCGTTCAGAAGACCACTTCCCTACCCAAGATGATGGCTA
CAGAGATTCAAAATTATTGTCAACTCCATTTTTTTCCTCAAATTCAGAATGTGATGTTGCATCTGATCATAGAGCTCATAGGCTTTATAAACCAAGCAAAGCTGCTCATA
ACCCAACATCTAATCGAACAGTTGACATTGAGTTGGGGTCAGATGAGCAGTCTAATTTTCAGCACCATAAAGATAAAGATTTTGCAGAAGGATGGGCAAAAGCTTTGCAG
ACTGAAGTGGATAATTCAAGTATATGGATTGCAAATAAAACTACTGTATCTGGTGAGGAACATCAAAACATAGAAGAACTTAATTATCGAGTTTGCAAGAATTATGTTGC
CACTGAGGTTGTGAGTAATAATAAATCAGTAAACACAACTAAGAAAACAGACTGCATGGATGAGTTCCACTACATAGAGGATCATTTTACAGACTTGCACAATTTTGACA
GTCAGATCTCCAAGCAAGGAGAGAAGGTTCCTTGTGATTTGGATTCACATTGGACAGGAACTGAGAAGACTAAACCATGGTGGCAATCTGCTAGTAAAGACGAGTTGGCT
TCCTTGGTTGCGCAGAAGTCTCTTGAAAACTTAGAAAATTGCGACCTTCCTCAACCACGAACAAAACACCAGAGAAAGGACCAATCTACCTGTCTTGAATGTTTTGATCA
AAATTGCTTTCTCACTTCATCATTTACTGAGATGCACTTCTCCAGTTTGGATGAATATAACAGGGGAATGCACCAAATGCACCCTTTGGTTGGCATGGGTGAGAGACAGT
CCGTTGTTGGTAGTGTAGGTAACTCACTGCATAATCAAGATCAAGATCATTTCAGCATTAGCAGAACTGGCAATGAAGAAGACAACTCAAGTAGTTTTACGAATCTGAAC
GCTAGCAAAGCCCAATTGTTGGAAGCACTATGCTATTCACAAACTCGAGCAAGGGAAGCCGAGAAAGCAGCACAAGAAGCAGATACAGAGAAGAAGCACATTGTCTCACT
CTTTCTCAGACAAGCCAGCCAGCTTTTTGCTTATAAACAGTGGTTCCAGTTGCTGCAGTTACAGAACATTTACCTTCAACTTAGGAACAAAGATCAACCAATAACTGGTC
TGTTCTCGGATGTCTTGCCTTGGGTCCCCTGTAAAGATAGGCAGTTCAATCGGCCTAGAAACAGAAGGAAGAAACGGGGCCGAGACCGTCGTGAGTTTACAATGTTCGAG
ATTGCTGTCACTGTGGGATTGGGTCTTGCTGGTGCCAGTTTGCTCCTCGGATGGACAACAGGTTGGTTGGTTCCCATTTTTTGA
Protein sequenceShow/hide protein sequence
MQNSLCVLDVGGVMAAAEAKPPRKRSEDHFPTQDDGYRDSKLLSTPFFSSNSECDVASDHRAHRLYKPSKAAHNPTSNRTVDIELGSDEQSNFQHHKDKDFAEGWAKALQ
TEVDNSSIWIANKTTVSGEEHQNIEELNYRVCKNYVATEVVSNNKSVNTTKKTDCMDEFHYIEDHFTDLHNFDSQISKQGEKVPCDLDSHWTGTEKTKPWWQSASKDELA
SLVAQKSLENLENCDLPQPRTKHQRKDQSTCLECFDQNCFLTSSFTEMHFSSLDEYNRGMHQMHPLVGMGERQSVVGSVGNSLHNQDQDHFSISRTGNEEDNSSSFTNLN
ASKAQLLEALCYSQTRAREAEKAAQEADTEKKHIVSLFLRQASQLFAYKQWFQLLQLQNIYLQLRNKDQPITGLFSDVLPWVPCKDRQFNRPRNRRKKRGRDRREFTMFE
IAVTVGLGLAGASLLLGWTTGWLVPIF