; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh16G004030 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh16G004030
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionSynaptojanin-1
Genome locationCmo_Chr16:1851860..1855035
RNA-Seq ExpressionCmoCh16G004030
SyntenyCmoCh16G004030
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576973.1 Transmembrane emp24 domain-containing protein p24delta3, partial [Cucurbita argyrosperma subsp. sororia]7.4e-24399.09Show/hide
Query:  DALQMGKGEDQNLPQQHRREDSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLP
        DALQMGKGEDQNLPQQHRREDSSGFICRECSI+FCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLP
Subjt:  DALQMGKGEDQNLPQQHRREDSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLP

Query:  HIKRLEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQR
        HIKRLEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQR
Subjt:  HIKRLEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQR

Query:  ASIWQFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSV
        ASIWQFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGR TTQRLQQLAAIINTSSERNLGLDYSV
Subjt:  ASIWQFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSV

Query:  FGEVKGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPADQ
        FGEVKGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPAN SPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPADQ
Subjt:  FGEVKGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPADQ

Query:  PRVSSPRASPVLFHYKPGKTKEDSHRVWQPTHSSHRDHD
        PR SSPRASPVLFHYKPGKTKEDSHRVWQPTHSSHRDHD
Subjt:  PRVSSPRASPVLFHYKPGKTKEDSHRVWQPTHSSHRDHD

KAG7014992.1 Pathogenesis-related protein 5, partial [Cucurbita argyrosperma subsp. argyrosperma]2.1e-23797.26Show/hide
Query:  DALQMGKGEDQNLPQQHRREDSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLP
        DALQMGKGEDQNLPQQHRREDSSGFICRECSI+F R STELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKD IKLSATVQVYFVLEKPV+ELLP
Subjt:  DALQMGKGEDQNLPQQHRREDSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLP

Query:  HIKRLEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQR
        HIKRLEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYI TPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPS FQILKFPGGISIIPFQR
Subjt:  HIKRLEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQR

Query:  ASIWQFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSV
        ASIWQFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSV
Subjt:  ASIWQFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSV

Query:  FGEVKGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPADQ
        FGEVKGISLSSYPKGTSMAMPP FSPAPAPAPGDH+ELPSAPHPSRSARSPAN SPPRANCETSSPALSMVPAPSLHEHSMPPI YPKSTRLIVVPPADQ
Subjt:  FGEVKGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPADQ

Query:  PRVSSPRASPVLFHYKPGKTKEDSHRVWQPTHSSHRDH
        PRVSSPRASPVLFHYKPGKTKEDSHRVWQPTHSSH+DH
Subjt:  PRVSSPRASPVLFHYKPGKTKEDSHRVWQPTHSSHRDH

XP_022922926.1 uncharacterized protein LOC111430758 isoform X1 [Cucurbita moschata]2.5e-243100Show/hide
Query:  MGKGEDQNLPQQHRREDSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLPHIKR
        MGKGEDQNLPQQHRREDSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLPHIKR
Subjt:  MGKGEDQNLPQQHRREDSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLPHIKR

Query:  LEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQRASIW
        LEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQRASIW
Subjt:  LEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQRASIW

Query:  QFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSVFGEV
        QFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSVFGEV
Subjt:  QFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSVFGEV

Query:  KGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPADQPRVS
        KGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPADQPRVS
Subjt:  KGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPADQPRVS

Query:  SPRASPVLFHYKPGKTKEDSHRVWQPTHSSHRDHD
        SPRASPVLFHYKPGKTKEDSHRVWQPTHSSHRDHD
Subjt:  SPRASPVLFHYKPGKTKEDSHRVWQPTHSSHRDHD

XP_022984786.1 uncharacterized protein LOC111482968 isoform X1 [Cucurbita maxima]5.1e-22895.4Show/hide
Query:  MGKGEDQNLPQQHRREDSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLPHIKR
        MGKGEDQNLPQQHRREDSSGF+CRECSI+F R STELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLPHIKR
Subjt:  MGKGEDQNLPQQHRREDSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLPHIKR

Query:  LEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQRASIW
        LEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPS FQILKFPGGISIIPFQ ASIW
Subjt:  LEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQRASIW

Query:  QFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSVFGEV
        QFPQIVFNFTLTNSISEILNKFAKFMSQ KLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSVFGEV
Subjt:  QFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSVFGEV

Query:  KGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPADQPRVS
        KGISL SYPKGTSMAMPPSFSPAPAPAPGDHVEL SAP PSRSAR PAN SPP+ANCETSSPALSMVPAPS HEHSMPPI YPKSTRLIVVPPADQPRVS
Subjt:  KGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPADQPRVS

Query:  SPRASPVLFHYKPGKTKEDSHRVWQPTHSSHRDHD
        SPRAS +LF YKPGKTKEDSHRV QPTHSSH DHD
Subjt:  SPRASPVLFHYKPGKTKEDSHRVWQPTHSSHRDHD

XP_023552690.1 uncharacterized protein LOC111810265 [Cucurbita pepo subsp. pepo]9.7e-22794.94Show/hide
Query:  MGKGEDQNLPQQHRREDSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLPHIKR
        MGKGEDQNLPQQHRRE SSGF+C ECS +F R S ELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLPHIKR
Subjt:  MGKGEDQNLPQQHRREDSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLPHIKR

Query:  LEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQRASIW
        LEFDINGELDIPNVKVS+LSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLH+SNLTLTTSIFGQPS FQILKFPGGISIIPFQRASIW
Subjt:  LEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQRASIW

Query:  QFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSVFGEV
        QFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSVFGEV
Subjt:  QFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSVFGEV

Query:  KGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPADQPRVS
        KGISLSSYPKGTSMAMPPSFSPAPAPAPGDHV L SAPHPSRSAR PAN SPPRANCETSSPA S VPAPS HEHSMPPI YPKSTRLIVVPPADQPRVS
Subjt:  KGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPADQPRVS

Query:  SPRASPVLFHYKPGKTKEDSHRVWQPTHSSHRDHD
        SPRASPVLF YKPGKTKEDSHRV QPTH SHRDHD
Subjt:  SPRASPVLFHYKPGKTKEDSHRVWQPTHSSHRDHD

TrEMBL top hitse value%identityAlignment
A0A0A0L6J0 Uncharacterized protein3.7e-18477.88Show/hide
Query:  MGKGEDQNLPQQHRRE-----DSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELL
        MGKGE+QNLP Q RRE     DSSGF+C +CSI+F R   ELNFKC FVL+LGF VF+PGFFWLLPLHERN GFEAKD IKLSATVQVYFVLEKPV ELL
Subjt:  MGKGEDQNLPQQHRRE-----DSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELL

Query:  PHIKRLEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQ
        PHIKRLEFDINGELDIPNVKVS+LSMHD+GESNRTYVVFGLLSEYIT PINPVSLSLLRSSLYD FL +SNLTLTTSIFGQPS  QILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQ

Query:  RASIWQFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYS
         ASIW+FPQIVFNFTLTNSISEIL+ FAKF SQLK  L LR YENVYLQITNKIGST+QP+V+VQASI+SELGRIT+QRLQQLAAIINTS ERNLGLDYS
Subjt:  RASIWQFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYS

Query:  VFGEVKGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPAD
        VFGEVK +SLSSYPK TS AMPPSFSPAPAPAPG+HVE+PS PHP RS R PAN SPP ANC++SSP  SMVPA S HEHS+PPI YPKSTRLI VPPA+
Subjt:  VFGEVKGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPAD

Query:  QPRVSSPRASPV----------------LFHYKPGKTKEDSHRVWQPTHSSH
        QPRV SPRASPV                 F  K G+T ED      P+H  H
Subjt:  QPRVSSPRASPV----------------LFHYKPGKTKEDSHRVWQPTHSSH

A0A1S3C2E0 uncharacterized protein LOC103496125 isoform X13.5e-18277.98Show/hide
Query:  MGKGEDQNLPQQHRRE-----DSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELL
        MGKGE+QNLP Q RRE     DSSGF+C +CSI+F R   ELNFKC FVL+LGF VF+PG FWLLPLHERN GFEAK+ +KLSATVQVYFVLEKPV ELL
Subjt:  MGKGEDQNLPQQHRRE-----DSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELL

Query:  PHIKRLEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQ
        PHIKRLEFDINGELDIP+VKVS+LSMHD+GESNRTYVVFGLLSEYIT PINPVSLSLLRSSLYD FL +SNLTLTTSIFGQPS  QILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQ

Query:  RASIWQFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYS
         ASIW+FPQIVFNFTLTNSISEIL+ FAKF S+LK  L LR YENVYLQITNKIGST+QP+V+VQASI+SELGRIT+QRLQQLAAIINTS ERNLGLDYS
Subjt:  RASIWQFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYS

Query:  VFGEVKGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPAD
        VFGEVK +SLSSYPK TS AMPPSFSPAPAPAPGDHVE+PS PH  RS R PAN SPP ANC++ SP  SMVPA S HEHS+PPI YPKSTRL VVPPA+
Subjt:  VFGEVKGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPAD

Query:  QPRVSSPRASPV----------------LFHYKPGKTKED-SHRV
        QPRVSSPRASP+                 FH K G+T ED SH V
Subjt:  QPRVSSPRASPV----------------LFHYKPGKTKED-SHRV

A0A5A7TCD6 Synaptojanin-11.3e-18177.75Show/hide
Query:  MGKGEDQNLPQQHRRE-----DSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELL
        MGKGE+QNLP Q RRE     DSSGF+C +CSI+F R   ELNFKC FVL+LGF VF+PG FWLLPLHERN GFEAK+ +KLSATVQVYFVLEKPV ELL
Subjt:  MGKGEDQNLPQQHRRE-----DSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELL

Query:  PHIKRLEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQ
        PHIKRLEFDINGELDIP+VKVS+LSMHD+GESNRTYVVFGLLSEYIT PINPVSLSLLRSSLYD FL +SNLTLTTSIFGQPS  QILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQ

Query:  RASIWQFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYS
         ASIW+FPQIVFNFTLTNSISEIL+ FAKF S+LK  L LR YENVYLQITNKIGST+QP+V+VQASI+SELGRIT+QRLQQLAAIINTS ERNLGLDYS
Subjt:  RASIWQFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYS

Query:  VFGEVKGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPAD
        VFGEVK +SLSSYPK TS AMPPSFSPAPAPAPGDHVE+PS PH  RS R PAN SPP ANC++ SP   MVPA S HEHS+PPI YPKSTRL VVPPA+
Subjt:  VFGEVKGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPAD

Query:  QPRVSSPRASPV----------------LFHYKPGKTKED-SHRV
        QPRVSSPRASP+                 FH K G+T ED SH V
Subjt:  QPRVSSPRASPV----------------LFHYKPGKTKED-SHRV

A0A6J1E5G3 uncharacterized protein LOC111430758 isoform X11.2e-243100Show/hide
Query:  MGKGEDQNLPQQHRREDSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLPHIKR
        MGKGEDQNLPQQHRREDSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLPHIKR
Subjt:  MGKGEDQNLPQQHRREDSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLPHIKR

Query:  LEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQRASIW
        LEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQRASIW
Subjt:  LEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQRASIW

Query:  QFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSVFGEV
        QFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSVFGEV
Subjt:  QFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSVFGEV

Query:  KGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPADQPRVS
        KGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPADQPRVS
Subjt:  KGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPADQPRVS

Query:  SPRASPVLFHYKPGKTKEDSHRVWQPTHSSHRDHD
        SPRASPVLFHYKPGKTKEDSHRVWQPTHSSHRDHD
Subjt:  SPRASPVLFHYKPGKTKEDSHRVWQPTHSSHRDHD

A0A6J1J689 uncharacterized protein LOC111482968 isoform X12.5e-22895.4Show/hide
Query:  MGKGEDQNLPQQHRREDSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLPHIKR
        MGKGEDQNLPQQHRREDSSGF+CRECSI+F R STELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLPHIKR
Subjt:  MGKGEDQNLPQQHRREDSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLPHIKR

Query:  LEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQRASIW
        LEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPS FQILKFPGGISIIPFQ ASIW
Subjt:  LEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQRASIW

Query:  QFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSVFGEV
        QFPQIVFNFTLTNSISEILNKFAKFMSQ KLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSVFGEV
Subjt:  QFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSVFGEV

Query:  KGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPADQPRVS
        KGISL SYPKGTSMAMPPSFSPAPAPAPGDHVEL SAP PSRSAR PAN SPP+ANCETSSPALSMVPAPS HEHSMPPI YPKSTRLIVVPPADQPRVS
Subjt:  KGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAPSLHEHSMPPIVYPKSTRLIVVPPADQPRVS

Query:  SPRASPVLFHYKPGKTKEDSHRVWQPTHSSHRDHD
        SPRAS +LF YKPGKTKEDSHRV QPTHSSH DHD
Subjt:  SPRASPVLFHYKPGKTKEDSHRVWQPTHSSHRDHD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)3.2e-5040.46Show/hide
Query:  RECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLPHIKRLEFDINGELDIP-NVKVSVLSMH
        R CS +F R    +  +CL VL+L  A+ L   FWL P    +  F+A   +KL+A+VQ  F L+KPV E++ H  ++E DI   + +  N KV+VLS++
Subjt:  RECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLPHIKRLEFDINGELDIP-NVKVSVLSMH

Query:  DLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQRASIWQFPQIVFNFTLTNSISEILNKF
          G SN T V F +L       I+  SLSLLRSS   LF  +S L LTTS FG+P++FQ+LKFPGGI++ P + A +     ++F+ T+  SIS + ++ 
Subjt:  DLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQRASIWQFPQIVFNFTLTNSISEILNKF

Query:  AKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSVFGEVKGISLSSYPKGTSMAMPPSFSP
               +  L L PYE+V+ Q+TNK GST+ P +  Q  ++  + +   QRL     II TS  +NLGLD +VFGEVK I+ S+Y  G         +P
Subjt:  AKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERNLGLDYSVFGEVKGISLSSYPKGTSMAMPPSFSP

Query:  APAP
        AP P
Subjt:  APAP

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein5.4e-4236.2Show/hide
Query:  MGKGEDQNLPQQHRREDSSGFICRECSISFCR-ASTELNFKCLFVLILGFAVFLPGFFWLLPL----HERNLGFEAKDAIKLSATVQVYFVLEKPVEELL
        MGK ED    +    E +     R      C+  S+ + FKCLFVL+L  A+FL   F LLP      + NL     D       +   F + +    L 
Subjt:  MGKGEDQNLPQQHRREDSSGFICRECSISFCR-ASTELNFKCLFVLILGFAVFLPGFFWLLPL----HERNLGFEAKDAIKLSATVQVYFVLEKPVEELL

Query:  PHIKRLEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQ
         +  +L+ DI  E+   ++KV++L++    E N T VVFG+  +     I P+SLS ++     + +++S L LT S+FG+   F++LKFPGGI++IP Q
Subjt:  PHIKRLEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQ

Query:  RASIWQFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRI-TTQRLQQLAAIINTSSERNLGLDY
         A   Q  +IVFNFTL  SI +I   F    SQLK  L L PYEN+Y+ ++N  GST+ P   V +S+   +G   ++ RL+QL   I  S  +NLGL+ 
Subjt:  RASIWQFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRI-TTQRLQQLAAIINTSSERNLGLDY

Query:  SVFGEVKGISLSSYPKGTSMAMPPSFSPAPAPAPGDH
        ++FG+VK + LSS+   +S +   S SP+P+P    H
Subjt:  SVFGEVKGISLSSYPKGTSMAMPPSFSPAPAPAPGDH

AT3G56590.1 hydroxyproline-rich glycoprotein family protein6.4e-4332.89Show/hide
Query:  MGKG--EDQNLP----QQHRREDSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSA-----TVQVYFVLEK
        MGK   E+QNLP        R +  G I   C   +   S+  + +C+ +L    AVFL   FWL P     LGF     + L        +   F + K
Subjt:  MGKG--EDQNLP----QQHRREDSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSA-----TVQVYFVLEK

Query:  PVEELLPHIKRLEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGI
        P+  +  ++ +LE DI  E+  P  KV VL++  LG+ NRT V+F +  E   + I     SL++++   L   + +  LT S+FG+P  F++LKFPGGI
Subjt:  PVEELLPHIKRLEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGI

Query:  SIIPFQRASIWQFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERN
        ++IP Q     Q  Q++FNFTL  SI +I + F +  SQLK  + L  YEN+Y+ ++N  GST+ P  +V +S+    G  ++ RL+QLA  I +S  +N
Subjt:  SIIPFQRASIWQFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERN

Query:  LGLDYSVFGEVKGISLSS-YPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAP-----------------S
        LGL+++VFG+VK + LSS  P   + +  PS SP P      H   P   H         + SPP      +S      P P                 +
Subjt:  LGLDYSVFGEVKGISLSS-YPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAP-----------------S

Query:  LHEHSMPPIVYPKSTRLIVVPPADQPR--------VSSPRASPVLFHYKP
        L+ H+ PP   P   R    PPA  P         VSSP    V  H  P
Subjt:  LHEHSMPPIVYPKSTRLIVVPPADQPR--------VSSPRASPVLFHYKP

AT3G56590.2 hydroxyproline-rich glycoprotein family protein6.4e-4332.89Show/hide
Query:  MGKG--EDQNLP----QQHRREDSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSA-----TVQVYFVLEK
        MGK   E+QNLP        R +  G I   C   +   S+  + +C+ +L    AVFL   FWL P     LGF     + L        +   F + K
Subjt:  MGKG--EDQNLP----QQHRREDSSGFICRECSISFCRASTELNFKCLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSA-----TVQVYFVLEK

Query:  PVEELLPHIKRLEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGI
        P+  +  ++ +LE DI  E+  P  KV VL++  LG+ NRT V+F +  E   + I     SL++++   L   + +  LT S+FG+P  F++LKFPGGI
Subjt:  PVEELLPHIKRLEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSLSLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGI

Query:  SIIPFQRASIWQFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERN
        ++IP Q     Q  Q++FNFTL  SI +I + F +  SQLK  + L  YEN+Y+ ++N  GST+ P  +V +S+    G  ++ RL+QLA  I +S  +N
Subjt:  SIIPFQRASIWQFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQASISSELGRITTQRLQQLAAIINTSSERN

Query:  LGLDYSVFGEVKGISLSS-YPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAP-----------------S
        LGL+++VFG+VK + LSS  P   + +  PS SP P      H   P   H         + SPP      +S      P P                 +
Subjt:  LGLDYSVFGEVKGISLSS-YPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAP-----------------S

Query:  LHEHSMPPIVYPKSTRLIVVPPADQPR--------VSSPRASPVLFHYKP
        L+ H+ PP   P   R    PPA  P         VSSP    V  H  P
Subjt:  LHEHSMPPIVYPKSTRLIVVPPADQPR--------VSSPRASPVLFHYKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAATCCTTGATCAACGCACCGATTTCTACTTCATTTCATCAAATCTCCGCCAAACCCAAAGATTACAAATCCGTTCCCTTTGTCCATTTGATATTCGTTCTAAAAT
CAATGGACACTTAGGATTTGCAGAAAGGGTTTTGTTTTTTAGTGCGTTTTACCGGCTTCGATTTGGTTTTCCGTGGGCGATTGTTACAGATGCTCTGCAAATGGGGAAAG
GGGAAGATCAAAATCTGCCGCAGCAGCACCGCCGTGAGGATTCTTCTGGGTTTATTTGTCGTGAATGTTCGATTTCGTTTTGTAGGGCTTCTACGGAGTTGAATTTCAAG
TGTTTGTTCGTTTTGATTTTGGGATTTGCGGTGTTTCTCCCTGGATTCTTTTGGCTTCTTCCGCTTCATGAAAGAAATTTAGGGTTTGAGGCGAAAGACGCCATTAAACT
CAGTGCTACAGTTCAGGTGTATTTCGTTCTCGAAAAGCCGGTCGAGGAGCTTCTCCCTCACATCAAGAGATTAGAGTTTGATATCAATGGTGAATTAGACATACCAAACG
TGAAGGTTTCCGTTCTGTCGATGCACGATTTAGGTGAGTCGAACAGGACTTACGTGGTTTTTGGTCTTCTTTCTGAATACATTACTACTCCAATAAATCCAGTGTCCTTA
AGTCTGCTGAGATCGTCTTTATATGACCTTTTCCTTCACAAATCCAACCTTACTTTGACGACTTCGATTTTTGGACAGCCATCGGCATTTCAAATCCTCAAGTTTCCAGG
GGGAATCTCTATAATCCCGTTTCAACGTGCTTCGATTTGGCAGTTTCCCCAGATCGTATTTAACTTCACTCTGACTAACTCCATTTCTGAAATACTCAACAAATTTGCAA
AGTTCATGAGCCAGCTTAAGTTGGAATTGTGTCTGAGGCCTTATGAGAATGTGTATTTGCAAATAACGAACAAGATTGGCTCGACGATGCAGCCCATCGTAGTTGTTCAG
GCTTCTATTTCGTCGGAATTGGGACGCATAACGACGCAGAGATTACAGCAGTTGGCTGCAATCATCAATACCTCTTCCGAAAGAAATCTCGGCCTTGATTATTCTGTTTT
TGGAGAAGTCAAGGGCATCAGTTTGTCTTCTTATCCAAAGGGAACCTCCATGGCTATGCCACCGAGTTTTTCTCCAGCTCCTGCCCCAGCACCTGGTGATCATGTAGAAC
TACCGAGTGCCCCACACCCATCGAGATCTGCACGATCACCTGCAAATTGTTCCCCCCCTCGAGCAAATTGTGAAACCTCGTCTCCAGCCCTTTCTATGGTTCCTGCACCT
TCCCTTCATGAACATTCAATGCCTCCAATCGTCTATCCGAAGTCTACAAGACTGATCGTCGTTCCTCCAGCTGATCAACCCCGGGTATCATCTCCACGTGCATCTCCGGT
GCTGTTTCACTACAAACCAGGGAAAACAAAGGAAGATTCGCATAGAGTTTGGCAGCCCACACATTCCTCACATCGAGATCATGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAATCCTTGATCAACGCACCGATTTCTACTTCATTTCATCAAATCTCCGCCAAACCCAAAGATTACAAATCCGTTCCCTTTGTCCATTTGATATTCGTTCTAAAAT
CAATGGACACTTAGGATTTGCAGAAAGGGTTTTGTTTTTTAGTGCGTTTTACCGGCTTCGATTTGGTTTTCCGTGGGCGATTGTTACAGATGCTCTGCAAATGGGGAAAG
GGGAAGATCAAAATCTGCCGCAGCAGCACCGCCGTGAGGATTCTTCTGGGTTTATTTGTCGTGAATGTTCGATTTCGTTTTGTAGGGCTTCTACGGAGTTGAATTTCAAG
TGTTTGTTCGTTTTGATTTTGGGATTTGCGGTGTTTCTCCCTGGATTCTTTTGGCTTCTTCCGCTTCATGAAAGAAATTTAGGGTTTGAGGCGAAAGACGCCATTAAACT
CAGTGCTACAGTTCAGGTGTATTTCGTTCTCGAAAAGCCGGTCGAGGAGCTTCTCCCTCACATCAAGAGATTAGAGTTTGATATCAATGGTGAATTAGACATACCAAACG
TGAAGGTTTCCGTTCTGTCGATGCACGATTTAGGTGAGTCGAACAGGACTTACGTGGTTTTTGGTCTTCTTTCTGAATACATTACTACTCCAATAAATCCAGTGTCCTTA
AGTCTGCTGAGATCGTCTTTATATGACCTTTTCCTTCACAAATCCAACCTTACTTTGACGACTTCGATTTTTGGACAGCCATCGGCATTTCAAATCCTCAAGTTTCCAGG
GGGAATCTCTATAATCCCGTTTCAACGTGCTTCGATTTGGCAGTTTCCCCAGATCGTATTTAACTTCACTCTGACTAACTCCATTTCTGAAATACTCAACAAATTTGCAA
AGTTCATGAGCCAGCTTAAGTTGGAATTGTGTCTGAGGCCTTATGAGAATGTGTATTTGCAAATAACGAACAAGATTGGCTCGACGATGCAGCCCATCGTAGTTGTTCAG
GCTTCTATTTCGTCGGAATTGGGACGCATAACGACGCAGAGATTACAGCAGTTGGCTGCAATCATCAATACCTCTTCCGAAAGAAATCTCGGCCTTGATTATTCTGTTTT
TGGAGAAGTCAAGGGCATCAGTTTGTCTTCTTATCCAAAGGGAACCTCCATGGCTATGCCACCGAGTTTTTCTCCAGCTCCTGCCCCAGCACCTGGTGATCATGTAGAAC
TACCGAGTGCCCCACACCCATCGAGATCTGCACGATCACCTGCAAATTGTTCCCCCCCTCGAGCAAATTGTGAAACCTCGTCTCCAGCCCTTTCTATGGTTCCTGCACCT
TCCCTTCATGAACATTCAATGCCTCCAATCGTCTATCCGAAGTCTACAAGACTGATCGTCGTTCCTCCAGCTGATCAACCCCGGGTATCATCTCCACGTGCATCTCCGGT
GCTGTTTCACTACAAACCAGGGAAAACAAAGGAAGATTCGCATAGAGTTTGGCAGCCCACACATTCCTCACATCGAGATCATGATTGA
Protein sequenceShow/hide protein sequence
MGILDQRTDFYFISSNLRQTQRLQIRSLCPFDIRSKINGHLGFAERVLFFSAFYRLRFGFPWAIVTDALQMGKGEDQNLPQQHRREDSSGFICRECSISFCRASTELNFK
CLFVLILGFAVFLPGFFWLLPLHERNLGFEAKDAIKLSATVQVYFVLEKPVEELLPHIKRLEFDINGELDIPNVKVSVLSMHDLGESNRTYVVFGLLSEYITTPINPVSL
SLLRSSLYDLFLHKSNLTLTTSIFGQPSAFQILKFPGGISIIPFQRASIWQFPQIVFNFTLTNSISEILNKFAKFMSQLKLELCLRPYENVYLQITNKIGSTMQPIVVVQ
ASISSELGRITTQRLQQLAAIINTSSERNLGLDYSVFGEVKGISLSSYPKGTSMAMPPSFSPAPAPAPGDHVELPSAPHPSRSARSPANCSPPRANCETSSPALSMVPAP
SLHEHSMPPIVYPKSTRLIVVPPADQPRVSSPRASPVLFHYKPGKTKEDSHRVWQPTHSSHRDHD