; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS012490 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS012490
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionSynaptojanin-1
Genome locationscaffold63:1079258..1081592
RNA-Seq ExpressionMS012490
SyntenyMS012490
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039065.1 synaptojanin-1 [Cucumis melo var. makuwa]1.5e-18676.3Show/hide
Query:  MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL
        MGKGEEQNLP+Q+RREVA SGDSSGFLCGQCSIA  RV +ELNFKC FVL+LGF+VFVPG FWLLPL ERNSGFEAK+++KLSATVQVYFVLEKPV ELL
Subjt:  MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL

Query:  PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ
        PHIKRLEFDINGELDI +VKVS+LSMHD+GESNRTYVVFG+LSEYITAPINPVSLSL+RS+LYD FL ESNLTLTT IFGQPST +ILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ

Query:  HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYS
        HASIW+FPQIVFNFTL+NSISE+LDNFAKF+SEL FGLRLR YENVY QITNKIGST+QP +IVQASI+SELG +TSQRLQQLAAIIN SPERNLGLDYS
Subjt:  HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYS

Query:  VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPP-ATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIV
        VFGEVKSVSLSSY K TS ++PPS SPAPAPAPGDH E  S P   RS+     +PPAN SPP A C++LSP P +VPAHSPH HS+PP SYP STRL+V
Subjt:  VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPP-ATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIV

Query:  SP-----------PPVGFTPLLPPDLLPKPKPRFGFKPGWRKENPTRVKPARSSHPDHVS
         P            P+ F PLLPPDLLPKPKP F  K G   E+         SHP HVS
Subjt:  SP-----------PPVGFTPLLPPDLLPKPKPRFGFKPGWRKENPTRVKPARSSHPDHVS

XP_004149972.2 uncharacterized protein LOC101222031 isoform X2 [Cucumis sativus]9.6e-18676.64Show/hide
Query:  MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL
        MGKGEEQNLP+Q+RREVA +GDSSGFLCGQCSIA  RV +ELNFKC FVL+LGF+VFVPGFFWLLPL ERNSGFEAKD+IKLSATVQVYFVLEKPV ELL
Subjt:  MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL

Query:  PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ
        PHIKRLEFDINGELDI NVKVS+LSMHD+GESNRTYVVFG+LSEYITAPINPVSLSL+RS+LYD FL ESNLTLTT IFGQPST +ILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ

Query:  HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYS
        HASIW+FPQIVFNFTL+NSISE+LDNFAKF+S+L FGLRLR YENVY QITNKIGST+QP +IVQASI+SELG +TSQRLQQLAAIIN SPERNLGLDYS
Subjt:  HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYS

Query:  VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPP-ATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIV
        VFGEVKSVSLSSY K TS ++PPS SPAPAPAPG+H E  S P   RS     ++PPAN SPP A C++ SP PS+VPA+SPH HS+PP SYP STRLIV
Subjt:  VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPP-ATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIV

Query:  SP-----------PPVGFTPLLPPDLLPKPKPRFGFKPGWRKENPTRVKPARSSHPDH
         P            PV   PLLPPDLLPKPKP F  K G   E+P        SHP H
Subjt:  SP-----------PPVGFTPLLPPDLLPKPKPRFGFKPGWRKENPTRVKPARSSHPDH

XP_008456084.1 PREDICTED: uncharacterized protein LOC103496125 isoform X1 [Cucumis melo]4.3e-18676.42Show/hide
Query:  MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL
        MGKGEEQNLP+Q+RREVA SGDSSGFLCGQCSIA  RV +ELNFKC FVL+LGF+VFVPG FWLLPL ERNSGFEAK+++KLSATVQVYFVLEKPV ELL
Subjt:  MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL

Query:  PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ
        PHIKRLEFDINGELDI +VKVS+LSMHD+GESNRTYVVFG+LSEYITAPINPVSLSL+RS+LYD FL ESNLTLTT IFGQPST +ILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ

Query:  HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYS
        HASIW+FPQIVFNFTL+NSISE+LDNFAKF+SEL FGLRLR YENVY QITNKIGST+QP +IVQASI+SELG +TSQRLQQLAAIIN SPERNLGLDYS
Subjt:  HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYS

Query:  VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPP-ATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIV
        VFGEVKSVSLSSY K TS ++PPS SPAPAPAPGDH E  S P   RS+     +PPAN SPP A C++LSP PS+VPAHSPH HS+PP SYP STRL+V
Subjt:  VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPP-ATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIV

Query:  SP-----------PPVGFTPLLPPDLLPKPKPRFGFKPGWRKENPTRVKPARSSHPDH
         P            P+ F PLLPPDLLPKPKP F  K G   E+         SHP H
Subjt:  SP-----------PPVGFTPLLPPDLLPKPKPRFGFKPGWRKENPTRVKPARSSHPDH

XP_011651267.1 uncharacterized protein LOC101222031 isoform X1 [Cucumis sativus]2.5e-18676.69Show/hide
Query:  MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL
        MGKGEEQNLP+Q+RREVA +GDSSGFLCGQCSIA  RV +ELNFKC FVL+LGF+VFVPGFFWLLPL ERNSGFEAKD+IKLSATVQVYFVLEKPV ELL
Subjt:  MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL

Query:  PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ
        PHIKRLEFDINGELDI NVKVS+LSMHD+GESNRTYVVFG+LSEYITAPINPVSLSL+RS+LYD FL ESNLTLTT IFGQPST +ILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ

Query:  HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYS
        HASIW+FPQIVFNFTL+NSISE+LDNFAKF+S+L FGLRLR YENVY QITNKIGST+QP +IVQASI+SELG +TSQRLQQLAAIIN SPERNLGLDYS
Subjt:  HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYS

Query:  VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPP-ATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIV
        VFGEVKSVSLSSY K TS ++PPS SPAPAPAPG+H E  S P   RS     ++PPAN SPP A C++ SP PS+VPA+SPH HS+PP SYP STRLIV
Subjt:  VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPP-ATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIV

Query:  SP-----------PPVGFTPLLPPDLLPKPKPRFGFKPGWRKENPTRVKPARSSHPDHV
         P            PV   PLLPPDLLPKPKP F  K G   E+P        SHP HV
Subjt:  SP-----------PPVGFTPLLPPDLLPKPKPRFGFKPGWRKENPTRVKPARSSHPDHV

XP_022149235.1 uncharacterized protein LOC111017707 [Momordica charantia]3.0e-24899.78Show/hide
Query:  MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL
        MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL
Subjt:  MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL

Query:  PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ
        PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ

Query:  HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYS
        HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELG LTSQRLQQLAAIINASPERNLGLDYS
Subjt:  HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYS

Query:  VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPPATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIVS
        VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPPATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIVS
Subjt:  VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPPATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIVS

Query:  PPPVGFTPLLPPDLLPKPKPRFGFKPGWRKENPTRVKPARSSHPDH
        PPPVGFTPLLPPDLLPKPKPRFGFKPGWRKENPTRVKPARSSHPDH
Subjt:  PPPVGFTPLLPPDLLPKPKPRFGFKPGWRKENPTRVKPARSSHPDH

TrEMBL top hitse value%identityAlignment
A0A0A0L6J0 Uncharacterized protein4.6e-18676.64Show/hide
Query:  MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL
        MGKGEEQNLP+Q+RREVA +GDSSGFLCGQCSIA  RV +ELNFKC FVL+LGF+VFVPGFFWLLPL ERNSGFEAKD+IKLSATVQVYFVLEKPV ELL
Subjt:  MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL

Query:  PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ
        PHIKRLEFDINGELDI NVKVS+LSMHD+GESNRTYVVFG+LSEYITAPINPVSLSL+RS+LYD FL ESNLTLTT IFGQPST +ILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ

Query:  HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYS
        HASIW+FPQIVFNFTL+NSISE+LDNFAKF+S+L FGLRLR YENVY QITNKIGST+QP +IVQASI+SELG +TSQRLQQLAAIIN SPERNLGLDYS
Subjt:  HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYS

Query:  VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPP-ATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIV
        VFGEVKSVSLSSY K TS ++PPS SPAPAPAPG+H E  S P   RS     ++PPAN SPP A C++ SP PS+VPA+SPH HS+PP SYP STRLIV
Subjt:  VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPP-ATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIV

Query:  SP-----------PPVGFTPLLPPDLLPKPKPRFGFKPGWRKENPTRVKPARSSHPDH
         P            PV   PLLPPDLLPKPKP F  K G   E+P        SHP H
Subjt:  SP-----------PPVGFTPLLPPDLLPKPKPRFGFKPGWRKENPTRVKPARSSHPDH

A0A1S3C2E0 uncharacterized protein LOC103496125 isoform X12.1e-18676.42Show/hide
Query:  MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL
        MGKGEEQNLP+Q+RREVA SGDSSGFLCGQCSIA  RV +ELNFKC FVL+LGF+VFVPG FWLLPL ERNSGFEAK+++KLSATVQVYFVLEKPV ELL
Subjt:  MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL

Query:  PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ
        PHIKRLEFDINGELDI +VKVS+LSMHD+GESNRTYVVFG+LSEYITAPINPVSLSL+RS+LYD FL ESNLTLTT IFGQPST +ILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ

Query:  HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYS
        HASIW+FPQIVFNFTL+NSISE+LDNFAKF+SEL FGLRLR YENVY QITNKIGST+QP +IVQASI+SELG +TSQRLQQLAAIIN SPERNLGLDYS
Subjt:  HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYS

Query:  VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPP-ATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIV
        VFGEVKSVSLSSY K TS ++PPS SPAPAPAPGDH E  S P   RS+     +PPAN SPP A C++LSP PS+VPAHSPH HS+PP SYP STRL+V
Subjt:  VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPP-ATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIV

Query:  SP-----------PPVGFTPLLPPDLLPKPKPRFGFKPGWRKENPTRVKPARSSHPDH
         P            P+ F PLLPPDLLPKPKP F  K G   E+         SHP H
Subjt:  SP-----------PPVGFTPLLPPDLLPKPKPRFGFKPGWRKENPTRVKPARSSHPDH

A0A5A7TCD6 Synaptojanin-17.2e-18776.3Show/hide
Query:  MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL
        MGKGEEQNLP+Q+RREVA SGDSSGFLCGQCSIA  RV +ELNFKC FVL+LGF+VFVPG FWLLPL ERNSGFEAK+++KLSATVQVYFVLEKPV ELL
Subjt:  MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL

Query:  PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ
        PHIKRLEFDINGELDI +VKVS+LSMHD+GESNRTYVVFG+LSEYITAPINPVSLSL+RS+LYD FL ESNLTLTT IFGQPST +ILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ

Query:  HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYS
        HASIW+FPQIVFNFTL+NSISE+LDNFAKF+SEL FGLRLR YENVY QITNKIGST+QP +IVQASI+SELG +TSQRLQQLAAIIN SPERNLGLDYS
Subjt:  HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYS

Query:  VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPP-ATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIV
        VFGEVKSVSLSSY K TS ++PPS SPAPAPAPGDH E  S P   RS+     +PPAN SPP A C++LSP P +VPAHSPH HS+PP SYP STRL+V
Subjt:  VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPP-ATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIV

Query:  SP-----------PPVGFTPLLPPDLLPKPKPRFGFKPGWRKENPTRVKPARSSHPDHVS
         P            P+ F PLLPPDLLPKPKP F  K G   E+         SHP HVS
Subjt:  SP-----------PPVGFTPLLPPDLLPKPKPRFGFKPGWRKENPTRVKPARSSHPDHVS

A0A6J1D566 uncharacterized protein LOC1110177071.5e-24899.78Show/hide
Query:  MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL
        MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL
Subjt:  MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL

Query:  PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ
        PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ

Query:  HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYS
        HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELG LTSQRLQQLAAIINASPERNLGLDYS
Subjt:  HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYS

Query:  VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPPATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIVS
        VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPPATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIVS
Subjt:  VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPPATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIVS

Query:  PPPVGFTPLLPPDLLPKPKPRFGFKPGWRKENPTRVKPARSSHPDH
        PPPVGFTPLLPPDLLPKPKPRFGFKPGWRKENPTRVKPARSSHPDH
Subjt:  PPPVGFTPLLPPDLLPKPKPRFGFKPGWRKENPTRVKPARSSHPDH

A0A6J1J689 uncharacterized protein LOC111482968 isoform X15.9e-17374.61Show/hide
Query:  MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL
        MGKGE+QNLP Q RRE     DSSGFLC +CSIA RRV  ELNFKC+FVLILGF VF+PGFFWLLPL ERN GFEAKD IKLSATVQVYFVLEKPV+ELL
Subjt:  MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELL

Query:  PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ
        PHIKRLEFDINGELDI NVKVSVLSMHD+GESNRTYVVFG+LSEYIT PINPVSLSL+RS+LYDLFL +SNLTLTT IFGQPSTF+ILKFPGGISIIPFQ
Subjt:  PHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQ

Query:  HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYS
        HASIWQFPQIVFNFTL+NSISE+L+ FAKF S+    L LRPYENVY QITNKIGSTMQP ++VQASISSELG +T+QRLQQLAAIIN S ERNLGLDYS
Subjt:  HASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYS

Query:  VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPP-ATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIV
        VFGEVK +SL SY KGTS ++PPS SPAPAPAPGDH E  SAP+ SRS+     +PPANRSPP A C   SPA S+VPA SPH HSMPP  YP STRLIV
Subjt:  VFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPP-ATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIV

Query:  SPPPVGFTPLLPPDLLPKPKPR-----FGFKPGWRKENPTRVK-PARSSHPDH
         P         P D      PR     F +KPG  KE+  RV+ P  SSHPDH
Subjt:  SPPPVGFTPLLPPDLLPKPKPR-----FGFKPGWRKENPTRVK-PARSSHPDH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)1.6e-5338.91Show/hide
Query:  EEQNLPIQRRREVADSGDSSGFLCGQ-CSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELLPHI
        +E  L +Q+     ++ +SS    G+ CS A  R+   +  +C+ VL+L   + +   FWL P R   S F+A   +KL+A+VQ  F L+KPV E++ H 
Subjt:  EEQNLPIQRRREVADSGDSSGFLCGQ-CSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELLPHI

Query:  KRLEFDINGELDIS-NVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQHA
         ++E DI   + +S N KV+VLS++  G SN T V F +L       I+  SLSL+RS+   LF + S L LTT  FG+P++F++LKFPGGI++ P + A
Subjt:  KRLEFDINGELDIS-NVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQHA

Query:  SIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYSVF
         +     ++F+ T+  SIS V D            L L PYE+V+FQ+TNK GST+ PP+  Q  ++  +     QRL     II  S  +NLGLD +VF
Subjt:  SIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYSVF

Query:  GEVKSVSLSSYLKGTSNSIPPSLSPAPAP
        GEVK ++ S+YL G        L+PAP P
Subjt:  GEVKSVSLSSYLKGTSNSIPPSLSPAPAP

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein3.3e-4332.94Show/hide
Query:  SGDSS--GFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELLPHIKRLEFDINGELDIS
        +GDS+     CG C    + +   + FKC+FVL+L   +F+   F LLP              +  A V   F + +    L  +  +L+ DI  E+   
Subjt:  SGDSS--GFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELLPHIKRLEFDINGELDIS

Query:  NVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQHASIWQFPQIVFNFTLS
        ++KV++L++    E N T VVFGI  +     I P+SLS ++     + + +S L LT  +FG+   FE+LKFPGGI++IP Q A   Q  +IVFNFTL+
Subjt:  NVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQHASIWQFPQIVFNFTLS

Query:  NSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELG-HLTSQRLQQLAAIINASPERNLGLDYSVFGEVKSVSLSSYLKG
         SI ++  NF    S+L  GL L PYEN+Y  ++N  GST+ PP  V +S+   +G   +S RL+QL   I  S  +NLGL+ ++FG+VK V LSS+L  
Subjt:  NSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELG-HLTSQRLQQLAAIINASPERNLGLDYSVFGEVKSVSLSSYLKG

Query:  TSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPPATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIVSPPPVGFTPLLPPDLLP
        +S+S   S SP+P+P    H              H+      +   P     +SP  S  P  S  R    P       R+      V F+        P
Subjt:  TSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPPATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIVSPPPVGFTPLLPPDLLP

Query:  KPKPRFGFKPGWRKENPTRVKPARS
         P P  G  P  +  +P  +  A+S
Subjt:  KPKPRFGFKPGWRKENPTRVKPARS

AT3G56590.1 hydroxyproline-rich glycoprotein family protein4.5e-4835.34Show/hide
Query:  MGKG--EEQNLPIQRRREVADSGDSSGF-LCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSA-----TVQVYFVL
        MGK   EEQNLP+      A +    G   C  C      +    + +CV +L     VF+   FWL P      GF    D+ L        +   F +
Subjt:  MGKG--EEQNLPIQRRREVADSGDSSGF-LCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSA-----TVQVYFVL

Query:  EKPVKELLPHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPG
         KP+  +  ++ +LE DI  E+     KV VL++  +G+ NRT V+F I  E   + I     SL+++    L  ++ +  LT  +FG+P  FE+LKFPG
Subjt:  EKPVKELLPHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPG

Query:  GISIIPFQHASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPE
        GI++IP Q     Q  Q++FNFTL+ SI ++  NF +  S+L  G+ L  YEN+Y  ++N  GST+ PP IV +S+    G  +S RL+QLA  I +S  
Subjt:  GISIIPFQHASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPE

Query:  RNLGLDYSVFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPPATCRALSPAPSVVPAHSPHRHSMPPSSY
        +NLGL+++VFG+VK V LSS L    +S   S +P+P+P P  H  P   P       H  + P  + SPP    A + AP+    HSP     PP  Y
Subjt:  RNLGLDYSVFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPPATCRALSPAPSVVPAHSPHRHSMPPSSY

AT3G56590.2 hydroxyproline-rich glycoprotein family protein4.5e-4835.34Show/hide
Query:  MGKG--EEQNLPIQRRREVADSGDSSGF-LCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSA-----TVQVYFVL
        MGK   EEQNLP+      A +    G   C  C      +    + +CV +L     VF+   FWL P      GF    D+ L        +   F +
Subjt:  MGKG--EEQNLPIQRRREVADSGDSSGF-LCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSA-----TVQVYFVL

Query:  EKPVKELLPHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPG
         KP+  +  ++ +LE DI  E+     KV VL++  +G+ NRT V+F I  E   + I     SL+++    L  ++ +  LT  +FG+P  FE+LKFPG
Subjt:  EKPVKELLPHIKRLEFDINGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPG

Query:  GISIIPFQHASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPE
        GI++IP Q     Q  Q++FNFTL+ SI ++  NF +  S+L  G+ L  YEN+Y  ++N  GST+ PP IV +S+    G  +S RL+QLA  I +S  
Subjt:  GISIIPFQHASIWQFPQIVFNFTLSNSISEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPE

Query:  RNLGLDYSVFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPPATCRALSPAPSVVPAHSPHRHSMPPSSY
        +NLGL+++VFG+VK V LSS L    +S   S +P+P+P P  H  P   P       H  + P  + SPP    A + AP+    HSP     PP  Y
Subjt:  RNLGLDYSVFGEVKSVSLSSYLKGTSNSIPPSLSPAPAPAPGDHAEPSSAPRASRSSSHSRVQPPANRSPPATCRALSPAPSVVPAHSPHRHSMPPSSY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAAGGCGAAGAGCAAAATCTGCCGATTCAGCGGCGTCGTGAGGTGGCTGACAGTGGCGATTCTTCTGGGTTTCTTTGTGGCCAATGCTCGATTGCTCTTCGTAG
AGTTCGTGAGGAGTTGAATTTCAAGTGTGTGTTCGTTTTGATTCTTGGGTTCTTGGTGTTCGTCCCTGGGTTCTTTTGGCTTCTTCCTCTTCGTGAAAGAAATTCAGGGT
TTGAGGCCAAAGACGACATTAAACTCAGTGCTACAGTTCAGGTCTATTTTGTTCTTGAAAAGCCTGTGAAGGAGCTTCTCCCTCACATCAAGAGATTAGAGTTTGATATC
AATGGTGAATTAGACATTTCTAATGTGAAGGTTTCTGTTCTATCCATGCACGATGTAGGTGAGTCGAACAGGACATACGTGGTTTTTGGTATTCTTTCTGAATACATAAC
TGCTCCAATAAATCCAGTGTCCTTAAGTCTTGTGAGGTCGACTTTATATGACCTTTTCCTTCGCGAATCCAACCTTACTTTGACGACACCGATCTTTGGACAGCCATCTA
CTTTTGAAATTCTGAAGTTTCCTGGGGGGATCTCTATAATCCCGTTCCAACATGCTTCGATTTGGCAGTTTCCCCAGATCGTATTTAACTTCACTCTTAGTAACTCCATT
TCTGAAGTACTTGACAATTTCGCCAAGTTCAGGAGCGAGCTGACGTTTGGATTGCGTCTCAGGCCTTACGAGAATGTGTATTTCCAAATAACAAACAAGATTGGCTCGAC
GATGCAACCACCCATAATTGTTCAGGCTTCAATTTCATCAGAGTTGGGGCACTTAACATCGCAGAGATTACAGCAGTTGGCTGCAATCATCAATGCCTCTCCCGAAAGAA
ATCTCGGCCTTGATTACTCTGTTTTCGGAGAAGTTAAGAGTGTGAGTTTGTCTTCTTATCTGAAGGGAACCTCTAACTCAATTCCGCCTAGTCTTTCTCCAGCTCCTGCC
CCAGCACCGGGCGATCATGCAGAACCATCAAGTGCCCCACGTGCTTCAAGATCATCGTCTCACAGCCGAGTGCAACCACCTGCAAATCGTTCCCCACCTGCAACTTGCAG
AGCCTTGTCTCCAGCCCCTTCTGTGGTTCCTGCACATTCCCCTCATCGACATTCAATGCCTCCAAGCTCCTATCCGGATTCTACAAGACTGATTGTTTCTCCACCTCCTG
TAGGTTTTACACCATTGTTGCCCCCTGATCTCTTACCTAAGCCAAAGCCTCGTTTCGGGTTCAAACCAGGGTGGAGAAAGGAAAATCCGACTAGAGTTAAGCCTGCCCGG
TCTTCTCATCCAGATCATGTAAGC
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAAGGCGAAGAGCAAAATCTGCCGATTCAGCGGCGTCGTGAGGTGGCTGACAGTGGCGATTCTTCTGGGTTTCTTTGTGGCCAATGCTCGATTGCTCTTCGTAG
AGTTCGTGAGGAGTTGAATTTCAAGTGTGTGTTCGTTTTGATTCTTGGGTTCTTGGTGTTCGTCCCTGGGTTCTTTTGGCTTCTTCCTCTTCGTGAAAGAAATTCAGGGT
TTGAGGCCAAAGACGACATTAAACTCAGTGCTACAGTTCAGGTCTATTTTGTTCTTGAAAAGCCTGTGAAGGAGCTTCTCCCTCACATCAAGAGATTAGAGTTTGATATC
AATGGTGAATTAGACATTTCTAATGTGAAGGTTTCTGTTCTATCCATGCACGATGTAGGTGAGTCGAACAGGACATACGTGGTTTTTGGTATTCTTTCTGAATACATAAC
TGCTCCAATAAATCCAGTGTCCTTAAGTCTTGTGAGGTCGACTTTATATGACCTTTTCCTTCGCGAATCCAACCTTACTTTGACGACACCGATCTTTGGACAGCCATCTA
CTTTTGAAATTCTGAAGTTTCCTGGGGGGATCTCTATAATCCCGTTCCAACATGCTTCGATTTGGCAGTTTCCCCAGATCGTATTTAACTTCACTCTTAGTAACTCCATT
TCTGAAGTACTTGACAATTTCGCCAAGTTCAGGAGCGAGCTGACGTTTGGATTGCGTCTCAGGCCTTACGAGAATGTGTATTTCCAAATAACAAACAAGATTGGCTCGAC
GATGCAACCACCCATAATTGTTCAGGCTTCAATTTCATCAGAGTTGGGGCACTTAACATCGCAGAGATTACAGCAGTTGGCTGCAATCATCAATGCCTCTCCCGAAAGAA
ATCTCGGCCTTGATTACTCTGTTTTCGGAGAAGTTAAGAGTGTGAGTTTGTCTTCTTATCTGAAGGGAACCTCTAACTCAATTCCGCCTAGTCTTTCTCCAGCTCCTGCC
CCAGCACCGGGCGATCATGCAGAACCATCAAGTGCCCCACGTGCTTCAAGATCATCGTCTCACAGCCGAGTGCAACCACCTGCAAATCGTTCCCCACCTGCAACTTGCAG
AGCCTTGTCTCCAGCCCCTTCTGTGGTTCCTGCACATTCCCCTCATCGACATTCAATGCCTCCAAGCTCCTATCCGGATTCTACAAGACTGATTGTTTCTCCACCTCCTG
TAGGTTTTACACCATTGTTGCCCCCTGATCTCTTACCTAAGCCAAAGCCTCGTTTCGGGTTCAAACCAGGGTGGAGAAAGGAAAATCCGACTAGAGTTAAGCCTGCCCGG
TCTTCTCATCCAGATCATGTAAGC
Protein sequenceShow/hide protein sequence
MGKGEEQNLPIQRRREVADSGDSSGFLCGQCSIALRRVREELNFKCVFVLILGFLVFVPGFFWLLPLRERNSGFEAKDDIKLSATVQVYFVLEKPVKELLPHIKRLEFDI
NGELDISNVKVSVLSMHDVGESNRTYVVFGILSEYITAPINPVSLSLVRSTLYDLFLRESNLTLTTPIFGQPSTFEILKFPGGISIIPFQHASIWQFPQIVFNFTLSNSI
SEVLDNFAKFRSELTFGLRLRPYENVYFQITNKIGSTMQPPIIVQASISSELGHLTSQRLQQLAAIINASPERNLGLDYSVFGEVKSVSLSSYLKGTSNSIPPSLSPAPA
PAPGDHAEPSSAPRASRSSSHSRVQPPANRSPPATCRALSPAPSVVPAHSPHRHSMPPSSYPDSTRLIVSPPPVGFTPLLPPDLLPKPKPRFGFKPGWRKENPTRVKPAR
SSHPDHVS