; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS017082 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS017082
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionARM repeat superfamily protein
Genome locationscaffold197:1311233..1313318
RNA-Seq ExpressionMS017082
SyntenyMS017082
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0032797 - SMN complex (cellular component)
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK08426.1 ARM repeat superfamily protein [Cucumis melo var. makuwa]2.0e-29587.95Show/hide
Query:  MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS
        MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS
Subjt:  MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS

Query:  KVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNP
        KVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLT+GAALCLKALVDSDNWRFASDEM+NKVCQNVAGALEEKSTQTNSHMGLVM+LAKRNP
Subjt:  KVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNP

Query:  RIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDKSPSSV
        RIVEPYARLLLQAGLRILK GVVEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+IIEEMENCQSDQM YVKGAAFETLQTAK+I ADKGSKMDKSPSSV
Subjt:  RIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDKSPSSV

Query:  TGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVSDTMSL
        TGSNFIDHRRRSPWRNGGSRTPSSES ESQTLDSFFDYGSLVGSP S RQASRNS FD RSVNRKLWSYENGGVDISLKDGLSLFS +TRG DVSDTMSL
Subjt:  TGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVSDTMSL

Query:  ISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPRSSFHN
         S SH FG NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSRSYI VEDMIFKTPRKLVHSLQDLNE NSD+AS S RR +RSLSSGNLEWSP  +F N
Subjt:  ISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPRSSFHN

Query:  QNGFPDDQKLSKEDVGGLDI-NGEQSQGSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTVFTSFLLINDQDQG
        +NG  D++KLSKED  GLDI NGEQSQGSSES+SSTDG+P H D+QA PV V  QS +K Q  G+EMAYKKTALKLVCGFSFLLFT+FTS L I+D DQG
Subjt:  QNGFPDDQKLSKEDVGGLDI-NGEQSQGSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTVFTSFLLINDQDQG

Query:  SYLVPT
        SYLVPT
Subjt:  SYLVPT

XP_004147557.1 protein SINE1 [Cucumis sativus]6.3e-29787.73Show/hide
Query:  KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPL
        KNLSPMLRREFANLDKDADSRRSAMKAL+TYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPL
Subjt:  KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPL

Query:  QQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMTL
        QQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLT+GAALCLKALVDSDNWRFASDEM+NKVCQNVAGALEEKSTQTNSHMGLVMTL
Subjt:  QQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMTL

Query:  AKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDK
        AKRNPRIVEPYARLLLQAGLRILK GVVEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+IIEEMENCQSDQM YVKGAAFETLQTAK+I ADKGSKMDK
Subjt:  AKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDK

Query:  SPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVS
        SPSSVTGSNF+DHRRRSPWRNGGSRTPSSES ESQTLDSFFDYGSLVGSP S RQASRNSGFD RSVNRKLWSYENGGVDISLKDGLSLFS +TRG DVS
Subjt:  SPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVS

Query:  DTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPR
        DTMS+ S SH FG NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNE  SD+AS S R  +RSLSSGNLEWSP 
Subjt:  DTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPR

Query:  SSFHNQNGFPDDQKLSKEDVGGL-DINGEQSQGSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTVFTSFLLIN
         +F NQNGF D+ KLSKED  GL + NGEQSQGS ES+SS DG P H D+QA PV VA QS MK Q  G+EMAYKKTALKLVCGFSFLLFT+FTS L I+
Subjt:  SSFHNQNGFPDDQKLSKEDVGGL-DINGEQSQGSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTVFTSFLLIN

Query:  DQDQGSYLVPT
        D DQGSYLVPT
Subjt:  DQDQGSYLVPT

XP_008441975.1 PREDICTED: uncharacterized protein LOC103485976 [Cucumis melo]2.0e-29888.05Show/hide
Query:  KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPL
        KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPL
Subjt:  KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPL

Query:  QQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMTL
        QQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLT+GAALCLKALVDSDNWRFASDEM+NKVCQNVAGALEEKSTQTNSHMGLVM+L
Subjt:  QQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMTL

Query:  AKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDK
        AKRNPRIVEPYARLLLQAGLRILK GVVEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+IIEEMENCQSDQM YVKGAAFETLQTAK+I ADKGSKMDK
Subjt:  AKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDK

Query:  SPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVS
        SPSSVTGSNFIDHRRRSPWRNGGSRTPSSES ESQTLDSFFDYGSLVGSP S RQASRNS FD RSVNRKLWSYENGGVDISLKDGLSLFS +TRG DVS
Subjt:  SPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVS

Query:  DTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPR
        DTMSL S SH FG NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSRSYI VEDMIFKTPRKLVHSLQDLNE NSD+AS S RR +RSLSSGNLEWSP 
Subjt:  DTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPR

Query:  SSFHNQNGFPDDQKLSKEDVGGLDI-NGEQSQGSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTVFTSFLLIN
         +F N+NG  D++KLSKED  GLDI NGEQSQGSSES+SSTDG+P H D+QA PV V  QS +K Q  G+EMAYKKTALKLVCGFSFLLFT+FTS L I+
Subjt:  SSFHNQNGFPDDQKLSKEDVGGLDI-NGEQSQGSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTVFTSFLLIN

Query:  DQDQGSYLVPT
        D DQGSYLVPT
Subjt:  DQDQGSYLVPT

XP_022156223.1 uncharacterized protein LOC111023161 [Momordica charantia]0.0e+0099.51Show/hide
Query:  FGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        FGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
Subjt:  FGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVM
        PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLV 
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVM

Query:  TLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKM
        TLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKM
Subjt:  TLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKM

Query:  DKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGND
        DKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGND
Subjt:  DKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGND

Query:  VSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWS
        VSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWS
Subjt:  VSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWS

Query:  PRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQGSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTVFTSFLLI
        PRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQG SESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGI+MAYKKTALKLVCGFSFLLFTVFTSFLLI
Subjt:  PRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQGSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTVFTSFLLI

Query:  NDQDQGSYLVPT
        NDQDQGSYLVPT
Subjt:  NDQDQGSYLVPT

XP_038883420.1 protein SINE1 [Benincasa hispida]7.7e-29587.56Show/hide
Query:  KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPL
        KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPL
Subjt:  KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPL

Query:  QQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMTL
        QQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLT+GAALCLKALVDSDNWRFASDE++NKVCQNVAGALEEKSTQTNSHMGLVMTL
Subjt:  QQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMTL

Query:  AKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDK
        AKRNPRIVEPYARLLLQAGLRILK G+VEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+IIEEMENCQSDQM YVKGAAFETLQTAK+I ADKGSKMDK
Subjt:  AKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDK

Query:  SPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVS
        SPSSVTGSNFID  RRSPWRNGGSRTPSSES ESQTLDSFFDYGSLVGSP S RQASRNSGFD RSVNRKLWSYENGGVDISLKDGLSLFS ITRG DVS
Subjt:  SPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVS

Query:  DTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPR
        DTMS+ S SH  G NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSR YINVEDMIFKTPRKLV SLQDLNEANS++ SKS RR +RSLSSGNLEWSP 
Subjt:  DTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPR

Query:  SSFHNQNGFPDDQKLSKEDVGGLDIN-GEQSQGSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTVFTSFLLIN
         SF NQ  FPDDQK SKED GGLD +  EQSQGSSES+SS+DG+P H D++A PV VA QS +K Q SG+EMAYKKTALKLVCGFSFLLFT+FTS L I+
Subjt:  SSFHNQNGFPDDQKLSKEDVGGLDIN-GEQSQGSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTVFTSFLLIN

Query:  DQDQGSYLVPT
        D DQGSYLVPT
Subjt:  DQDQGSYLVPT

TrEMBL top hitse value%identityAlignment
A0A0A0KYP2 Uncharacterized protein3.0e-29787.73Show/hide
Query:  KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPL
        KNLSPMLRREFANLDKDADSRRSAMKAL+TYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPL
Subjt:  KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPL

Query:  QQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMTL
        QQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLT+GAALCLKALVDSDNWRFASDEM+NKVCQNVAGALEEKSTQTNSHMGLVMTL
Subjt:  QQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMTL

Query:  AKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDK
        AKRNPRIVEPYARLLLQAGLRILK GVVEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+IIEEMENCQSDQM YVKGAAFETLQTAK+I ADKGSKMDK
Subjt:  AKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDK

Query:  SPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVS
        SPSSVTGSNF+DHRRRSPWRNGGSRTPSSES ESQTLDSFFDYGSLVGSP S RQASRNSGFD RSVNRKLWSYENGGVDISLKDGLSLFS +TRG DVS
Subjt:  SPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVS

Query:  DTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPR
        DTMS+ S SH FG NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNE  SD+AS S R  +RSLSSGNLEWSP 
Subjt:  DTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPR

Query:  SSFHNQNGFPDDQKLSKEDVGGL-DINGEQSQGSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTVFTSFLLIN
         +F NQNGF D+ KLSKED  GL + NGEQSQGS ES+SS DG P H D+QA PV VA QS MK Q  G+EMAYKKTALKLVCGFSFLLFT+FTS L I+
Subjt:  SSFHNQNGFPDDQKLSKEDVGGL-DINGEQSQGSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTVFTSFLLIN

Query:  DQDQGSYLVPT
        D DQGSYLVPT
Subjt:  DQDQGSYLVPT

A0A1S3B5D3 uncharacterized protein LOC1034859769.5e-29988.05Show/hide
Query:  KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPL
        KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPL
Subjt:  KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPL

Query:  QQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMTL
        QQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLT+GAALCLKALVDSDNWRFASDEM+NKVCQNVAGALEEKSTQTNSHMGLVM+L
Subjt:  QQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMTL

Query:  AKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDK
        AKRNPRIVEPYARLLLQAGLRILK GVVEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+IIEEMENCQSDQM YVKGAAFETLQTAK+I ADKGSKMDK
Subjt:  AKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDK

Query:  SPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVS
        SPSSVTGSNFIDHRRRSPWRNGGSRTPSSES ESQTLDSFFDYGSLVGSP S RQASRNS FD RSVNRKLWSYENGGVDISLKDGLSLFS +TRG DVS
Subjt:  SPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVS

Query:  DTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPR
        DTMSL S SH FG NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSRSYI VEDMIFKTPRKLVHSLQDLNE NSD+AS S RR +RSLSSGNLEWSP 
Subjt:  DTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPR

Query:  SSFHNQNGFPDDQKLSKEDVGGLDI-NGEQSQGSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTVFTSFLLIN
         +F N+NG  D++KLSKED  GLDI NGEQSQGSSES+SSTDG+P H D+QA PV V  QS +K Q  G+EMAYKKTALKLVCGFSFLLFT+FTS L I+
Subjt:  SSFHNQNGFPDDQKLSKEDVGGLDI-NGEQSQGSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTVFTSFLLIN

Query:  DQDQGSYLVPT
        D DQGSYLVPT
Subjt:  DQDQGSYLVPT

A0A5A7UWA1 ARM repeat superfamily protein9.5e-29988.05Show/hide
Query:  KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPL
        KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPL
Subjt:  KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPL

Query:  QQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMTL
        QQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLT+GAALCLKALVDSDNWRFASDEM+NKVCQNVAGALEEKSTQTNSHMGLVM+L
Subjt:  QQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMTL

Query:  AKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDK
        AKRNPRIVEPYARLLLQAGLRILK GVVEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+IIEEMENCQSDQM YVKGAAFETLQTAK+I ADKGSKMDK
Subjt:  AKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDK

Query:  SPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVS
        SPSSVTGSNFIDHRRRSPWRNGGSRTPSSES ESQTLDSFFDYGSLVGSP S RQASRNS FD RSVNRKLWSYENGGVDISLKDGLSLFS +TRG DVS
Subjt:  SPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVS

Query:  DTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPR
        DTMSL S SH FG NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSRSYI VEDMIFKTPRKLVHSLQDLNE NSD+AS S RR +RSLSSGNLEWSP 
Subjt:  DTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPR

Query:  SSFHNQNGFPDDQKLSKEDVGGLDI-NGEQSQGSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTVFTSFLLIN
         +F N+NG  D++KLSKED  GLDI NGEQSQGSSES+SSTDG+P H D+QA PV V  QS +K Q  G+EMAYKKTALKLVCGFSFLLFT+FTS L I+
Subjt:  SSFHNQNGFPDDQKLSKEDVGGLDI-NGEQSQGSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTVFTSFLLIN

Query:  DQDQGSYLVPT
        D DQGSYLVPT
Subjt:  DQDQGSYLVPT

A0A5D3CDJ7 ARM repeat superfamily protein9.8e-29687.95Show/hide
Query:  MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS
        MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS
Subjt:  MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS

Query:  KVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNP
        KVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLT+GAALCLKALVDSDNWRFASDEM+NKVCQNVAGALEEKSTQTNSHMGLVM+LAKRNP
Subjt:  KVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNP

Query:  RIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDKSPSSV
        RIVEPYARLLLQAGLRILK GVVEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+IIEEMENCQSDQM YVKGAAFETLQTAK+I ADKGSKMDKSPSSV
Subjt:  RIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDKSPSSV

Query:  TGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVSDTMSL
        TGSNFIDHRRRSPWRNGGSRTPSSES ESQTLDSFFDYGSLVGSP S RQASRNS FD RSVNRKLWSYENGGVDISLKDGLSLFS +TRG DVSDTMSL
Subjt:  TGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVSDTMSL

Query:  ISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPRSSFHN
         S SH FG NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSRSYI VEDMIFKTPRKLVHSLQDLNE NSD+AS S RR +RSLSSGNLEWSP  +F N
Subjt:  ISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPRSSFHN

Query:  QNGFPDDQKLSKEDVGGLDI-NGEQSQGSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTVFTSFLLINDQDQG
        +NG  D++KLSKED  GLDI NGEQSQGSSES+SSTDG+P H D+QA PV V  QS +K Q  G+EMAYKKTALKLVCGFSFLLFT+FTS L I+D DQG
Subjt:  QNGFPDDQKLSKEDVGGLDI-NGEQSQGSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTVFTSFLLINDQDQG

Query:  SYLVPT
        SYLVPT
Subjt:  SYLVPT

A0A6J1DQ15 uncharacterized protein LOC1110231610.0e+0099.51Show/hide
Query:  FGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        FGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
Subjt:  FGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVM
        PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLV 
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVM

Query:  TLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKM
        TLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKM
Subjt:  TLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKM

Query:  DKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGND
        DKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGND
Subjt:  DKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGND

Query:  VSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWS
        VSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWS
Subjt:  VSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWS

Query:  PRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQGSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTVFTSFLLI
        PRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQG SESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGI+MAYKKTALKLVCGFSFLLFTVFTSFLLI
Subjt:  PRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQGSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTVFTSFLLI

Query:  NDQDQGSYLVPT
        NDQDQGSYLVPT
Subjt:  NDQDQGSYLVPT

SwissProt top hitse value%identityAlignment
Q5XVI1 Protein SINE18.6e-15654.91Show/hide
Query:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP
        G NL+P+LR+E ANLDKD +SR+SAMKAL++YVK+LDSKAIPGFLAQV ET+ET +L+GE TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSFP
Subjt:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP

Query:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMT
        LQQACSKV+PAIARYGIDPTT +DKK+ +IHSLC PL +SLL+SQESLTSGAALCLKALVDSDNWRFASDEM+N+VCQNV  AL+  S QT+  MGLVM+
Subjt:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMT

Query:  LAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMD
        LAK NP IVE YARLL+  GLRIL  GV E NSQKRLSA+QM+NFLMKCLDP SI SE++ II+EME CQSDQMAYV+GAA+E + T+KRIAA+  SKM+
Subjt:  LAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMD

Query:  KSPSSVTGSNFIDHRRRSPWRNGGSRTPS-SESQESQTLDSFFDYGSLV-GSPISPRQASRNSGFDCRSVNRKLWSY-ENGG-VDISLKDGLSLFSGITR
        K   SVTGSNF         RN  S  P  S S ESQTL SF  Y S V  SPIS    S NS FD RSVNRKLW   ENGG VDISLKDG  LFS +T+
Subjt:  KSPSSVTGSNFIDHRRRSPWRNGGSRTPS-SESQESQTLDSFFDYGSLV-GSPISPRQASRNSGFDCRSVNRKLWSY-ENGG-VDISLKDGLSLFSGITR

Query:  GNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRS-YINVEDM-IFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSG
        G+      + +S+S +   +  E  D+F GFL  S       ++TT SP R RS  IN ED  IF TPRKL+ SLQ                        
Subjt:  GNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRS-YINVEDM-IFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSG

Query:  NLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQ--GSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTV
                       +PDD  L   D+    + GE+ +  GS ++       P   +  ++ + V+  +      +G +   K +  KLV   SF++  +
Subjt:  NLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQ--GSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTV

Query:  FTSFLLI--NDQDQGSYLVPT
        F + +L+   D D G Y VPT
Subjt:  FTSFLLI--NDQDQGSYLVPT

Q9SQR5 Protein SINE21.3e-8756.83Show/hide
Query:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP
        G+NL    R+E ANLDKD DS ++AM  LR+ VK+LD+K +  F+AQ+S+ +E G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS  
Subjt:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP

Query:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSS--QESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLV
        +QQACS+ V A+ARYGIDPTTP+DKK +VIHSLC PL +SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EM+N VCQ++A ALE  S++  SHM LV
Subjt:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSS--QESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLV

Query:  MTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSK
        M L+K NP  VE YARL +++GLRIL +GVVE +SQKRL AIQM+NFLMK L+P SI SEL+ I +EME  Q DQ  YVK AA ET++ A+R+  +    
Subjt:  MTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSK

Query:  MD----KSPSSVTGS
         D    K  +S++GS
Subjt:  MD----KSPSSVTGS

Arabidopsis top hitse value%identityAlignment
AT1G54385.1 ARM repeat superfamily protein6.1e-15754.91Show/hide
Query:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP
        G NL+P+LR+E ANLDKD +SR+SAMKAL++YVK+LDSKAIPGFLAQV ET+ET +L+GE TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSFP
Subjt:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP

Query:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMT
        LQQACSKV+PAIARYGIDPTT +DKK+ +IHSLC PL +SLL+SQESLTSGAALCLKALVDSDNWRFASDEM+N+VCQNV  AL+  S QT+  MGLVM+
Subjt:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMT

Query:  LAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMD
        LAK NP IVE YARLL+  GLRIL  GV E NSQKRLSA+QM+NFLMKCLDP SI SE++ II+EME CQSDQMAYV+GAA+E + T+KRIAA+  SKM+
Subjt:  LAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMD

Query:  KSPSSVTGSNFIDHRRRSPWRNGGSRTPS-SESQESQTLDSFFDYGSLV-GSPISPRQASRNSGFDCRSVNRKLWSY-ENGG-VDISLKDGLSLFSGITR
        K   SVTGSNF         RN  S  P  S S ESQTL SF  Y S V  SPIS    S NS FD RSVNRKLW   ENGG VDISLKDG  LFS +T+
Subjt:  KSPSSVTGSNFIDHRRRSPWRNGGSRTPS-SESQESQTLDSFFDYGSLV-GSPISPRQASRNSGFDCRSVNRKLWSY-ENGG-VDISLKDGLSLFSGITR

Query:  GNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRS-YINVEDM-IFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSG
        G+      + +S+S +   +  E  D+F GFL  S       ++TT SP R RS  IN ED  IF TPRKL+ SLQ                        
Subjt:  GNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRS-YINVEDM-IFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSG

Query:  NLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQ--GSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTV
                       +PDD  L   D+    + GE+ +  GS ++       P   +  ++ + V+  +      +G +   K +  KLV   SF++  +
Subjt:  NLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQ--GSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTV

Query:  FTSFLLI--NDQDQGSYLVPT
        F + +L+   D D G Y VPT
Subjt:  FTSFLLI--NDQDQGSYLVPT

AT1G54385.2 ARM repeat superfamily protein6.1e-15754.91Show/hide
Query:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP
        G NL+P+LR+E ANLDKD +SR+SAMKAL++YVK+LDSKAIPGFLAQV ET+ET +L+GE TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSFP
Subjt:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP

Query:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMT
        LQQACSKV+PAIARYGIDPTT +DKK+ +IHSLC PL +SLL+SQESLTSGAALCLKALVDSDNWRFASDEM+N+VCQNV  AL+  S QT+  MGLVM+
Subjt:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMT

Query:  LAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMD
        LAK NP IVE YARLL+  GLRIL  GV E NSQKRLSA+QM+NFLMKCLDP SI SE++ II+EME CQSDQMAYV+GAA+E + T+KRIAA+  SKM+
Subjt:  LAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMD

Query:  KSPSSVTGSNFIDHRRRSPWRNGGSRTPS-SESQESQTLDSFFDYGSLV-GSPISPRQASRNSGFDCRSVNRKLWSY-ENGG-VDISLKDGLSLFSGITR
        K   SVTGSNF         RN  S  P  S S ESQTL SF  Y S V  SPIS    S NS FD RSVNRKLW   ENGG VDISLKDG  LFS +T+
Subjt:  KSPSSVTGSNFIDHRRRSPWRNGGSRTPS-SESQESQTLDSFFDYGSLV-GSPISPRQASRNSGFDCRSVNRKLWSY-ENGG-VDISLKDGLSLFSGITR

Query:  GNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRS-YINVEDM-IFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSG
        G+      + +S+S +   +  E  D+F GFL  S       ++TT SP R RS  IN ED  IF TPRKL+ SLQ                        
Subjt:  GNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRS-YINVEDM-IFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSG

Query:  NLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQ--GSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTV
                       +PDD  L   D+    + GE+ +  GS ++       P   +  ++ + V+  +      +G +   K +  KLV   SF++  +
Subjt:  NLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQ--GSSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTV

Query:  FTSFLLI--NDQDQGSYLVPT
        F + +L+   D D G Y VPT
Subjt:  FTSFLLI--NDQDQGSYLVPT

AT3G03970.1 ARM repeat superfamily protein9.3e-8956.83Show/hide
Query:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP
        G+NL    R+E ANLDKD DS ++AM  LR+ VK+LD+K +  F+AQ+S+ +E G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS  
Subjt:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP

Query:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSS--QESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLV
        +QQACS+ V A+ARYGIDPTTP+DKK +VIHSLC PL +SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EM+N VCQ++A ALE  S++  SHM LV
Subjt:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSS--QESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLV

Query:  MTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSK
        M L+K NP  VE YARL +++GLRIL +GVVE +SQKRL AIQM+NFLMK L+P SI SEL+ I +EME  Q DQ  YVK AA ET++ A+R+  +    
Subjt:  MTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSK

Query:  MD----KSPSSVTGS
         D    K  +S++GS
Subjt:  MD----KSPSSVTGS

AT3G03970.2 ARM repeat superfamily protein9.3e-8956.83Show/hide
Query:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP
        G+NL    R+E ANLDKD DS ++AM  LR+ VK+LD+K +  F+AQ+S+ +E G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS  
Subjt:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP

Query:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSS--QESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLV
        +QQACS+ V A+ARYGIDPTTP+DKK +VIHSLC PL +SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EM+N VCQ++A ALE  S++  SHM LV
Subjt:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSS--QESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLV

Query:  MTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSK
        M L+K NP  VE YARL +++GLRIL +GVVE +SQKRL AIQM+NFLMK L+P SI SEL+ I +EME  Q DQ  YVK AA ET++ A+R+  +    
Subjt:  MTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSK

Query:  MD----KSPSSVTGS
         D    K  +S++GS
Subjt:  MD----KSPSSVTGS

AT3G03970.3 ARM repeat superfamily protein9.3e-8956.83Show/hide
Query:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP
        G+NL    R+E ANLDKD DS ++AM  LR+ VK+LD+K +  F+AQ+S+ +E G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS  
Subjt:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP

Query:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSS--QESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLV
        +QQACS+ V A+ARYGIDPTTP+DKK +VIHSLC PL +SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EM+N VCQ++A ALE  S++  SHM LV
Subjt:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSS--QESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLV

Query:  MTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSK
        M L+K NP  VE YARL +++GLRIL +GVVE +SQKRL AIQM+NFLMK L+P SI SEL+ I +EME  Q DQ  YVK AA ET++ A+R+  +    
Subjt:  MTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSK

Query:  MD----KSPSSVTGS
         D    K  +S++GS
Subjt:  MD----KSPSSVTGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TTCGGCAAAAATTTGAGTCCAATGCTTCGGCGGGAATTTGCTAATCTTGATAAAGATGCTGATAGTCGCAGATCTGCGATGAAGGCACTGAGGACTTATGTGAAGGAATT
GGACTCCAAGGCCATCCCGGGTTTTCTTGCTCAAGTTTCTGAGACCAGAGAAACTGGTGCATTGACTGGGGAGTGCACGATTTCTCTCTACGAAGTTCTCGCCCGTGTTC
ATGGCGTCAACATCGTGCCGCAGATCGACAGGATTATGACTTCCATAATCAAGACTTTGGCTTCAAGTGCTGGTTCGTTCCCTCTTCAACAGGCATGCTCTAAGGTTGTT
CCGGCGATTGCGAGATATGGGATCGATCCTACCACTCCTGATGATAAGAAGAAGCATGTTATTCACTCTCTGTGTAATCCTCTTTGTGAATCTTTGTTGAGTTCTCAAGA
GAGCCTCACTTCTGGGGCTGCCCTCTGCTTGAAGGCTCTGGTGGATTCGGATAACTGGCGGTTCGCTTCCGATGAGATGATTAACAAGGTTTGCCAGAATGTTGCTGGAG
CTCTGGAGGAGAAGTCCACGCAAACTAATTCACACATGGGGCTTGTTATGACATTAGCAAAGCGAAATCCTCGGATTGTCGAACCATATGCTAGATTGTTACTACAGGCC
GGGCTGCGAATATTGAAGGTCGGGGTAGTAGAGAAGAATTCTCAGAAGAGATTGTCTGCCATTCAAATGATTAATTTCTTGATGAAGTGTCTGGATCCTTGGAGCATATT
GTCGGAACTTCAGACTATAATTGAGGAGATGGAGAATTGCCAGTCCGATCAAATGGCTTATGTCAAAGGTGCAGCCTTTGAAACTCTGCAAACTGCGAAGAGAATAGCTG
CGGATAAAGGGTCGAAAATGGACAAGTCTCCGAGCTCGGTCACTGGATCAAATTTCATTGATCACAGGCGGAGAAGTCCATGGCGGAATGGTGGAAGCCGCACTCCGTCA
TCTGAGTCTCAAGAATCCCAGACCCTTGATTCATTTTTTGATTATGGTTCACTAGTTGGATCACCCATTTCACCAAGACAAGCCTCTCGTAATTCTGGTTTCGATTGTAG
GAGTGTGAATCGTAAGCTTTGGAGTTATGAGAATGGTGGGGTTGATATTTCCCTCAAAGATGGCTTGTCTTTGTTCTCAGGTATCACTCGTGGAAACGACGTCTCCGACA
CCATGTCTTTGATCTCTGAAAGTCATATATTTGGCCAAAATGGCGAAGAATATGCAGATGATTTTGCAGGGTTTCTTCAAATAAGTCCTCCTAGACGCAGAGTATCAAAA
AGCACTACAACCAGTCCCCTTAGGTCGCGCAGTTACATAAACGTCGAAGATATGATCTTCAAAACTCCTCGGAAGCTTGTCCATTCTCTTCAAGATCTAAACGAGGCGAA
CTCGGACCATGCTAGCAAAAGTTTCAGACGTGCCTACAGGAGCCTATCATCAGGCAATTTGGAGTGGAGTCCAAGATCATCTTTCCATAACCAAAATGGGTTCCCAGACG
ATCAGAAACTCAGCAAAGAGGATGTAGGCGGCCTAGACATCAATGGTGAACAATCACAAGGCAGTTCGGAATCTGTCTCTTCGACAGATGGCATTCCAGCCCACACCGAC
ATCCAAGCTACACCAGTGGAGGTGGCTTATCAAAGCAACATGAAAACTCAATGTTCTGGCATTGAAATGGCATACAAGAAGACTGCTTTGAAACTGGTCTGTGGCTTCTC
ATTTTTGCTTTTCACAGTGTTCACTTCATTCCTGTTGATTAATGATCAGGACCAAGGCTCCTATCTTGTGCCAACC
mRNA sequenceShow/hide mRNA sequence
TTCGGCAAAAATTTGAGTCCAATGCTTCGGCGGGAATTTGCTAATCTTGATAAAGATGCTGATAGTCGCAGATCTGCGATGAAGGCACTGAGGACTTATGTGAAGGAATT
GGACTCCAAGGCCATCCCGGGTTTTCTTGCTCAAGTTTCTGAGACCAGAGAAACTGGTGCATTGACTGGGGAGTGCACGATTTCTCTCTACGAAGTTCTCGCCCGTGTTC
ATGGCGTCAACATCGTGCCGCAGATCGACAGGATTATGACTTCCATAATCAAGACTTTGGCTTCAAGTGCTGGTTCGTTCCCTCTTCAACAGGCATGCTCTAAGGTTGTT
CCGGCGATTGCGAGATATGGGATCGATCCTACCACTCCTGATGATAAGAAGAAGCATGTTATTCACTCTCTGTGTAATCCTCTTTGTGAATCTTTGTTGAGTTCTCAAGA
GAGCCTCACTTCTGGGGCTGCCCTCTGCTTGAAGGCTCTGGTGGATTCGGATAACTGGCGGTTCGCTTCCGATGAGATGATTAACAAGGTTTGCCAGAATGTTGCTGGAG
CTCTGGAGGAGAAGTCCACGCAAACTAATTCACACATGGGGCTTGTTATGACATTAGCAAAGCGAAATCCTCGGATTGTCGAACCATATGCTAGATTGTTACTACAGGCC
GGGCTGCGAATATTGAAGGTCGGGGTAGTAGAGAAGAATTCTCAGAAGAGATTGTCTGCCATTCAAATGATTAATTTCTTGATGAAGTGTCTGGATCCTTGGAGCATATT
GTCGGAACTTCAGACTATAATTGAGGAGATGGAGAATTGCCAGTCCGATCAAATGGCTTATGTCAAAGGTGCAGCCTTTGAAACTCTGCAAACTGCGAAGAGAATAGCTG
CGGATAAAGGGTCGAAAATGGACAAGTCTCCGAGCTCGGTCACTGGATCAAATTTCATTGATCACAGGCGGAGAAGTCCATGGCGGAATGGTGGAAGCCGCACTCCGTCA
TCTGAGTCTCAAGAATCCCAGACCCTTGATTCATTTTTTGATTATGGTTCACTAGTTGGATCACCCATTTCACCAAGACAAGCCTCTCGTAATTCTGGTTTCGATTGTAG
GAGTGTGAATCGTAAGCTTTGGAGTTATGAGAATGGTGGGGTTGATATTTCCCTCAAAGATGGCTTGTCTTTGTTCTCAGGTATCACTCGTGGAAACGACGTCTCCGACA
CCATGTCTTTGATCTCTGAAAGTCATATATTTGGCCAAAATGGCGAAGAATATGCAGATGATTTTGCAGGGTTTCTTCAAATAAGTCCTCCTAGACGCAGAGTATCAAAA
AGCACTACAACCAGTCCCCTTAGGTCGCGCAGTTACATAAACGTCGAAGATATGATCTTCAAAACTCCTCGGAAGCTTGTCCATTCTCTTCAAGATCTAAACGAGGCGAA
CTCGGACCATGCTAGCAAAAGTTTCAGACGTGCCTACAGGAGCCTATCATCAGGCAATTTGGAGTGGAGTCCAAGATCATCTTTCCATAACCAAAATGGGTTCCCAGACG
ATCAGAAACTCAGCAAAGAGGATGTAGGCGGCCTAGACATCAATGGTGAACAATCACAAGGCAGTTCGGAATCTGTCTCTTCGACAGATGGCATTCCAGCCCACACCGAC
ATCCAAGCTACACCAGTGGAGGTGGCTTATCAAAGCAACATGAAAACTCAATGTTCTGGCATTGAAATGGCATACAAGAAGACTGCTTTGAAACTGGTCTGTGGCTTCTC
ATTTTTGCTTTTCACAGTGTTCACTTCATTCCTGTTGATTAATGATCAGGACCAAGGCTCCTATCTTGTGCCAACC
Protein sequenceShow/hide protein sequence
FGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVV
PAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNPRIVEPYARLLLQA
GLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPS
SESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSK
STTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQGSSESVSSTDGIPAHTD
IQATPVEVAYQSNMKTQCSGIEMAYKKTALKLVCGFSFLLFTVFTSFLLINDQDQGSYLVPT