; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g0315 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g0315
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionARM repeat superfamily protein
Genome locationMC01:9878616..9882180
RNA-Seq ExpressionMC01g0315
SyntenyMC01g0315
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0032797 - SMN complex (cellular component)
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK08426.1 ARM repeat superfamily protein [Cucumis melo var. makuwa]0.087.46Show/hide
Query:  MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS
        MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS
Subjt:  MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS

Query:  KVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVTTLAKRNP
        KVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLT+GAALCLKALVDSDNWRFASDEM+NKVCQNVAGALEEKSTQTNSHMGLV +LAKRNP
Subjt:  KVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVTTLAKRNP

Query:  RIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDKSPSSV
        RIVEPYARLLLQAGLRILK GVVEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+IIEEMENCQSDQM YVKGAAFETLQTAK+I ADKGSKMDKSPSSV
Subjt:  RIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDKSPSSV

Query:  TGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVSDTMSL
        TGSNFIDHRRRSPWRNGGSRTPSSES ESQTLDSFFDYGSLVGSP S RQASRNS FD RSVNRKLWSYENGGVDISLKDGLSLFS +TRG DVSDTMSL
Subjt:  TGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVSDTMSL

Query:  ISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPRSSFHN
         S SH FG NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSRSYI VEDMIFKTPRKLVHSLQDLNE NSD+AS S RR +RSLSSGNLEWSP  +F N
Subjt:  ISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPRSSFHN

Query:  QNGFPDDQKLSKEDVGGLDI-NGEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSFLLFTVFTSFLLINDQDQG
        +NG  D++KLSKED  GLDI NGEQSQG SES+SSTDG+P H D+QA PV V  QS +K Q  G++MAYKKTALKLVCGFSFLLFT+FTS L I+D DQG
Subjt:  QNGFPDDQKLSKEDVGGLDI-NGEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSFLLFTVFTSFLLINDQDQG

Query:  SYLVPT
        SYLVPT
Subjt:  SYLVPT

XP_004147557.1 protein SINE1 [Cucumis sativus]0.086.54Show/hide
Query:  MKATPETQRFVFGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA  ETQR    KNLSPMLRREFANLDKDADSRRSAMKAL+TYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATPETQRFVFGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLT+GAALCLKALVDSDNWRFASDEM+NKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKS

Query:  TQTNSHMGLVTTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
        TQTNSHMGLV TLAKRNPRIVEPYARLLLQAGLRILK GVVEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+IIEEMENCQSDQM YVKGAAFETLQTA
Subjt:  TQTNSHMGLVTTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTA

Query:  KRIAADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL
        K+I ADKGSKMDKSPSSVTGSNF+DHRRRSPWRNGGSRTPSSES ESQTLDSFFDYGSLVGSP S RQASRNSGFD RSVNRKLWSYENGGVDISLKDGL
Subjt:  KRIAADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSGITRGNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAY
        SLFS +TRG DVSDTMS+ S SH FG NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNE  SD+AS S R  +
Subjt:  SLFSGITRGNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAY

Query:  RSLSSGNLEWSPRSSFHNQNGFPDDQKLSKEDVGGL-DINGEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSF
        RSLSSGNLEWSP  +F NQNGF D+ KLSKED  GL + NGEQSQG  ES+SS DG P H D+QA PV VA QS MK Q  G++MAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPRSSFHNQNGFPDDQKLSKEDVGGL-DINGEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSF

Query:  LLFTVFTSFLLINDQDQGSYLVPT
        LLFT+FTS L I+D DQGSYLVPT
Subjt:  LLFTVFTSFLLINDQDQGSYLVPT

XP_008441975.1 PREDICTED: uncharacterized protein LOC103485976 [Cucumis melo]0.086.86Show/hide
Query:  MKATPETQRFVFGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA  ETQR    KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATPETQRFVFGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLT+GAALCLKALVDSDNWRFASDEM+NKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKS

Query:  TQTNSHMGLVTTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
        TQTNSHMGLV +LAKRNPRIVEPYARLLLQAGLRILK GVVEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+IIEEMENCQSDQM YVKGAAFETLQTA
Subjt:  TQTNSHMGLVTTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTA

Query:  KRIAADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL
        K+I ADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSES ESQTLDSFFDYGSLVGSP S RQASRNS FD RSVNRKLWSYENGGVDISLKDGL
Subjt:  KRIAADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSGITRGNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAY
        SLFS +TRG DVSDTMSL S SH FG NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSRSYI VEDMIFKTPRKLVHSLQDLNE NSD+AS S RR +
Subjt:  SLFSGITRGNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAY

Query:  RSLSSGNLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDI-NGEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSF
        RSLSSGNLEWSP  +F N+NG  D++KLSKED  GLDI NGEQSQG SES+SSTDG+P H D+QA PV V  QS +K Q  G++MAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDI-NGEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSF

Query:  LLFTVFTSFLLINDQDQGSYLVPT
        LLFT+FTS L I+D DQGSYLVPT
Subjt:  LLFTVFTSFLLINDQDQGSYLVPT

XP_022156223.1 uncharacterized protein LOC111023161 [Momordica charantia]0.0100Show/hide
Query:  MKATPETQRFVFGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKATPETQRFVFGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATPETQRFVFGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKS

Query:  TQTNSHMGLVTTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
        TQTNSHMGLVTTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
Subjt:  TQTNSHMGLVTTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTA

Query:  KRIAADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL
        KRIAADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL
Subjt:  KRIAADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSGITRGNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAY
        SLFSGITRGNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAY
Subjt:  SLFSGITRGNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAY

Query:  RSLSSGNLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSFL
        RSLSSGNLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSFL
Subjt:  RSLSSGNLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSFL

Query:  LFTVFTSFLLINDQDQGSYLVPT
        LFTVFTSFLLINDQDQGSYLVPT
Subjt:  LFTVFTSFLLINDQDQGSYLVPT

XP_038883420.1 protein SINE1 [Benincasa hispida]0.086.38Show/hide
Query:  MKATPETQRFVFGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA  ETQR    KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATPETQRFVFGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLT+GAALCLKALVDSDNWRFASDE++NKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKS

Query:  TQTNSHMGLVTTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
        TQTNSHMGLV TLAKRNPRIVEPYARLLLQAGLRILK G+VEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+IIEEMENCQSDQM YVKGAAFETLQTA
Subjt:  TQTNSHMGLVTTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTA

Query:  KRIAADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL
        K+I ADKGSKMDKSPSSVTGSNFID  RRSPWRNGGSRTPSSES ESQTLDSFFDYGSLVGSP S RQASRNSGFD RSVNRKLWSYENGGVDISLKDGL
Subjt:  KRIAADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSGITRGNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAY
        SLFS ITRG DVSDTMS+ S SH  G NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSR YINVEDMIFKTPRKLV SLQDLNEANS++ SKS RR +
Subjt:  SLFSGITRGNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAY

Query:  RSLSSGNLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDIN-GEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSF
        RSLSSGNLEWSP  SF NQ  FPDDQK SKED GGLD +  EQSQG SES+SS+DG+P H D++A PV VA QS +K Q SG++MAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDIN-GEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSF

Query:  LLFTVFTSFLLINDQDQGSYLVPT
        LLFT+FTS L I+D DQGSYLVPT
Subjt:  LLFTVFTSFLLINDQDQGSYLVPT

TrEMBL top hitse value%identityAlignment
A0A0A0KYP2 Uncharacterized protein0.086.54Show/hide
Query:  MKATPETQRFVFGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA  ETQR    KNLSPMLRREFANLDKDADSRRSAMKAL+TYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATPETQRFVFGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLT+GAALCLKALVDSDNWRFASDEM+NKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKS

Query:  TQTNSHMGLVTTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
        TQTNSHMGLV TLAKRNPRIVEPYARLLLQAGLRILK GVVEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+IIEEMENCQSDQM YVKGAAFETLQTA
Subjt:  TQTNSHMGLVTTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTA

Query:  KRIAADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL
        K+I ADKGSKMDKSPSSVTGSNF+DHRRRSPWRNGGSRTPSSES ESQTLDSFFDYGSLVGSP S RQASRNSGFD RSVNRKLWSYENGGVDISLKDGL
Subjt:  KRIAADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSGITRGNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAY
        SLFS +TRG DVSDTMS+ S SH FG NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNE  SD+AS S R  +
Subjt:  SLFSGITRGNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAY

Query:  RSLSSGNLEWSPRSSFHNQNGFPDDQKLSKEDVGGL-DINGEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSF
        RSLSSGNLEWSP  +F NQNGF D+ KLSKED  GL + NGEQSQG  ES+SS DG P H D+QA PV VA QS MK Q  G++MAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPRSSFHNQNGFPDDQKLSKEDVGGL-DINGEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSF

Query:  LLFTVFTSFLLINDQDQGSYLVPT
        LLFT+FTS L I+D DQGSYLVPT
Subjt:  LLFTVFTSFLLINDQDQGSYLVPT

A0A1S3B5D3 uncharacterized protein LOC1034859760.086.86Show/hide
Query:  MKATPETQRFVFGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA  ETQR    KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATPETQRFVFGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLT+GAALCLKALVDSDNWRFASDEM+NKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKS

Query:  TQTNSHMGLVTTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
        TQTNSHMGLV +LAKRNPRIVEPYARLLLQAGLRILK GVVEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+IIEEMENCQSDQM YVKGAAFETLQTA
Subjt:  TQTNSHMGLVTTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTA

Query:  KRIAADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL
        K+I ADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSES ESQTLDSFFDYGSLVGSP S RQASRNS FD RSVNRKLWSYENGGVDISLKDGL
Subjt:  KRIAADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSGITRGNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAY
        SLFS +TRG DVSDTMSL S SH FG NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSRSYI VEDMIFKTPRKLVHSLQDLNE NSD+AS S RR +
Subjt:  SLFSGITRGNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAY

Query:  RSLSSGNLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDI-NGEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSF
        RSLSSGNLEWSP  +F N+NG  D++KLSKED  GLDI NGEQSQG SES+SSTDG+P H D+QA PV V  QS +K Q  G++MAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDI-NGEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSF

Query:  LLFTVFTSFLLINDQDQGSYLVPT
        LLFT+FTS L I+D DQGSYLVPT
Subjt:  LLFTVFTSFLLINDQDQGSYLVPT

A0A5A7UWA1 ARM repeat superfamily protein0.086.86Show/hide
Query:  MKATPETQRFVFGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA  ETQR    KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATPETQRFVFGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLT+GAALCLKALVDSDNWRFASDEM+NKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKS

Query:  TQTNSHMGLVTTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
        TQTNSHMGLV +LAKRNPRIVEPYARLLLQAGLRILK GVVEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+IIEEMENCQSDQM YVKGAAFETLQTA
Subjt:  TQTNSHMGLVTTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTA

Query:  KRIAADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL
        K+I ADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSES ESQTLDSFFDYGSLVGSP S RQASRNS FD RSVNRKLWSYENGGVDISLKDGL
Subjt:  KRIAADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSGITRGNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAY
        SLFS +TRG DVSDTMSL S SH FG NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSRSYI VEDMIFKTPRKLVHSLQDLNE NSD+AS S RR +
Subjt:  SLFSGITRGNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAY

Query:  RSLSSGNLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDI-NGEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSF
        RSLSSGNLEWSP  +F N+NG  D++KLSKED  GLDI NGEQSQG SES+SSTDG+P H D+QA PV V  QS +K Q  G++MAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDI-NGEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSF

Query:  LLFTVFTSFLLINDQDQGSYLVPT
        LLFT+FTS L I+D DQGSYLVPT
Subjt:  LLFTVFTSFLLINDQDQGSYLVPT

A0A5D3CDJ7 ARM repeat superfamily protein0.087.46Show/hide
Query:  MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS
        MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS
Subjt:  MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS

Query:  KVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVTTLAKRNP
        KVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLT+GAALCLKALVDSDNWRFASDEM+NKVCQNVAGALEEKSTQTNSHMGLV +LAKRNP
Subjt:  KVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVTTLAKRNP

Query:  RIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDKSPSSV
        RIVEPYARLLLQAGLRILK GVVEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+IIEEMENCQSDQM YVKGAAFETLQTAK+I ADKGSKMDKSPSSV
Subjt:  RIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDKSPSSV

Query:  TGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVSDTMSL
        TGSNFIDHRRRSPWRNGGSRTPSSES ESQTLDSFFDYGSLVGSP S RQASRNS FD RSVNRKLWSYENGGVDISLKDGLSLFS +TRG DVSDTMSL
Subjt:  TGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVSDTMSL

Query:  ISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPRSSFHN
         S SH FG NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSRSYI VEDMIFKTPRKLVHSLQDLNE NSD+AS S RR +RSLSSGNLEWSP  +F N
Subjt:  ISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPRSSFHN

Query:  QNGFPDDQKLSKEDVGGLDI-NGEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSFLLFTVFTSFLLINDQDQG
        +NG  D++KLSKED  GLDI NGEQSQG SES+SSTDG+P H D+QA PV V  QS +K Q  G++MAYKKTALKLVCGFSFLLFT+FTS L I+D DQG
Subjt:  QNGFPDDQKLSKEDVGGLDI-NGEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSFLLFTVFTSFLLINDQDQG

Query:  SYLVPT
        SYLVPT
Subjt:  SYLVPT

A0A6J1DQ15 uncharacterized protein LOC1110231610.0100Show/hide
Query:  MKATPETQRFVFGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKATPETQRFVFGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATPETQRFVFGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKS

Query:  TQTNSHMGLVTTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
        TQTNSHMGLVTTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
Subjt:  TQTNSHMGLVTTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTA

Query:  KRIAADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL
        KRIAADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL
Subjt:  KRIAADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSGITRGNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAY
        SLFSGITRGNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAY
Subjt:  SLFSGITRGNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAY

Query:  RSLSSGNLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSFL
        RSLSSGNLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSFL
Subjt:  RSLSSGNLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQGGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSFL

Query:  LFTVFTSFLLINDQDQGSYLVPT
        LFTVFTSFLLINDQDQGSYLVPT
Subjt:  LFTVFTSFLLINDQDQGSYLVPT

SwissProt top hitse value%identityAlignment
F4IK92 TORTIFOLIA1-like protein 23.1e-0421.18Show/hide
Query:  MKATPETQRFVFGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSET--RETGALTGECTISLYEVLARVHGVNIVPQIDRIMT
        MKA   TQ+ +F      ++     N   D D+ +  +  L   V+ L    +  FL+ + +T   +  A+  EC I L   LAR H   + P + ++++
Subjt:  MKATPETQRFVFGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSET--RETGALTGECTISLYEVLARVHGVNIVPQIDRIMT

Query:  SIIKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEE
        SI+K L        ++ AC + +  +A      +  +D+   V  SL  PL E++    + + SGAALCL  ++DS      S E    + Q +   +  
Subjt:  SIIKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEE

Query:  KSTQTNSHMGLVTTLAKRNPRIV---EPYARLLLQAGLRILKVGVVEKN-SQKRLSAIQMINFLM---KCLDPWSILSELQTIIEEMENCQSDQMAYVKG
             NSH      + + N  I+      ++ +L + +   +  +  K+ + ++ +++ ++       K L P        + I  +E+C+ D++  V+ 
Subjt:  KSTQTNSHMGLVTTLAKRNPRIV---EPYARLLLQAGLRILKVGVVEKN-SQKRLSAIQMINFLM---KCLDPWSILSELQTIIEEMENCQSDQMAYVKG

Query:  AAFETLQTAKRIAADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGS-------------PISPRQASRNSGFDCR
        +    L+  K +      +  ++ SSV  S            NG        ++ES  L S  D+    G              P+S RQ       D R
Subjt:  AAFETLQTAKRIAADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESQESQTLDSFFDYGSLVGS-------------PISPRQASRNSGFDCR

Query:  SVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVSDTMS
          N+  W  E    + S    + L++  + G+ ++ T +
Subjt:  SVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVSDTMS

Q5XVI1 Protein SINE11.3e-15454.59Show/hide
Query:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP
        G NL+P+LR+E ANLDKD +SR+SAMKAL++YVK+LDSKAIPGFLAQV ET+ET +L+GE TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSFP
Subjt:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP

Query:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVTT
        LQQACSKV+PAIARYGIDPTT +DKK+ +IHSLC PL +SLL+SQESLTSGAALCLKALVDSDNWRFASDEM+N+VCQNV  AL+  S QT+  MGLV +
Subjt:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVTT

Query:  LAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMD
        LAK NP IVE YARLL+  GLRIL  GV E NSQKRLSA+QM+NFLMKCLDP SI SE++ II+EME CQSDQMAYV+GAA+E + T+KRIAA+  SKM+
Subjt:  LAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMD

Query:  KSPSSVTGSNFIDHRRRSPWRNGGSRTPS-SESQESQTLDSFFDYGSLV-GSPISPRQASRNSGFDCRSVNRKLWSY-ENGG-VDISLKDGLSLFSGITR
        K   SVTGSNF         RN  S  P  S S ESQTL SF  Y S V  SPIS    S NS FD RSVNRKLW   ENGG VDISLKDG  LFS +T+
Subjt:  KSPSSVTGSNFIDHRRRSPWRNGGSRTPS-SESQESQTLDSFFDYGSLV-GSPISPRQASRNSGFDCRSVNRKLWSY-ENGG-VDISLKDGLSLFSGITR

Query:  GNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRS-YINVEDM-IFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSG
        G+      + +S+S +   +  E  D+F GFL  S       ++TT SP R RS  IN ED  IF TPRKL+ SLQ                        
Subjt:  GNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRS-YINVEDM-IFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSG

Query:  NLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQ--GGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSFLLFTV
                       +PDD  L   D+    + GE+ +  G  ++       P   +  ++ + V+  +      +G     K +  KLV   SF++  +
Subjt:  NLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQ--GGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSFLLFTV

Query:  FTSFLLI--NDQDQGSYLVPT
        F + +L+   D D G Y VPT
Subjt:  FTSFLLI--NDQDQGSYLVPT

Q9SQR5 Protein SINE28.6e-8756.51Show/hide
Query:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP
        G+NL    R+E ANLDKD DS ++AM  LR+ VK+LD+K +  F+AQ+S+ +E G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS  
Subjt:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP

Query:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSS--QESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLV
        +QQACS+ V A+ARYGIDPTTP+DKK +VIHSLC PL +SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EM+N VCQ++A ALE  S++  SHM LV
Subjt:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSS--QESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLV

Query:  TTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSK
          L+K NP  VE YARL +++GLRIL +GVVE +SQKRL AIQM+NFLMK L+P SI SEL+ I +EME  Q DQ  YVK AA ET++ A+R+  +    
Subjt:  TTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSK

Query:  MD----KSPSSVTGS
         D    K  +S++GS
Subjt:  MD----KSPSSVTGS

Arabidopsis top hitse value%identityAlignment
AT1G54385.1 ARM repeat superfamily protein9.0e-15654.59Show/hide
Query:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP
        G NL+P+LR+E ANLDKD +SR+SAMKAL++YVK+LDSKAIPGFLAQV ET+ET +L+GE TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSFP
Subjt:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP

Query:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVTT
        LQQACSKV+PAIARYGIDPTT +DKK+ +IHSLC PL +SLL+SQESLTSGAALCLKALVDSDNWRFASDEM+N+VCQNV  AL+  S QT+  MGLV +
Subjt:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVTT

Query:  LAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMD
        LAK NP IVE YARLL+  GLRIL  GV E NSQKRLSA+QM+NFLMKCLDP SI SE++ II+EME CQSDQMAYV+GAA+E + T+KRIAA+  SKM+
Subjt:  LAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMD

Query:  KSPSSVTGSNFIDHRRRSPWRNGGSRTPS-SESQESQTLDSFFDYGSLV-GSPISPRQASRNSGFDCRSVNRKLWSY-ENGG-VDISLKDGLSLFSGITR
        K   SVTGSNF         RN  S  P  S S ESQTL SF  Y S V  SPIS    S NS FD RSVNRKLW   ENGG VDISLKDG  LFS +T+
Subjt:  KSPSSVTGSNFIDHRRRSPWRNGGSRTPS-SESQESQTLDSFFDYGSLV-GSPISPRQASRNSGFDCRSVNRKLWSY-ENGG-VDISLKDGLSLFSGITR

Query:  GNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRS-YINVEDM-IFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSG
        G+      + +S+S +   +  E  D+F GFL  S       ++TT SP R RS  IN ED  IF TPRKL+ SLQ                        
Subjt:  GNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRS-YINVEDM-IFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSG

Query:  NLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQ--GGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSFLLFTV
                       +PDD  L   D+    + GE+ +  G  ++       P   +  ++ + V+  +      +G     K +  KLV   SF++  +
Subjt:  NLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQ--GGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSFLLFTV

Query:  FTSFLLI--NDQDQGSYLVPT
        F + +L+   D D G Y VPT
Subjt:  FTSFLLI--NDQDQGSYLVPT

AT1G54385.2 ARM repeat superfamily protein9.0e-15654.59Show/hide
Query:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP
        G NL+P+LR+E ANLDKD +SR+SAMKAL++YVK+LDSKAIPGFLAQV ET+ET +L+GE TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSFP
Subjt:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP

Query:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVTT
        LQQACSKV+PAIARYGIDPTT +DKK+ +IHSLC PL +SLL+SQESLTSGAALCLKALVDSDNWRFASDEM+N+VCQNV  AL+  S QT+  MGLV +
Subjt:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVTT

Query:  LAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMD
        LAK NP IVE YARLL+  GLRIL  GV E NSQKRLSA+QM+NFLMKCLDP SI SE++ II+EME CQSDQMAYV+GAA+E + T+KRIAA+  SKM+
Subjt:  LAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMD

Query:  KSPSSVTGSNFIDHRRRSPWRNGGSRTPS-SESQESQTLDSFFDYGSLV-GSPISPRQASRNSGFDCRSVNRKLWSY-ENGG-VDISLKDGLSLFSGITR
        K   SVTGSNF         RN  S  P  S S ESQTL SF  Y S V  SPIS    S NS FD RSVNRKLW   ENGG VDISLKDG  LFS +T+
Subjt:  KSPSSVTGSNFIDHRRRSPWRNGGSRTPS-SESQESQTLDSFFDYGSLV-GSPISPRQASRNSGFDCRSVNRKLWSY-ENGG-VDISLKDGLSLFSGITR

Query:  GNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRS-YINVEDM-IFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSG
        G+      + +S+S +   +  E  D+F GFL  S       ++TT SP R RS  IN ED  IF TPRKL+ SLQ                        
Subjt:  GNDVSDTMSLISESHIFGQNGEEYADDFAGFLQISPPRRRVSKSTTTSPLRSRS-YINVEDM-IFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSG

Query:  NLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQ--GGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSFLLFTV
                       +PDD  L   D+    + GE+ +  G  ++       P   +  ++ + V+  +      +G     K +  KLV   SF++  +
Subjt:  NLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQ--GGSESVSSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSFLLFTV

Query:  FTSFLLI--NDQDQGSYLVPT
        F + +L+   D D G Y VPT
Subjt:  FTSFLLI--NDQDQGSYLVPT

AT3G03970.1 ARM repeat superfamily protein6.1e-8856.51Show/hide
Query:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP
        G+NL    R+E ANLDKD DS ++AM  LR+ VK+LD+K +  F+AQ+S+ +E G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS  
Subjt:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP

Query:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSS--QESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLV
        +QQACS+ V A+ARYGIDPTTP+DKK +VIHSLC PL +SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EM+N VCQ++A ALE  S++  SHM LV
Subjt:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSS--QESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLV

Query:  TTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSK
          L+K NP  VE YARL +++GLRIL +GVVE +SQKRL AIQM+NFLMK L+P SI SEL+ I +EME  Q DQ  YVK AA ET++ A+R+  +    
Subjt:  TTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSK

Query:  MD----KSPSSVTGS
         D    K  +S++GS
Subjt:  MD----KSPSSVTGS

AT3G03970.2 ARM repeat superfamily protein6.1e-8856.51Show/hide
Query:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP
        G+NL    R+E ANLDKD DS ++AM  LR+ VK+LD+K +  F+AQ+S+ +E G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS  
Subjt:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP

Query:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSS--QESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLV
        +QQACS+ V A+ARYGIDPTTP+DKK +VIHSLC PL +SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EM+N VCQ++A ALE  S++  SHM LV
Subjt:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSS--QESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLV

Query:  TTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSK
          L+K NP  VE YARL +++GLRIL +GVVE +SQKRL AIQM+NFLMK L+P SI SEL+ I +EME  Q DQ  YVK AA ET++ A+R+  +    
Subjt:  TTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSK

Query:  MD----KSPSSVTGS
         D    K  +S++GS
Subjt:  MD----KSPSSVTGS

AT3G03970.3 ARM repeat superfamily protein6.1e-8856.51Show/hide
Query:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP
        G+NL    R+E ANLDKD DS ++AM  LR+ VK+LD+K +  F+AQ+S+ +E G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS  
Subjt:  GKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFP

Query:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSS--QESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLV
        +QQACS+ V A+ARYGIDPTTP+DKK +VIHSLC PL +SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EM+N VCQ++A ALE  S++  SHM LV
Subjt:  LQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSS--QESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLV

Query:  TTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSK
          L+K NP  VE YARL +++GLRIL +GVVE +SQKRL AIQM+NFLMK L+P SI SEL+ I +EME  Q DQ  YVK AA ET++ A+R+  +    
Subjt:  TTLAKRNPRIVEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSK

Query:  MD----KSPSSVTGS
         D    K  +S++GS
Subjt:  MD----KSPSSVTGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGCAACCCCAGAAACTCAAAGGTTTGTTTTCGGCAAAAATTTGAGTCCAATGCTTCGGCGGGAATTTGCTAATCTTGATAAAGATGCTGATAGTCGCAGATCTGC
GATGAAGGCACTGAGGACTTATGTGAAGGAATTGGACTCCAAGGCCATCCCGGGTTTTCTTGCTCAAGTTTCTGAGACCAGAGAAACTGGTGCATTGACTGGGGAGTGCA
CGATTTCTCTCTACGAAGTTCTCGCCCGTGTTCATGGCGTCAACATCGTGCCGCAGATCGACAGGATTATGACTTCCATAATCAAGACTTTGGCTTCAAGTGCTGGGTCG
TTCCCTCTTCAACAGGCATGCTCTAAGGTTGTTCCGGCGATTGCGAGATATGGGATCGATCCTACCACTCCTGATGATAAGAAGAAGCATGTTATTCACTCTCTGTGTAA
TCCTCTTTGTGAATCTTTGTTGAGTTCTCAAGAGAGCCTCACTTCTGGGGCTGCCCTCTGCTTGAAGGCTCTGGTGGATTCAGATAACTGGCGGTTCGCTTCCGATGAGA
TGATTAACAAGGTTTGCCAGAATGTTGCTGGAGCTCTGGAGGAGAAGTCCACGCAAACTAATTCACACATGGGGCTTGTTACGACATTAGCAAAGCGAAATCCTCGGATT
GTCGAACCATATGCTAGATTGTTACTACAGGCCGGGCTGCGAATATTGAAGGTCGGGGTAGTAGAGAAGAATTCTCAGAAGAGATTGTCTGCCATTCAAATGATTAATTT
CTTGATGAAGTGTCTGGATCCTTGGAGCATATTGTCGGAACTTCAGACTATAATTGAGGAGATGGAGAATTGCCAGTCCGATCAAATGGCTTATGTCAAAGGTGCAGCCT
TTGAAACTCTGCAAACTGCGAAGAGAATAGCTGCGGATAAAGGGTCGAAAATGGACAAGTCTCCGAGCTCGGTCACTGGATCAAATTTCATTGATCACAGGCGGAGAAGT
CCATGGCGGAATGGTGGAAGCCGCACTCCGTCATCTGAGTCTCAAGAATCCCAGACCCTTGATTCATTTTTTGATTATGGTTCACTAGTTGGATCACCCATTTCACCAAG
ACAAGCCTCTCGTAATTCTGGTTTTGATTGTAGGAGTGTGAATCGTAAGCTTTGGAGTTATGAGAATGGTGGGGTTGATATTTCCCTCAAAGATGGCTTGTCTTTGTTCT
CAGGAATCACTCGTGGAAACGACGTCTCCGACACCATGTCTTTGATCTCTGAAAGTCATATATTTGGCCAAAATGGCGAAGAATATGCAGATGATTTTGCAGGGTTTCTT
CAAATAAGTCCTCCTAGACGCAGAGTATCAAAAAGCACTACAACCAGTCCCCTTAGGTCGCGCAGTTACATAAATGTCGAAGATATGATCTTCAAAACTCCTCGGAAGCT
TGTCCATTCTCTTCAAGATCTAAACGAGGCGAACTCGGACCATGCTAGCAAAAGTTTCAGACGTGCCTACAGGAGCCTATCATCAGGCAATTTGGAGTGGAGTCCAAGAT
CATCTTTCCATAACCAAAATGGGTTCCCAGACGATCAGAAACTCAGCAAAGAGGATGTAGGCGGCCTAGACATCAATGGTGAACAATCACAAGGCGGTTCGGAATCTGTC
TCTTCTACCGATGGCATTCCAGCCCACACCGACATCCAAGCTACACCAGTGGAGGTGGCTTATCAAAGCAACATGAAAACTCAATGTTCTGGAATTGATATGGCATACAA
GAAGACTGCTTTGAAACTGGTCTGTGGCTTCTCATTTTTGCTTTTCACAGTGTTCACTTCATTCCTGTTGATTAATGATCAGGACCAAGGCTCCTATCTTGTGCCAACCT
AA
mRNA sequenceShow/hide mRNA sequence
CACAAATTAAAATTGAAAATACAAAAACCAAAATCAACGTTTTAAAAAGGAACCGAAACAACCAGTACTTAAACCCTAGAAATAAAAACAATGAATATTATTTAAGAACA
AAGGTGGCAATAGCAAAATCAAAGGAGTGATGAGAAAAGAAAGATTTGAATTTGGTATAGTTTGGAAGGGTCATTATGGTCCAATGAGCAAAGTTTAAGAGAGGGGCCCA
CAGAAGAAGGGATAGCGGTGGAATTTCATGAAAAGATTTTGAATTCAAAGAAACAGAGCGCTGCTCAGTGCTGAATCCCCAGAAAAGGAAGGGAAAAAAAAATCCCCAAA
TCAAATACAAAATCGGACAGCACCACACCACACAACAAAATCCCTAATCCCTCTCCTCGAGAATCATAGTTTCTTTCTTCCATCTCCACGCCATGAAAGCAACCCCAGAA
ACTCAAAGGTTTGTTTTCGGCAAAAATTTGAGTCCAATGCTTCGGCGGGAATTTGCTAATCTTGATAAAGATGCTGATAGTCGCAGATCTGCGATGAAGGCACTGAGGAC
TTATGTGAAGGAATTGGACTCCAAGGCCATCCCGGGTTTTCTTGCTCAAGTTTCTGAGACCAGAGAAACTGGTGCATTGACTGGGGAGTGCACGATTTCTCTCTACGAAG
TTCTCGCCCGTGTTCATGGCGTCAACATCGTGCCGCAGATCGACAGGATTATGACTTCCATAATCAAGACTTTGGCTTCAAGTGCTGGGTCGTTCCCTCTTCAACAGGCA
TGCTCTAAGGTTGTTCCGGCGATTGCGAGATATGGGATCGATCCTACCACTCCTGATGATAAGAAGAAGCATGTTATTCACTCTCTGTGTAATCCTCTTTGTGAATCTTT
GTTGAGTTCTCAAGAGAGCCTCACTTCTGGGGCTGCCCTCTGCTTGAAGGCTCTGGTGGATTCAGATAACTGGCGGTTCGCTTCCGATGAGATGATTAACAAGGTTTGCC
AGAATGTTGCTGGAGCTCTGGAGGAGAAGTCCACGCAAACTAATTCACACATGGGGCTTGTTACGACATTAGCAAAGCGAAATCCTCGGATTGTCGAACCATATGCTAGA
TTGTTACTACAGGCCGGGCTGCGAATATTGAAGGTCGGGGTAGTAGAGAAGAATTCTCAGAAGAGATTGTCTGCCATTCAAATGATTAATTTCTTGATGAAGTGTCTGGA
TCCTTGGAGCATATTGTCGGAACTTCAGACTATAATTGAGGAGATGGAGAATTGCCAGTCCGATCAAATGGCTTATGTCAAAGGTGCAGCCTTTGAAACTCTGCAAACTG
CGAAGAGAATAGCTGCGGATAAAGGGTCGAAAATGGACAAGTCTCCGAGCTCGGTCACTGGATCAAATTTCATTGATCACAGGCGGAGAAGTCCATGGCGGAATGGTGGA
AGCCGCACTCCGTCATCTGAGTCTCAAGAATCCCAGACCCTTGATTCATTTTTTGATTATGGTTCACTAGTTGGATCACCCATTTCACCAAGACAAGCCTCTCGTAATTC
TGGTTTTGATTGTAGGAGTGTGAATCGTAAGCTTTGGAGTTATGAGAATGGTGGGGTTGATATTTCCCTCAAAGATGGCTTGTCTTTGTTCTCAGGAATCACTCGTGGAA
ACGACGTCTCCGACACCATGTCTTTGATCTCTGAAAGTCATATATTTGGCCAAAATGGCGAAGAATATGCAGATGATTTTGCAGGGTTTCTTCAAATAAGTCCTCCTAGA
CGCAGAGTATCAAAAAGCACTACAACCAGTCCCCTTAGGTCGCGCAGTTACATAAATGTCGAAGATATGATCTTCAAAACTCCTCGGAAGCTTGTCCATTCTCTTCAAGA
TCTAAACGAGGCGAACTCGGACCATGCTAGCAAAAGTTTCAGACGTGCCTACAGGAGCCTATCATCAGGCAATTTGGAGTGGAGTCCAAGATCATCTTTCCATAACCAAA
ATGGGTTCCCAGACGATCAGAAACTCAGCAAAGAGGATGTAGGCGGCCTAGACATCAATGGTGAACAATCACAAGGCGGTTCGGAATCTGTCTCTTCTACCGATGGCATT
CCAGCCCACACCGACATCCAAGCTACACCAGTGGAGGTGGCTTATCAAAGCAACATGAAAACTCAATGTTCTGGAATTGATATGGCATACAAGAAGACTGCTTTGAAACT
GGTCTGTGGCTTCTCATTTTTGCTTTTCACAGTGTTCACTTCATTCCTGTTGATTAATGATCAGGACCAAGGCTCCTATCTTGTGCCAACCTAATATTCTCCTTCTACTG
CCTCACCTGTCTGAAATAGGCTTAAACTGACTGTTGTAAGCTATGATGAGTGAGTTCAATGTAGAAGAAGGAAATATAGAGCTTACTCAGAAAAAGAAAGCAAATCCTAT
TAGATAGTTTTGGAAAATGACTCGATGTAGCACATTTCAGTGTGGTCAATGAATCTCTTGACTGTCAAAGTTTCTACACCTGTTTTTTCTCCTTTTTGTTTTTTGTACAA
TCTGGATTGAATCCTTGGAAGTTTTTAGGAGTGAGGAGATTTTAGCTAGTAGAGATTTGGGTTGTGTAAATGTTGAGAGCCAGTCCATACTTGTTGAGGCTGCAGGAGGA
TTGGTTTGAGTATGGCTGACTGAGCAACACATATGTAAGAATATTCTCCATAGTAGACATGGTTTGATCCAGGGCTGAACATGTACAGTTCATCAAATCCATCTCTCCCC
ACCACCACAGAGTTTCTCCTTCCCTGTGCCATGCCAAAACAAAAATGCAGGTGATTAAACCCAATCATGGAGCTTAGAAACTGGCTGGTGTGTCTAAATCTTGTAACTTT
TGTGTATTTTGGTTTTTGAATTTTAAAAATGTCTATTTTGATCCTTCGAACGTAAAAAAAAATTATTATTATTTTAATCCTTGACAAGAAAAAAGGACAATGATCAATTT
TTATCTTGAAAGTTGATGAGGTGTATCTATTTTTATTATAAACTTTCAATTTTGCCAAATTGCATCTATTCACTTTAAACTTTAAAAAGTGTTGCAATT
Protein sequenceShow/hide protein sequence
MKATPETQRFVFGKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPGFLAQVSETRETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGS
FPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLCESLLSSQESLTSGAALCLKALVDSDNWRFASDEMINKVCQNVAGALEEKSTQTNSHMGLVTTLAKRNPRI
VEPYARLLLQAGLRILKVGVVEKNSQKRLSAIQMINFLMKCLDPWSILSELQTIIEEMENCQSDQMAYVKGAAFETLQTAKRIAADKGSKMDKSPSSVTGSNFIDHRRRS
PWRNGGSRTPSSESQESQTLDSFFDYGSLVGSPISPRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSGITRGNDVSDTMSLISESHIFGQNGEEYADDFAGFL
QISPPRRRVSKSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQDLNEANSDHASKSFRRAYRSLSSGNLEWSPRSSFHNQNGFPDDQKLSKEDVGGLDINGEQSQGGSESV
SSTDGIPAHTDIQATPVEVAYQSNMKTQCSGIDMAYKKTALKLVCGFSFLLFTVFTSFLLINDQDQGSYLVPT