; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10017450 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10017450
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionARM repeat superfamily protein
Genome locationChr03:14403194..14405528
RNA-Seq ExpressionHG10017450
SyntenyHG10017450
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0032797 - SMN complex (cellular component)
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK08426.1 ARM repeat superfamily protein [Cucumis melo var. makuwa]0.0e+0094.07Show/hide
Query:  MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS
        MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS
Subjt:  MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS

Query:  KVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNP
        KVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM+LAKRNP
Subjt:  KVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNP

Query:  RIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKLDKSPSSV
        RIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKILADKGSK+DKSPSSV
Subjt:  RIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKLDKSPSSV

Query:  TGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSEVARGTDVSDTMSV
        TGSNFID RRRSPWRNGGSRTPSSESPESQTL+SFFDYGSLVGSPFSSRQASRNS FD RSVNRKLWSYENGGVDISLKDGLSLFSEV RGTDVSDTMS+
Subjt:  TGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSEVARGTDVSDTMSV

Query:  HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANSDYASKSSRHRHRSLSSGNLEWSPPRSFLN
        HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSR YI VEDMIFKTPRKLV SLQDLNE NSDYAS SSR RHRSLSSGNLEWSPPR+FLN
Subjt:  HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANSDYASKSSRHRHRSLSSGNLEWSPPRSFLN

Query:  QNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFSFLLFTIFTSLLWIDDQDQ
        +NG  D++KLSK+D   GLD DNGEQSQGSSESISSTDGVP H DVQA+PVA  CQS IKPQY G+EMAYKKTALKLVCGFSFLLFTIFTSLLWIDD DQ
Subjt:  QNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFSFLLFTIFTSLLWIDDQDQ

Query:  GSYLVPT
        GSYLVPT
Subjt:  GSYLVPT

XP_004147557.1 protein SINE1 [Cucumis sativus]0.0e+0093.76Show/hide
Query:  MKAASETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA SETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKAL+TYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKAASETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA

Query:  KKILADKGSKLDKSPSSVTGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL
        KKILADKGSK+DKSPSSVTGSNF+D RRRSPWRNGGSRTPSSESPESQTL+SFFDYGSLVGSPFSSRQASRNSGFD RSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKLDKSPSSVTGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANSDYASKSSRHRH
        SLFSEV RGTDVSDTMS++SGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSR YINVEDMIFKTPRKLV SLQDLNE  SDYAS SSR RH
Subjt:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANSDYASKSSRHRH

Query:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFS
        RSLSSGNLEWSPPR+FLNQNGF D+ KLSK+D   GL N NGEQSQGS ESISS DG P H DVQAIPVA ACQS +KPQY G+EMAYKKTALKLVCGFS
Subjt:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFS

Query:  FLLFTIFTSLLWIDDQDQGSYLVPT
        FLLFTIFTSLLWIDD DQGSYLVPT
Subjt:  FLLFTIFTSLLWIDDQDQGSYLVPT

XP_008441975.1 PREDICTED: uncharacterized protein LOC103485976 [Cucumis melo]0.0e+0094.08Show/hide
Query:  MKAASETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA SETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKAASETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA
        TQTNSHMGLVM+LAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA

Query:  KKILADKGSKLDKSPSSVTGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL
        KKILADKGSK+DKSPSSVTGSNFID RRRSPWRNGGSRTPSSESPESQTL+SFFDYGSLVGSPFSSRQASRNS FD RSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKLDKSPSSVTGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANSDYASKSSRHRH
        SLFSEV RGTDVSDTMS+HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSR YI VEDMIFKTPRKLV SLQDLNE NSDYAS SSR RH
Subjt:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANSDYASKSSRHRH

Query:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFS
        RSLSSGNLEWSPPR+FLN+NG  D++KLSK+D   GLD DNGEQSQGSSESISSTDGVP H DVQA+PVA  CQS IKPQY G+EMAYKKTALKLVCGFS
Subjt:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFS

Query:  FLLFTIFTSLLWIDDQDQGSYLVPT
        FLLFTIFTSLLWIDD DQGSYLVPT
Subjt:  FLLFTIFTSLLWIDDQDQGSYLVPT

XP_022156223.1 uncharacterized protein LOC111023161 [Momordica charantia]1.5e-30187.68Show/hide
Query:  MKAASETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA  ETQR    KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKAASETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLTSGAALCLKALVDSDNWRFASDEM+NKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA
        TQTNSHMGLV TLAKRNPRIVEPYARLLLQAGLRILK G+VEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+IIEEMENCQSDQM YVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA

Query:  KKILADKGSKLDKSPSSVTGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL
        K+I ADKGSK+DKSPSSVTGSNFID RRRSPWRNGGSRTPSSES ESQTL+SFFDYGSLVGSP S RQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKLDKSPSSVTGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANSDYASKSSRHRH
        SLFS + RG DVSDTMS+ S SH FG NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSR YINVEDMIFKTPRKLV SLQDLNEANSD+ASKS R  +
Subjt:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANSDYASKSSRHRH

Query:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFS
        RSLSSGNLEWSP  SF NQNGFPDDQKLSK+D  GGLD  NGEQSQG SES+SSTDG+P H D+QA PV  A QS +K Q SGI+MAYKKTALKLVCGFS
Subjt:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFS

Query:  FLLFTIFTSLLWIDDQDQGSYLVPT
        FLLFT+FTS L I+DQDQGSYLVPT
Subjt:  FLLFTIFTSLLWIDDQDQGSYLVPT

XP_038883420.1 protein SINE1 [Benincasa hispida]0.0e+0095.2Show/hide
Query:  MKAASETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA SETQRSFM+KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKAASETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDE+VNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA

Query:  KKILADKGSKLDKSPSSVTGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL
        KKILADKGSK+DKSPSSVTGSNFIDR RRSPWRNGGSRTPSSESPESQTL+SFFDYGSLVGSPFSSRQASRNSGFD RSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKLDKSPSSVTGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANSDYASKSSRHRH
        SLFS++ RGTDVSDTMSVHSGSHK GHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANS+Y SKSSR RH
Subjt:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANSDYASKSSRHRH

Query:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFS
        RSLSSGNLEWSPPRSFLNQ  FPDDQK SK+D GGGLDND  EQSQGSSESISS+DGVP HGDV+AIPVA ACQS IKPQYSG+EMAYKKTALKLVCGFS
Subjt:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFS

Query:  FLLFTIFTSLLWIDDQDQGSYLVPT
        FLLFTIFTSLLWIDD DQGSYLVPT
Subjt:  FLLFTIFTSLLWIDDQDQGSYLVPT

TrEMBL top hitse value%identityAlignment
A0A0A0KYP2 Uncharacterized protein0.0e+0093.76Show/hide
Query:  MKAASETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA SETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKAL+TYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKAASETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA

Query:  KKILADKGSKLDKSPSSVTGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL
        KKILADKGSK+DKSPSSVTGSNF+D RRRSPWRNGGSRTPSSESPESQTL+SFFDYGSLVGSPFSSRQASRNSGFD RSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKLDKSPSSVTGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANSDYASKSSRHRH
        SLFSEV RGTDVSDTMS++SGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSR YINVEDMIFKTPRKLV SLQDLNE  SDYAS SSR RH
Subjt:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANSDYASKSSRHRH

Query:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFS
        RSLSSGNLEWSPPR+FLNQNGF D+ KLSK+D   GL N NGEQSQGS ESISS DG P H DVQAIPVA ACQS +KPQY G+EMAYKKTALKLVCGFS
Subjt:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFS

Query:  FLLFTIFTSLLWIDDQDQGSYLVPT
        FLLFTIFTSLLWIDD DQGSYLVPT
Subjt:  FLLFTIFTSLLWIDDQDQGSYLVPT

A0A1S3B5D3 uncharacterized protein LOC1034859760.0e+0094.08Show/hide
Query:  MKAASETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA SETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKAASETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA
        TQTNSHMGLVM+LAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA

Query:  KKILADKGSKLDKSPSSVTGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL
        KKILADKGSK+DKSPSSVTGSNFID RRRSPWRNGGSRTPSSESPESQTL+SFFDYGSLVGSPFSSRQASRNS FD RSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKLDKSPSSVTGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANSDYASKSSRHRH
        SLFSEV RGTDVSDTMS+HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSR YI VEDMIFKTPRKLV SLQDLNE NSDYAS SSR RH
Subjt:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANSDYASKSSRHRH

Query:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFS
        RSLSSGNLEWSPPR+FLN+NG  D++KLSK+D   GLD DNGEQSQGSSESISSTDGVP H DVQA+PVA  CQS IKPQY G+EMAYKKTALKLVCGFS
Subjt:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFS

Query:  FLLFTIFTSLLWIDDQDQGSYLVPT
        FLLFTIFTSLLWIDD DQGSYLVPT
Subjt:  FLLFTIFTSLLWIDDQDQGSYLVPT

A0A5A7UWA1 ARM repeat superfamily protein0.0e+0094.08Show/hide
Query:  MKAASETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA SETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKAASETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA
        TQTNSHMGLVM+LAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA

Query:  KKILADKGSKLDKSPSSVTGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL
        KKILADKGSK+DKSPSSVTGSNFID RRRSPWRNGGSRTPSSESPESQTL+SFFDYGSLVGSPFSSRQASRNS FD RSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKLDKSPSSVTGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANSDYASKSSRHRH
        SLFSEV RGTDVSDTMS+HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSR YI VEDMIFKTPRKLV SLQDLNE NSDYAS SSR RH
Subjt:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANSDYASKSSRHRH

Query:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFS
        RSLSSGNLEWSPPR+FLN+NG  D++KLSK+D   GLD DNGEQSQGSSESISSTDGVP H DVQA+PVA  CQS IKPQY G+EMAYKKTALKLVCGFS
Subjt:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFS

Query:  FLLFTIFTSLLWIDDQDQGSYLVPT
        FLLFTIFTSLLWIDD DQGSYLVPT
Subjt:  FLLFTIFTSLLWIDDQDQGSYLVPT

A0A5D3CDJ7 ARM repeat superfamily protein0.0e+0094.07Show/hide
Query:  MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS
        MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS
Subjt:  MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS

Query:  KVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNP
        KVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM+LAKRNP
Subjt:  KVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNP

Query:  RIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKLDKSPSSV
        RIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKILADKGSK+DKSPSSV
Subjt:  RIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKLDKSPSSV

Query:  TGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSEVARGTDVSDTMSV
        TGSNFID RRRSPWRNGGSRTPSSESPESQTL+SFFDYGSLVGSPFSSRQASRNS FD RSVNRKLWSYENGGVDISLKDGLSLFSEV RGTDVSDTMS+
Subjt:  TGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSEVARGTDVSDTMSV

Query:  HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANSDYASKSSRHRHRSLSSGNLEWSPPRSFLN
        HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSR YI VEDMIFKTPRKLV SLQDLNE NSDYAS SSR RHRSLSSGNLEWSPPR+FLN
Subjt:  HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANSDYASKSSRHRHRSLSSGNLEWSPPRSFLN

Query:  QNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFSFLLFTIFTSLLWIDDQDQ
        +NG  D++KLSK+D   GLD DNGEQSQGSSESISSTDGVP H DVQA+PVA  CQS IKPQY G+EMAYKKTALKLVCGFSFLLFTIFTSLLWIDD DQ
Subjt:  QNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFSFLLFTIFTSLLWIDDQDQ

Query:  GSYLVPT
        GSYLVPT
Subjt:  GSYLVPT

A0A6J1DQ15 uncharacterized protein LOC1110231617.2e-30287.68Show/hide
Query:  MKAASETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA  ETQR    KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKAASETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLTSGAALCLKALVDSDNWRFASDEM+NKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA
        TQTNSHMGLV TLAKRNPRIVEPYARLLLQAGLRILK G+VEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+IIEEMENCQSDQM YVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTA

Query:  KKILADKGSKLDKSPSSVTGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL
        K+I ADKGSK+DKSPSSVTGSNFID RRRSPWRNGGSRTPSSES ESQTL+SFFDYGSLVGSP S RQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKLDKSPSSVTGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANSDYASKSSRHRH
        SLFS + RG DVSDTMS+ S SH FG NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSR YINVEDMIFKTPRKLV SLQDLNEANSD+ASKS R  +
Subjt:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANSDYASKSSRHRH

Query:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFS
        RSLSSGNLEWSP  SF NQNGFPDDQKLSK+D  GGLD  NGEQSQG SES+SSTDG+P H D+QA PV  A QS +K Q SGI+MAYKKTALKLVCGFS
Subjt:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFS

Query:  FLLFTIFTSLLWIDDQDQGSYLVPT
        FLLFT+FTS L I+DQDQGSYLVPT
Subjt:  FLLFTIFTSLLWIDDQDQGSYLVPT

SwissProt top hitse value%identityAlignment
F4IK92 TORTIFOLIA1-like protein 29.2e-0420.66Show/hide
Query:  MKAASETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQV--SENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMT
        MKA + TQ+         ++     N   D D+ +  +  L   V+ L    +  FL+ +  +++++  A+  EC I L   LAR H   + P + ++++
Subjt:  MKAASETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQV--SENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMT

Query:  SIIKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEE
        SI+K L        ++ AC + +  +A      +  +D+   V  SL  PL E++    + + SGAALCL  ++DS      S E    + Q +   +  
Subjt:  SIIKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEE

Query:  KSTQTNSHMGLVMTLAKRNPRIV---EPYARLLLQAGLRILKCGIVEKN-SQKRLSAIQMINFLM---RCLDPWSIFSELQSIIEEMENCQSDQMPYVKG
             NSH      + + N  I+      ++ +L + +   +  +  K+ + ++ +++ ++       + L P        S I  +E+C+ D++  V+ 
Subjt:  KSTQTNSHMGLVMTLAKRNPRIV---EPYARLLLQAGLRILKCGIVEKN-SQKRLSAIQMINFLM---RCLDPWSIFSELQSIIEEMENCQSDQMPYVKG

Query:  AAFETLQTAKKILADKGSKLDKSPSSVTGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGG
        +    L+  K +      +  ++ SSV  S +   R  S   +    T   +  +  ++    D  +    P S+RQ       D R  N+  W  E   
Subjt:  AAFETLQTAKKILADKGSKLDKSPSSVTGSNFIDRRRRSPWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGG

Query:  VDISLKDGLSLFSEVARGTDVSDTMS
         + S    + L++E + G+ ++ T +
Subjt:  VDISLKDGLSLFSEVARGTDVSDTMS

Q5XVI1 Protein SINE12.3e-14853.61Show/hide
Query:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        M  NL+P+LR+E ANLDKD +SR+SAMKAL++YVK+LDSKAIP FLAQV E KET +L+GE TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSF
Subjt:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM
        PLQQACSKV+PAIARYGIDPTT +DKK+ +I+SLC PL++SLL SQESLTSGAALCLKALVDSDNWRFASDEMVN+VCQNV  AL+  S QT+  MGLVM
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM

Query:  TLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKL
        +LAK NP IVE YARLL+  GLRIL  G+ E NSQKRLSA+QM+NFLM+CLDP SI+SE++ II+EME CQSDQM YV+GAA+E + T+K+I A+  SK+
Subjt:  TLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKL

Query:  DKSPSSVTGSNFIDRRRRSPWRNGGSRTPS-SESPESQTLNSFFDYGSLV-GSPFSSRQASRNSGFDCRSVNRKLWSY-ENGG-VDISLKDGLSLFSEVA
        +K   SVTGSNF         RN  S  P  S SPESQTL SF  Y S V  SP S    S NS FD RSVNRKLW   ENGG VDISLKDG  LFS V 
Subjt:  DKSPSSVTGSNFIDRRRRSPWRNGGSRTPS-SESPESQTLNSFFDYGSLV-GSPFSSRQASRNSGFDCRSVNRKLWSY-ENGG-VDISLKDGLSLFSEVA

Query:  RG-TDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRG-YINVEDM-IFKTPRKLVQSLQDLNEANSDYASKSSRHRHRSLS
        +G T VSD+  V        ++  E  D+F GF   S       R+TT SP R R   IN ED  IF TPRKL+ SLQ                      
Subjt:  RG-TDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRG-YINVEDM-IFKTPRKLVQSLQDLNEANSDYASKSSRHRHRSLS

Query:  SGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFSFLLF
                         +PDD  L   D    +     E++ GS ++       P   +  +  +  +  +A     +G +   K +  KLV   SF++ 
Subjt:  SGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFSFLLF

Query:  TIFTSLLWI--DDQDQGSYLVPT
         +F +++ +   D D G Y VPT
Subjt:  TIFTSLLWI--DDQDQGSYLVPT

Q9SQR5 Protein SINE27.8e-8856.65Show/hide
Query:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        M +NL    R+E ANLDKD DS ++AM  LR+ VK+LD+K + VF+AQ+S+ KE G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS 
Subjt:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL
         +QQACS+ V A+ARYGIDPTTP+DKK +VI+SLC PLS+SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EMVN VCQ++A ALE  S++  SHM L
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL

Query:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKILADKGS
        VM L+K NP  VE YARL +++GLRIL  G+VE +SQKRL AIQM+NFLM+ L+P SI SEL+ I +EME  Q DQ  YVK AA ET++ A++++ +   
Subjt:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKILADKGS

Query:  KLD----KSPSSVTGS
          D    K  +S++GS
Subjt:  KLD----KSPSSVTGS

Arabidopsis top hitse value%identityAlignment
AT1G54385.1 ARM repeat superfamily protein1.6e-14953.61Show/hide
Query:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        M  NL+P+LR+E ANLDKD +SR+SAMKAL++YVK+LDSKAIP FLAQV E KET +L+GE TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSF
Subjt:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM
        PLQQACSKV+PAIARYGIDPTT +DKK+ +I+SLC PL++SLL SQESLTSGAALCLKALVDSDNWRFASDEMVN+VCQNV  AL+  S QT+  MGLVM
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM

Query:  TLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKL
        +LAK NP IVE YARLL+  GLRIL  G+ E NSQKRLSA+QM+NFLM+CLDP SI+SE++ II+EME CQSDQM YV+GAA+E + T+K+I A+  SK+
Subjt:  TLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKL

Query:  DKSPSSVTGSNFIDRRRRSPWRNGGSRTPS-SESPESQTLNSFFDYGSLV-GSPFSSRQASRNSGFDCRSVNRKLWSY-ENGG-VDISLKDGLSLFSEVA
        +K   SVTGSNF         RN  S  P  S SPESQTL SF  Y S V  SP S    S NS FD RSVNRKLW   ENGG VDISLKDG  LFS V 
Subjt:  DKSPSSVTGSNFIDRRRRSPWRNGGSRTPS-SESPESQTLNSFFDYGSLV-GSPFSSRQASRNSGFDCRSVNRKLWSY-ENGG-VDISLKDGLSLFSEVA

Query:  RG-TDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRG-YINVEDM-IFKTPRKLVQSLQDLNEANSDYASKSSRHRHRSLS
        +G T VSD+  V        ++  E  D+F GF   S       R+TT SP R R   IN ED  IF TPRKL+ SLQ                      
Subjt:  RG-TDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRG-YINVEDM-IFKTPRKLVQSLQDLNEANSDYASKSSRHRHRSLS

Query:  SGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFSFLLF
                         +PDD  L   D    +     E++ GS ++       P   +  +  +  +  +A     +G +   K +  KLV   SF++ 
Subjt:  SGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFSFLLF

Query:  TIFTSLLWI--DDQDQGSYLVPT
         +F +++ +   D D G Y VPT
Subjt:  TIFTSLLWI--DDQDQGSYLVPT

AT1G54385.2 ARM repeat superfamily protein1.6e-14953.61Show/hide
Query:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        M  NL+P+LR+E ANLDKD +SR+SAMKAL++YVK+LDSKAIP FLAQV E KET +L+GE TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSF
Subjt:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM
        PLQQACSKV+PAIARYGIDPTT +DKK+ +I+SLC PL++SLL SQESLTSGAALCLKALVDSDNWRFASDEMVN+VCQNV  AL+  S QT+  MGLVM
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM

Query:  TLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKL
        +LAK NP IVE YARLL+  GLRIL  G+ E NSQKRLSA+QM+NFLM+CLDP SI+SE++ II+EME CQSDQM YV+GAA+E + T+K+I A+  SK+
Subjt:  TLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKL

Query:  DKSPSSVTGSNFIDRRRRSPWRNGGSRTPS-SESPESQTLNSFFDYGSLV-GSPFSSRQASRNSGFDCRSVNRKLWSY-ENGG-VDISLKDGLSLFSEVA
        +K   SVTGSNF         RN  S  P  S SPESQTL SF  Y S V  SP S    S NS FD RSVNRKLW   ENGG VDISLKDG  LFS V 
Subjt:  DKSPSSVTGSNFIDRRRRSPWRNGGSRTPS-SESPESQTLNSFFDYGSLV-GSPFSSRQASRNSGFDCRSVNRKLWSY-ENGG-VDISLKDGLSLFSEVA

Query:  RG-TDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRG-YINVEDM-IFKTPRKLVQSLQDLNEANSDYASKSSRHRHRSLS
        +G T VSD+  V        ++  E  D+F GF   S       R+TT SP R R   IN ED  IF TPRKL+ SLQ                      
Subjt:  RG-TDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRG-YINVEDM-IFKTPRKLVQSLQDLNEANSDYASKSSRHRHRSLS

Query:  SGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFSFLLF
                         +PDD  L   D    +     E++ GS ++       P   +  +  +  +  +A     +G +   K +  KLV   SF++ 
Subjt:  SGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFSFLLF

Query:  TIFTSLLWI--DDQDQGSYLVPT
         +F +++ +   D D G Y VPT
Subjt:  TIFTSLLWI--DDQDQGSYLVPT

AT3G03970.1 ARM repeat superfamily protein5.6e-8956.65Show/hide
Query:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        M +NL    R+E ANLDKD DS ++AM  LR+ VK+LD+K + VF+AQ+S+ KE G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS 
Subjt:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL
         +QQACS+ V A+ARYGIDPTTP+DKK +VI+SLC PLS+SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EMVN VCQ++A ALE  S++  SHM L
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL

Query:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKILADKGS
        VM L+K NP  VE YARL +++GLRIL  G+VE +SQKRL AIQM+NFLM+ L+P SI SEL+ I +EME  Q DQ  YVK AA ET++ A++++ +   
Subjt:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKILADKGS

Query:  KLD----KSPSSVTGS
          D    K  +S++GS
Subjt:  KLD----KSPSSVTGS

AT3G03970.2 ARM repeat superfamily protein5.6e-8956.65Show/hide
Query:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        M +NL    R+E ANLDKD DS ++AM  LR+ VK+LD+K + VF+AQ+S+ KE G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS 
Subjt:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL
         +QQACS+ V A+ARYGIDPTTP+DKK +VI+SLC PLS+SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EMVN VCQ++A ALE  S++  SHM L
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL

Query:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKILADKGS
        VM L+K NP  VE YARL +++GLRIL  G+VE +SQKRL AIQM+NFLM+ L+P SI SEL+ I +EME  Q DQ  YVK AA ET++ A++++ +   
Subjt:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKILADKGS

Query:  KLD----KSPSSVTGS
          D    K  +S++GS
Subjt:  KLD----KSPSSVTGS

AT3G03970.3 ARM repeat superfamily protein5.6e-8956.65Show/hide
Query:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        M +NL    R+E ANLDKD DS ++AM  LR+ VK+LD+K + VF+AQ+S+ KE G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS 
Subjt:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL
         +QQACS+ V A+ARYGIDPTTP+DKK +VI+SLC PLS+SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EMVN VCQ++A ALE  S++  SHM L
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL

Query:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKILADKGS
        VM L+K NP  VE YARL +++GLRIL  G+VE +SQKRL AIQM+NFLM+ L+P SI SEL+ I +EME  Q DQ  YVK AA ET++ A++++ +   
Subjt:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKILADKGS

Query:  KLD----KSPSSVTGS
          D    K  +S++GS
Subjt:  KLD----KSPSSVTGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGCAGCATCAGAAACTCAAAGGTCTTTTATGAGCAAAAATTTGAGTCCAATGCTTCGGCGGGAGTTTGCTAATCTTGATAAAGATGCTGATAGTCGCAGATCTGC
AATGAAGGCATTGAGAACTTATGTAAAGGAATTAGACTCCAAGGCTATTCCTGTTTTTCTTGCTCAAGTTTCTGAGAACAAAGAAACTGGTGCTTTGAATGGGGAATGTA
CTATTTCTCTCTATGAAGTTCTAGCTCGTGTTCATGGCGTCAATATCGTGCCGCAGATCGATCGGATTATGACTTCTATTATCAAGACTTTGGCTTCAAGTGCTGGCTCT
TTTCCTCTTCAACAAGCTTGCTCTAAAGTTGTTCCGGCGATTGCGAGATACGGGATCGATCCCACCACTCCAGATGATAAGAAGAAGCATGTGATTTACTCTCTTTGTAA
TCCGCTTTCGGAATCTTTGTTAGGTTCTCAAGAGAGCCTCACTTCTGGTGCTGCCCTATGCTTGAAGGCTCTGGTGGATTCGGATAACTGGCGGTTCGCTTCTGATGAGA
TGGTTAACAAGGTTTGCCAGAATGTTGCTGGAGCTCTGGAGGAGAAATCTACACAAACTAATTCACACATGGGGCTTGTTATGACACTAGCTAAGAGAAATCCTCGGATT
GTTGAACCGTATGCTAGATTGTTGTTACAGGCTGGGCTACGAATATTGAAATGTGGGATTGTGGAGAAGAATTCTCAGAAAAGATTGTCTGCTATTCAAATGATTAATTT
CTTGATGAGATGTCTTGATCCTTGGAGTATATTTTCGGAGCTTCAGTCTATAATTGAGGAGATGGAGAATTGTCAGTCTGATCAAATGCCTTATGTCAAAGGTGCAGCAT
TTGAAACTTTGCAAACGGCTAAGAAAATATTGGCTGATAAAGGGTCAAAACTGGACAAATCTCCAAGCTCGGTGACGGGATCAAACTTCATTGATCGCAGGAGAAGAAGT
CCATGGAGGAATGGTGGAAGCCGAACTCCCTCGTCCGAGTCTCCAGAATCCCAGACCCTCAATTCATTCTTCGATTATGGCTCACTTGTAGGATCGCCCTTTTCATCAAG
ACAAGCTTCTCGTAACTCAGGATTCGACTGTAGGAGTGTGAATCGTAAACTTTGGAGTTATGAGAATGGTGGGGTTGATATATCACTCAAGGATGGCTTGTCTTTGTTCT
CAGAAGTCGCTCGTGGAACCGACGTTTCCGACACCATGTCCGTGCACTCTGGAAGTCATAAATTTGGCCATAATGGTGAAGAATATGCAGATGATTTTTCAGGGTTTTTT
CAAATGAGTCCTCCTCGACGCAGACTCTCGAGAAGCACTACAACCAGCCCCCTTCGGAGTCGTGGTTACATAAATGTTGAAGATATGATCTTCAAAACTCCTCGGAAGCT
CGTCCAATCCCTTCAGGATCTAAACGAGGCGAACTCCGACTATGCTAGCAAAAGTAGCAGACATAGGCATAGGAGTTTGTCATCAGGAAATTTGGAATGGAGTCCTCCAA
GATCATTTCTCAATCAAAATGGGTTCCCAGATGATCAGAAACTCAGCAAAGACGACGGAGGAGGCGGCTTAGACAACGATAACGGTGAACAATCACAAGGTAGTTCCGAA
TCGATCTCTTCAACTGATGGTGTCCCTAACCATGGTGATGTGCAAGCTATACCTGTGGCAGCGGCTTGTCAAAGTGCAATCAAACCTCAATATTCTGGCATTGAGATGGC
ATATAAGAAGACTGCTTTGAAATTGGTCTGTGGCTTCTCATTTTTGCTTTTCACAATATTCACTTCGTTGCTATGGATTGATGATCAGGACCAAGGTTCCTATCTTGTTC
CAACATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGCAGCATCAGAAACTCAAAGGTCTTTTATGAGCAAAAATTTGAGTCCAATGCTTCGGCGGGAGTTTGCTAATCTTGATAAAGATGCTGATAGTCGCAGATCTGC
AATGAAGGCATTGAGAACTTATGTAAAGGAATTAGACTCCAAGGCTATTCCTGTTTTTCTTGCTCAAGTTTCTGAGAACAAAGAAACTGGTGCTTTGAATGGGGAATGTA
CTATTTCTCTCTATGAAGTTCTAGCTCGTGTTCATGGCGTCAATATCGTGCCGCAGATCGATCGGATTATGACTTCTATTATCAAGACTTTGGCTTCAAGTGCTGGCTCT
TTTCCTCTTCAACAAGCTTGCTCTAAAGTTGTTCCGGCGATTGCGAGATACGGGATCGATCCCACCACTCCAGATGATAAGAAGAAGCATGTGATTTACTCTCTTTGTAA
TCCGCTTTCGGAATCTTTGTTAGGTTCTCAAGAGAGCCTCACTTCTGGTGCTGCCCTATGCTTGAAGGCTCTGGTGGATTCGGATAACTGGCGGTTCGCTTCTGATGAGA
TGGTTAACAAGGTTTGCCAGAATGTTGCTGGAGCTCTGGAGGAGAAATCTACACAAACTAATTCACACATGGGGCTTGTTATGACACTAGCTAAGAGAAATCCTCGGATT
GTTGAACCGTATGCTAGATTGTTGTTACAGGCTGGGCTACGAATATTGAAATGTGGGATTGTGGAGAAGAATTCTCAGAAAAGATTGTCTGCTATTCAAATGATTAATTT
CTTGATGAGATGTCTTGATCCTTGGAGTATATTTTCGGAGCTTCAGTCTATAATTGAGGAGATGGAGAATTGTCAGTCTGATCAAATGCCTTATGTCAAAGGTGCAGCAT
TTGAAACTTTGCAAACGGCTAAGAAAATATTGGCTGATAAAGGGTCAAAACTGGACAAATCTCCAAGCTCGGTGACGGGATCAAACTTCATTGATCGCAGGAGAAGAAGT
CCATGGAGGAATGGTGGAAGCCGAACTCCCTCGTCCGAGTCTCCAGAATCCCAGACCCTCAATTCATTCTTCGATTATGGCTCACTTGTAGGATCGCCCTTTTCATCAAG
ACAAGCTTCTCGTAACTCAGGATTCGACTGTAGGAGTGTGAATCGTAAACTTTGGAGTTATGAGAATGGTGGGGTTGATATATCACTCAAGGATGGCTTGTCTTTGTTCT
CAGAAGTCGCTCGTGGAACCGACGTTTCCGACACCATGTCCGTGCACTCTGGAAGTCATAAATTTGGCCATAATGGTGAAGAATATGCAGATGATTTTTCAGGGTTTTTT
CAAATGAGTCCTCCTCGACGCAGACTCTCGAGAAGCACTACAACCAGCCCCCTTCGGAGTCGTGGTTACATAAATGTTGAAGATATGATCTTCAAAACTCCTCGGAAGCT
CGTCCAATCCCTTCAGGATCTAAACGAGGCGAACTCCGACTATGCTAGCAAAAGTAGCAGACATAGGCATAGGAGTTTGTCATCAGGAAATTTGGAATGGAGTCCTCCAA
GATCATTTCTCAATCAAAATGGGTTCCCAGATGATCAGAAACTCAGCAAAGACGACGGAGGAGGCGGCTTAGACAACGATAACGGTGAACAATCACAAGGTAGTTCCGAA
TCGATCTCTTCAACTGATGGTGTCCCTAACCATGGTGATGTGCAAGCTATACCTGTGGCAGCGGCTTGTCAAAGTGCAATCAAACCTCAATATTCTGGCATTGAGATGGC
ATATAAGAAGACTGCTTTGAAATTGGTCTGTGGCTTCTCATTTTTGCTTTTCACAATATTCACTTCGTTGCTATGGATTGATGATCAGGACCAAGGTTCCTATCTTGTTC
CAACATAA
Protein sequenceShow/hide protein sequence
MKAASETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGS
FPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNPRI
VEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKLDKSPSSVTGSNFIDRRRRS
PWRNGGSRTPSSESPESQTLNSFFDYGSLVGSPFSSRQASRNSGFDCRSVNRKLWSYENGGVDISLKDGLSLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFF
QMSPPRRRLSRSTTTSPLRSRGYINVEDMIFKTPRKLVQSLQDLNEANSDYASKSSRHRHRSLSSGNLEWSPPRSFLNQNGFPDDQKLSKDDGGGGLDNDNGEQSQGSSE
SISSTDGVPNHGDVQAIPVAAACQSAIKPQYSGIEMAYKKTALKLVCGFSFLLFTIFTSLLWIDDQDQGSYLVPT