; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG04G005280 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG04G005280
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionARM repeat superfamily protein
Genome locationCG_Chr04:18938401..18940940
RNA-Seq ExpressionClCG04G005280
SyntenyClCG04G005280
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0032797 - SMN complex (cellular component)
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK08426.1 ARM repeat superfamily protein [Cucumis melo var. makuwa]0.0e+0094.72Show/hide
Query:  MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS
        MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS
Subjt:  MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS

Query:  KVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNP
        KVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM+LAKRNP
Subjt:  KVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNP

Query:  RIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKMDKSPSSV
        RIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSII+EMENCQSDQMPYVKGAAFETLQTAKKILADKGSKMDKSPSSV
Subjt:  RIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKMDKSPSSV

Query:  TGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLSLFSEVARGTDVSDTMSV
        TGSNFID RRRSPWRN GSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNS FDRRSVNRKLWSYENGGVDISLKDGLSLFSEV RGTDVSDTMS+
Subjt:  TGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLSLFSEVARGTDVSDTMSV

Query:  HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASKSSRRRHRSLSSGNLEWSPPRSFLN
        HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSR +I VEDMI+KTPRKLV SLQDLNE NSDYAS SSRRRHRSLSSGNLEWSPPR+FLN
Subjt:  HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASKSSRRRHRSLSSGNLEWSPPRSFLN

Query:  QNGFPDDQKLSKEDEGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSFLLFTIFTSLLWIDDQDQG
        +NG  D++KLSKEDE GLD DNGEQSQGSSESISSTDGVP H DVQA+P AV CQSKIKPQY G+EMAYKKTALKLVCGFSFLLFTIFTSLLWIDD DQG
Subjt:  QNGFPDDQKLSKEDEGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSFLLFTIFTSLLWIDDQDQG

Query:  SYLVPT
        SYLVPT
Subjt:  SYLVPT

XP_004147557.1 protein SINE1 [Cucumis sativus]0.0e+0094.39Show/hide
Query:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA SETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKAL+TYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSII+EMENCQSDQMPYVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTA

Query:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
        KKILADKGSKMDKSPSSVTGSNF+D RRRSPWRN GSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASKSSRRRH
        SLFSEV RGTDVSDTMS++SGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSR +INVEDMI+KTPRKLV SLQDLNEG SDYAS SSR RH
Subjt:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASKSSRRRH

Query:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKEDEGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSF
        RSLSSGNLEWSPPR+FLNQNGF D+ KLSKEDE GL N NGEQSQGS ESISS DG P H DVQAIP AVACQSK+KPQY G+EMAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKEDEGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSF

Query:  LLFTIFTSLLWIDDQDQGSYLVPT
        LLFTIFTSLLWIDD DQGSYLVPT
Subjt:  LLFTIFTSLLWIDDQDQGSYLVPT

XP_008441975.1 PREDICTED: uncharacterized protein LOC103485976 [Cucumis melo]0.0e+0094.71Show/hide
Query:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA SETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTA
        TQTNSHMGLVM+LAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSII+EMENCQSDQMPYVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTA

Query:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
        KKILADKGSKMDKSPSSVTGSNFID RRRSPWRN GSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNS FDRRSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASKSSRRRH
        SLFSEV RGTDVSDTMS+HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSR +I VEDMI+KTPRKLV SLQDLNE NSDYAS SSRRRH
Subjt:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASKSSRRRH

Query:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKEDEGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSF
        RSLSSGNLEWSPPR+FLN+NG  D++KLSKEDE GLD DNGEQSQGSSESISSTDGVP H DVQA+P AV CQSKIKPQY G+EMAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKEDEGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSF

Query:  LLFTIFTSLLWIDDQDQGSYLVPT
        LLFTIFTSLLWIDD DQGSYLVPT
Subjt:  LLFTIFTSLLWIDDQDQGSYLVPT

XP_022156223.1 uncharacterized protein LOC111023161 [Momordica charantia]8.6e-30287.66Show/hide
Query:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKAT ETQR    KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLTSGAALCLKALVDSDNWRFASDEM+NKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTA
        TQTNSHMGLV TLAKRNPRIVEPYARLLLQAGLRILK G+VEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+II+EMENCQSDQM YVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTA

Query:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
        K+I ADKGSKMDKSPSSVTGSNFID RRRSPWRN GSRTPSSES ESQTLDSFFDYGSLVGSP S RQASRNSGFD RSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASKSSRRRH
        SLFS + RG DVSDTMS+ S SH FG NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSR +INVEDMI+KTPRKLV SLQDLNE NSD+ASKS RR +
Subjt:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASKSSRRRH

Query:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKEDEGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSF
        RSLSSGNLEWSP  SF NQNGFPDDQKLSKED GGLD  NGEQSQG SES+SSTDG+P H D+QA P  VA QS +K Q SGI+MAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKEDEGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSF

Query:  LLFTIFTSLLWIDDQDQGSYLVPT
        LLFT+FTS L I+DQDQGSYLVPT
Subjt:  LLFTIFTSLLWIDDQDQGSYLVPT

XP_038883420.1 protein SINE1 [Benincasa hispida]0.0e+0095.35Show/hide
Query:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA SETQRSFM+KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDE+VNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSII+EMENCQSDQMPYVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTA

Query:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
        KKILADKGSKMDKSPSSVTGSNFIDR RRSPWRN GSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASKSSRRRH
        SLFS++ RGTDVSDTMSVHSGSHK GHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRG+INVEDMI+KTPRKLVQSLQDLNE NS+Y SKSSRRRH
Subjt:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASKSSRRRH

Query:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKEDEGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSF
        RSLSSGNLEWSPPRSFLNQ  FPDDQK SKED GGLDND  EQSQGSSESISS+DGVP HGDV+AIP AVACQSKIKPQYSG+EMAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKEDEGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSF

Query:  LLFTIFTSLLWIDDQDQGSYLVPT
        LLFTIFTSLLWIDD DQGSYLVPT
Subjt:  LLFTIFTSLLWIDDQDQGSYLVPT

TrEMBL top hitse value%identityAlignment
A0A0A0KYP2 Uncharacterized protein0.0e+0094.39Show/hide
Query:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA SETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKAL+TYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSII+EMENCQSDQMPYVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTA

Query:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
        KKILADKGSKMDKSPSSVTGSNF+D RRRSPWRN GSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASKSSRRRH
        SLFSEV RGTDVSDTMS++SGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSR +INVEDMI+KTPRKLV SLQDLNEG SDYAS SSR RH
Subjt:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASKSSRRRH

Query:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKEDEGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSF
        RSLSSGNLEWSPPR+FLNQNGF D+ KLSKEDE GL N NGEQSQGS ESISS DG P H DVQAIP AVACQSK+KPQY G+EMAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKEDEGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSF

Query:  LLFTIFTSLLWIDDQDQGSYLVPT
        LLFTIFTSLLWIDD DQGSYLVPT
Subjt:  LLFTIFTSLLWIDDQDQGSYLVPT

A0A1S3B5D3 uncharacterized protein LOC1034859760.0e+0094.71Show/hide
Query:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA SETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTA
        TQTNSHMGLVM+LAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSII+EMENCQSDQMPYVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTA

Query:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
        KKILADKGSKMDKSPSSVTGSNFID RRRSPWRN GSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNS FDRRSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASKSSRRRH
        SLFSEV RGTDVSDTMS+HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSR +I VEDMI+KTPRKLV SLQDLNE NSDYAS SSRRRH
Subjt:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASKSSRRRH

Query:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKEDEGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSF
        RSLSSGNLEWSPPR+FLN+NG  D++KLSKEDE GLD DNGEQSQGSSESISSTDGVP H DVQA+P AV CQSKIKPQY G+EMAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKEDEGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSF

Query:  LLFTIFTSLLWIDDQDQGSYLVPT
        LLFTIFTSLLWIDD DQGSYLVPT
Subjt:  LLFTIFTSLLWIDDQDQGSYLVPT

A0A5A7UWA1 ARM repeat superfamily protein0.0e+0094.71Show/hide
Query:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA SETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTA
        TQTNSHMGLVM+LAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSII+EMENCQSDQMPYVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTA

Query:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
        KKILADKGSKMDKSPSSVTGSNFID RRRSPWRN GSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNS FDRRSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASKSSRRRH
        SLFSEV RGTDVSDTMS+HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSR +I VEDMI+KTPRKLV SLQDLNE NSDYAS SSRRRH
Subjt:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASKSSRRRH

Query:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKEDEGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSF
        RSLSSGNLEWSPPR+FLN+NG  D++KLSKEDE GLD DNGEQSQGSSESISSTDGVP H DVQA+P AV CQSKIKPQY G+EMAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKEDEGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSF

Query:  LLFTIFTSLLWIDDQDQGSYLVPT
        LLFTIFTSLLWIDD DQGSYLVPT
Subjt:  LLFTIFTSLLWIDDQDQGSYLVPT

A0A5D3CDJ7 ARM repeat superfamily protein0.0e+0094.72Show/hide
Query:  MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS
        MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS
Subjt:  MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS

Query:  KVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNP
        KVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM+LAKRNP
Subjt:  KVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNP

Query:  RIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKMDKSPSSV
        RIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSII+EMENCQSDQMPYVKGAAFETLQTAKKILADKGSKMDKSPSSV
Subjt:  RIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKMDKSPSSV

Query:  TGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLSLFSEVARGTDVSDTMSV
        TGSNFID RRRSPWRN GSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNS FDRRSVNRKLWSYENGGVDISLKDGLSLFSEV RGTDVSDTMS+
Subjt:  TGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLSLFSEVARGTDVSDTMSV

Query:  HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASKSSRRRHRSLSSGNLEWSPPRSFLN
        HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSR +I VEDMI+KTPRKLV SLQDLNE NSDYAS SSRRRHRSLSSGNLEWSPPR+FLN
Subjt:  HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASKSSRRRHRSLSSGNLEWSPPRSFLN

Query:  QNGFPDDQKLSKEDEGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSFLLFTIFTSLLWIDDQDQG
        +NG  D++KLSKEDE GLD DNGEQSQGSSESISSTDGVP H DVQA+P AV CQSKIKPQY G+EMAYKKTALKLVCGFSFLLFTIFTSLLWIDD DQG
Subjt:  QNGFPDDQKLSKEDEGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSFLLFTIFTSLLWIDDQDQG

Query:  SYLVPT
        SYLVPT
Subjt:  SYLVPT

A0A6J1DQ15 uncharacterized protein LOC1110231614.2e-30287.66Show/hide
Query:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKAT ETQR    KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLTSGAALCLKALVDSDNWRFASDEM+NKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTA
        TQTNSHMGLV TLAKRNPRIVEPYARLLLQAGLRILK G+VEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+II+EMENCQSDQM YVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTA

Query:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
        K+I ADKGSKMDKSPSSVTGSNFID RRRSPWRN GSRTPSSES ESQTLDSFFDYGSLVGSP S RQASRNSGFD RSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASKSSRRRH
        SLFS + RG DVSDTMS+ S SH FG NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSR +INVEDMI+KTPRKLV SLQDLNE NSD+ASKS RR +
Subjt:  SLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASKSSRRRH

Query:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKEDEGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSF
        RSLSSGNLEWSP  SF NQNGFPDDQKLSKED GGLD  NGEQSQG SES+SSTDG+P H D+QA P  VA QS +K Q SGI+MAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPPRSFLNQNGFPDDQKLSKEDEGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSF

Query:  LLFTIFTSLLWIDDQDQGSYLVPT
        LLFT+FTS L I+DQDQGSYLVPT
Subjt:  LLFTIFTSLLWIDDQDQGSYLVPT

SwissProt top hitse value%identityAlignment
F4IK92 TORTIFOLIA1-like protein 27.0e-0420.66Show/hide
Query:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQV--SENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMT
        MKA + TQ+         ++     N   D D+ +  +  L   V+ L    +  FL+ +  +++++  A+  EC I L   LAR H   + P + ++++
Subjt:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQV--SENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMT

Query:  SIIKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEE
        SI+K L        ++ AC + +  +A      +  +D+   V  SL  PL E++    + + SGAALCL  ++DS      S E    + Q +   +  
Subjt:  SIIKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEE

Query:  KSTQTNSHMGLVMTLAKRNPRIV---EPYARLLLQAGLRILKCGIVEKN-SQKRLSAIQMINFLM---RCLDPWSIFSELQSIIDEMENCQSDQMPYVKG
             NSH      + + N  I+      ++ +L + +   +  +  K+ + ++ +++ ++       + L P        S I  +E+C+ D++  V+ 
Subjt:  KSTQTNSHMGLVMTLAKRNPRIV---EPYARLLLQAGLRILKCGIVEKN-SQKRLSAIQMINFLM---RCLDPWSIFSELQSIIDEMENCQSDQMPYVKG

Query:  AAFETLQTAKKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGG
        +    L+  K +      +  ++ SSV  S   +  R S   ++   T   +  +  ++    D  +    P S+RQ       D R  N+  W  E   
Subjt:  AAFETLQTAKKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGG

Query:  VDISLKDGLSLFSEVARGTDVSDTMS
         + S    + L++E + G+ ++ T +
Subjt:  VDISLKDGLSLFSEVARGTDVSDTMS

Q5XVI1 Protein SINE16.7e-14853.77Show/hide
Query:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        M  NL+P+LR+E ANLDKD +SR+SAMKAL++YVK+LDSKAIP FLAQV E KET +L+GE TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSF
Subjt:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM
        PLQQACSKV+PAIARYGIDPTT +DKK+ +I+SLC PL++SLL SQESLTSGAALCLKALVDSDNWRFASDEMVN+VCQNV  AL+  S QT+  MGLVM
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM

Query:  TLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKM
        +LAK NP IVE YARLL+  GLRIL  G+ E NSQKRLSA+QM+NFLM+CLDP SI+SE++ II EME CQSDQM YV+GAA+E + T+K+I A+  SKM
Subjt:  TLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKM

Query:  DKSPSSVTGSNFIDRRRRSPWRNDGSRTPS-SESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSY-ENGG-VDISLKDGLSLFSEVA
        +K   SVTGSNF         RN  S  P  S SPESQTL SF  Y S V  SP S    S NS FDRRSVNRKLW   ENGG VDISLKDG  LFS V 
Subjt:  DKSPSSVTGSNFIDRRRRSPWRNDGSRTPS-SESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSY-ENGG-VDISLKDGLSLFSEVA

Query:  RG-TDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRG-FINVEDM-IYKTPRKLVQSLQDLNEGNSDYASKSSRRRHRSLS
        +G T VSD+  V        ++  E  D+F GF   S       R+TT SP R R   IN ED  I+ TPRKL+ SLQ                      
Subjt:  RG-TDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRG-FINVEDM-IYKTPRKLVQSLQDLNEGNSDYASKSSRRRHRSLS

Query:  SGNLEWSPPRSFLNQNGFPDDQKLSKED-EGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSFLLF
                         +PDD  L   D +  +     E++ GS ++       P   +  +    V+  +      +G +   K +  KLV   SF++ 
Subjt:  SGNLEWSPPRSFLNQNGFPDDQKLSKED-EGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSFLLF

Query:  TIFTSLLWI--DDQDQGSYLVPT
         +F +++ +   D D G Y VPT
Subjt:  TIFTSLLWI--DDQDQGSYLVPT

Q9SQR5 Protein SINE21.3e-8756.65Show/hide
Query:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        M +NL    R+E ANLDKD DS ++AM  LR+ VK+LD+K + VF+AQ+S+ KE G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS 
Subjt:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL
         +QQACS+ V A+ARYGIDPTTP+DKK +VI+SLC PLS+SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EMVN VCQ++A ALE  S++  SHM L
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL

Query:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTAKKILADKGS
        VM L+K NP  VE YARL +++GLRIL  G+VE +SQKRL AIQM+NFLM+ L+P SI SEL+ I  EME  Q DQ  YVK AA ET++ A++++ +   
Subjt:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTAKKILADKGS

Query:  KMD----KSPSSVTGS
          D    K  +S++GS
Subjt:  KMD----KSPSSVTGS

Arabidopsis top hitse value%identityAlignment
AT1G54385.1 ARM repeat superfamily protein4.8e-14953.77Show/hide
Query:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        M  NL+P+LR+E ANLDKD +SR+SAMKAL++YVK+LDSKAIP FLAQV E KET +L+GE TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSF
Subjt:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM
        PLQQACSKV+PAIARYGIDPTT +DKK+ +I+SLC PL++SLL SQESLTSGAALCLKALVDSDNWRFASDEMVN+VCQNV  AL+  S QT+  MGLVM
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM

Query:  TLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKM
        +LAK NP IVE YARLL+  GLRIL  G+ E NSQKRLSA+QM+NFLM+CLDP SI+SE++ II EME CQSDQM YV+GAA+E + T+K+I A+  SKM
Subjt:  TLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKM

Query:  DKSPSSVTGSNFIDRRRRSPWRNDGSRTPS-SESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSY-ENGG-VDISLKDGLSLFSEVA
        +K   SVTGSNF         RN  S  P  S SPESQTL SF  Y S V  SP S    S NS FDRRSVNRKLW   ENGG VDISLKDG  LFS V 
Subjt:  DKSPSSVTGSNFIDRRRRSPWRNDGSRTPS-SESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSY-ENGG-VDISLKDGLSLFSEVA

Query:  RG-TDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRG-FINVEDM-IYKTPRKLVQSLQDLNEGNSDYASKSSRRRHRSLS
        +G T VSD+  V        ++  E  D+F GF   S       R+TT SP R R   IN ED  I+ TPRKL+ SLQ                      
Subjt:  RG-TDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRG-FINVEDM-IYKTPRKLVQSLQDLNEGNSDYASKSSRRRHRSLS

Query:  SGNLEWSPPRSFLNQNGFPDDQKLSKED-EGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSFLLF
                         +PDD  L   D +  +     E++ GS ++       P   +  +    V+  +      +G +   K +  KLV   SF++ 
Subjt:  SGNLEWSPPRSFLNQNGFPDDQKLSKED-EGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSFLLF

Query:  TIFTSLLWI--DDQDQGSYLVPT
         +F +++ +   D D G Y VPT
Subjt:  TIFTSLLWI--DDQDQGSYLVPT

AT1G54385.2 ARM repeat superfamily protein4.8e-14953.77Show/hide
Query:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        M  NL+P+LR+E ANLDKD +SR+SAMKAL++YVK+LDSKAIP FLAQV E KET +L+GE TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSF
Subjt:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM
        PLQQACSKV+PAIARYGIDPTT +DKK+ +I+SLC PL++SLL SQESLTSGAALCLKALVDSDNWRFASDEMVN+VCQNV  AL+  S QT+  MGLVM
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM

Query:  TLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKM
        +LAK NP IVE YARLL+  GLRIL  G+ E NSQKRLSA+QM+NFLM+CLDP SI+SE++ II EME CQSDQM YV+GAA+E + T+K+I A+  SKM
Subjt:  TLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKM

Query:  DKSPSSVTGSNFIDRRRRSPWRNDGSRTPS-SESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSY-ENGG-VDISLKDGLSLFSEVA
        +K   SVTGSNF         RN  S  P  S SPESQTL SF  Y S V  SP S    S NS FDRRSVNRKLW   ENGG VDISLKDG  LFS V 
Subjt:  DKSPSSVTGSNFIDRRRRSPWRNDGSRTPS-SESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSY-ENGG-VDISLKDGLSLFSEVA

Query:  RG-TDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRG-FINVEDM-IYKTPRKLVQSLQDLNEGNSDYASKSSRRRHRSLS
        +G T VSD+  V        ++  E  D+F GF   S       R+TT SP R R   IN ED  I+ TPRKL+ SLQ                      
Subjt:  RG-TDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRG-FINVEDM-IYKTPRKLVQSLQDLNEGNSDYASKSSRRRHRSLS

Query:  SGNLEWSPPRSFLNQNGFPDDQKLSKED-EGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSFLLF
                         +PDD  L   D +  +     E++ GS ++       P   +  +    V+  +      +G +   K +  KLV   SF++ 
Subjt:  SGNLEWSPPRSFLNQNGFPDDQKLSKED-EGGLDNDNGEQSQGSSESISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSFLLF

Query:  TIFTSLLWI--DDQDQGSYLVPT
         +F +++ +   D D G Y VPT
Subjt:  TIFTSLLWI--DDQDQGSYLVPT

AT3G03970.1 ARM repeat superfamily protein9.5e-8956.65Show/hide
Query:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        M +NL    R+E ANLDKD DS ++AM  LR+ VK+LD+K + VF+AQ+S+ KE G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS 
Subjt:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL
         +QQACS+ V A+ARYGIDPTTP+DKK +VI+SLC PLS+SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EMVN VCQ++A ALE  S++  SHM L
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL

Query:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTAKKILADKGS
        VM L+K NP  VE YARL +++GLRIL  G+VE +SQKRL AIQM+NFLM+ L+P SI SEL+ I  EME  Q DQ  YVK AA ET++ A++++ +   
Subjt:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTAKKILADKGS

Query:  KMD----KSPSSVTGS
          D    K  +S++GS
Subjt:  KMD----KSPSSVTGS

AT3G03970.2 ARM repeat superfamily protein9.5e-8956.65Show/hide
Query:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        M +NL    R+E ANLDKD DS ++AM  LR+ VK+LD+K + VF+AQ+S+ KE G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS 
Subjt:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL
         +QQACS+ V A+ARYGIDPTTP+DKK +VI+SLC PLS+SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EMVN VCQ++A ALE  S++  SHM L
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL

Query:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTAKKILADKGS
        VM L+K NP  VE YARL +++GLRIL  G+VE +SQKRL AIQM+NFLM+ L+P SI SEL+ I  EME  Q DQ  YVK AA ET++ A++++ +   
Subjt:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTAKKILADKGS

Query:  KMD----KSPSSVTGS
          D    K  +S++GS
Subjt:  KMD----KSPSSVTGS

AT3G03970.3 ARM repeat superfamily protein9.5e-8956.65Show/hide
Query:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        M +NL    R+E ANLDKD DS ++AM  LR+ VK+LD+K + VF+AQ+S+ KE G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS 
Subjt:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL
         +QQACS+ V A+ARYGIDPTTP+DKK +VI+SLC PLS+SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EMVN VCQ++A ALE  S++  SHM L
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL

Query:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTAKKILADKGS
        VM L+K NP  VE YARL +++GLRIL  G+VE +SQKRL AIQM+NFLM+ L+P SI SEL+ I  EME  Q DQ  YVK AA ET++ A++++ +   
Subjt:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTAKKILADKGS

Query:  KMD----KSPSSVTGS
          D    K  +S++GS
Subjt:  KMD----KSPSSVTGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGCAACCTCAGAAACTCAAAGGTCTTTTATGAGCAAAAATTTGAGTCCAATGCTTCGGCGGGAGTTTGCTAATCTTGATAAAGATGCCGATAGTCGTAGATCTGC
GATGAAGGCATTGAGAACTTATGTGAAGGAATTAGACTCCAAGGCTATCCCTGTTTTTCTTGCCCAAGTTTCTGAGAATAAAGAAACTGGTGCTTTGAACGGGGAATGTA
CCATTTCTCTCTATGAAGTTCTAGCTCGTGTTCATGGCGTCAATATCGTGCCACAGATCGATCGGATTATGACTTCTATTATCAAGACTTTGGCTTCAAGTGCTGGCTCT
TTCCCTCTTCAACAAGCTTGTTCCAAAGTTGTTCCGGCGATTGCGAGATATGGGATTGATCCCACCACTCCTGACGATAAGAAGAAGCATGTGATTTACTCTCTTTGTAA
TCCGCTTTCGGAATCTTTGTTGGGTTCTCAAGAGAGCCTCACTTCTGGTGCTGCCCTATGCTTGAAGGCTCTTGTCGATTCTGATAACTGGCGGTTCGCTTCTGATGAGA
TGGTTAACAAGGTTTGCCAGAATGTTGCTGGAGCTTTGGAGGAGAAATCTACACAAACCAATTCACATATGGGGCTCGTTATGACTCTAGCTAAGCGGAATCCTCGGATT
GTCGAACCGTATGCTAGATTGTTACTACAGGCTGGGCTGCGGATATTGAAGTGTGGGATTGTGGAGAAGAATTCTCAGAAAAGATTGTCTGCTATTCAAATGATTAATTT
CTTGATGAGATGTCTAGATCCTTGGAGTATATTTTCGGAGCTTCAGTCTATAATTGATGAGATGGAGAATTGTCAGTCTGATCAAATGCCTTATGTCAAAGGTGCCGCTT
TTGAAACTTTGCAAACGGCTAAGAAAATATTGGCTGATAAAGGGTCGAAAATGGACAAATCTCCAAGCTCGGTGACGGGATCAAACTTCATTGATCGCAGGAGGAGAAGT
CCATGGAGAAATGATGGAAGCCGAACTCCCTCGTCCGAGTCCCCAGAATCCCAGACCCTTGATTCATTCTTTGATTATGGCTCACTTGTAGGATCACCCTTTTCATCAAG
ACAAGCTTCTCGTAACTCAGGATTCGACCGAAGGAGTGTGAATCGTAAACTTTGGAGTTATGAGAATGGTGGGGTTGATATATCCCTCAAGGATGGCTTGTCTTTGTTCT
CGGAAGTCGCTCGTGGAACCGATGTTTCCGACACCATGTCCGTGCACTCTGGAAGTCACAAATTTGGCCATAATGGTGAAGAATATGCTGATGATTTTTCAGGGTTTTTT
CAAATGAGTCCTCCTCGACGCAGACTCTCAAGAAGCACTACAACCAGCCCCCTTCGGAGTCGTGGTTTCATAAACGTTGAAGATATGATCTACAAAACTCCTCGGAAGCT
CGTCCAATCCCTTCAGGATCTAAACGAGGGGAACTCCGACTATGCTAGCAAAAGTAGCAGACGTAGGCATAGGAGTTTGTCATCAGGCAATTTGGAGTGGAGTCCTCCAA
GGTCATTTCTAAATCAAAATGGGTTCCCAGATGATCAGAAACTCAGCAAAGAGGATGAAGGCGGCTTAGACAACGATAACGGTGAACAATCACAAGGTAGCTCCGAATCG
ATCTCTTCAACTGATGGTGTCCCTAACCATGGTGATGTCCAAGCTATACCTGCGGCAGTGGCTTGTCAAAGTAAAATCAAACCTCAATATTCTGGCATTGAGATGGCATA
TAAGAAGACTGCTTTGAAATTGGTTTGTGGCTTCTCATTTTTGCTTTTCACAATATTCACATCATTGCTATGGATTGATGATCAGGACCAAGGTTCCTATCTTGTACCAA
CATAA
mRNA sequenceShow/hide mRNA sequence
CTCCACGCCATGAAAGCAACCTCAGAAACTCAAAGGTCTTTTATGAGCAAAAATTTGAGTCCAATGCTTCGGCGGGAGTTTGCTAATCTTGATAAAGATGCCGATAGTCG
TAGATCTGCGATGAAGGCATTGAGAACTTATGTGAAGGAATTAGACTCCAAGGCTATCCCTGTTTTTCTTGCCCAAGTTTCTGAGAATAAAGAAACTGGTGCTTTGAACG
GGGAATGTACCATTTCTCTCTATGAAGTTCTAGCTCGTGTTCATGGCGTCAATATCGTGCCACAGATCGATCGGATTATGACTTCTATTATCAAGACTTTGGCTTCAAGT
GCTGGCTCTTTCCCTCTTCAACAAGCTTGTTCCAAAGTTGTTCCGGCGATTGCGAGATATGGGATTGATCCCACCACTCCTGACGATAAGAAGAAGCATGTGATTTACTC
TCTTTGTAATCCGCTTTCGGAATCTTTGTTGGGTTCTCAAGAGAGCCTCACTTCTGGTGCTGCCCTATGCTTGAAGGCTCTTGTCGATTCTGATAACTGGCGGTTCGCTT
CTGATGAGATGGTTAACAAGGTTTGCCAGAATGTTGCTGGAGCTTTGGAGGAGAAATCTACACAAACCAATTCACATATGGGGCTCGTTATGACTCTAGCTAAGCGGAAT
CCTCGGATTGTCGAACCGTATGCTAGATTGTTACTACAGGCTGGGCTGCGGATATTGAAGTGTGGGATTGTGGAGAAGAATTCTCAGAAAAGATTGTCTGCTATTCAAAT
GATTAATTTCTTGATGAGATGTCTAGATCCTTGGAGTATATTTTCGGAGCTTCAGTCTATAATTGATGAGATGGAGAATTGTCAGTCTGATCAAATGCCTTATGTCAAAG
GTGCCGCTTTTGAAACTTTGCAAACGGCTAAGAAAATATTGGCTGATAAAGGGTCGAAAATGGACAAATCTCCAAGCTCGGTGACGGGATCAAACTTCATTGATCGCAGG
AGGAGAAGTCCATGGAGAAATGATGGAAGCCGAACTCCCTCGTCCGAGTCCCCAGAATCCCAGACCCTTGATTCATTCTTTGATTATGGCTCACTTGTAGGATCACCCTT
TTCATCAAGACAAGCTTCTCGTAACTCAGGATTCGACCGAAGGAGTGTGAATCGTAAACTTTGGAGTTATGAGAATGGTGGGGTTGATATATCCCTCAAGGATGGCTTGT
CTTTGTTCTCGGAAGTCGCTCGTGGAACCGATGTTTCCGACACCATGTCCGTGCACTCTGGAAGTCACAAATTTGGCCATAATGGTGAAGAATATGCTGATGATTTTTCA
GGGTTTTTTCAAATGAGTCCTCCTCGACGCAGACTCTCAAGAAGCACTACAACCAGCCCCCTTCGGAGTCGTGGTTTCATAAACGTTGAAGATATGATCTACAAAACTCC
TCGGAAGCTCGTCCAATCCCTTCAGGATCTAAACGAGGGGAACTCCGACTATGCTAGCAAAAGTAGCAGACGTAGGCATAGGAGTTTGTCATCAGGCAATTTGGAGTGGA
GTCCTCCAAGGTCATTTCTAAATCAAAATGGGTTCCCAGATGATCAGAAACTCAGCAAAGAGGATGAAGGCGGCTTAGACAACGATAACGGTGAACAATCACAAGGTAGC
TCCGAATCGATCTCTTCAACTGATGGTGTCCCTAACCATGGTGATGTCCAAGCTATACCTGCGGCAGTGGCTTGTCAAAGTAAAATCAAACCTCAATATTCTGGCATTGA
GATGGCATATAAGAAGACTGCTTTGAAATTGGTTTGTGGCTTCTCATTTTTGCTTTTCACAATATTCACATCATTGCTATGGATTGATGATCAGGACCAAGGTTCCTATC
TTGTACCAACATAATGTTCTTGTGCTTCACCTGAAATAGGGTTAAACTGGTTGTTGTAAGTTTTGTTGAGTTCAAATGTTGTGTGTCAAGTAGCAAGAAGGCATATAGAA
GTTAGAAGCAAAAGCAAATCTGTTAGAAGGATTTGGAAAATGATTCAAAGTAAACATTTCAGTGTGGTCAATGAACTTCTTGACTCCAATAGGTATCTCTTACCTTAGCA
AAAATTATAGAAT
Protein sequenceShow/hide protein sequence
MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGS
FPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNPRI
VEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAFETLQTAKKILADKGSKMDKSPSSVTGSNFIDRRRRS
PWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLSLFSEVARGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFF
QMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASKSSRRRHRSLSSGNLEWSPPRSFLNQNGFPDDQKLSKEDEGGLDNDNGEQSQGSSES
ISSTDGVPNHGDVQAIPAAVACQSKIKPQYSGIEMAYKKTALKLVCGFSFLLFTIFTSLLWIDDQDQGSYLVPT