; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC04G064010 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC04G064010
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionARM repeat superfamily protein
Genome locationCicolChr04:20325671..20344973
RNA-Seq ExpressionCcUC04G064010
SyntenyCcUC04G064010
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0032797 - SMN complex (cellular component)
InterPro domainsIPR001965 - Zinc finger, PHD-type
IPR011011 - Zinc finger, FYVE/PHD-type
IPR011989 - Armadillo-like helical
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR016024 - Armadillo-type fold
IPR024610 - Inhibitor of growth protein, N-terminal histone-binding


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK08426.1 ARM repeat superfamily protein [Cucumis melo var. makuwa]0.0e+0094.55Show/hide
Query:  MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS
        MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS
Subjt:  MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS

Query:  KVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNP
        KVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM+LAKRNP
Subjt:  KVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNP

Query:  RIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTAKKILADKGSKMDKSPSSV
        RIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSII+EMENCQSDQMPYVKGAA+ETLQTAKKILADKGSKMDKSPSSV
Subjt:  RIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTAKKILADKGSKMDKSPSSV

Query:  TGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLSLFSEVTRGTDVSDTMSV
        TGSNFID RRRSPWRN GSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNS FDRRSVNRKLWSYENGGVDISLKDGLSLFSEVTRGTDVSDTMS+
Subjt:  TGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLSLFSEVTRGTDVSDTMSV

Query:  HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASRSSRRRHRSLSSGNLEWSPPRSFLN
        HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSR +I VEDMI+KTPRKLV SLQDLNE NSDYAS SSRRRHRSLSSGNLEWSPPR+FLN
Subjt:  HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASRSSRRRHRSLSSGNLEWSPPRSFLN

Query:  QNRLPDDQKLSKEDEGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSFLLFTIFTSLLWIDDQDQG
        +N   D++KLSKEDE GLD DNGEQSQGSSESISSTDGVP H DVQA+PVAV CQSKIKPQY G+EMAYKKTALKLVCGFSFLLFTIFTSLLWIDD DQG
Subjt:  QNRLPDDQKLSKEDEGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSFLLFTIFTSLLWIDDQDQG

Query:  SYLIP
        SYL+P
Subjt:  SYLIP

XP_004147557.1 protein SINE1 [Cucumis sativus]0.0e+0093.74Show/hide
Query:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA SETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKAL+TYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSII+EMENCQSDQMPYVKGAA+ETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTA

Query:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
        KKILADKGSKMDKSPSSVTGSNF+D RRRSPWRN GSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVTRGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASRSSRRRH
        SLFSEVTRGTDVSDTMS++SGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSR +INVEDMI+KTPRKLV SLQDLNEG SDYAS SSR RH
Subjt:  SLFSEVTRGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASRSSRRRH

Query:  RSLSSGNLEWSPPRSFLNQNRLPDDQKLSKEDEGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSF
        RSLSSGNLEWSPPR+FLNQN   D+ KLSKEDE GL + NGEQSQGS ESISS DG P H DVQAIPVAV CQSK+KPQY G+EMAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPPRSFLNQNRLPDDQKLSKEDEGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSF

Query:  LLFTIFTSLLWIDDQDQGSYLIP
        LLFTIFTSLLWIDD DQGSYL+P
Subjt:  LLFTIFTSLLWIDDQDQGSYLIP

XP_008441975.1 PREDICTED: uncharacterized protein LOC103485976 [Cucumis melo]0.0e+0094.54Show/hide
Query:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA SETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTA
        TQTNSHMGLVM+LAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSII+EMENCQSDQMPYVKGAA+ETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTA

Query:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
        KKILADKGSKMDKSPSSVTGSNFID RRRSPWRN GSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNS FDRRSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVTRGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASRSSRRRH
        SLFSEVTRGTDVSDTMS+HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSR +I VEDMI+KTPRKLV SLQDLNE NSDYAS SSRRRH
Subjt:  SLFSEVTRGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASRSSRRRH

Query:  RSLSSGNLEWSPPRSFLNQNRLPDDQKLSKEDEGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSF
        RSLSSGNLEWSPPR+FLN+N   D++KLSKEDE GLD DNGEQSQGSSESISSTDGVP H DVQA+PVAV CQSKIKPQY G+EMAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPPRSFLNQNRLPDDQKLSKEDEGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSF

Query:  LLFTIFTSLLWIDDQDQGSYLIP
        LLFTIFTSLLWIDD DQGSYL+P
Subjt:  LLFTIFTSLLWIDDQDQGSYLIP

XP_022156223.1 uncharacterized protein LOC111023161 [Momordica charantia]1.8e-29987Show/hide
Query:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKAT ETQR    KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLTSGAALCLKALVDSDNWRFASDEM+NKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTA
        TQTNSHMGLV TLAKRNPRIVEPYARLLLQAGLRILK G+VEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+II+EMENCQSDQM YVKGAA+ETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTA

Query:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
        K+I ADKGSKMDKSPSSVTGSNFID RRRSPWRN GSRTPSSES ESQTLDSFFDYGSLVGSP S RQASRNSGFD RSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVTRGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASRSSRRRH
        SLFS +TRG DVSDTMS+ S SH FG NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSR +INVEDMI+KTPRKLV SLQDLNE NSD+AS+S RR +
Subjt:  SLFSEVTRGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASRSSRRRH

Query:  RSLSSGNLEWSPPRSFLNQNRLPDDQKLSKEDEGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSF
        RSLSSGNLEWSP  SF NQN  PDDQKLSKED GGLD  NGEQSQG SES+SSTDG+P H D+QA PV V  QS +K Q SGI+MAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPPRSFLNQNRLPDDQKLSKEDEGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSF

Query:  LLFTIFTSLLWIDDQDQGSYLIP
        LLFT+FTS L I+DQDQGSYL+P
Subjt:  LLFTIFTSLLWIDDQDQGSYLIP

XP_038883420.1 protein SINE1 [Benincasa hispida]0.0e+0094.7Show/hide
Query:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA SETQRSFM+KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDE+VNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSII+EMENCQSDQMPYVKGAA+ETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTA

Query:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
        KKILADKGSKMDKSPSSVTGSNFIDR RRSPWRN GSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVTRGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASRSSRRRH
        SLFS++TRGTDVSDTMSVHSGSHK GHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRG+INVEDMI+KTPRKLVQSLQDLNE NS+Y S+SSRRRH
Subjt:  SLFSEVTRGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASRSSRRRH

Query:  RSLSSGNLEWSPPRSFLNQNRLPDDQKLSKEDEGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSF
        RSLSSGNLEWSPPRSFLNQ + PDDQK SKED GGLD+D  EQSQGSSESISS+DGVP HGDV+AIPVAV CQSKIKPQYSG+EMAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPPRSFLNQNRLPDDQKLSKEDEGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSF

Query:  LLFTIFTSLLWIDDQDQGSYLIP
        LLFTIFTSLLWIDD DQGSYL+P
Subjt:  LLFTIFTSLLWIDDQDQGSYLIP

TrEMBL top hitse value%identityAlignment
A0A0A0KYP2 Uncharacterized protein0.0e+0093.74Show/hide
Query:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA SETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKAL+TYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSII+EMENCQSDQMPYVKGAA+ETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTA

Query:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
        KKILADKGSKMDKSPSSVTGSNF+D RRRSPWRN GSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVTRGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASRSSRRRH
        SLFSEVTRGTDVSDTMS++SGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSR +INVEDMI+KTPRKLV SLQDLNEG SDYAS SSR RH
Subjt:  SLFSEVTRGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASRSSRRRH

Query:  RSLSSGNLEWSPPRSFLNQNRLPDDQKLSKEDEGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSF
        RSLSSGNLEWSPPR+FLNQN   D+ KLSKEDE GL + NGEQSQGS ESISS DG P H DVQAIPVAV CQSK+KPQY G+EMAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPPRSFLNQNRLPDDQKLSKEDEGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSF

Query:  LLFTIFTSLLWIDDQDQGSYLIP
        LLFTIFTSLLWIDD DQGSYL+P
Subjt:  LLFTIFTSLLWIDDQDQGSYLIP

A0A1S3B5D3 uncharacterized protein LOC1034859760.0e+0094.54Show/hide
Query:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA SETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTA
        TQTNSHMGLVM+LAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSII+EMENCQSDQMPYVKGAA+ETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTA

Query:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
        KKILADKGSKMDKSPSSVTGSNFID RRRSPWRN GSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNS FDRRSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVTRGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASRSSRRRH
        SLFSEVTRGTDVSDTMS+HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSR +I VEDMI+KTPRKLV SLQDLNE NSDYAS SSRRRH
Subjt:  SLFSEVTRGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASRSSRRRH

Query:  RSLSSGNLEWSPPRSFLNQNRLPDDQKLSKEDEGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSF
        RSLSSGNLEWSPPR+FLN+N   D++KLSKEDE GLD DNGEQSQGSSESISSTDGVP H DVQA+PVAV CQSKIKPQY G+EMAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPPRSFLNQNRLPDDQKLSKEDEGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSF

Query:  LLFTIFTSLLWIDDQDQGSYLIP
        LLFTIFTSLLWIDD DQGSYL+P
Subjt:  LLFTIFTSLLWIDDQDQGSYLIP

A0A5A7UWA1 ARM repeat superfamily protein0.0e+0094.54Show/hide
Query:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA SETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTA
        TQTNSHMGLVM+LAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSII+EMENCQSDQMPYVKGAA+ETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTA

Query:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
        KKILADKGSKMDKSPSSVTGSNFID RRRSPWRN GSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNS FDRRSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVTRGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASRSSRRRH
        SLFSEVTRGTDVSDTMS+HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSR +I VEDMI+KTPRKLV SLQDLNE NSDYAS SSRRRH
Subjt:  SLFSEVTRGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASRSSRRRH

Query:  RSLSSGNLEWSPPRSFLNQNRLPDDQKLSKEDEGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSF
        RSLSSGNLEWSPPR+FLN+N   D++KLSKEDE GLD DNGEQSQGSSESISSTDGVP H DVQA+PVAV CQSKIKPQY G+EMAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPPRSFLNQNRLPDDQKLSKEDEGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSF

Query:  LLFTIFTSLLWIDDQDQGSYLIP
        LLFTIFTSLLWIDD DQGSYL+P
Subjt:  LLFTIFTSLLWIDDQDQGSYLIP

A0A5D3CDJ7 ARM repeat superfamily protein0.0e+0094.55Show/hide
Query:  MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS
        MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS
Subjt:  MLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACS

Query:  KVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNP
        KVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM+LAKRNP
Subjt:  KVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNP

Query:  RIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTAKKILADKGSKMDKSPSSV
        RIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSII+EMENCQSDQMPYVKGAA+ETLQTAKKILADKGSKMDKSPSSV
Subjt:  RIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTAKKILADKGSKMDKSPSSV

Query:  TGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLSLFSEVTRGTDVSDTMSV
        TGSNFID RRRSPWRN GSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNS FDRRSVNRKLWSYENGGVDISLKDGLSLFSEVTRGTDVSDTMS+
Subjt:  TGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLSLFSEVTRGTDVSDTMSV

Query:  HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASRSSRRRHRSLSSGNLEWSPPRSFLN
        HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSR +I VEDMI+KTPRKLV SLQDLNE NSDYAS SSRRRHRSLSSGNLEWSPPR+FLN
Subjt:  HSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASRSSRRRHRSLSSGNLEWSPPRSFLN

Query:  QNRLPDDQKLSKEDEGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSFLLFTIFTSLLWIDDQDQG
        +N   D++KLSKEDE GLD DNGEQSQGSSESISSTDGVP H DVQA+PVAV CQSKIKPQY G+EMAYKKTALKLVCGFSFLLFTIFTSLLWIDD DQG
Subjt:  QNRLPDDQKLSKEDEGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSFLLFTIFTSLLWIDDQDQG

Query:  SYLIP
        SYL+P
Subjt:  SYLIP

A0A6J1DQ15 uncharacterized protein LOC1110231618.9e-30087Show/hide
Query:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKAT ETQR    KNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIP FLAQVSE +ETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPL ESLL SQESLTSGAALCLKALVDSDNWRFASDEM+NKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTA
        TQTNSHMGLV TLAKRNPRIVEPYARLLLQAGLRILK G+VEKNSQKRLSAIQMINFLM+CLDPWSI SELQ+II+EMENCQSDQM YVKGAA+ETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTA

Query:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
        K+I ADKGSKMDKSPSSVTGSNFID RRRSPWRN GSRTPSSES ESQTLDSFFDYGSLVGSP S RQASRNSGFD RSVNRKLWSYENGGVDISLKDGL
Subjt:  KKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEVTRGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASRSSRRRH
        SLFS +TRG DVSDTMS+ S SH FG NGEEYADDF+GF Q+SPPRRR+S+STTTSPLRSR +INVEDMI+KTPRKLV SLQDLNE NSD+AS+S RR +
Subjt:  SLFSEVTRGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASRSSRRRH

Query:  RSLSSGNLEWSPPRSFLNQNRLPDDQKLSKEDEGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSF
        RSLSSGNLEWSP  SF NQN  PDDQKLSKED GGLD  NGEQSQG SES+SSTDG+P H D+QA PV V  QS +K Q SGI+MAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWSPPRSFLNQNRLPDDQKLSKEDEGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSF

Query:  LLFTIFTSLLWIDDQDQGSYLIP
        LLFT+FTS L I+DQDQGSYL+P
Subjt:  LLFTIFTSLLWIDDQDQGSYLIP

SwissProt top hitse value%identityAlignment
B3H615 PHD finger protein ING21.6e-8874.77Show/hide
Query:  TMIDQTRQQTKYCLGLSTQSSKKG---YSNSNTEDEESAFEKLRKDIEANQDSALSLCTEKVLLARQAGDLIDSHIKRLDEDLNNFAEDLKQEGKISPDE
        ++I+QTRQQTKYCLGL++QSSKKG   + N+   DEE   EK+RK+IE++Q++ALSLCTEKVLLARQA DLIDSH+KRLDEDLNNFAEDLKQEGKI PDE
Subjt:  TMIDQTRQQTKYCLGLSTQSSKKG---YSNSNTEDEESAFEKLRKDIEANQDSALSLCTEKVLLARQAGDLIDSHIKRLDEDLNNFAEDLKQEGKISPDE

Query:  PAILPPLPLVSKNERRRPVFITPQSKRPDYRDRDWDRERDRDFELMPPPGGHKKDFAPSLDVDQPIDPNEPTYCICHQVSFGDMIACDNENCQGGEWFHY
        P++LPPLP+V K E+R+  + TPQ K+ DYRDRDWD  RDRDFELMPPPG ++KD  P    +QPIDPNEPTYC+CHQVSFGDMIACDNENCQGGEWFHY
Subjt:  PAILPPLPLVSKNERRRPVFITPQSKRPDYRDRDWDRERDRDFELMPPPGGHKKDFAPSLDVDQPIDPNEPTYCICHQVSFGDMIACDNENCQGGEWFHY

Query:  SCVGLTPETRFKGK
        +CVGLTPETRFKGK
Subjt:  SCVGLTPETRFKGK

Q5XVI1 Protein SINE11.2e-14753.86Show/hide
Query:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        M  NL+P+LR+E ANLDKD +SR+SAMKAL++YVK+LDSKAIP FLAQV E KET +L+GE TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSF
Subjt:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM
        PLQQACSKV+PAIARYGIDPTT +DKK+ +I+SLC PL++SLL SQESLTSGAALCLKALVDSDNWRFASDEMVN+VCQNV  AL+  S QT+  MGLVM
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM

Query:  TLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTAKKILADKGSKM
        +LAK NP IVE YARLL+  GLRIL  G+ E NSQKRLSA+QM+NFLM+CLDP SI+SE++ II EME CQSDQM YV+GAAYE + T+K+I A+  SKM
Subjt:  TLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTAKKILADKGSKM

Query:  DKSPSSVTGSNFIDRRRRSPWRNDGSRTPS-SESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSY-ENGG-VDISLKDGLSLFSEVT
        +K   SVTGSNF         RN  S  P  S SPESQTL SF  Y S V  SP S    S NS FDRRSVNRKLW   ENGG VDISLKDG  LFS VT
Subjt:  DKSPSSVTGSNFIDRRRRSPWRNDGSRTPS-SESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSY-ENGG-VDISLKDGLSLFSEVT

Query:  RG-TDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRG-FINVEDM-IYKTPRKLVQSLQDLNEGNSDYASRSSRRRHRSLS
        +G T VSD+  V        ++  E  D+F GF   S       R+TT SP R R   IN ED  I+ TPRKL+ SLQ                      
Subjt:  RG-TDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRG-FINVEDM-IYKTPRKLVQSLQDLNEGNSDYASRSSRRRHRSLS

Query:  SGNLEWSPPRSFLNQNRLPDDQKLSKED-EGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSFLLF
                          PDD  L   D +  +     E++ GS ++       P   +  +  + V   +      +G +   K +  KLV   SF++ 
Subjt:  SGNLEWSPPRSFLNQNRLPDDQKLSKED-EGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSFLLF

Query:  TIFTSLLWI--DDQDQGSYLIP
         +F +++ +   D D G Y +P
Subjt:  TIFTSLLWI--DDQDQGSYLIP

Q8C0D7 Inhibitor of growth protein 43.2e-2038.04Show/hide
Query:  EKVLLARQAGDLIDSHIKRLDEDLNNFAEDLKQEGKISPDEPAILPPLPLVSKNERRRPVFITPQSKRPDYRDRDWDRERDRDFE-------LMPPPGGH
        +KV LA Q  +++D HI+RLD DL  F  DLK++   S D  +        SK +++       ++ R   + ++ D E  +  +         P  G  
Subjt:  EKVLLARQAGDLIDSHIKRLDEDLNNFAEDLKQEGKISPDEPAILPPLPLVSKNERRRPVFITPQSKRPDYRDRDWDRERDRDFE-------LMPPPGGH

Query:  KKDFA---PSLDVDQPIDPNEPTYCICHQVSFGDMIACDNENCQGGEWFHYSCVGLTPETRFK
           F    PS  +D P+DPNEPTYC+CHQVS+G+MI CDN +C   EWFH++CVGLT + R K
Subjt:  KKDFA---PSLDVDQPIDPNEPTYCICHQVSFGDMIACDNENCQGGEWFHYSCVGLTPETRFK

Q9SQR5 Protein SINE21.0e-8756.65Show/hide
Query:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        M +NL    R+E ANLDKD DS ++AM  LR+ VK+LD+K + VF+AQ+S+ KE G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS 
Subjt:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL
         +QQACS+ V A+ARYGIDPTTP+DKK +VI+SLC PLS+SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EMVN VCQ++A ALE  S++  SHM L
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL

Query:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTAKKILADKGS
        VM L+K NP  VE YARL +++GLRIL  G+VE +SQKRL AIQM+NFLM+ L+P SI SEL+ I  EME  Q DQ  YVK AA+ET++ A++++ +   
Subjt:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTAKKILADKGS

Query:  KMD----KSPSSVTGS
          D    K  +S++GS
Subjt:  KMD----KSPSSVTGS

Q9UNL4 Inhibitor of growth protein 42.4e-2036.11Show/hide
Query:  LRKDIEANQDSALSLCTEKVLLARQAGDLIDSHIKRLDEDLNNFAEDLKQEGKISPDEPAILPPLPLVSKNERRRPVFITPQSKRPDYRDRDWDRERDRD
        L K I+           +KV LA Q  +++D HI+RLD DL  F  DLK++   S D  +        SK +++       ++ R   + ++ D E  + 
Subjt:  LRKDIEANQDSALSLCTEKVLLARQAGDLIDSHIKRLDEDLNNFAEDLKQEGKISPDEPAILPPLPLVSKNERRRPVFITPQSKRPDYRDRDWDRERDRD

Query:  FE-------LMPPPGGHKKDFA---PSLDVDQPIDPNEPTYCICHQVSFGDMIACDNENCQGGEWFHYSCVGLTPETRFK
         +         P  G     F    PS  +D P+DPNEPTYC+CHQVS+G+MI CDN +C   EWFH++CVGLT + R K
Subjt:  FE-------LMPPPGGHKKDFA---PSLDVDQPIDPNEPTYCICHQVSFGDMIACDNENCQGGEWFHYSCVGLTPETRFK

Arabidopsis top hitse value%identityAlignment
AT1G54385.1 ARM repeat superfamily protein8.3e-14953.86Show/hide
Query:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        M  NL+P+LR+E ANLDKD +SR+SAMKAL++YVK+LDSKAIP FLAQV E KET +L+GE TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSF
Subjt:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM
        PLQQACSKV+PAIARYGIDPTT +DKK+ +I+SLC PL++SLL SQESLTSGAALCLKALVDSDNWRFASDEMVN+VCQNV  AL+  S QT+  MGLVM
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM

Query:  TLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTAKKILADKGSKM
        +LAK NP IVE YARLL+  GLRIL  G+ E NSQKRLSA+QM+NFLM+CLDP SI+SE++ II EME CQSDQM YV+GAAYE + T+K+I A+  SKM
Subjt:  TLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTAKKILADKGSKM

Query:  DKSPSSVTGSNFIDRRRRSPWRNDGSRTPS-SESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSY-ENGG-VDISLKDGLSLFSEVT
        +K   SVTGSNF         RN  S  P  S SPESQTL SF  Y S V  SP S    S NS FDRRSVNRKLW   ENGG VDISLKDG  LFS VT
Subjt:  DKSPSSVTGSNFIDRRRRSPWRNDGSRTPS-SESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSY-ENGG-VDISLKDGLSLFSEVT

Query:  RG-TDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRG-FINVEDM-IYKTPRKLVQSLQDLNEGNSDYASRSSRRRHRSLS
        +G T VSD+  V        ++  E  D+F GF   S       R+TT SP R R   IN ED  I+ TPRKL+ SLQ                      
Subjt:  RG-TDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRG-FINVEDM-IYKTPRKLVQSLQDLNEGNSDYASRSSRRRHRSLS

Query:  SGNLEWSPPRSFLNQNRLPDDQKLSKED-EGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSFLLF
                          PDD  L   D +  +     E++ GS ++       P   +  +  + V   +      +G +   K +  KLV   SF++ 
Subjt:  SGNLEWSPPRSFLNQNRLPDDQKLSKED-EGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSFLLF

Query:  TIFTSLLWI--DDQDQGSYLIP
         +F +++ +   D D G Y +P
Subjt:  TIFTSLLWI--DDQDQGSYLIP

AT1G54385.2 ARM repeat superfamily protein8.3e-14953.86Show/hide
Query:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        M  NL+P+LR+E ANLDKD +SR+SAMKAL++YVK+LDSKAIP FLAQV E KET +L+GE TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSF
Subjt:  MSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM
        PLQQACSKV+PAIARYGIDPTT +DKK+ +I+SLC PL++SLL SQESLTSGAALCLKALVDSDNWRFASDEMVN+VCQNV  AL+  S QT+  MGLVM
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM

Query:  TLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTAKKILADKGSKM
        +LAK NP IVE YARLL+  GLRIL  G+ E NSQKRLSA+QM+NFLM+CLDP SI+SE++ II EME CQSDQM YV+GAAYE + T+K+I A+  SKM
Subjt:  TLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLSAIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTAKKILADKGSKM

Query:  DKSPSSVTGSNFIDRRRRSPWRNDGSRTPS-SESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSY-ENGG-VDISLKDGLSLFSEVT
        +K   SVTGSNF         RN  S  P  S SPESQTL SF  Y S V  SP S    S NS FDRRSVNRKLW   ENGG VDISLKDG  LFS VT
Subjt:  DKSPSSVTGSNFIDRRRRSPWRNDGSRTPS-SESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSY-ENGG-VDISLKDGLSLFSEVT

Query:  RG-TDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRG-FINVEDM-IYKTPRKLVQSLQDLNEGNSDYASRSSRRRHRSLS
        +G T VSD+  V        ++  E  D+F GF   S       R+TT SP R R   IN ED  I+ TPRKL+ SLQ                      
Subjt:  RG-TDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRG-FINVEDM-IYKTPRKLVQSLQDLNEGNSDYASRSSRRRHRSLS

Query:  SGNLEWSPPRSFLNQNRLPDDQKLSKED-EGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSFLLF
                          PDD  L   D +  +     E++ GS ++       P   +  +  + V   +      +G +   K +  KLV   SF++ 
Subjt:  SGNLEWSPPRSFLNQNRLPDDQKLSKED-EGGLDSDNGEQSQGSSESISSTDGVPNHGDVQAIPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSFLLF

Query:  TIFTSLLWI--DDQDQGSYLIP
         +F +++ +   D D G Y +P
Subjt:  TIFTSLLWI--DDQDQGSYLIP

AT1G54390.3 PHD finger protein-related1.1e-8974.77Show/hide
Query:  TMIDQTRQQTKYCLGLSTQSSKKG---YSNSNTEDEESAFEKLRKDIEANQDSALSLCTEKVLLARQAGDLIDSHIKRLDEDLNNFAEDLKQEGKISPDE
        ++I+QTRQQTKYCLGL++QSSKKG   + N+   DEE   EK+RK+IE++Q++ALSLCTEKVLLARQA DLIDSH+KRLDEDLNNFAEDLKQEGKI PDE
Subjt:  TMIDQTRQQTKYCLGLSTQSSKKG---YSNSNTEDEESAFEKLRKDIEANQDSALSLCTEKVLLARQAGDLIDSHIKRLDEDLNNFAEDLKQEGKISPDE

Query:  PAILPPLPLVSKNERRRPVFITPQSKRPDYRDRDWDRERDRDFELMPPPGGHKKDFAPSLDVDQPIDPNEPTYCICHQVSFGDMIACDNENCQGGEWFHY
        P++LPPLP+V K E+R+  + TPQ K+ DYRDRDWD  RDRDFELMPPPG ++KD  P    +QPIDPNEPTYC+CHQVSFGDMIACDNENCQGGEWFHY
Subjt:  PAILPPLPLVSKNERRRPVFITPQSKRPDYRDRDWDRERDRDFELMPPPGGHKKDFAPSLDVDQPIDPNEPTYCICHQVSFGDMIACDNENCQGGEWFHY

Query:  SCVGLTPETRFKGK
        +CVGLTPETRFKGK
Subjt:  SCVGLTPETRFKGK

AT1G54390.4 PHD finger protein-related1.1e-8974.77Show/hide
Query:  TMIDQTRQQTKYCLGLSTQSSKKG---YSNSNTEDEESAFEKLRKDIEANQDSALSLCTEKVLLARQAGDLIDSHIKRLDEDLNNFAEDLKQEGKISPDE
        ++I+QTRQQTKYCLGL++QSSKKG   + N+   DEE   EK+RK+IE++Q++ALSLCTEKVLLARQA DLIDSH+KRLDEDLNNFAEDLKQEGKI PDE
Subjt:  TMIDQTRQQTKYCLGLSTQSSKKG---YSNSNTEDEESAFEKLRKDIEANQDSALSLCTEKVLLARQAGDLIDSHIKRLDEDLNNFAEDLKQEGKISPDE

Query:  PAILPPLPLVSKNERRRPVFITPQSKRPDYRDRDWDRERDRDFELMPPPGGHKKDFAPSLDVDQPIDPNEPTYCICHQVSFGDMIACDNENCQGGEWFHY
        P++LPPLP+V K E+R+  + TPQ K+ DYRDRDWD  RDRDFELMPPPG ++KD  P    +QPIDPNEPTYC+CHQVSFGDMIACDNENCQGGEWFHY
Subjt:  PAILPPLPLVSKNERRRPVFITPQSKRPDYRDRDWDRERDRDFELMPPPGGHKKDFAPSLDVDQPIDPNEPTYCICHQVSFGDMIACDNENCQGGEWFHY

Query:  SCVGLTPETRFKGK
        +CVGLTPETRFKGK
Subjt:  SCVGLTPETRFKGK

AT1G54390.5 PHD finger protein-related1.5e-8975.12Show/hide
Query:  MIDQTRQQTKYCLGLSTQSSKKG---YSNSNTEDEESAFEKLRKDIEANQDSALSLCTEKVLLARQAGDLIDSHIKRLDEDLNNFAEDLKQEGKISPDEP
        +I+QTRQQTKYCLGL++QSSKKG   + N+   DEE   EK+RK+IE++Q++ALSLCTEKVLLARQA DLIDSH+KRLDEDLNNFAEDLKQEGKI PDEP
Subjt:  MIDQTRQQTKYCLGLSTQSSKKG---YSNSNTEDEESAFEKLRKDIEANQDSALSLCTEKVLLARQAGDLIDSHIKRLDEDLNNFAEDLKQEGKISPDEP

Query:  AILPPLPLVSKNERRRPVFITPQSKRPDYRDRDWDRERDRDFELMPPPGGHKKDFAPSLDVDQPIDPNEPTYCICHQVSFGDMIACDNENCQGGEWFHYS
        ++LPPLP+V K E+R+  + TPQ K+ DYRDRDWD  RDRDFELMPPPG ++KD  P    +QPIDPNEPTYC+CHQVSFGDMIACDNENCQGGEWFHY+
Subjt:  AILPPLPLVSKNERRRPVFITPQSKRPDYRDRDWDRERDRDFELMPPPGGHKKDFAPSLDVDQPIDPNEPTYCICHQVSFGDMIACDNENCQGGEWFHYS

Query:  CVGLTPETRFKGK
        CVGLTPETRFKGK
Subjt:  CVGLTPETRFKGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GAGTGGGCGCGCCAAGCCAAGGCGAAACGAAACGACCTTCGATCAAGCCCTAGATCACAAAGCTCCACTCAATCTCAACCATGGCGATTGCTCGAACAGGAGTCT
ACGTTGATGATTATTTGGAATATGCCAGCACATTGCCTGCTGAGCTTCAGAGGCTGCTCAATACAATCAGAGAACTCGATGATCGTTCCCACGATGATTGACCAG
ACGAGGCAGCAAACGAAATACTGCTTGGGATTGTCGACACAGAGTTCGAAGAAAGGATATAGTAATAGTAATACTGAGGATGAGGAGTCCGCCTTTGAGAAACTG
CGGAAGGATATCGAGGCCAATCAGGATAGCGCGTTGAGTTTGTGCACTGAGAAGGTTTTGTTGGCACGTCAAGCAGGCGACCTGATAGATAGCCATATTAAACGA
CTTGATGAAGATCTAAACAACTTTGCAGAAGATCTGAAACAAGAGGGAAAGATTTCACCAGACGAACCGGCAATTCTTCCGCCATTGCCATTAGTTTCAAAAAAT
GAAAGGCGCAGACCAGTATTTATAACACCTCAATCTAAGAGGCCAGATTACAGGGACCGGGATTGGGATAGGGAGCGTGATCGTGATTTTGAGCTCATGCCTCCT
CCTGGAGGCCATAAAAAAGACTTTGCACCTTCACTTGATGTGGATCAACCCATTGATCCAAACGAACCTACGTATTGCATTTGTCATCAGGTTTCATTTGGAGAT
ATGATTGCCTGTGACAATGAGAATTGTCAAGGAGGCGAATGGTTCCATTATTCCTGCGTTGGACTGACTCCAGAAACAAGATTTAAAGGAAAGGCCATTATGGTC
CAAAATCAAAAGTTTAAGAGAGGGACCCACAAAAGAAGAAGAAGGGATAGCGGTGGAATTTCAAGAAAAGATTTTGAATTCATAGTGTCGAAGAGCGCTTTGCAA
AGACCAAACCCCAGAAAAGGAAAAACCTTCAAATCAAAGCCAAAATCAGACAGCACCGAACCCATAAATTCAAATCCCTCAAAAATCCCTCAAAAATCCCTAACC
CTTCTCTCTCTCTTTCACTCTTCTTCTTCTTCTTCTTATTCCTCCTTATCTTCTTCTTTCCATTTCTTCTCAGACATACATCTCCACGCCATGAAAGCAACCTCA
GAAACTCAAAGGTCTTTTATGAGCAAAAATTTGAGTCCAATGCTTCGGCGGGAGTTTGCTAATCTTGATAAAGATGCCGATAGTCGTAGATCTGCGATGAAGGCA
TTGAGAACTTATGTGAAGGAATTAGACTCCAAGGCTATCCCTGTTTTTCTTGCCCAAGTTTCTGAGAATAAAGAAACTGGTGCTTTGAATGGGGAATGTACCATT
TCTCTCTATGAAGTTCTCGCTCGTGTTCATGGCGTCAATATCGTGCCACAGATCGATCGGATTATGACTTCTATTATCAAGACTTTGGCTTCAAGTGCTGGCTCT
TTCCCTCTTCAACAAGCTTGTTCCAAAGTTGTTCCGGCGATTGCGAGATATGGGATTGATCCCACCACTCCTGATGATAAGAAGAAGCATGTGATTTACTCTCTT
TGTAATCCGCTTTCGGAATCTTTGTTGGGTTCTCAAGAGAGCCTCACTTCTGGTGCTGCCCTATGCTTGAAGGCTCTTGTCGATTCTGATAACTGGCGGTTTGCT
TCTGATGAGATGGTTAACAAGGTTTGCCAGAATGTCGCTGGAGCTTTGGAGGAGAAATCTACACAAACCAATTCACATATGGGGCTCGTTATGACTCTAGCTAAG
CGGAATCCTCGGATTGTCGAACCGTATGCTAGATTGTTGCTACAGGCTGGGCTGCGGATATTGAAGTGTGGGATTGTGGAGAAGAATTCTCAGAAAAGATTGTCT
GCTATTCAAATGATTAATTTCTTGATGAGATGTCTAGATCCTTGGAGTATATTTTCGGAGCTTCAGTCTATAATTGATGAGATGGAGAATTGTCAGTCTGATCAA
ATGCCTTATGTCAAAGGTGCCGCTTATGAAACTTTGCAAACGGCTAAGAAAATATTGGCTGATAAAGGGTCGAAAATGGACAAATCTCCAAGCTCGGTGACGGGA
TCAAACTTCATTGATCGCAGGAGGAGAAGTCCATGGAGAAATGATGGAAGCCGAACTCCCTCATCCGAGTCCCCAGAATCCCAGACCCTTGATTCATTCTTTGAT
TATGGCTCACTTGTAGGATCACCCTTTTCATCAAGACAAGCTTCTCGTAACTCAGGATTCGACCGAAGGAGTGTGAATCGTAAACTTTGGAGTTATGAGAATGGT
GGGGTTGATATATCCCTCAAGGATGGCTTGTCTTTGTTCTCAGAAGTCACTCGTGGAACCGATGTTTCCGACACCATGTCCGTGCACTCTGGAAGTCACAAATTT
GGCCATAATGGTGAAGAATATGCTGATGATTTTTCAGGGTTTTTTCAAATGAGTCCTCCTCGACGCAGACTCTCAAGAAGCACTACAACCAGCCCCCTTCGGAGT
CGTGGTTTCATAAACGTTGAAGATATGATCTACAAAACTCCTCGGAAGCTCGTCCAATCCCTTCAGGATCTAAACGAGGGGAATTCCGACTATGCTAGCAGAAGT
AGCAGACGCAGGCATAGGAGTTTGTCATCAGGCAATTTGGAGTGGAGTCCTCCAAGGTCATTTCTAAATCAAAATCGGCTCCCAGATGATCAGAAACTCAGCAAA
GAGGATGAAGGCGGCTTAGACAGCGATAACGGTGAACAATCACAAGGTAGCTCCGAATCGATCTCTTCAACTGATGGTGTCCCTAACCATGGTGATGTCCAAGCT
ATACCTGTGGCAGTGGTTTGTCAAAGTAAAATCAAACCTCAATATTCTGGCATTGAGATGGCATATAAGAAGACTGCTTTGAAATTGGTTTGTGGCTTCTCATTT
TTGCTTTTCACAATATTCACATCATTACTATGGATTGATGATCAGGACCAAGGTTCCTATCTTATACCGAAGAGTGAAGCAACTCGAGCGATCCTCCATGCCACA
TCGGAGATTTGTATGAGCCACTCCGGGAAGTCGGAATCCTCTTCCTCCACTGCGGCGGCTTCCGGTGGATCGATGAAGGCAATGACAAACAAATCGGCGCCGGCA
CGTGAACGGACGCTCGTAGGGAATGCCTGTGGGCGGATGGAGAAGCTTAGGAGTTACGCTCATGGCGGTCGGACGCCGGCGGCGGCGGCTCTCTCGACTTCGCGA
mRNA sequenceShow/hide mRNA sequence
GAGTGGGCGCGCCAAGCCAAGGCGAAACGAAACGACCTTCGATCAAGCCCTAGATCACAAAGCTCCACTCAATCTCAACCATGGCGATTGCTCGAACAGGAGTCT
ACGTTGATGATTATTTGGAATATGCCAGCACATTGCCTGCTGAGCTTCAGAGGCTGCTCAATACAATCAGAGAACTCGATGATCGTTCCCACGATGATTGACCAG
ACGAGGCAGCAAACGAAATACTGCTTGGGATTGTCGACACAGAGTTCGAAGAAAGGATATAGTAATAGTAATACTGAGGATGAGGAGTCCGCCTTTGAGAAACTG
CGGAAGGATATCGAGGCCAATCAGGATAGCGCGTTGAGTTTGTGCACTGAGAAGGTTTTGTTGGCACGTCAAGCAGGCGACCTGATAGATAGCCATATTAAACGA
CTTGATGAAGATCTAAACAACTTTGCAGAAGATCTGAAACAAGAGGGAAAGATTTCACCAGACGAACCGGCAATTCTTCCGCCATTGCCATTAGTTTCAAAAAAT
GAAAGGCGCAGACCAGTATTTATAACACCTCAATCTAAGAGGCCAGATTACAGGGACCGGGATTGGGATAGGGAGCGTGATCGTGATTTTGAGCTCATGCCTCCT
CCTGGAGGCCATAAAAAAGACTTTGCACCTTCACTTGATGTGGATCAACCCATTGATCCAAACGAACCTACGTATTGCATTTGTCATCAGGTTTCATTTGGAGAT
ATGATTGCCTGTGACAATGAGAATTGTCAAGGAGGCGAATGGTTCCATTATTCCTGCGTTGGACTGACTCCAGAAACAAGATTTAAAGGAAAGGCCATTATGGTC
CAAAATCAAAAGTTTAAGAGAGGGACCCACAAAAGAAGAAGAAGGGATAGCGGTGGAATTTCAAGAAAAGATTTTGAATTCATAGTGTCGAAGAGCGCTTTGCAA
AGACCAAACCCCAGAAAAGGAAAAACCTTCAAATCAAAGCCAAAATCAGACAGCACCGAACCCATAAATTCAAATCCCTCAAAAATCCCTCAAAAATCCCTAACC
CTTCTCTCTCTCTTTCACTCTTCTTCTTCTTCTTCTTATTCCTCCTTATCTTCTTCTTTCCATTTCTTCTCAGACATACATCTCCACGCCATGAAAGCAACCTCA
GAAACTCAAAGGTCTTTTATGAGCAAAAATTTGAGTCCAATGCTTCGGCGGGAGTTTGCTAATCTTGATAAAGATGCCGATAGTCGTAGATCTGCGATGAAGGCA
TTGAGAACTTATGTGAAGGAATTAGACTCCAAGGCTATCCCTGTTTTTCTTGCCCAAGTTTCTGAGAATAAAGAAACTGGTGCTTTGAATGGGGAATGTACCATT
TCTCTCTATGAAGTTCTCGCTCGTGTTCATGGCGTCAATATCGTGCCACAGATCGATCGGATTATGACTTCTATTATCAAGACTTTGGCTTCAAGTGCTGGCTCT
TTCCCTCTTCAACAAGCTTGTTCCAAAGTTGTTCCGGCGATTGCGAGATATGGGATTGATCCCACCACTCCTGATGATAAGAAGAAGCATGTGATTTACTCTCTT
TGTAATCCGCTTTCGGAATCTTTGTTGGGTTCTCAAGAGAGCCTCACTTCTGGTGCTGCCCTATGCTTGAAGGCTCTTGTCGATTCTGATAACTGGCGGTTTGCT
TCTGATGAGATGGTTAACAAGGTTTGCCAGAATGTCGCTGGAGCTTTGGAGGAGAAATCTACACAAACCAATTCACATATGGGGCTCGTTATGACTCTAGCTAAG
CGGAATCCTCGGATTGTCGAACCGTATGCTAGATTGTTGCTACAGGCTGGGCTGCGGATATTGAAGTGTGGGATTGTGGAGAAGAATTCTCAGAAAAGATTGTCT
GCTATTCAAATGATTAATTTCTTGATGAGATGTCTAGATCCTTGGAGTATATTTTCGGAGCTTCAGTCTATAATTGATGAGATGGAGAATTGTCAGTCTGATCAA
ATGCCTTATGTCAAAGGTGCCGCTTATGAAACTTTGCAAACGGCTAAGAAAATATTGGCTGATAAAGGGTCGAAAATGGACAAATCTCCAAGCTCGGTGACGGGA
TCAAACTTCATTGATCGCAGGAGGAGAAGTCCATGGAGAAATGATGGAAGCCGAACTCCCTCATCCGAGTCCCCAGAATCCCAGACCCTTGATTCATTCTTTGAT
TATGGCTCACTTGTAGGATCACCCTTTTCATCAAGACAAGCTTCTCGTAACTCAGGATTCGACCGAAGGAGTGTGAATCGTAAACTTTGGAGTTATGAGAATGGT
GGGGTTGATATATCCCTCAAGGATGGCTTGTCTTTGTTCTCAGAAGTCACTCGTGGAACCGATGTTTCCGACACCATGTCCGTGCACTCTGGAAGTCACAAATTT
GGCCATAATGGTGAAGAATATGCTGATGATTTTTCAGGGTTTTTTCAAATGAGTCCTCCTCGACGCAGACTCTCAAGAAGCACTACAACCAGCCCCCTTCGGAGT
CGTGGTTTCATAAACGTTGAAGATATGATCTACAAAACTCCTCGGAAGCTCGTCCAATCCCTTCAGGATCTAAACGAGGGGAATTCCGACTATGCTAGCAGAAGT
AGCAGACGCAGGCATAGGAGTTTGTCATCAGGCAATTTGGAGTGGAGTCCTCCAAGGTCATTTCTAAATCAAAATCGGCTCCCAGATGATCAGAAACTCAGCAAA
GAGGATGAAGGCGGCTTAGACAGCGATAACGGTGAACAATCACAAGGTAGCTCCGAATCGATCTCTTCAACTGATGGTGTCCCTAACCATGGTGATGTCCAAGCT
ATACCTGTGGCAGTGGTTTGTCAAAGTAAAATCAAACCTCAATATTCTGGCATTGAGATGGCATATAAGAAGACTGCTTTGAAATTGGTTTGTGGCTTCTCATTT
TTGCTTTTCACAATATTCACATCATTACTATGGATTGATGATCAGGACCAAGGTTCCTATCTTATACCGAAGAGTGAAGCAACTCGAGCGATCCTCCATGCCACA
TCGGAGATTTGTATGAGCCACTCCGGGAAGTCGGAATCCTCTTCCTCCACTGCGGCGGCTTCCGGTGGATCGATGAAGGCAATGACAAACAAATCGGCGCCGGCA
CGTGAACGGACGCTCGTAGGGAATGCCTGTGGGCGGATGGAGAAGCTTAGGAGTTACGCTCATGGCGGTCGGACGCCGGCGGCGGCGGCTCTCTCGACTTCGCGA
Protein sequenceShow/hide protein sequence
EWARQAKAKRNDLRSSPRSQSSTQSQPWRLLEQESTLMIIWNMPAHCLLSFRGCSIQSENSMIVPTMIDQTRQQTKYCLGLSTQSSKKGYSNSNTEDEESAFEKL
RKDIEANQDSALSLCTEKVLLARQAGDLIDSHIKRLDEDLNNFAEDLKQEGKISPDEPAILPPLPLVSKNERRRPVFITPQSKRPDYRDRDWDRERDRDFELMPP
PGGHKKDFAPSLDVDQPIDPNEPTYCICHQVSFGDMIACDNENCQGGEWFHYSCVGLTPETRFKGKAIMVQNQKFKRGTHKRRRRDSGGISRKDFEFIVSKSALQ
RPNPRKGKTFKSKPKSDSTEPINSNPSKIPQKSLTLLSLFHSSSSSSYSSLSSSFHFFSDIHLHAMKATSETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKA
LRTYVKELDSKAIPVFLAQVSENKETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIYSL
CNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGIVEKNSQKRLS
AIQMINFLMRCLDPWSIFSELQSIIDEMENCQSDQMPYVKGAAYETLQTAKKILADKGSKMDKSPSSVTGSNFIDRRRRSPWRNDGSRTPSSESPESQTLDSFFD
YGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLSLFSEVTRGTDVSDTMSVHSGSHKFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRS
RGFINVEDMIYKTPRKLVQSLQDLNEGNSDYASRSSRRRHRSLSSGNLEWSPPRSFLNQNRLPDDQKLSKEDEGGLDSDNGEQSQGSSESISSTDGVPNHGDVQA
IPVAVVCQSKIKPQYSGIEMAYKKTALKLVCGFSFLLFTIFTSLLWIDDQDQGSYLIPKSEATRAILHATSEICMSHSGKSESSSSTAAASGGSMKAMTNKSAPA
RERTLVGNACGRMEKLRSYAHGGRTPAAAALSTSR