; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012205 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012205
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionARM repeat superfamily protein
Genome locationchr1:38624669..38627055
RNA-Seq ExpressionLag0012205
SyntenyLag0012205
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0032797 - SMN complex (cellular component)
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595508.1 Protein SINE1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0090.24Show/hide
Query:  MKATSETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKAT E HRSF+GKSFSPMLRRELAN DKDADSRR+AMKALKTYVKELDSKAIP FLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKK +VIHSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEK 
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILK GMVEKNSQKRLS+IQMINFLMKCLDPWSIFSELQTI EEMENCQSDQMAYVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA

Query:  RRIAADKGSKMDKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLS
        RRIAAD+GSKM KSPSSVTGSNFIDRRRSPWRNGGS++PSSESPESQT DSFFDYGSL GSPFSS+QAS NSGFDRRS+NRKLW YENGGVDISLKDGLS
Subjt:  RRIAADKGSKMDKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLS

Query:  LFSEITRGTDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRSCINVEDMIFKTPRKLVHSLQDLNEANSDYASKSCRRRQR
        LFS+I RGTDVSDT+S+HS SHKFGH+GEEYAD+F GFFQMSPPR RLSRSTTTSP+RSRSCINVEDMIFKTPRKLVHSLQDLNEANS YASKSC+ RQR
Subjt:  LFSEITRGTDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRSCINVEDMIFKTPRKLVHSLQDLNEANSDYASKSCRRRQR

Query:  SLSSGNLEWSPRSFHNQNGFPDDQKLSKEDGVLDNGSGG----QSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLVCGFS
        SLS GNLEWSPRS HNQNG P+DQKLSK+D   DN +      QSPGGSESVSST GVP    VQA PVVVA HSKIKTQYSGIEMAYKKTALKLVCGFS
Subjt:  SLSSGNLEWSPRSFHNQNGFPDDQKLSKEDGVLDNGSGG----QSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLVCGFS

Query:  FLLFTIFTSLLWINDQDQGSYLVPT
        FLLFTIFTSLLWIN+QDQG+YLVPT
Subjt:  FLLFTIFTSLLWINDQDQGSYLVPT

KAG7027494.1 hypothetical protein SDJN02_11507, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0090.08Show/hide
Query:  MKATSETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKAT E HRSF+GKSFSPMLRRELANLDKDADSRR+AMKALKTYVKELDSKAIP FLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKK +VIHSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEK 
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILK GMVEKNSQKRLS+IQMINFLMKCLDPWSIFSELQTI EEMENCQSDQMAYVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA

Query:  RRIAADKGSKMDKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLS
        RRIAAD+GSKM KSPSSVTGSNFIDRRRSPWRNGGS++PSSESPESQT DSFFDYGSL GSPFSS+QAS NSGFDRRS+NRKLW YENGGVDISLKDGLS
Subjt:  RRIAADKGSKMDKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLS

Query:  LFSEITRGTDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRSCINVEDMIFKTPRKLVHSLQDLNEANSDYASKSCRRRQR
        LFS+I RGTDVSDT+S+HS SHKFGH+GEEYAD+F GFFQMSPPR RLSRSTTTSP+RSRSCINVEDMIFKTPRKLVHSLQDLNEANS YASKSC+ RQR
Subjt:  LFSEITRGTDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRSCINVEDMIFKTPRKLVHSLQDLNEANSDYASKSCRRRQR

Query:  SLSSGNLEWSPRSFHNQNGFPDDQKLSKEDGVLDNGSGG----QSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLVCGFS
        SLS GNLEWSPRS HNQNG P+DQKLSK+D   DN +      QSPGGSESVSST GVP    VQA PVVV  HSKIKTQYSGIEMAYKKTALKLVCGFS
Subjt:  SLSSGNLEWSPRSFHNQNGFPDDQKLSKEDGVLDNGSGG----QSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLVCGFS

Query:  FLLFTIFTSLLWINDQDQGSYLVPT
        FLLFTIFTSL WIN+QDQG+YLVPT
Subjt:  FLLFTIFTSLLWINDQDQGSYLVPT

XP_022925214.1 protein SINE1-like [Cucurbita moschata]0.0e+0090.08Show/hide
Query:  MKATSETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        M+AT E HRSF+GKSFSPMLRRELANLDKDADSRR+AMKALKTYVKELDSKAIP FLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKK +VIHSLCNPLSESLLG QESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEK 
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILK GMVEKNSQKRLS+IQMINFLMKCLDPWSIFSELQTI EEMENCQSDQMAYVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA

Query:  RRIAADKGSKMDKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLS
        RRIAAD+GSKM KSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQT DSFFDYGSL GSPFSS+QAS NSGFDRRS+NRKLW YENGGVDISLKDGLS
Subjt:  RRIAADKGSKMDKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLS

Query:  LFSEITRGTDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRSCINVEDMIFKTPRKLVHSLQDLNEANSDYASKSCRRRQR
        LFS+I RGTDVSDT+S+HS SHKFGH+GEEYAD+F GFFQM PPR RLSRSTTTSP+RSR+CINVEDMIFKTPRKLVHSLQDLNEANSDYASKSC+ RQR
Subjt:  LFSEITRGTDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRSCINVEDMIFKTPRKLVHSLQDLNEANSDYASKSCRRRQR

Query:  SLSSGNLEWSPRSFHNQNGFPDDQKLSKEDGVLDNGSGG----QSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLVCGFS
        SLS GNLEWSPRS HNQNG PDDQKLSK+D   DN +      QSP GSESVSST GVP    VQA PVVVA H+KIKTQYSGIEMAYKKTALKLVCGFS
Subjt:  SLSSGNLEWSPRSFHNQNGFPDDQKLSKEDGVLDNGSGG----QSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLVCGFS

Query:  FLLFTIFTSLLWINDQDQGSYLVPT
        FLLFTIFTSLLWIN+QDQG+YLVPT
Subjt:  FLLFTIFTSLLWINDQDQGSYLVPT

XP_022966522.1 protein SINE1-like [Cucurbita maxima]0.0e+0090.14Show/hide
Query:  MKATSETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKAT E HRSF+GKSFSPMLRRELANLDKDADSRR+AMKALKTYVKELDSKAIP FLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKK +VIHSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEK 
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGL+ILK GMVEKNSQKRLS+IQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA

Query:  RRIAADKGSKMDKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLS
        RRIAAD+GSKM KSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQT DSFFDYGSL GSPFSS+QAS NSGFDRRS+NRKLW YENGGVDISLKDGLS
Subjt:  RRIAADKGSKMDKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLS

Query:  LFSEITRGTDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRSCINVEDMIFKTPRKLVHSLQDLNEANSDYASKSCRRRQR
        LFS+I RGTDVSDTMS+HS SHKF H+GEEYADEF GFFQMSPPR RLSRSTTTSP+RSRSCINVEDMIFKTPRKLVHSLQDLN+ANSDYASKSC+ RQR
Subjt:  LFSEITRGTDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRSCINVEDMIFKTPRKLVHSLQDLNEANSDYASKSCRRRQR

Query:  SLSSGNLEWSPRSFHNQNGFPDDQKLSKEDGVLDNGSGG--------QSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLV
        SLS GNLEWSPRS HNQNG PD QKLSK+D   DN +          +SPGGSESVSST GVP    VQA PVVVA HSKIKTQYSGIEMAYKKTALKLV
Subjt:  SLSSGNLEWSPRSFHNQNGFPDDQKLSKEDGVLDNGSGG--------QSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLV

Query:  CGFSFLLFTIFTSLLWINDQDQGSYLVPT
        CGFSFLLFTIFTSLLWIN+QDQG+YLVPT
Subjt:  CGFSFLLFTIFTSLLWINDQDQGSYLVPT

XP_023517874.1 protein SINE1-like isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0089.7Show/hide
Query:  MKATSETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKAT E HRSF+GKSFSPMLRRELANLDKDADSRR+AMKALKTYVKELDSKAIP FLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKK +VIHSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEK 
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILK GMVEKNSQKRLS+IQMINFLMKCLDPWSIFSELQTI EEMENCQSDQMAYVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA

Query:  RRIAADKGSKMDKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLS
        RRIAAD+GSKM KSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQT DSFFDYGSL GSPFSS+QAS NSGFDRRS+NRKLW YENGGVDISLKDGLS
Subjt:  RRIAADKGSKMDKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLS

Query:  LFSEITRGTDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRSCINVEDMIFKTPRKLVHSLQDLNEANSDYASKSCRRRQR
        LFS+I RGTDVSDT+S+HS SHKF H+GEEYAD+F GFFQMSPPR RLSRSTTTSP+RSR+CINVEDMIFKTPRKLVHSLQDLN+ANSDYASKSC+ RQR
Subjt:  LFSEITRGTDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRSCINVEDMIFKTPRKLVHSLQDLNEANSDYASKSCRRRQR

Query:  SLSSGNLEWSPRSFHNQNGFPDDQKLSKEDGVLDNGSGG----------QSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALK
        SLS GNLEWSPRS HNQNG PDDQKLSK+D   DN +            QSPGGSESVSST GVP    VQA PVVVA HSKIKTQYSGIEMAYKKTALK
Subjt:  SLSSGNLEWSPRSFHNQNGFPDDQKLSKEDGVLDNGSGG----------QSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALK

Query:  LVCGFSFLLFTIFTSLLWINDQDQGSYLVPT
        LVCGFSFLLFTIFTSLLWIN+QDQG+YLVPT
Subjt:  LVCGFSFLLFTIFTSLLWINDQDQGSYLVPT

TrEMBL top hitse value%identityAlignment
A0A0A0KYP2 Uncharacterized protein0.0e+0090.06Show/hide
Query:  MKATSETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA SET RSF+ K+ SPMLRRE ANLDKDADSRRSAMKALKTYVKELDSKAIP FLAQVSENKETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLM+CLDPWSIFSELQ+IIEEMENCQSDQM YVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA

Query:  RRIAADKGSKMDKSPSSVTGSNFID-RRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
        ++I ADKGSKMDKSPSSVTGSNF+D RRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
Subjt:  RRIAADKGSKMDKSPSSVTGSNFID-RRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEITRGTDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRSCINVEDMIFKTPRKLVHSLQDLNEANSDYASKSCRRRQ
        SLFSE+TRGTDVSDTMS++SGSHKFGHNGEEYAD+F+GFFQMSPPRRRLSRSTTTSPLRSRS INVEDMIFKTPRKLVHSLQDLNE  SDYAS S R R 
Subjt:  SLFSEITRGTDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRSCINVEDMIFKTPRKLVHSLQDLNEANSDYASKSCRRRQ

Query:  RSLSSGNLEWS-PRSFHNQNGFPDDQKLSKED-GVLDNGSGGQSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSF
        RSLSSGNLEWS PR+F NQNGF D+ KLSKED   L NG+G QS G  ES+SS DG P H DVQA PV VA  SK+K QY G+EMAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWS-PRSFHNQNGFPDDQKLSKED-GVLDNGSGGQSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSF

Query:  LLFTIFTSLLWINDQDQGSYLVPT
        LLFTIFTSLLWI+D DQGSYLVPT
Subjt:  LLFTIFTSLLWINDQDQGSYLVPT

A0A1S3B5D3 uncharacterized protein LOC1034859762.3e-31090.06Show/hide
Query:  MKATSETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA SET RSF+ K+ SPMLRRE ANLDKDADSRRSAMKAL+TYVKELDSKAIP FLAQVSENKETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
        TQTNSHMGLVM+LAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLM+CLDPWSIFSELQ+IIEEMENCQSDQM YVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA

Query:  RRIAADKGSKMDKSPSSVTGSNFID-RRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
        ++I ADKGSKMDKSPSSVTGSNFID RRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNS FDRRSVNRKLWSYENGGVDISLKDGL
Subjt:  RRIAADKGSKMDKSPSSVTGSNFID-RRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEITRGTDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRSCINVEDMIFKTPRKLVHSLQDLNEANSDYASKSCRRRQ
        SLFSE+TRGTDVSDTMS+HSGSHKFGHNGEEYAD+F+GFFQMSPPRRRLSRSTTTSPLRSRS I VEDMIFKTPRKLVHSLQDLNE NSDYAS S RRR 
Subjt:  SLFSEITRGTDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRSCINVEDMIFKTPRKLVHSLQDLNEANSDYASKSCRRRQ

Query:  RSLSSGNLEWS-PRSFHNQNGFPDDQKLSKED-GVLDNGSGGQSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSF
        RSLSSGNLEWS PR+F N+NG  D++KLSKED   LD  +G QS G SES+SSTDGVP H DVQA PV V   SKIK QY G+EMAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWS-PRSFHNQNGFPDDQKLSKED-GVLDNGSGGQSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSF

Query:  LLFTIFTSLLWINDQDQGSYLVPT
        LLFTIFTSLLWI+D DQGSYLVPT
Subjt:  LLFTIFTSLLWINDQDQGSYLVPT

A0A5A7UWA1 ARM repeat superfamily protein2.3e-31090.06Show/hide
Query:  MKATSETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA SET RSF+ K+ SPMLRRE ANLDKDADSRRSAMKAL+TYVKELDSKAIP FLAQVSENKETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
        TQTNSHMGLVM+LAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLM+CLDPWSIFSELQ+IIEEMENCQSDQM YVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA

Query:  RRIAADKGSKMDKSPSSVTGSNFID-RRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL
        ++I ADKGSKMDKSPSSVTGSNFID RRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNS FDRRSVNRKLWSYENGGVDISLKDGL
Subjt:  RRIAADKGSKMDKSPSSVTGSNFID-RRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGL

Query:  SLFSEITRGTDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRSCINVEDMIFKTPRKLVHSLQDLNEANSDYASKSCRRRQ
        SLFSE+TRGTDVSDTMS+HSGSHKFGHNGEEYAD+F+GFFQMSPPRRRLSRSTTTSPLRSRS I VEDMIFKTPRKLVHSLQDLNE NSDYAS S RRR 
Subjt:  SLFSEITRGTDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRSCINVEDMIFKTPRKLVHSLQDLNEANSDYASKSCRRRQ

Query:  RSLSSGNLEWS-PRSFHNQNGFPDDQKLSKED-GVLDNGSGGQSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSF
        RSLSSGNLEWS PR+F N+NG  D++KLSKED   LD  +G QS G SES+SSTDGVP H DVQA PV V   SKIK QY G+EMAYKKTALKLVCGFSF
Subjt:  RSLSSGNLEWS-PRSFHNQNGFPDDQKLSKED-GVLDNGSGGQSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSF

Query:  LLFTIFTSLLWINDQDQGSYLVPT
        LLFTIFTSLLWI+D DQGSYLVPT
Subjt:  LLFTIFTSLLWINDQDQGSYLVPT

A0A6J1EBI1 protein SINE1-like0.0e+0090.08Show/hide
Query:  MKATSETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        M+AT E HRSF+GKSFSPMLRRELANLDKDADSRR+AMKALKTYVKELDSKAIP FLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKK +VIHSLCNPLSESLLG QESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEK 
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILK GMVEKNSQKRLS+IQMINFLMKCLDPWSIFSELQTI EEMENCQSDQMAYVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA

Query:  RRIAADKGSKMDKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLS
        RRIAAD+GSKM KSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQT DSFFDYGSL GSPFSS+QAS NSGFDRRS+NRKLW YENGGVDISLKDGLS
Subjt:  RRIAADKGSKMDKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLS

Query:  LFSEITRGTDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRSCINVEDMIFKTPRKLVHSLQDLNEANSDYASKSCRRRQR
        LFS+I RGTDVSDT+S+HS SHKFGH+GEEYAD+F GFFQM PPR RLSRSTTTSP+RSR+CINVEDMIFKTPRKLVHSLQDLNEANSDYASKSC+ RQR
Subjt:  LFSEITRGTDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRSCINVEDMIFKTPRKLVHSLQDLNEANSDYASKSCRRRQR

Query:  SLSSGNLEWSPRSFHNQNGFPDDQKLSKEDGVLDNGSGG----QSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLVCGFS
        SLS GNLEWSPRS HNQNG PDDQKLSK+D   DN +      QSP GSESVSST GVP    VQA PVVVA H+KIKTQYSGIEMAYKKTALKLVCGFS
Subjt:  SLSSGNLEWSPRSFHNQNGFPDDQKLSKEDGVLDNGSGG----QSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLVCGFS

Query:  FLLFTIFTSLLWINDQDQGSYLVPT
        FLLFTIFTSLLWIN+QDQG+YLVPT
Subjt:  FLLFTIFTSLLWINDQDQGSYLVPT

A0A6J1HN73 protein SINE1-like0.0e+0090.14Show/hide
Query:  MKATSETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKAT E HRSF+GKSFSPMLRRELANLDKDADSRR+AMKALKTYVKELDSKAIP FLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATSETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKK +VIHSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEK 
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGL+ILK GMVEKNSQKRLS+IQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTA

Query:  RRIAADKGSKMDKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLS
        RRIAAD+GSKM KSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQT DSFFDYGSL GSPFSS+QAS NSGFDRRS+NRKLW YENGGVDISLKDGLS
Subjt:  RRIAADKGSKMDKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLS

Query:  LFSEITRGTDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRSCINVEDMIFKTPRKLVHSLQDLNEANSDYASKSCRRRQR
        LFS+I RGTDVSDTMS+HS SHKF H+GEEYADEF GFFQMSPPR RLSRSTTTSP+RSRSCINVEDMIFKTPRKLVHSLQDLN+ANSDYASKSC+ RQR
Subjt:  LFSEITRGTDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRSCINVEDMIFKTPRKLVHSLQDLNEANSDYASKSCRRRQR

Query:  SLSSGNLEWSPRSFHNQNGFPDDQKLSKEDGVLDNGSGG--------QSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLV
        SLS GNLEWSPRS HNQNG PD QKLSK+D   DN +          +SPGGSESVSST GVP    VQA PVVVA HSKIKTQYSGIEMAYKKTALKLV
Subjt:  SLSSGNLEWSPRSFHNQNGFPDDQKLSKEDGVLDNGSGG--------QSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLV

Query:  CGFSFLLFTIFTSLLWINDQDQGSYLVPT
        CGFSFLLFTIFTSLLWIN+QDQG+YLVPT
Subjt:  CGFSFLLFTIFTSLLWINDQDQGSYLVPT

SwissProt top hitse value%identityAlignment
Q5XVI1 Protein SINE16.7e-15657.14Show/hide
Query:  LGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        +G + +P+LR+ELANLDKD +SR+SAMKALK+YVK+LDSKAIPGFLAQV E KET +L+GE TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSF
Subjt:  LGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM
        PLQQACSKV+PAIARYGIDPTT +DKK+ +IHSLC PL++SLL SQESLTSGAALCLKALVDSDNWRFASDEMVN+VCQNV  AL+  S QT+  MGLVM
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM

Query:  TLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTARRIAADKGSKM
        +LAK NP IVE YARLL+  GLRIL  G+ E NSQKRLSA+QM+NFLMKCLDP SI+SE++ II+EME CQSDQMAYV+GAA+E + T++RIAA+  SKM
Subjt:  TLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTARRIAADKGSKM

Query:  DKSPSSVTGSNFIDRRRSPWRNGGSRTPS-SESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSY-ENGG-VDISLKDGLSLFSEITR
        +K   SVTGSNF        RN  S  P  S SPESQTL SF  Y S V  SP S    S NS FDRRSVNRKLW   ENGG VDISLKDG  LFS +T+
Subjt:  DKSPSSVTGSNFIDRRRSPWRNGGSRTPS-SESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSY-ENGG-VDISLKDGLSLFSEITR

Query:  G-TDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRS-CINVEDM-IFKTPRKLVHSLQ---DLNEANSDYASKSCR-RRQR
        G T VSD+  V        ++  E  DEF GF   S       R+TT SP R RS  IN ED  IF TPRKL+ SLQ   D++  +SD  S   R  R++
Subjt:  G-TDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRS-CINVEDM-IFKTPRKLVHSLQ---DLNEANSDYASKSCR-RRQR

Query:  SLSSGNLEWSPRSFHNQNGFPDDQKLSKEDGVLDNGSGGQSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLLF
        ++ S                  + KL K           Q P   E++SST  + V  D   T ++     K K  Y+           KLV   SF++ 
Subjt:  SLSSGNLEWSPRSFHNQNGFPDDQKLSKEDGVLDNGSGGQSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLLF

Query:  TIF-TSLLWIN-DQDQGSYLVPT
         +F T +L +N D D G Y VPT
Subjt:  TIF-TSLLWIN-DQDQGSYLVPT

Q9SQR5 Protein SINE25.0e-8756.65Show/hide
Query:  LGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        +G++     R+ELANLDKD DS ++AM  L++ VK+LD+K +  F+AQ+S+ KE G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS 
Subjt:  LGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL
         +QQACS+ V A+ARYGIDPTTP+DKK +VIHSLC PLS+SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EMVN VCQ++A ALE  S++  SHM L
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL

Query:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTARRIAADKGS
        VM L+K NP  VE YARL +++GLRIL  G+VE +SQKRL AIQM+NFLMK L+P SI SEL+ I +EME  Q DQ  YVK AA ET++ A R+  +   
Subjt:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTARRIAADKGS

Query:  KMD----KSPSSVTGS
          D    K  +S++GS
Subjt:  KMD----KSPSSVTGS

Arabidopsis top hitse value%identityAlignment
AT1G54385.1 ARM repeat superfamily protein4.7e-15757.14Show/hide
Query:  LGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        +G + +P+LR+ELANLDKD +SR+SAMKALK+YVK+LDSKAIPGFLAQV E KET +L+GE TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSF
Subjt:  LGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM
        PLQQACSKV+PAIARYGIDPTT +DKK+ +IHSLC PL++SLL SQESLTSGAALCLKALVDSDNWRFASDEMVN+VCQNV  AL+  S QT+  MGLVM
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM

Query:  TLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTARRIAADKGSKM
        +LAK NP IVE YARLL+  GLRIL  G+ E NSQKRLSA+QM+NFLMKCLDP SI+SE++ II+EME CQSDQMAYV+GAA+E + T++RIAA+  SKM
Subjt:  TLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTARRIAADKGSKM

Query:  DKSPSSVTGSNFIDRRRSPWRNGGSRTPS-SESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSY-ENGG-VDISLKDGLSLFSEITR
        +K   SVTGSNF        RN  S  P  S SPESQTL SF  Y S V  SP S    S NS FDRRSVNRKLW   ENGG VDISLKDG  LFS +T+
Subjt:  DKSPSSVTGSNFIDRRRSPWRNGGSRTPS-SESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSY-ENGG-VDISLKDGLSLFSEITR

Query:  G-TDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRS-CINVEDM-IFKTPRKLVHSLQ---DLNEANSDYASKSCR-RRQR
        G T VSD+  V        ++  E  DEF GF   S       R+TT SP R RS  IN ED  IF TPRKL+ SLQ   D++  +SD  S   R  R++
Subjt:  G-TDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRS-CINVEDM-IFKTPRKLVHSLQ---DLNEANSDYASKSCR-RRQR

Query:  SLSSGNLEWSPRSFHNQNGFPDDQKLSKEDGVLDNGSGGQSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLLF
        ++ S                  + KL K           Q P   E++SST  + V  D   T ++     K K  Y+           KLV   SF++ 
Subjt:  SLSSGNLEWSPRSFHNQNGFPDDQKLSKEDGVLDNGSGGQSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLLF

Query:  TIF-TSLLWIN-DQDQGSYLVPT
         +F T +L +N D D G Y VPT
Subjt:  TIF-TSLLWIN-DQDQGSYLVPT

AT1G54385.2 ARM repeat superfamily protein4.7e-15757.14Show/hide
Query:  LGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        +G + +P+LR+ELANLDKD +SR+SAMKALK+YVK+LDSKAIPGFLAQV E KET +L+GE TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSF
Subjt:  LGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM
        PLQQACSKV+PAIARYGIDPTT +DKK+ +IHSLC PL++SLL SQESLTSGAALCLKALVDSDNWRFASDEMVN+VCQNV  AL+  S QT+  MGLVM
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM

Query:  TLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTARRIAADKGSKM
        +LAK NP IVE YARLL+  GLRIL  G+ E NSQKRLSA+QM+NFLMKCLDP SI+SE++ II+EME CQSDQMAYV+GAA+E + T++RIAA+  SKM
Subjt:  TLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTARRIAADKGSKM

Query:  DKSPSSVTGSNFIDRRRSPWRNGGSRTPS-SESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSY-ENGG-VDISLKDGLSLFSEITR
        +K   SVTGSNF        RN  S  P  S SPESQTL SF  Y S V  SP S    S NS FDRRSVNRKLW   ENGG VDISLKDG  LFS +T+
Subjt:  DKSPSSVTGSNFIDRRRSPWRNGGSRTPS-SESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSY-ENGG-VDISLKDGLSLFSEITR

Query:  G-TDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRS-CINVEDM-IFKTPRKLVHSLQ---DLNEANSDYASKSCR-RRQR
        G T VSD+  V        ++  E  DEF GF   S       R+TT SP R RS  IN ED  IF TPRKL+ SLQ   D++  +SD  S   R  R++
Subjt:  G-TDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQMSPPRRRLSRSTTTSPLRSRS-CINVEDM-IFKTPRKLVHSLQ---DLNEANSDYASKSCR-RRQR

Query:  SLSSGNLEWSPRSFHNQNGFPDDQKLSKEDGVLDNGSGGQSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLLF
        ++ S                  + KL K           Q P   E++SST  + V  D   T ++     K K  Y+           KLV   SF++ 
Subjt:  SLSSGNLEWSPRSFHNQNGFPDDQKLSKEDGVLDNGSGGQSPGGSESVSSTDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLLF

Query:  TIF-TSLLWIN-DQDQGSYLVPT
         +F T +L +N D D G Y VPT
Subjt:  TIF-TSLLWIN-DQDQGSYLVPT

AT3G03970.1 ARM repeat superfamily protein3.6e-8856.65Show/hide
Query:  LGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        +G++     R+ELANLDKD DS ++AM  L++ VK+LD+K +  F+AQ+S+ KE G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS 
Subjt:  LGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL
         +QQACS+ V A+ARYGIDPTTP+DKK +VIHSLC PLS+SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EMVN VCQ++A ALE  S++  SHM L
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL

Query:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTARRIAADKGS
        VM L+K NP  VE YARL +++GLRIL  G+VE +SQKRL AIQM+NFLMK L+P SI SEL+ I +EME  Q DQ  YVK AA ET++ A R+  +   
Subjt:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTARRIAADKGS

Query:  KMD----KSPSSVTGS
          D    K  +S++GS
Subjt:  KMD----KSPSSVTGS

AT3G03970.2 ARM repeat superfamily protein3.6e-8856.65Show/hide
Query:  LGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        +G++     R+ELANLDKD DS ++AM  L++ VK+LD+K +  F+AQ+S+ KE G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS 
Subjt:  LGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL
         +QQACS+ V A+ARYGIDPTTP+DKK +VIHSLC PLS+SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EMVN VCQ++A ALE  S++  SHM L
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL

Query:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTARRIAADKGS
        VM L+K NP  VE YARL +++GLRIL  G+VE +SQKRL AIQM+NFLMK L+P SI SEL+ I +EME  Q DQ  YVK AA ET++ A R+  +   
Subjt:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTARRIAADKGS

Query:  KMD----KSPSSVTGS
          D    K  +S++GS
Subjt:  KMD----KSPSSVTGS

AT3G03970.3 ARM repeat superfamily protein3.6e-8856.65Show/hide
Query:  LGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        +G++     R+ELANLDKD DS ++AM  L++ VK+LD+K +  F+AQ+S+ KE G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS 
Subjt:  LGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL
         +QQACS+ V A+ARYGIDPTTP+DKK +VIHSLC PLS+SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EMVN VCQ++A ALE  S++  SHM L
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL

Query:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTARRIAADKGS
        VM L+K NP  VE YARL +++GLRIL  G+VE +SQKRL AIQM+NFLMK L+P SI SEL+ I +EME  Q DQ  YVK AA ET++ A R+  +   
Subjt:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTARRIAADKGS

Query:  KMD----KSPSSVTGS
          D    K  +S++GS
Subjt:  KMD----KSPSSVTGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGCAACCTCAGAAACTCACAGGTCTTTTTTGGGCAAAAGTTTCAGTCCAATGCTTAGGCGGGAACTTGCTAATCTCGATAAAGATGCTGACAGTCGCAGATCTGC
GATGAAGGCATTGAAGACTTATGTGAAAGAATTGGACTCCAAGGCTATTCCTGGTTTTCTTGCTCAAGTTTCTGAGAACAAAGAAACTGGTGCATTGACTGGGGAGTGCA
CCATTTCTCTCTATGAGGTTCTCGCTCGTGTTCATGGCGTCAACATCGTGCCACAGATCGATCGGATCATGACTTCCATTATCAAGACTTTGGCTTCGAGTGCTGGCTCT
TTCCCTCTTCAACAGGCCTGTTCTAAAGTTGTTCCGGCGATTGCGAGATATGGGATTGATCCCACCACTCCTGATGATAAGAAGAAGCATGTTATTCATTCTCTTTGTAA
TCCTCTTTCGGAATCTTTGTTGGGTTCTCAAGAGAGCCTCACTTCTGGAGCTGCCCTCTGCTTGAAGGCTCTGGTGGATTCGGATAATTGGCGGTTCGCTTCCGATGAGA
TGGTGAACAAGGTTTGCCAGAATGTTGCTGGAGCTCTGGAGGAGAAGTCTACTCAAACCAATTCACACATGGGGCTTGTTATGACACTAGCAAAGCGAAATCCTCGGATT
GTCGAACCGTATGCTAGATTGTTGCTTCAAGCTGGACTGCGAATATTGAAGTGTGGGATGGTGGAGAAGAATTCTCAGAAAAGATTGTCTGCGATTCAAATGATTAATTT
CTTGATGAAGTGTCTGGATCCTTGGAGTATATTTTCGGAGCTTCAAACTATAATTGAGGAGATGGAGAATTGCCAGTCTGATCAAATGGCTTATGTCAAAGGAGCAGCTT
TTGAGACTCTGCAAACGGCAAGGAGAATAGCTGCAGATAAAGGGTCGAAAATGGACAAATCTCCAAGCTCAGTGACTGGATCGAATTTCATCGATCGCAGGAGAAGTCCA
TGGAGGAATGGTGGAAGCCGAACTCCCTCATCTGAGTCTCCAGAATCCCAGACTCTTGATTCATTCTTTGATTATGGCTCGCTTGTTGGATCGCCCTTTTCATCAAGACA
AGCTTCTCGTAATTCAGGATTCGACCGTAGGAGTGTGAATCGGAAACTTTGGAGTTATGAGAATGGTGGGGTTGATATATCCCTCAAGGATGGCTTGTCATTGTTCTCGG
AAATCACTCGTGGAACCGATGTCTCCGACACCATGTCTGTGCACTCTGGAAGTCACAAATTTGGCCATAATGGTGAAGAATATGCAGATGAGTTTACAGGATTTTTTCAA
ATGAGTCCTCCTAGACGCAGACTATCAAGAAGCACTACAACCAGCCCCCTTAGGTCTCGTAGTTGCATAAATGTCGAAGATATGATCTTCAAAACTCCTCGGAAGCTCGT
CCATTCTCTTCAAGATCTGAACGAAGCAAACTCGGACTATGCTAGCAAAAGTTGCAGACGAAGGCAAAGGAGTTTATCATCAGGAAATTTGGAGTGGAGTCCAAGATCAT
TTCACAATCAAAATGGGTTCCCAGATGATCAGAAACTTAGCAAAGAGGACGGAGTCTTAGACAACGGTAGCGGTGGACAATCACCAGGTGGTTCGGAATCAGTCTCTTCA
ACTGATGGCGTCCCTGTCCATGGTGATGTCCAAGCTACACCTGTGGTGGTGGCTCACCACAGCAAGATCAAAACTCAATATTCTGGCATTGAGATGGCATACAAGAAGAC
TGCTTTGAAACTGGTCTGTGGCTTCTCATTTTTGCTTTTCACAATATTCACTTCATTGCTCTGGATTAATGATCAGGACCAAGGCTCCTATCTTGTACCAACATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGCAACCTCAGAAACTCACAGGTCTTTTTTGGGCAAAAGTTTCAGTCCAATGCTTAGGCGGGAACTTGCTAATCTCGATAAAGATGCTGACAGTCGCAGATCTGC
GATGAAGGCATTGAAGACTTATGTGAAAGAATTGGACTCCAAGGCTATTCCTGGTTTTCTTGCTCAAGTTTCTGAGAACAAAGAAACTGGTGCATTGACTGGGGAGTGCA
CCATTTCTCTCTATGAGGTTCTCGCTCGTGTTCATGGCGTCAACATCGTGCCACAGATCGATCGGATCATGACTTCCATTATCAAGACTTTGGCTTCGAGTGCTGGCTCT
TTCCCTCTTCAACAGGCCTGTTCTAAAGTTGTTCCGGCGATTGCGAGATATGGGATTGATCCCACCACTCCTGATGATAAGAAGAAGCATGTTATTCATTCTCTTTGTAA
TCCTCTTTCGGAATCTTTGTTGGGTTCTCAAGAGAGCCTCACTTCTGGAGCTGCCCTCTGCTTGAAGGCTCTGGTGGATTCGGATAATTGGCGGTTCGCTTCCGATGAGA
TGGTGAACAAGGTTTGCCAGAATGTTGCTGGAGCTCTGGAGGAGAAGTCTACTCAAACCAATTCACACATGGGGCTTGTTATGACACTAGCAAAGCGAAATCCTCGGATT
GTCGAACCGTATGCTAGATTGTTGCTTCAAGCTGGACTGCGAATATTGAAGTGTGGGATGGTGGAGAAGAATTCTCAGAAAAGATTGTCTGCGATTCAAATGATTAATTT
CTTGATGAAGTGTCTGGATCCTTGGAGTATATTTTCGGAGCTTCAAACTATAATTGAGGAGATGGAGAATTGCCAGTCTGATCAAATGGCTTATGTCAAAGGAGCAGCTT
TTGAGACTCTGCAAACGGCAAGGAGAATAGCTGCAGATAAAGGGTCGAAAATGGACAAATCTCCAAGCTCAGTGACTGGATCGAATTTCATCGATCGCAGGAGAAGTCCA
TGGAGGAATGGTGGAAGCCGAACTCCCTCATCTGAGTCTCCAGAATCCCAGACTCTTGATTCATTCTTTGATTATGGCTCGCTTGTTGGATCGCCCTTTTCATCAAGACA
AGCTTCTCGTAATTCAGGATTCGACCGTAGGAGTGTGAATCGGAAACTTTGGAGTTATGAGAATGGTGGGGTTGATATATCCCTCAAGGATGGCTTGTCATTGTTCTCGG
AAATCACTCGTGGAACCGATGTCTCCGACACCATGTCTGTGCACTCTGGAAGTCACAAATTTGGCCATAATGGTGAAGAATATGCAGATGAGTTTACAGGATTTTTTCAA
ATGAGTCCTCCTAGACGCAGACTATCAAGAAGCACTACAACCAGCCCCCTTAGGTCTCGTAGTTGCATAAATGTCGAAGATATGATCTTCAAAACTCCTCGGAAGCTCGT
CCATTCTCTTCAAGATCTGAACGAAGCAAACTCGGACTATGCTAGCAAAAGTTGCAGACGAAGGCAAAGGAGTTTATCATCAGGAAATTTGGAGTGGAGTCCAAGATCAT
TTCACAATCAAAATGGGTTCCCAGATGATCAGAAACTTAGCAAAGAGGACGGAGTCTTAGACAACGGTAGCGGTGGACAATCACCAGGTGGTTCGGAATCAGTCTCTTCA
ACTGATGGCGTCCCTGTCCATGGTGATGTCCAAGCTACACCTGTGGTGGTGGCTCACCACAGCAAGATCAAAACTCAATATTCTGGCATTGAGATGGCATACAAGAAGAC
TGCTTTGAAACTGGTCTGTGGCTTCTCATTTTTGCTTTTCACAATATTCACTTCATTGCTCTGGATTAATGATCAGGACCAAGGCTCCTATCTTGTACCAACATAA
Protein sequenceShow/hide protein sequence
MKATSETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALKTYVKELDSKAIPGFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGS
FPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNPRI
VEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTARRIAADKGSKMDKSPSSVTGSNFIDRRRSP
WRNGGSRTPSSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLSLFSEITRGTDVSDTMSVHSGSHKFGHNGEEYADEFTGFFQ
MSPPRRRLSRSTTTSPLRSRSCINVEDMIFKTPRKLVHSLQDLNEANSDYASKSCRRRQRSLSSGNLEWSPRSFHNQNGFPDDQKLSKEDGVLDNGSGGQSPGGSESVSS
TDGVPVHGDVQATPVVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTIFTSLLWINDQDQGSYLVPT