; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005517 (gene) of Snake gourd v1 genome

Gene IDTan0005517
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionARM repeat superfamily protein
Genome locationLG03:61623388..61625750
RNA-Seq ExpressionTan0005517
SyntenyTan0005517
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0032797 - SMN complex (cellular component)
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595508.1 Protein SINE1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0091Show/hide
Query:  MKATPETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKATPE HRSF+GKSFSPMLRRELAN DKDADSRR+AMKAL+TYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATPETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKK +VIHSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEK 
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILK GMVEKNSQKRLS+IQMINFLMKCLDPWSIFSELQ+I EEMENCQSDQMAYVKGAAFET+QTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTA

Query:  RRIAADQGSKMEKSPSSVTGSNFIDRRRSPWRRNGGSRTPLSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSHENGGVDISLKDGL
        RRIAAD+GSKM KSPSSVTGSNFIDRRRSPW RNGGS++P SESPESQT DSFFDYGSL GSPFSS+QAS NSGFDRRS+NRKLW +ENGGVDISLKDGL
Subjt:  RRIAADQGSKMEKSPSSVTGSNFIDRRRSPWRRNGGSRTPLSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSHENGGVDISLKDGL

Query:  SLFSEITRGTDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRSFINVEDMIFKTPRKLVHSLQDPNEANSDYASKSCRRRQ
        SLFS+I RGTDVSDTLS+HS SHKFGHHGEEYADDFAGFFQMSPPR RLSRSTTTSP+RSRS INVEDMIFKTPRKLVHSLQD NEANS YASKSC+ RQ
Subjt:  SLFSEITRGTDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRSFINVEDMIFKTPRKLVHSLQDPNEANSDYASKSCRRRQ

Query:  RSLSSGNLEWSPRSFHNQNGFPDDQELSKEDGGLD---NNDNGEQSPGGSESVSSTDGVPVQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLL
        RSLS GNLEWSPRS HNQNG P+DQ+LSK+D   D   NNDN EQSPGGSESVSST GVPVQAMP+VVA HSKIKTQYSGIEMAYKKTALKLVCGFSFLL
Subjt:  RSLSSGNLEWSPRSFHNQNGFPDDQELSKEDGGLD---NNDNGEQSPGGSESVSSTDGVPVQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLL

Query:  FTIFTSLLWINEQDQDAYLVPT
        FTIFTSLLWINEQDQ  YLVPT
Subjt:  FTIFTSLLWINEQDQDAYLVPT

KAG7027494.1 hypothetical protein SDJN02_11507, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0090.84Show/hide
Query:  MKATPETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKATPE HRSF+GKSFSPMLRRELANLDKDADSRR+AMKAL+TYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATPETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKK +VIHSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEK 
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILK GMVEKNSQKRLS+IQMINFLMKCLDPWSIFSELQ+I EEMENCQSDQMAYVKGAAFET+QTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTA

Query:  RRIAADQGSKMEKSPSSVTGSNFIDRRRSPWRRNGGSRTPLSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSHENGGVDISLKDGL
        RRIAAD+GSKM KSPSSVTGSNFIDRRRSPW RNGGS++P SESPESQT DSFFDYGSL GSPFSS+QAS NSGFDRRS+NRKLW +ENGGVDISLKDGL
Subjt:  RRIAADQGSKMEKSPSSVTGSNFIDRRRSPWRRNGGSRTPLSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSHENGGVDISLKDGL

Query:  SLFSEITRGTDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRSFINVEDMIFKTPRKLVHSLQDPNEANSDYASKSCRRRQ
        SLFS+I RGTDVSDTLS+HS SHKFGHHGEEYADDFAGFFQMSPPR RLSRSTTTSP+RSRS INVEDMIFKTPRKLVHSLQD NEANS YASKSC+ RQ
Subjt:  SLFSEITRGTDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRSFINVEDMIFKTPRKLVHSLQDPNEANSDYASKSCRRRQ

Query:  RSLSSGNLEWSPRSFHNQNGFPDDQELSKEDGGLD---NNDNGEQSPGGSESVSSTDGVPVQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLL
        RSLS GNLEWSPRS HNQNG P+DQ+LSK+D   D   NNDN EQSPGGSESVSST GVPVQAMP+VV  HSKIKTQYSGIEMAYKKTALKLVCGFSFLL
Subjt:  RSLSSGNLEWSPRSFHNQNGFPDDQELSKEDGGLD---NNDNGEQSPGGSESVSSTDGVPVQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLL

Query:  FTIFTSLLWINEQDQDAYLVPT
        FTIFTSL WINEQDQ  YLVPT
Subjt:  FTIFTSLLWINEQDQDAYLVPT

XP_022925214.1 protein SINE1-like [Cucurbita moschata]0.0e+0090.84Show/hide
Query:  MKATPETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        M+ATPE HRSF+GKSFSPMLRRELANLDKDADSRR+AMKAL+TYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATPETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKK +VIHSLCNPLSESLLG QESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEK 
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILK GMVEKNSQKRLS+IQMINFLMKCLDPWSIFSELQ+I EEMENCQSDQMAYVKGAAFET+QTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTA

Query:  RRIAADQGSKMEKSPSSVTGSNFIDRRRSPWRRNGGSRTPLSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSHENGGVDISLKDGL
        RRIAAD+GSKM KSPSSVTGSNFIDRRRSPW RNGGSRTP SESPESQT DSFFDYGSL GSPFSS+QAS NSGFDRRS+NRKLW +ENGGVDISLKDGL
Subjt:  RRIAADQGSKMEKSPSSVTGSNFIDRRRSPWRRNGGSRTPLSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSHENGGVDISLKDGL

Query:  SLFSEITRGTDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRSFINVEDMIFKTPRKLVHSLQDPNEANSDYASKSCRRRQ
        SLFS+I RGTDVSDTLS+HS SHKFGHHGEEYADDFAGFFQM PPR RLSRSTTTSP+RSR+ INVEDMIFKTPRKLVHSLQD NEANSDYASKSC+ RQ
Subjt:  SLFSEITRGTDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRSFINVEDMIFKTPRKLVHSLQDPNEANSDYASKSCRRRQ

Query:  RSLSSGNLEWSPRSFHNQNGFPDDQELSKEDGGLDN---NDNGEQSPGGSESVSSTDGVPVQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLL
        RSLS GNLEWSPRS HNQNG PDDQ+LSK+D   DN   NDN EQSP GSESVSST GVPVQAMP+VVA H+KIKTQYSGIEMAYKKTALKLVCGFSFLL
Subjt:  RSLSSGNLEWSPRSFHNQNGFPDDQELSKEDGGLDN---NDNGEQSPGGSESVSSTDGVPVQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLL

Query:  FTIFTSLLWINEQDQDAYLVPT
        FTIFTSLLWINEQDQ  YLVPT
Subjt:  FTIFTSLLWINEQDQDAYLVPT

XP_022966522.1 protein SINE1-like [Cucurbita maxima]0.0e+0090.26Show/hide
Query:  MKATPETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKATPE HRSF+GKSFSPMLRRELANLDKDADSRR+AMKAL+TYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATPETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKK +VIHSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEK 
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGL+ILK GMVEKNSQKRLS+IQMINFLMKCLDPWSIFSELQ+IIEEMENCQSDQMAYVKGAAFET+QTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTA

Query:  RRIAADQGSKMEKSPSSVTGSNFIDRRRSPWRRNGGSRTPLSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSHENGGVDISLKDGL
        RRIAAD+GSKM KSPSSVTGSNFIDRRRSPW RNGGSRTP SESPESQT DSFFDYGSL GSPFSS+QAS NSGFDRRS+NRKLW +ENGGVDISLKDGL
Subjt:  RRIAADQGSKMEKSPSSVTGSNFIDRRRSPWRRNGGSRTPLSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSHENGGVDISLKDGL

Query:  SLFSEITRGTDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRSFINVEDMIFKTPRKLVHSLQDPNEANSDYASKSCRRRQ
        SLFS+I RGTDVSDT+S+HS SHKF HHGEEYAD+FAGFFQMSPPR RLSRSTTTSP+RSRS INVEDMIFKTPRKLVHSLQD N+ANSDYASKSC+ RQ
Subjt:  SLFSEITRGTDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRSFINVEDMIFKTPRKLVHSLQDPNEANSDYASKSCRRRQ

Query:  RSLSSGNLEWSPRSFHNQNGFPDDQELSKEDGGLDN-------NDNGEQSPGGSESVSSTDGVPVQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGF
        RSLS GNLEWSPRS HNQNG PD Q+LSK+D   DN       NDN E+SPGGSESVSST GVPVQAMP+VVA HSKIKTQYSGIEMAYKKTALKLVCGF
Subjt:  RSLSSGNLEWSPRSFHNQNGFPDDQELSKEDGGLDN-------NDNGEQSPGGSESVSSTDGVPVQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGF

Query:  SFLLFTIFTSLLWINEQDQDAYLVPT
        SFLLFTIFTSLLWINEQDQ  YLVPT
Subjt:  SFLLFTIFTSLLWINEQDQDAYLVPT

XP_023517874.1 protein SINE1-like isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0090.45Show/hide
Query:  MKATPETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKATPE HRSF+GKSFSPMLRRELANLDKDADSRR+AMKAL+TYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATPETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKK +VIHSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEK 
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILK GMVEKNSQKRLS+IQMINFLMKCLDPWSIFSELQ+I EEMENCQSDQMAYVKGAAFET+QTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTA

Query:  RRIAADQGSKMEKSPSSVTGSNFIDRRRSPWRRNGGSRTPLSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSHENGGVDISLKDGL
        RRIAAD+GSKM KSPSSVTGSNFIDRRRSPW RNGGSRTP SESPESQT DSFFDYGSL GSPFSS+QAS NSGFDRRS+NRKLW +ENGGVDISLKDGL
Subjt:  RRIAADQGSKMEKSPSSVTGSNFIDRRRSPWRRNGGSRTPLSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSHENGGVDISLKDGL

Query:  SLFSEITRGTDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRSFINVEDMIFKTPRKLVHSLQDPNEANSDYASKSCRRRQ
        SLFS+I RGTDVSDTLS+HS SHKF HHGEEYADDFAGFFQMSPPR RLSRSTTTSP+RSR+ INVEDMIFKTPRKLVHSLQD N+ANSDYASKSC+ RQ
Subjt:  SLFSEITRGTDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRSFINVEDMIFKTPRKLVHSLQDPNEANSDYASKSCRRRQ

Query:  RSLSSGNLEWSPRSFHNQNGFPDDQELSKEDGGLDN---------NDNGEQSPGGSESVSSTDGVPVQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVC
        RSLS GNLEWSPRS HNQNG PDDQ+LSK+D   DN         NDN EQSPGGSESVSST GVPVQAMP+VVA HSKIKTQYSGIEMAYKKTALKLVC
Subjt:  RSLSSGNLEWSPRSFHNQNGFPDDQELSKEDGGLDN---------NDNGEQSPGGSESVSSTDGVPVQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVC

Query:  GFSFLLFTIFTSLLWINEQDQDAYLVPT
        GFSFLLFTIFTSLLWINEQDQ  YLVPT
Subjt:  GFSFLLFTIFTSLLWINEQDQDAYLVPT

TrEMBL top hitse value%identityAlignment
A0A0A0KYP2 Uncharacterized protein5.4e-30287.84Show/hide
Query:  MKATPETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA  ET RSF+ K+ SPMLRRE ANLDKDADSRRSAMKAL+TYVKELDSKAIPVFLAQVSENKETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATPETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLM+CLDPWSIFSELQSIIEEMENCQSDQM YVKGAAFET+QTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTA

Query:  RRIAADQGSKMEKSPSSVTGSNFID-RRRSPWRRNGGSRTPLSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSHENGGVDISLKDG
        ++I AD+GSKM+KSPSSVTGSNF+D RRRSPW RNGGSRTP SESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWS+ENGGVDISLKDG
Subjt:  RRIAADQGSKMEKSPSSVTGSNFID-RRRSPWRRNGGSRTPLSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSHENGGVDISLKDG

Query:  LSLFSEITRGTDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRSFINVEDMIFKTPRKLVHSLQDPNEANSDYASKSCRRR
        LSLFSE+TRGTDVSDT+SM+SGSHKFGH+GEEYADDF+GFFQMSPPRRRLSRSTTTSPLRSRS+INVEDMIFKTPRKLVHSLQD NE  SDYAS S R R
Subjt:  LSLFSEITRGTDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRSFINVEDMIFKTPRKLVHSLQDPNEANSDYASKSCRRR

Query:  QRSLSSGNLEWS-PRSFHNQNGFPDDQELSKEDGGLDNNDNGEQSPGGSESVSSTDGVP----VQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGFS
         RSLSSGNLEWS PR+F NQNGF D+ +LSKED     N NGEQS G  ES+SS DG P    VQA+P+ VA  SK+K QY G+EMAYKKTALKLVCGFS
Subjt:  QRSLSSGNLEWS-PRSFHNQNGFPDDQELSKEDGGLDNNDNGEQSPGGSESVSSTDGVP----VQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGFS

Query:  FLLFTIFTSLLWINEQDQDAYLVPT
        FLLFTIFTSLLWI++ DQ +YLVPT
Subjt:  FLLFTIFTSLLWINEQDQDAYLVPT

A0A1S3B5D3 uncharacterized protein LOC1034859767.6e-30488.66Show/hide
Query:  MKATPETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA  ET RSF+ K+ SPMLRRE ANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATPETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTA
        TQTNSHMGLVM+LAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLM+CLDPWSIFSELQSIIEEMENCQSDQM YVKGAAFET+QTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTA

Query:  RRIAADQGSKMEKSPSSVTGSNFID-RRRSPWRRNGGSRTPLSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSHENGGVDISLKDG
        ++I AD+GSKM+KSPSSVTGSNFID RRRSPW RNGGSRTP SESPESQTLDSFFDYGSLVGSPFSSRQASRNS FDRRSVNRKLWS+ENGGVDISLKDG
Subjt:  RRIAADQGSKMEKSPSSVTGSNFID-RRRSPWRRNGGSRTPLSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSHENGGVDISLKDG

Query:  LSLFSEITRGTDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRSFINVEDMIFKTPRKLVHSLQDPNEANSDYASKSCRRR
        LSLFSE+TRGTDVSDT+S+HSGSHKFGH+GEEYADDF+GFFQMSPPRRRLSRSTTTSPLRSRS+I VEDMIFKTPRKLVHSLQD NE NSDYAS S RRR
Subjt:  LSLFSEITRGTDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRSFINVEDMIFKTPRKLVHSLQDPNEANSDYASKSCRRR

Query:  QRSLSSGNLEWS-PRSFHNQNGFPDDQELSKED-GGLDNNDNGEQSPGGSESVSSTDGVP----VQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGF
         RSLSSGNLEWS PR+F N+NG  D+++LSKED  GLD  DNGEQS G SES+SSTDGVP    VQAMP+ V   SKIK QY G+EMAYKKTALKLVCGF
Subjt:  QRSLSSGNLEWS-PRSFHNQNGFPDDQELSKED-GGLDNNDNGEQSPGGSESVSSTDGVP----VQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGF

Query:  SFLLFTIFTSLLWINEQDQDAYLVPT
        SFLLFTIFTSLLWI++ DQ +YLVPT
Subjt:  SFLLFTIFTSLLWINEQDQDAYLVPT

A0A5A7UWA1 ARM repeat superfamily protein7.6e-30488.66Show/hide
Query:  MKATPETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKA  ET RSF+ K+ SPMLRRE ANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATPETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVI+SLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTA
        TQTNSHMGLVM+LAKRNPRIVEPYARLLLQAGLRILKCG+VEKNSQKRLSAIQMINFLM+CLDPWSIFSELQSIIEEMENCQSDQM YVKGAAFET+QTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTA

Query:  RRIAADQGSKMEKSPSSVTGSNFID-RRRSPWRRNGGSRTPLSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSHENGGVDISLKDG
        ++I AD+GSKM+KSPSSVTGSNFID RRRSPW RNGGSRTP SESPESQTLDSFFDYGSLVGSPFSSRQASRNS FDRRSVNRKLWS+ENGGVDISLKDG
Subjt:  RRIAADQGSKMEKSPSSVTGSNFID-RRRSPWRRNGGSRTPLSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSHENGGVDISLKDG

Query:  LSLFSEITRGTDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRSFINVEDMIFKTPRKLVHSLQDPNEANSDYASKSCRRR
        LSLFSE+TRGTDVSDT+S+HSGSHKFGH+GEEYADDF+GFFQMSPPRRRLSRSTTTSPLRSRS+I VEDMIFKTPRKLVHSLQD NE NSDYAS S RRR
Subjt:  LSLFSEITRGTDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRSFINVEDMIFKTPRKLVHSLQDPNEANSDYASKSCRRR

Query:  QRSLSSGNLEWS-PRSFHNQNGFPDDQELSKED-GGLDNNDNGEQSPGGSESVSSTDGVP----VQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGF
         RSLSSGNLEWS PR+F N+NG  D+++LSKED  GLD  DNGEQS G SES+SSTDGVP    VQAMP+ V   SKIK QY G+EMAYKKTALKLVCGF
Subjt:  QRSLSSGNLEWS-PRSFHNQNGFPDDQELSKED-GGLDNNDNGEQSPGGSESVSSTDGVP----VQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGF

Query:  SFLLFTIFTSLLWINEQDQDAYLVPT
        SFLLFTIFTSLLWI++ DQ +YLVPT
Subjt:  SFLLFTIFTSLLWINEQDQDAYLVPT

A0A6J1EBI1 protein SINE1-like0.0e+0090.84Show/hide
Query:  MKATPETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        M+ATPE HRSF+GKSFSPMLRRELANLDKDADSRR+AMKAL+TYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATPETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKK +VIHSLCNPLSESLLG QESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEK 
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILK GMVEKNSQKRLS+IQMINFLMKCLDPWSIFSELQ+I EEMENCQSDQMAYVKGAAFET+QTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTA

Query:  RRIAADQGSKMEKSPSSVTGSNFIDRRRSPWRRNGGSRTPLSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSHENGGVDISLKDGL
        RRIAAD+GSKM KSPSSVTGSNFIDRRRSPW RNGGSRTP SESPESQT DSFFDYGSL GSPFSS+QAS NSGFDRRS+NRKLW +ENGGVDISLKDGL
Subjt:  RRIAADQGSKMEKSPSSVTGSNFIDRRRSPWRRNGGSRTPLSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSHENGGVDISLKDGL

Query:  SLFSEITRGTDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRSFINVEDMIFKTPRKLVHSLQDPNEANSDYASKSCRRRQ
        SLFS+I RGTDVSDTLS+HS SHKFGHHGEEYADDFAGFFQM PPR RLSRSTTTSP+RSR+ INVEDMIFKTPRKLVHSLQD NEANSDYASKSC+ RQ
Subjt:  SLFSEITRGTDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRSFINVEDMIFKTPRKLVHSLQDPNEANSDYASKSCRRRQ

Query:  RSLSSGNLEWSPRSFHNQNGFPDDQELSKEDGGLDN---NDNGEQSPGGSESVSSTDGVPVQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLL
        RSLS GNLEWSPRS HNQNG PDDQ+LSK+D   DN   NDN EQSP GSESVSST GVPVQAMP+VVA H+KIKTQYSGIEMAYKKTALKLVCGFSFLL
Subjt:  RSLSSGNLEWSPRSFHNQNGFPDDQELSKEDGGLDN---NDNGEQSPGGSESVSSTDGVPVQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLL

Query:  FTIFTSLLWINEQDQDAYLVPT
        FTIFTSLLWINEQDQ  YLVPT
Subjt:  FTIFTSLLWINEQDQDAYLVPT

A0A6J1HN73 protein SINE1-like0.0e+0090.26Show/hide
Query:  MKATPETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
        MKATPE HRSF+GKSFSPMLRRELANLDKDADSRR+AMKAL+TYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI
Subjt:  MKATPETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSI

Query:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS
        IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKK +VIHSLCNPLSESLLGSQESLT+GAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEK 
Subjt:  IKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKS

Query:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTA
        TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGL+ILK GMVEKNSQKRLS+IQMINFLMKCLDPWSIFSELQ+IIEEMENCQSDQMAYVKGAAFET+QTA
Subjt:  TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTA

Query:  RRIAADQGSKMEKSPSSVTGSNFIDRRRSPWRRNGGSRTPLSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSHENGGVDISLKDGL
        RRIAAD+GSKM KSPSSVTGSNFIDRRRSPW RNGGSRTP SESPESQT DSFFDYGSL GSPFSS+QAS NSGFDRRS+NRKLW +ENGGVDISLKDGL
Subjt:  RRIAADQGSKMEKSPSSVTGSNFIDRRRSPWRRNGGSRTPLSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSHENGGVDISLKDGL

Query:  SLFSEITRGTDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRSFINVEDMIFKTPRKLVHSLQDPNEANSDYASKSCRRRQ
        SLFS+I RGTDVSDT+S+HS SHKF HHGEEYAD+FAGFFQMSPPR RLSRSTTTSP+RSRS INVEDMIFKTPRKLVHSLQD N+ANSDYASKSC+ RQ
Subjt:  SLFSEITRGTDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRSFINVEDMIFKTPRKLVHSLQDPNEANSDYASKSCRRRQ

Query:  RSLSSGNLEWSPRSFHNQNGFPDDQELSKEDGGLDN-------NDNGEQSPGGSESVSSTDGVPVQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGF
        RSLS GNLEWSPRS HNQNG PD Q+LSK+D   DN       NDN E+SPGGSESVSST GVPVQAMP+VVA HSKIKTQYSGIEMAYKKTALKLVCGF
Subjt:  RSLSSGNLEWSPRSFHNQNGFPDDQELSKEDGGLDN-------NDNGEQSPGGSESVSSTDGVPVQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGF

Query:  SFLLFTIFTSLLWINEQDQDAYLVPT
        SFLLFTIFTSLLWINEQDQ  YLVPT
Subjt:  SFLLFTIFTSLLWINEQDQDAYLVPT

SwissProt top hitse value%identityAlignment
Q5XVI1 Protein SINE13.7e-15456.4Show/hide
Query:  LGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        +G + +P+LR+ELANLDKD +SR+SAMKAL++YVK+LDSKAIP FLAQV E KET +L+GE TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSF
Subjt:  LGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM
        PLQQACSKV+PAIARYGIDPTT +DKK+ +IHSLC PL++SLL SQESLTSGAALCLKALVDSDNWRFASDEMVN+VCQNV  AL+  S QT+  MGLVM
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM

Query:  TLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTARRIAADQGSKM
        +LAK NP IVE YARLL+  GLRIL  G+ E NSQKRLSA+QM+NFLMKCLDP SI+SE++ II+EME CQSDQMAYV+GAA+E M T++RIAA+  SKM
Subjt:  TLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTARRIAADQGSKM

Query:  EKSPSSVTGSNFIDRRRSPWRRNGGSRTP-LSESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSH-ENGG-VDISLKDGLSLFSEIT
        EK   SVTGSNF        RRN  S  P  S SPESQTL SF  Y S V  SP S    S NS FDRRSVNRKLW   ENGG VDISLKDG  LFS +T
Subjt:  EKSPSSVTGSNFIDRRRSPWRRNGGSRTP-LSESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSH-ENGG-VDISLKDGLSLFSEIT

Query:  RG-TDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRS-FINVEDM-IFKTPRKLVHSLQDPNEANSDYASKSCRRRQRSLS
        +G T VSD       S    +   E  D+F GF   S       R+TT SP R RS  IN ED  IF TPRKL+ SLQ P++ + D++       Q  + 
Subjt:  RG-TDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRS-FINVEDM-IFKTPRKLVHSLQDPNEANSDYASKSCRRRQRSLS

Query:  SGNLEWSPRSFHNQNGFPDDQELSKEDGGLDNNDNGEQSPGGSESVSSTDGVPVQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTIFTSL
         G                   E  K  G   N    +Q P   E++SST         + V+  +      +G +   K +  KLV   SF++  +F ++
Subjt:  SGNLEWSPRSFHNQNGFPDDQELSKEDGGLDNNDNGEQSPGGSESVSSTDGVPVQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTIFTSL

Query:  LWINEQDQDA--YLVPT
        + +  QD D   Y VPT
Subjt:  LWINEQDQDA--YLVPT

Q9SQR5 Protein SINE21.0e-8757.28Show/hide
Query:  LGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        +G++     R+ELANLDKD DS ++AM  LR+ VK+LD+K + VF+AQ+S+ KE G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS 
Subjt:  LGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL
         +QQACS+ V A+ARYGIDPTTP+DKK +VIHSLC PLS+SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EMVN VCQ++A ALE  S++  SHM L
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL

Query:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTARRIAADQGS
        VM L+K NP  VE YARL +++GLRIL  G+VE +SQKRL AIQM+NFLMK L+P SI SEL+ I +EME  Q DQ  YVK AA ETM+ A R+  +   
Subjt:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTARRIAADQGS

Query:  KME----KSPSSVTGS
          +    K  +S++GS
Subjt:  KME----KSPSSVTGS

Arabidopsis top hitse value%identityAlignment
AT1G54385.1 ARM repeat superfamily protein2.6e-15556.4Show/hide
Query:  LGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        +G + +P+LR+ELANLDKD +SR+SAMKAL++YVK+LDSKAIP FLAQV E KET +L+GE TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSF
Subjt:  LGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM
        PLQQACSKV+PAIARYGIDPTT +DKK+ +IHSLC PL++SLL SQESLTSGAALCLKALVDSDNWRFASDEMVN+VCQNV  AL+  S QT+  MGLVM
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM

Query:  TLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTARRIAADQGSKM
        +LAK NP IVE YARLL+  GLRIL  G+ E NSQKRLSA+QM+NFLMKCLDP SI+SE++ II+EME CQSDQMAYV+GAA+E M T++RIAA+  SKM
Subjt:  TLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTARRIAADQGSKM

Query:  EKSPSSVTGSNFIDRRRSPWRRNGGSRTP-LSESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSH-ENGG-VDISLKDGLSLFSEIT
        EK   SVTGSNF        RRN  S  P  S SPESQTL SF  Y S V  SP S    S NS FDRRSVNRKLW   ENGG VDISLKDG  LFS +T
Subjt:  EKSPSSVTGSNFIDRRRSPWRRNGGSRTP-LSESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSH-ENGG-VDISLKDGLSLFSEIT

Query:  RG-TDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRS-FINVEDM-IFKTPRKLVHSLQDPNEANSDYASKSCRRRQRSLS
        +G T VSD       S    +   E  D+F GF   S       R+TT SP R RS  IN ED  IF TPRKL+ SLQ P++ + D++       Q  + 
Subjt:  RG-TDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRS-FINVEDM-IFKTPRKLVHSLQDPNEANSDYASKSCRRRQRSLS

Query:  SGNLEWSPRSFHNQNGFPDDQELSKEDGGLDNNDNGEQSPGGSESVSSTDGVPVQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTIFTSL
         G                   E  K  G   N    +Q P   E++SST         + V+  +      +G +   K +  KLV   SF++  +F ++
Subjt:  SGNLEWSPRSFHNQNGFPDDQELSKEDGGLDNNDNGEQSPGGSESVSSTDGVPVQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTIFTSL

Query:  LWINEQDQDA--YLVPT
        + +  QD D   Y VPT
Subjt:  LWINEQDQDA--YLVPT

AT1G54385.2 ARM repeat superfamily protein2.6e-15556.4Show/hide
Query:  LGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        +G + +P+LR+ELANLDKD +SR+SAMKAL++YVK+LDSKAIP FLAQV E KET +L+GE TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSF
Subjt:  LGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM
        PLQQACSKV+PAIARYGIDPTT +DKK+ +IHSLC PL++SLL SQESLTSGAALCLKALVDSDNWRFASDEMVN+VCQNV  AL+  S QT+  MGLVM
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVM

Query:  TLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTARRIAADQGSKM
        +LAK NP IVE YARLL+  GLRIL  G+ E NSQKRLSA+QM+NFLMKCLDP SI+SE++ II+EME CQSDQMAYV+GAA+E M T++RIAA+  SKM
Subjt:  TLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTARRIAADQGSKM

Query:  EKSPSSVTGSNFIDRRRSPWRRNGGSRTP-LSESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSH-ENGG-VDISLKDGLSLFSEIT
        EK   SVTGSNF        RRN  S  P  S SPESQTL SF  Y S V  SP S    S NS FDRRSVNRKLW   ENGG VDISLKDG  LFS +T
Subjt:  EKSPSSVTGSNFIDRRRSPWRRNGGSRTP-LSESPESQTLDSFFDYGSLV-GSPFSSRQASRNSGFDRRSVNRKLWSH-ENGG-VDISLKDGLSLFSEIT

Query:  RG-TDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRS-FINVEDM-IFKTPRKLVHSLQDPNEANSDYASKSCRRRQRSLS
        +G T VSD       S    +   E  D+F GF   S       R+TT SP R RS  IN ED  IF TPRKL+ SLQ P++ + D++       Q  + 
Subjt:  RG-TDVSDTLSMHSGSHKFGHHGEEYADDFAGFFQMSPPRRRLSRSTTTSPLRSRS-FINVEDM-IFKTPRKLVHSLQDPNEANSDYASKSCRRRQRSLS

Query:  SGNLEWSPRSFHNQNGFPDDQELSKEDGGLDNNDNGEQSPGGSESVSSTDGVPVQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTIFTSL
         G                   E  K  G   N    +Q P   E++SST         + V+  +      +G +   K +  KLV   SF++  +F ++
Subjt:  SGNLEWSPRSFHNQNGFPDDQELSKEDGGLDNNDNGEQSPGGSESVSSTDGVPVQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTIFTSL

Query:  LWINEQDQDA--YLVPT
        + +  QD D   Y VPT
Subjt:  LWINEQDQDA--YLVPT

AT3G03970.1 ARM repeat superfamily protein7.2e-8957.28Show/hide
Query:  LGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        +G++     R+ELANLDKD DS ++AM  LR+ VK+LD+K + VF+AQ+S+ KE G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS 
Subjt:  LGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL
         +QQACS+ V A+ARYGIDPTTP+DKK +VIHSLC PLS+SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EMVN VCQ++A ALE  S++  SHM L
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL

Query:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTARRIAADQGS
        VM L+K NP  VE YARL +++GLRIL  G+VE +SQKRL AIQM+NFLMK L+P SI SEL+ I +EME  Q DQ  YVK AA ETM+ A R+  +   
Subjt:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTARRIAADQGS

Query:  KME----KSPSSVTGS
          +    K  +S++GS
Subjt:  KME----KSPSSVTGS

AT3G03970.2 ARM repeat superfamily protein7.2e-8957.28Show/hide
Query:  LGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        +G++     R+ELANLDKD DS ++AM  LR+ VK+LD+K + VF+AQ+S+ KE G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS 
Subjt:  LGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL
         +QQACS+ V A+ARYGIDPTTP+DKK +VIHSLC PLS+SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EMVN VCQ++A ALE  S++  SHM L
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL

Query:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTARRIAADQGS
        VM L+K NP  VE YARL +++GLRIL  G+VE +SQKRL AIQM+NFLMK L+P SI SEL+ I +EME  Q DQ  YVK AA ETM+ A R+  +   
Subjt:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTARRIAADQGS

Query:  KME----KSPSSVTGS
          +    K  +S++GS
Subjt:  KME----KSPSSVTGS

AT3G03970.3 ARM repeat superfamily protein7.2e-8957.28Show/hide
Query:  LGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF
        +G++     R+ELANLDKD DS ++AM  LR+ VK+LD+K + VF+AQ+S+ KE G  +G  T+SL+E LAR HGV I P ID IM +II+TL+SS GS 
Subjt:  LGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSF

Query:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL
         +QQACS+ V A+ARYGIDPTTP+DKK +VIHSLC PLS+SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EMVN VCQ++A ALE  S++  SHM L
Subjt:  PLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGS--QESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGL

Query:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTARRIAADQGS
        VM L+K NP  VE YARL +++GLRIL  G+VE +SQKRL AIQM+NFLMK L+P SI SEL+ I +EME  Q DQ  YVK AA ETM+ A R+  +   
Subjt:  VMTLAKRNPRIVEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTARRIAADQGS

Query:  KME----KSPSSVTGS
          +    K  +S++GS
Subjt:  KME----KSPSSVTGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGCAACTCCAGAAACTCACAGGTCTTTTTTGGGCAAGAGTTTTAGTCCAATGCTCAGGCGGGAACTTGCTAATCTTGATAAAGATGCTGATAGTCGCAGATCTGC
GATGAAGGCATTGAGGACTTATGTGAAGGAATTGGACTCCAAGGCTATTCCTGTGTTTCTTGCTCAAGTTTCTGAGAATAAAGAAACTGGTGCATTGACTGGGGAGTGCA
CAATTTCTCTATATGAAGTTCTTGCTCGTGTTCATGGCGTCAATATCGTGCCACAGATCGATCGAATTATGACTTCTATTATCAAGACTTTGGCTTCAAGTGCTGGCTCT
TTCCCTCTTCAACAGGCCTGCTCGAAAGTTGTTCCTGCTATTGCTAGATATGGGATTGATCCGACTACTCCTGATGATAAGAAGAAGCATGTTATTCATTCTCTTTGTAA
TCCCCTTTCGGAATCTTTGTTGGGTTCTCAAGAGAGCCTCACTTCTGGGGCTGCCCTCTGTTTGAAGGCTCTGGTGGATTCGGATAATTGGCGGTTTGCTTCTGATGAGA
TGGTTAATAAGGTGTGCCAGAATGTTGCTGGAGCTCTGGAGGAGAAGTCTACCCAAACCAATTCACATATGGGGCTTGTTATGACGCTAGCAAAGCGGAATCCTCGGATT
GTCGAACCATATGCTAGATTGTTGCTGCAGGCCGGGTTGCGAATATTGAAGTGTGGGATGGTGGAGAAGAATTCTCAGAAAAGATTGTCTGCCATTCAAATGATTAATTT
CTTGATGAAGTGTTTAGATCCTTGGAGTATATTTTCGGAACTTCAGTCTATAATTGAGGAGATGGAGAATTGCCAGTCTGATCAAATGGCTTATGTCAAAGGTGCAGCTT
TTGAAACTATGCAAACGGCAAGGAGAATAGCTGCCGATCAAGGGTCGAAAATGGAAAAATCACCAAGCTCGGTGACTGGATCAAACTTCATTGATCGCAGGAGAAGTCCA
TGGAGGAGGAATGGTGGAAGCCGAACTCCCTTGTCCGAGTCTCCAGAATCTCAGACCCTTGATTCATTCTTCGATTACGGCTCGCTTGTTGGATCACCCTTTTCATCAAG
ACAAGCTTCTCGTAATTCAGGATTCGACCGTAGGAGCGTGAATCGGAAACTTTGGAGTCATGAGAATGGTGGGGTTGATATATCCCTCAAGGATGGCTTGTCTTTGTTCT
CGGAAATCACTCGTGGAACCGACGTCTCCGACACGCTGTCTATGCACTCTGGAAGTCACAAATTTGGGCATCACGGTGAAGAATATGCAGATGACTTTGCAGGGTTCTTT
CAAATGAGTCCTCCTAGACGCAGACTATCAAGAAGCACTACAACCAGCCCCCTTAGGTCTCGTAGTTTCATAAACGTCGAAGATATGATCTTCAAAACTCCTCGGAAGCT
CGTCCACTCTCTTCAAGATCCAAACGAGGCAAACTCAGACTATGCTAGCAAAAGCTGCAGACGAAGGCAAAGGAGTTTATCATCAGGCAATTTGGAGTGGAGTCCAAGAT
CATTTCATAATCAAAACGGGTTCCCAGATGATCAGGAACTTAGCAAAGAGGACGGCGGCTTAGACAACAACGACAACGGCGAACAATCACCAGGTGGTTCGGAATCAGTC
TCTTCAACTGATGGTGTTCCTGTCCAAGCTATGCCTATGGTGGTGGCTCACCACAGCAAGATCAAAACTCAATATTCTGGCATCGAGATGGCATACAAGAAGACTGCTTT
GAAACTGGTCTGTGGCTTCTCATTTTTGCTTTTCACAATATTCACTTCATTGCTTTGGATTAATGAGCAAGACCAAGATGCCTATCTTGTACCAACATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGCAACTCCAGAAACTCACAGGTCTTTTTTGGGCAAGAGTTTTAGTCCAATGCTCAGGCGGGAACTTGCTAATCTTGATAAAGATGCTGATAGTCGCAGATCTGC
GATGAAGGCATTGAGGACTTATGTGAAGGAATTGGACTCCAAGGCTATTCCTGTGTTTCTTGCTCAAGTTTCTGAGAATAAAGAAACTGGTGCATTGACTGGGGAGTGCA
CAATTTCTCTATATGAAGTTCTTGCTCGTGTTCATGGCGTCAATATCGTGCCACAGATCGATCGAATTATGACTTCTATTATCAAGACTTTGGCTTCAAGTGCTGGCTCT
TTCCCTCTTCAACAGGCCTGCTCGAAAGTTGTTCCTGCTATTGCTAGATATGGGATTGATCCGACTACTCCTGATGATAAGAAGAAGCATGTTATTCATTCTCTTTGTAA
TCCCCTTTCGGAATCTTTGTTGGGTTCTCAAGAGAGCCTCACTTCTGGGGCTGCCCTCTGTTTGAAGGCTCTGGTGGATTCGGATAATTGGCGGTTTGCTTCTGATGAGA
TGGTTAATAAGGTGTGCCAGAATGTTGCTGGAGCTCTGGAGGAGAAGTCTACCCAAACCAATTCACATATGGGGCTTGTTATGACGCTAGCAAAGCGGAATCCTCGGATT
GTCGAACCATATGCTAGATTGTTGCTGCAGGCCGGGTTGCGAATATTGAAGTGTGGGATGGTGGAGAAGAATTCTCAGAAAAGATTGTCTGCCATTCAAATGATTAATTT
CTTGATGAAGTGTTTAGATCCTTGGAGTATATTTTCGGAACTTCAGTCTATAATTGAGGAGATGGAGAATTGCCAGTCTGATCAAATGGCTTATGTCAAAGGTGCAGCTT
TTGAAACTATGCAAACGGCAAGGAGAATAGCTGCCGATCAAGGGTCGAAAATGGAAAAATCACCAAGCTCGGTGACTGGATCAAACTTCATTGATCGCAGGAGAAGTCCA
TGGAGGAGGAATGGTGGAAGCCGAACTCCCTTGTCCGAGTCTCCAGAATCTCAGACCCTTGATTCATTCTTCGATTACGGCTCGCTTGTTGGATCACCCTTTTCATCAAG
ACAAGCTTCTCGTAATTCAGGATTCGACCGTAGGAGCGTGAATCGGAAACTTTGGAGTCATGAGAATGGTGGGGTTGATATATCCCTCAAGGATGGCTTGTCTTTGTTCT
CGGAAATCACTCGTGGAACCGACGTCTCCGACACGCTGTCTATGCACTCTGGAAGTCACAAATTTGGGCATCACGGTGAAGAATATGCAGATGACTTTGCAGGGTTCTTT
CAAATGAGTCCTCCTAGACGCAGACTATCAAGAAGCACTACAACCAGCCCCCTTAGGTCTCGTAGTTTCATAAACGTCGAAGATATGATCTTCAAAACTCCTCGGAAGCT
CGTCCACTCTCTTCAAGATCCAAACGAGGCAAACTCAGACTATGCTAGCAAAAGCTGCAGACGAAGGCAAAGGAGTTTATCATCAGGCAATTTGGAGTGGAGTCCAAGAT
CATTTCATAATCAAAACGGGTTCCCAGATGATCAGGAACTTAGCAAAGAGGACGGCGGCTTAGACAACAACGACAACGGCGAACAATCACCAGGTGGTTCGGAATCAGTC
TCTTCAACTGATGGTGTTCCTGTCCAAGCTATGCCTATGGTGGTGGCTCACCACAGCAAGATCAAAACTCAATATTCTGGCATCGAGATGGCATACAAGAAGACTGCTTT
GAAACTGGTCTGTGGCTTCTCATTTTTGCTTTTCACAATATTCACTTCATTGCTTTGGATTAATGAGCAAGACCAAGATGCCTATCTTGTACCAACATAA
Protein sequenceShow/hide protein sequence
MKATPETHRSFLGKSFSPMLRRELANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGS
FPLQQACSKVVPAIARYGIDPTTPDDKKKHVIHSLCNPLSESLLGSQESLTSGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNPRI
VEPYARLLLQAGLRILKCGMVEKNSQKRLSAIQMINFLMKCLDPWSIFSELQSIIEEMENCQSDQMAYVKGAAFETMQTARRIAADQGSKMEKSPSSVTGSNFIDRRRSP
WRRNGGSRTPLSESPESQTLDSFFDYGSLVGSPFSSRQASRNSGFDRRSVNRKLWSHENGGVDISLKDGLSLFSEITRGTDVSDTLSMHSGSHKFGHHGEEYADDFAGFF
QMSPPRRRLSRSTTTSPLRSRSFINVEDMIFKTPRKLVHSLQDPNEANSDYASKSCRRRQRSLSSGNLEWSPRSFHNQNGFPDDQELSKEDGGLDNNDNGEQSPGGSESV
SSTDGVPVQAMPMVVAHHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTIFTSLLWINEQDQDAYLVPT