; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g33760 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g33760
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionSWIM-type domain-containing protein
Genome locationchr8:24552266..24559721
RNA-Seq ExpressionMoc08g33760
SyntenyMoc08g33760
Gene Ontology termsGO:0048507 - meristem development (biological process)
InterPro domainsIPR004332 - Transposase, MuDR, plant
IPR018289 - MULE transposase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044308.1 uncharacterized protein E6C27_scaffold46G00290 [Cucumis melo var. makuwa]1.7e-20053.46Show/hide
Query:  MSLDCDEIVILELGNGSASDIEVEVDELFRTELCLNIPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQ
        MSL  +++VILE GN   SDI+VE+DELF  E         N    NE  PS +F +ID++I +S+CE+GS L      +  +   I+ G  F  KE LQ
Subjt:  MSLDCDEIVILELGNGSASDIEVEVDELFRTELCLNIPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQ

Query:  LAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEV
        LAVKKYC+ QHY+ VV+ESNQ++W V+CKQW++ C WRLR  R KSHG+FEIS LE  HS LY+ LTQDHSQLDSNFM  +IQNIVR DPS  V+ L E+
Subjt:  LAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEV

Query:  IKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYK
        IK+ +GY V Y+R+WQAKRK L+ VFGDW+KSY EL YWL+A+VHYN  TRV W  +PS+ P   IFGRVFW+FGPAIEGFK+CRPLIQIDGTHLY KYK
Subjt:  IKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYK

Query:  GKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKNYNSKPLKELV
        GK+L   +IDANGHIFP+AFAIVE EN S WSWFL ALR  VT++D ICLISDRH GIL A+ N+E GW+EPRA HR+CLRHVASNFN  Y SK LK+LV
Subjt:  GKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKNYNSKPLKELV

Query:  YRGGSQHQKHKFERYMRELQQLNPDCIDFFTHINIEKWTQAHDQGFRYGWMTTNLAECINGVFKGARMLSITALVKFTFFKTITYFQKRRDDINEVVDR-
        +R G+QHQ+ KF R M+E++QLNP+C++FF  I+++KWTQ+HD G+RYGWMT+N AEC+NGVFKGARML +T+LV+ TF++TI YF++RR +I+E VDR 
Subjt:  YRGGSQHQKHKFERYMRELQQLNPDCIDFFTHINIEKWTQAHDQGFRYGWMTTNLAECINGVFKGARMLSITALVKFTFFKTITYFQKRRDDINEVVDR-

Query:  -----------------------------------------------------------------------------VCNHVHLPYLPYVEKYYELSRYQ
                                                                                     VCN++HL Y  Y+++ Y LS ++
Subjt:  -----------------------------------------------------------------------------VCNHVHLPYLPYVEKYYELSRYQ

Query:  LCYASRFYPIKHKDYWSEVKFDVIHPNPNLLREYGCQKTSHIHNEMDWRET
         CYA RF+PI+H DYW E+ F  + PN +LL+  G  +T+ IHNEMDW+E+
Subjt:  LCYASRFYPIKHKDYWSEVKFDVIHPNPNLLREYGCQKTSHIHNEMDWRET

TYK02543.1 uncharacterized protein E5676_scaffold201G00230 [Cucumis melo var. makuwa]7.5e-20153.61Show/hide
Query:  MSLDCDEIVILELGNGSASDIEVEVDELFRTELCLNIPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQ
        MSL  +++VILE GN   SDI+VE+DELF  E         N    NE  PS +F +ID++I +S+CE+GS L      +  +   I+ G  F  KE LQ
Subjt:  MSLDCDEIVILELGNGSASDIEVEVDELFRTELCLNIPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQ

Query:  LAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEV
        LAVKKYC+ QHY+ VV+ESNQ++W V+CKQW++ C WRLR  R KSHG+FEIS LE  HS LY+ LTQDHSQLDSNFM  EIQNIVR DPS  V+ L E+
Subjt:  LAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEV

Query:  IKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYK
        IK+ +GY V Y+R+WQAKRK L+ VFGDW+KSY EL YWL+A+VHYN  TRV W  +PS+ P   IFGRVFW+FGPAIEGFK+CRPLIQIDGTHLY KYK
Subjt:  IKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYK

Query:  GKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKNYNSKPLKELV
        GK+L   +IDANGHIFP+AFAIVE EN S WSWFL ALR  VT++D ICLISDRH GIL A+ N+E GW+EPRA HR+CLRHVASNFN  Y SK LK+LV
Subjt:  GKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKNYNSKPLKELV

Query:  YRGGSQHQKHKFERYMRELQQLNPDCIDFFTHINIEKWTQAHDQGFRYGWMTTNLAECINGVFKGARMLSITALVKFTFFKTITYFQKRRDDINEVVDR-
        +R G+QHQ+ KF R M+E++QLNP+C++FF  I+++KWTQ+HD G+RYGWMT+N AEC+NGVFKGARML +T+LV+ TF++TI YF++RR +I+E VDR 
Subjt:  YRGGSQHQKHKFERYMRELQQLNPDCIDFFTHINIEKWTQAHDQGFRYGWMTTNLAECINGVFKGARMLSITALVKFTFFKTITYFQKRRDDINEVVDR-

Query:  -----------------------------------------------------------------------------VCNHVHLPYLPYVEKYYELSRYQ
                                                                                     VCN++HL Y  Y+++ Y LS ++
Subjt:  -----------------------------------------------------------------------------VCNHVHLPYLPYVEKYYELSRYQ

Query:  LCYASRFYPIKHKDYWSEVKFDVIHPNPNLLREYGCQKTSHIHNEMDWRET
         CYA RF+PI+H DYW E+ F  + PN +LL+  G  +T+ IHNEMDW+E+
Subjt:  LCYASRFYPIKHKDYWSEVKFDVIHPNPNLLREYGCQKTSHIHNEMDWRET

XP_022153141.1 uncharacterized protein LOC111020709 [Momordica charantia]2.6e-30299.8Show/hide
Query:  MSLDCDEIVILELGNGSASDIEVEVDELFRTELCLNIPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQ
        MSLDCDEIVILELGNGSASDIEVEVDELFRTELCLNIPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQ
Subjt:  MSLDCDEIVILELGNGSASDIEVEVDELFRTELCLNIPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQ

Query:  LAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEV
        LAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVR DPSTYVTSLQEV
Subjt:  LAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEV

Query:  IKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYK
        IKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYK
Subjt:  IKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYK

Query:  GKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKNYNSKPLKELV
        GKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKNYNSKPLKELV
Subjt:  GKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKNYNSKPLKELV

Query:  YRGGSQHQKHKFERYMRELQQLNPDCIDFFTHINIEKWTQAHDQGFRYGWMTTNLAECINGVFKGARMLSITALVKFTFFKTITYFQKRRDDINEVVDR
        YRGGSQHQKHKFERYMRELQQLNPDCIDFFTHINIEKWTQAHDQGFRYGWMTTNLAECINGVFKGARMLSITALVKFTFFKTITYFQKRRDDINEVVDR
Subjt:  YRGGSQHQKHKFERYMRELQQLNPDCIDFFTHINIEKWTQAHDQGFRYGWMTTNLAECINGVFKGARMLSITALVKFTFFKTITYFQKRRDDINEVVDR

XP_022157858.1 uncharacterized protein LOC111024473 [Momordica charantia]4.5e-20689.23Show/hide
Query:  MSLDCDEIVILELGNGSASDIEVEVDELFRTELCLNIPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQ
        MSLDCDEIVILE GNGS+SDIEVEVDELF TELCLNIPVNLNPPL NEEAPSPMF EIDFE   SICEEGSKLEEGSMSL+M+VDNIRI SKFRTKE+LQ
Subjt:  MSLDCDEIVILELGNGSASDIEVEVDELFRTELCLNIPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQ

Query:  LAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEV
        LAVKKYCI QHYQFVV+ESNQDMWYVKCKQWND CKWRLRARR KSHGMFEISVLEEVHS LY ELTQDHSQLDSNFM +EIQNIVRRDPST + SLQEV
Subjt:  LAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEV

Query:  IKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYK
        IKK+FGYCVNY+RMWQAKRK LIKVFGDWEKSYEELSYWLNALVHYN DTRVLWDLIPSN P HLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLY  YK
Subjt:  IKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYK

Query:  GKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKN
        GKLLI TT DANGHIFPIAFAIVEEENT  WSWFLCALRNLVT++D ICLISDRHGGIL AVK+KENGW+E RAHHRFCLRHVASNFNKN
Subjt:  GKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKN

XP_031737534.1 uncharacterized protein LOC116402427 [Cucumis sativus]2.6e-18550.08Show/hide
Query:  MSLDCDEIVILELGNGSASDIEVEVDELFRTELCLNIPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQ
        MSL  +++VILE GN   SDI+VEVDELF   +                             VD+ C                   I+ G  F +KE L 
Subjt:  MSLDCDEIVILELGNGSASDIEVEVDELFRTELCLNIPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQ

Query:  LAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEV
        LAVK+YC+ QHY+ VV+ESNQ +W ++CKQW + C WRLR  + KSHG+FEI+ L+  HS LY+ LTQDHSQLDSNFM  +IQN+V+ DP+  V+ L E+
Subjt:  LAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEV

Query:  IKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYK
        IK+ +GY V Y ++WQAK+K LI VFGDWEKSY EL YWL+A+VHYN  TRV W  +PS+ P   IFGRVFWSFGPAIEGFKHCRPLIQIDGTHLY KYK
Subjt:  IKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYK

Query:  GKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKNYNSKPLKELV
        GK+L   +IDANGHIFP+AFAIVE ENTS WSWFL ALR  VT++  ICLISDRH  IL A+ N+E GW+EPRA HR+CLRHVASNFN+ Y SK LK+LV
Subjt:  GKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKNYNSKPLKELV

Query:  YRGGSQHQKHKFERYMRELQQLNPDCIDFFTHINIEKWTQAHDQGFRYGWMTTNLAECINGVFKGARMLSITALVKFTFFKTITYFQKRRDDINEVVDR-
        +R G+QHQ+ KF + M+ELQ+LNP+C++FF  I++EKWTQ+HD G RYGWMT+N AEC+NGVFKGARML IT+LV+ TF++TI YF++RR +I+E +DR 
Subjt:  YRGGSQHQKHKFERYMRELQQLNPDCIDFFTHINIEKWTQAHDQGFRYGWMTTNLAECINGVFKGARMLSITALVKFTFFKTITYFQKRRDDINEVVDR-

Query:  -----------------------------------------------------------------------------VCNHVHLPYLPYVEKYYELSRYQ
                                                                                     +CN++HL Y+PY+++ Y LS ++
Subjt:  -----------------------------------------------------------------------------VCNHVHLPYLPYVEKYYELSRYQ

Query:  LCYASRFYPIKHKDYWSEVKFDVIHPNPNLLREYGCQKTSHIHNEMDWRETDR
         CY  RF+PI+H DYW E+ F  + PN +LL+  G  +T+ I NEMDW+E+ +
Subjt:  LCYASRFYPIKHKDYWSEVKFDVIHPNPNLLREYGCQKTSHIHNEMDWRETDR

TrEMBL top hitse value%identityAlignment
A0A5A7TN56 SWIM-type domain-containing protein8.1e-20153.46Show/hide
Query:  MSLDCDEIVILELGNGSASDIEVEVDELFRTELCLNIPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQ
        MSL  +++VILE GN   SDI+VE+DELF  E         N    NE  PS +F +ID++I +S+CE+GS L      +  +   I+ G  F  KE LQ
Subjt:  MSLDCDEIVILELGNGSASDIEVEVDELFRTELCLNIPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQ

Query:  LAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEV
        LAVKKYC+ QHY+ VV+ESNQ++W V+CKQW++ C WRLR  R KSHG+FEIS LE  HS LY+ LTQDHSQLDSNFM  +IQNIVR DPS  V+ L E+
Subjt:  LAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEV

Query:  IKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYK
        IK+ +GY V Y+R+WQAKRK L+ VFGDW+KSY EL YWL+A+VHYN  TRV W  +PS+ P   IFGRVFW+FGPAIEGFK+CRPLIQIDGTHLY KYK
Subjt:  IKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYK

Query:  GKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKNYNSKPLKELV
        GK+L   +IDANGHIFP+AFAIVE EN S WSWFL ALR  VT++D ICLISDRH GIL A+ N+E GW+EPRA HR+CLRHVASNFN  Y SK LK+LV
Subjt:  GKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKNYNSKPLKELV

Query:  YRGGSQHQKHKFERYMRELQQLNPDCIDFFTHINIEKWTQAHDQGFRYGWMTTNLAECINGVFKGARMLSITALVKFTFFKTITYFQKRRDDINEVVDR-
        +R G+QHQ+ KF R M+E++QLNP+C++FF  I+++KWTQ+HD G+RYGWMT+N AEC+NGVFKGARML +T+LV+ TF++TI YF++RR +I+E VDR 
Subjt:  YRGGSQHQKHKFERYMRELQQLNPDCIDFFTHINIEKWTQAHDQGFRYGWMTTNLAECINGVFKGARMLSITALVKFTFFKTITYFQKRRDDINEVVDR-

Query:  -----------------------------------------------------------------------------VCNHVHLPYLPYVEKYYELSRYQ
                                                                                     VCN++HL Y  Y+++ Y LS ++
Subjt:  -----------------------------------------------------------------------------VCNHVHLPYLPYVEKYYELSRYQ

Query:  LCYASRFYPIKHKDYWSEVKFDVIHPNPNLLREYGCQKTSHIHNEMDWRET
         CYA RF+PI+H DYW E+ F  + PN +LL+  G  +T+ IHNEMDW+E+
Subjt:  LCYASRFYPIKHKDYWSEVKFDVIHPNPNLLREYGCQKTSHIHNEMDWRET

A0A5D3BS92 SWIM-type domain-containing protein3.6e-20153.61Show/hide
Query:  MSLDCDEIVILELGNGSASDIEVEVDELFRTELCLNIPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQ
        MSL  +++VILE GN   SDI+VE+DELF  E         N    NE  PS +F +ID++I +S+CE+GS L      +  +   I+ G  F  KE LQ
Subjt:  MSLDCDEIVILELGNGSASDIEVEVDELFRTELCLNIPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQ

Query:  LAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEV
        LAVKKYC+ QHY+ VV+ESNQ++W V+CKQW++ C WRLR  R KSHG+FEIS LE  HS LY+ LTQDHSQLDSNFM  EIQNIVR DPS  V+ L E+
Subjt:  LAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEV

Query:  IKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYK
        IK+ +GY V Y+R+WQAKRK L+ VFGDW+KSY EL YWL+A+VHYN  TRV W  +PS+ P   IFGRVFW+FGPAIEGFK+CRPLIQIDGTHLY KYK
Subjt:  IKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYK

Query:  GKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKNYNSKPLKELV
        GK+L   +IDANGHIFP+AFAIVE EN S WSWFL ALR  VT++D ICLISDRH GIL A+ N+E GW+EPRA HR+CLRHVASNFN  Y SK LK+LV
Subjt:  GKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKNYNSKPLKELV

Query:  YRGGSQHQKHKFERYMRELQQLNPDCIDFFTHINIEKWTQAHDQGFRYGWMTTNLAECINGVFKGARMLSITALVKFTFFKTITYFQKRRDDINEVVDR-
        +R G+QHQ+ KF R M+E++QLNP+C++FF  I+++KWTQ+HD G+RYGWMT+N AEC+NGVFKGARML +T+LV+ TF++TI YF++RR +I+E VDR 
Subjt:  YRGGSQHQKHKFERYMRELQQLNPDCIDFFTHINIEKWTQAHDQGFRYGWMTTNLAECINGVFKGARMLSITALVKFTFFKTITYFQKRRDDINEVVDR-

Query:  -----------------------------------------------------------------------------VCNHVHLPYLPYVEKYYELSRYQ
                                                                                     VCN++HL Y  Y+++ Y LS ++
Subjt:  -----------------------------------------------------------------------------VCNHVHLPYLPYVEKYYELSRYQ

Query:  LCYASRFYPIKHKDYWSEVKFDVIHPNPNLLREYGCQKTSHIHNEMDWRET
         CYA RF+PI+H DYW E+ F  + PN +LL+  G  +T+ IHNEMDW+E+
Subjt:  LCYASRFYPIKHKDYWSEVKFDVIHPNPNLLREYGCQKTSHIHNEMDWRET

A0A6J1CPH0 uncharacterized protein LOC1110133718.1e-18589.46Show/hide
Query:  MFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQLAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEIS
        MF EIDFE VDSICEEGSKLEEGSMSL+M+VDNIRIGSKFRTKE+LQLAVKKYCI QHYQFVV+ESNQDMWYVKCKQWND CKWRLRA R KSHGMFEIS
Subjt:  MFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQLAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEIS

Query:  VLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEVIKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVL
        VLEEVHS LY ELTQDHSQLDSNFM NEIQNIVRRDPS  + SLQEVIKKDFGYCVNY+RMWQAKRK LIKVFGDWEKSYEEL Y LNALVHYN DTRVL
Subjt:  VLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEVIKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVL

Query:  WDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYKGKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISD
        WDLIPSN P HLIFGRVFWSFGPAIEGFKHCRPLI IDGTHLY KYKGKLLI TTIDANGHIFPIAFAIVEEENTS WS FLCALRNLVT++D ICLISD
Subjt:  WDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYKGKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISD

Query:  RHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKNYNSKPLKELVYRGG
        RHGGIL AVKNKENGW+EPRAHHRFCLRHVASNFNK  NSKPLKELVYR G
Subjt:  RHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKNYNSKPLKELVYRGG

A0A6J1DJS7 uncharacterized protein LOC1110207091.2e-30299.8Show/hide
Query:  MSLDCDEIVILELGNGSASDIEVEVDELFRTELCLNIPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQ
        MSLDCDEIVILELGNGSASDIEVEVDELFRTELCLNIPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQ
Subjt:  MSLDCDEIVILELGNGSASDIEVEVDELFRTELCLNIPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQ

Query:  LAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEV
        LAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVR DPSTYVTSLQEV
Subjt:  LAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEV

Query:  IKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYK
        IKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYK
Subjt:  IKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYK

Query:  GKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKNYNSKPLKELV
        GKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKNYNSKPLKELV
Subjt:  GKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKNYNSKPLKELV

Query:  YRGGSQHQKHKFERYMRELQQLNPDCIDFFTHINIEKWTQAHDQGFRYGWMTTNLAECINGVFKGARMLSITALVKFTFFKTITYFQKRRDDINEVVDR
        YRGGSQHQKHKFERYMRELQQLNPDCIDFFTHINIEKWTQAHDQGFRYGWMTTNLAECINGVFKGARMLSITALVKFTFFKTITYFQKRRDDINEVVDR
Subjt:  YRGGSQHQKHKFERYMRELQQLNPDCIDFFTHINIEKWTQAHDQGFRYGWMTTNLAECINGVFKGARMLSITALVKFTFFKTITYFQKRRDDINEVVDR

A0A6J1DUI1 uncharacterized protein LOC1110244732.2e-20689.23Show/hide
Query:  MSLDCDEIVILELGNGSASDIEVEVDELFRTELCLNIPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQ
        MSLDCDEIVILE GNGS+SDIEVEVDELF TELCLNIPVNLNPPL NEEAPSPMF EIDFE   SICEEGSKLEEGSMSL+M+VDNIRI SKFRTKE+LQ
Subjt:  MSLDCDEIVILELGNGSASDIEVEVDELFRTELCLNIPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEEGSMSLKMNVDNIRIGSKFRTKEKLQ

Query:  LAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEV
        LAVKKYCI QHYQFVV+ESNQDMWYVKCKQWND CKWRLRARR KSHGMFEISVLEEVHS LY ELTQDHSQLDSNFM +EIQNIVRRDPST + SLQEV
Subjt:  LAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEV

Query:  IKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYK
        IKK+FGYCVNY+RMWQAKRK LIKVFGDWEKSYEELSYWLNALVHYN DTRVLWDLIPSN P HLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLY  YK
Subjt:  IKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYK

Query:  GKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKN
        GKLLI TT DANGHIFPIAFAIVEEENT  WSWFLCALRNLVT++D ICLISDRHGGIL AVK+KENGW+E RAHHRFCLRHVASNFNKN
Subjt:  GKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase5.7e-5832.35Show/hide
Query:  DNIRIGSKFRTKEKLQLAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQN
        D +R+G  F+   +++ AV    I +  + ++ E+ +D++ V+C++W+  CKW + A R +  G+FEI+     H   Y E   D    D+  +  +I+ 
Subjt:  DNIRIGSKFRTKEKLQLAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQN

Query:  IVRRDPSTYVTSLQEVIKKDFGYCVNY-------KRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIP-SNKPRHLIFGRVFWSFGP
        +VR  P+     L +  +K FG+ ++          +  AK K + + FGDW++S+  L   L +++H +    V W     ++ P H  F  +FW+F  
Subjt:  IVRRDPSTYVTSLQEVIKKDFGYCVNY-------KRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIP-SNKPRHLIFGRVFWSFGP

Query:  AIEGFKHCRPLIQIDGTHLYDKYKGKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHH
        +I+GF+HCRPLI +D  +L  KYK KL+I +  DA    FP+AFA+ +E +   W WFL  +R  VT++  ICLIS     IL  +    + W EP A+H
Subjt:  AIEGFKHCRPLIQIDGTHLYDKYKGKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHH

Query:  RFCLRHVASNF-----NKNYNSKPLKELVYRGGSQHQKHKFERYMRELQQLNPDCIDFFTHINIEKWTQAHDQGFRYGWMTTNLAECINGVFKGARMLSI
        RFCL H+ S         +YN   +  LV   GS  QK +F+ YM+E+++ NP+   +       +W  AHD G RYG M  +  E +  V K  R +++
Subjt:  RFCLRHVASNF-----NKNYNSKPLKELVYRGGSQHQKHKFERYMRELQQLNPDCIDFFTHINIEKWTQAHDQGFRYGWMTTNLAECINGVFKGARMLSI

Query:  TALVKFTF
           V   F
Subjt:  TALVKFTF

AT1G64255.1 MuDR family transposase1.9e-6135.71Show/hide
Query:  SLKMNVDNIRIGSKFRTKEKLQLAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFM
        SL ++  ++R+G  F+  ++L+ AV    +    + VV E+ +D +  +C +W   CKW L A R K HG+ EI      H+  +  + +D     S F 
Subjt:  SLKMNVDNIRIGSKFRTKEKLQLAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFM

Query:  WNEIQNIVRRDPSTYVTSLQEVIKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLW--DLIPSNKPRHLIFGRVFWSFGP
         +EI+  VR  P+  ++ L++  KK  GY +    +  AK K + +VFGDW++S+E+    ++AL   N    V W  DL P+  P    F  VFW+F  
Subjt:  WNEIQNIVRRDPSTYVTSLQEVIKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLW--DLIPSNKPRHLIFGRVFWSFGP

Query:  AIEGFKHCRPLIQIDGTHLYDKYKGKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHH
        +IEGF+HCRPLI +D  +L  +Y+ KL+I + +DA    FP+AFA+ +E +T  W WFL  +R  VT++  +CLIS  H  I+  V    + W EP A+H
Subjt:  AIEGFKHCRPLIQIDGTHLYDKYKGKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHH

Query:  RFCLRHVASNFNKNYNSKPLKELVYRGGSQHQKHKFERYMRELQQLNPDCIDFFTHINIEKWTQAHDQGFRYGWMTTN
        RF L H  S F++ + S  L   + R GS  QK +F  YM ++++ NP+   +       +W  AHD G RYG M  N
Subjt:  RFCLRHVASNFNKNYNSKPLKELVYRGGSQHQKHKFERYMRELQQLNPDCIDFFTHINIEKWTQAHDQGFRYGWMTTN

AT1G64260.1 MuDR family transposase5.4e-6431.17Show/hide
Query:  IPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEE--GSMSLK------MNVD---NIRIGSKFRTKEKLQLAVKKYCIVQHYQFVVMESNQDMWY
        +PV  + PL + ++ S        EIVD+       L +   S +LK      + +D   ++ +G  F+ +++L+ AV  +CI +    +V E+ ++M+ 
Subjt:  IPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKLEE--GSMSLK------MNVD---NIRIGSKFRTKEKLQLAVKKYCIVQHYQFVVMESNQDMWY

Query:  VKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEVIKKDFGYCVNYKRMWQAKRKTLIKV
         +C +W   CKW LRA R + HG+ EI+     H+  +    +  +  +S F  +EI+ +VR  P+  +  L++  K+  GY +   +M   K + + +V
Subjt:  VKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQNIVRRDPSTYVTSLQEVIKKDFGYCVNYKRMWQAKRKTLIKV

Query:  FGDWEKSYEELSYWLNALVHYNADTRVLW--DLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYKGKLLIVTTIDANGHIFPIAFAIV
        FGD ++S+  +   ++A  H +    V W  DL P+  P    F  VFWSF  +IEGF+HCRPLI +D   L  KY+ KL+I + +DA    FP+AFA+ 
Subjt:  FGDWEKSYEELSYWLNALVHYNADTRVLW--DLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGTHLYDKYKGKLLIVTTIDANGHIFPIAFAIV

Query:  EEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKNYNSKPLKELVYRGGSQHQKHKFERYMRELQQLN
        +E +T  W WF   +R  VT++ ++CLIS     I+  V    + W EP AHH+FCL H+ S F   +    L+ LV + GS +QK +F+ YM ++++ N
Subjt:  EEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKNYNSKPLKELVYRGGSQHQKHKFERYMRELQQLN

Query:  PDCIDFFTHINIEKWTQAHDQGFRYGWMTTNLAECINGVFKGAR--MLSITALVKFTFFKTITYFQKRRDDINEVVDRVCNHVHLPYLPYVEKY
        P+   +   I   KW  AHD G RYG +  +  E +  V +G     +++T  V   F +  + F K    I   ++R   +   P++  +E++
Subjt:  PDCIDFFTHINIEKWTQAHDQGFRYGWMTTNLAECINGVFKGAR--MLSITALVKFTFFKTITYFQKRRDDINEVVDRVCNHVHLPYLPYVEKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCAGAAGCCGCAGATTAGGGCCGATGACGATTCGAAAGCGAAGGGAGAAAGAAAAGGGAAAAAGATATTTGTTTATCTATCTGCCACCGTCGACCAACTGCCTGC
CGTCACTGTACTTTCCGCCTTGTCGGACCTTAGAACCGTCGTTGCACCGTCGTCGTCATCATGCCATCGATGTACTATCTCCTTTGCAGTCACCAACGAGAGGAGGAGGT
CTCTCTCTCTCCCTCCTTCTAGATCTGGAATGAAGGAGAGAGGGAGACCTCCCTCTGTCGTAGTTGCCGGCAGGAGGAAGAGATATGGAATGAAGGAGAGAGGGAGACCT
CATTTCCTCTTTGCGGTCATCAGCGAGAAGAGGTCTCCCTCCCTTTCCTCTCATTGGTGCGAATGGCACGGAACAACAACAAATGGAGAAAGCGCAATGACAATTGTAGT
TATGTCATTAGATTGTGATGAAATTGTGATACTTGAACTTGGAAATGGTTCCGCTTCTGATATTGAGGTGGAAGTGGATGAATTGTTTAGAACTGAATTGTGTTTGAACA
TTCCAGTAAATTTGAATCCTCCACTCCCAAATGAGGAAGCACCTTCACCGATGTTTATTGAAATAGATTTTGAGATTGTAGACTCTATATGTGAAGAAGGTTCAAAGTTG
GAAGAAGGGTCGATGTCTCTAAAAATGAATGTGGATAACATACGTATAGGGTCAAAATTTAGGACCAAAGAAAAACTACAACTTGCTGTCAAGAAATATTGTATTGTACA
ACATTATCAATTTGTTGTAATGGAATCGAATCAAGATATGTGGTATGTAAAATGCAAACAGTGGAATGATTGTTGCAAGTGGAGACTTCGCGCTCGTCGACATAAATCTC
ATGGGATGTTTGAAATTAGTGTACTTGAGGAGGTTCATTCGTACTTATATGCTGAGCTTACACAGGATCACTCACAATTGGACTCAAATTTTATGTGGAATGAAATTCAG
AATATTGTACGACGTGATCCTAGTACGTATGTAACTTCATTACAAGAAGTCATAAAAAAAGATTTTGGATATTGTGTGAACTACAAACGTATGTGGCAAGCTAAACGAAA
AACACTAATTAAAGTTTTTGGAGATTGGGAAAAATCATATGAAGAGCTCTCATACTGGTTGAATGCTTTAGTACATTACAATGCCGACACTCGAGTACTATGGGACTTGA
TACCTTCGAATAAACCTAGACATCTTATTTTTGGGCGAGTGTTTTGGTCATTTGGTCCTGCAATTGAAGGATTTAAACATTGTAGACCATTAATCCAGATTGATGGTACA
CATCTGTATGACAAGTATAAGGGAAAATTACTTATTGTCACCACAATTGATGCGAATGGACATATATTCCCTATTGCTTTTGCTATTGTGGAAGAGGAAAATACTTCTGG
TTGGTCGTGGTTTTTGTGTGCCTTGCGAAATTTGGTTACCGAACAAGATGAAATTTGTCTAATTTCTGATCGACATGGAGGTATTTTGGGGGCTGTGAAAAATAAGGAAA
ATGGCTGGAATGAACCGAGAGCACATCATCGATTTTGTCTCCGTCATGTAGCTAGCAATTTCAACAAAAACTATAATTCGAAGCCCTTAAAAGAGTTGGTGTATCGTGGT
GGGTCTCAACACCAAAAGCATAAATTTGAAAGATATATGAGGGAGTTACAACAATTGAATCCAGATTGTATTGATTTTTTCACTCATATAAACATTGAAAAATGGACACA
AGCACATGATCAAGGGTTTCGATATGGATGGATGACAACAAATCTAGCAGAGTGTATTAATGGTGTTTTCAAGGGAGCTAGAATGTTGTCAATTACTGCATTGGTGAAGT
TTACGTTTTTTAAGACTATTACTTATTTCCAGAAACGTAGAGATGATATCAATGAAGTAGTGGATCGGGTCTGCAATCATGTTCACTTACCATATTTGCCTTATGTTGAG
AAGTATTACGAATTGTCACGGTACCAACTTTGTTATGCATCTCGATTCTACCCTATAAAACATAAGGACTATTGGTCGGAAGTGAAGTTTGATGTTATTCATCCGAACCC
AAATTTGTTGAGGGAATATGGATGCCAAAAAACTTCTCACATTCACAATGAGATGGATTGGCGGGAGACGGATAGGCTCTACTTAGTTAGATTGACATCTTATTACTGTA
CTGGTGGAGCAATGGAGGCAGGAAACTCATTCGTTTCACATGCCTGGTGGAAAGTGTACTATTACACTACAAGATATTGCTGTGCATGTCATGATCGGTGTGCATTTAGT
GATGCGACAGATATGCCGGCCATGTCAGATGATTATCAACAGTGGTATGATTCGATTACAAGAAAGTACATCACATGTGAATCTGTTTATCACAGTTACATGGATTCTTT
TGTTAGCAACGTACAAGATTTTGCAATACAAGAAAATCTACCTGTATTAAAATCAATGTGTGATACTTGCATTAGCATAACTGAATCAAATTTAGTTTATCCAAGAGTAG
GTTCAAACTTCGAAATGAGGCTTCCTCGTCGTAGGCGCATTGAAGCACAGCCAAATGATGAAGTTCAAGAAAAAACACAAGAAGCTGAACAAGTTCAGAAACAAAATGAA
GATGTTATAGGAGATGAACAGATTATGAATACTCTTATACAATTACATTTTCTAATGATTTCTACGCCAGGAGTTCATTTAAACACTGCATCTACTTCAGCTCATCAACA
TGACCCATCAGCTGTTCAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCAGAAGCCGCAGATTAGGGCCGATGACGATTCGAAAGCGAAGGGAGAAAGAAAAGGGAAAAAGATATTTGTTTATCTATCTGCCACCGTCGACCAACTGCCTGC
CGTCACTGTACTTTCCGCCTTGTCGGACCTTAGAACCGTCGTTGCACCGTCGTCGTCATCATGCCATCGATGTACTATCTCCTTTGCAGTCACCAACGAGAGGAGGAGGT
CTCTCTCTCTCCCTCCTTCTAGATCTGGAATGAAGGAGAGAGGGAGACCTCCCTCTGTCGTAGTTGCCGGCAGGAGGAAGAGATATGGAATGAAGGAGAGAGGGAGACCT
CATTTCCTCTTTGCGGTCATCAGCGAGAAGAGGTCTCCCTCCCTTTCCTCTCATTGGTGCGAATGGCACGGAACAACAACAAATGGAGAAAGCGCAATGACAATTGTAGT
TATGTCATTAGATTGTGATGAAATTGTGATACTTGAACTTGGAAATGGTTCCGCTTCTGATATTGAGGTGGAAGTGGATGAATTGTTTAGAACTGAATTGTGTTTGAACA
TTCCAGTAAATTTGAATCCTCCACTCCCAAATGAGGAAGCACCTTCACCGATGTTTATTGAAATAGATTTTGAGATTGTAGACTCTATATGTGAAGAAGGTTCAAAGTTG
GAAGAAGGGTCGATGTCTCTAAAAATGAATGTGGATAACATACGTATAGGGTCAAAATTTAGGACCAAAGAAAAACTACAACTTGCTGTCAAGAAATATTGTATTGTACA
ACATTATCAATTTGTTGTAATGGAATCGAATCAAGATATGTGGTATGTAAAATGCAAACAGTGGAATGATTGTTGCAAGTGGAGACTTCGCGCTCGTCGACATAAATCTC
ATGGGATGTTTGAAATTAGTGTACTTGAGGAGGTTCATTCGTACTTATATGCTGAGCTTACACAGGATCACTCACAATTGGACTCAAATTTTATGTGGAATGAAATTCAG
AATATTGTACGACGTGATCCTAGTACGTATGTAACTTCATTACAAGAAGTCATAAAAAAAGATTTTGGATATTGTGTGAACTACAAACGTATGTGGCAAGCTAAACGAAA
AACACTAATTAAAGTTTTTGGAGATTGGGAAAAATCATATGAAGAGCTCTCATACTGGTTGAATGCTTTAGTACATTACAATGCCGACACTCGAGTACTATGGGACTTGA
TACCTTCGAATAAACCTAGACATCTTATTTTTGGGCGAGTGTTTTGGTCATTTGGTCCTGCAATTGAAGGATTTAAACATTGTAGACCATTAATCCAGATTGATGGTACA
CATCTGTATGACAAGTATAAGGGAAAATTACTTATTGTCACCACAATTGATGCGAATGGACATATATTCCCTATTGCTTTTGCTATTGTGGAAGAGGAAAATACTTCTGG
TTGGTCGTGGTTTTTGTGTGCCTTGCGAAATTTGGTTACCGAACAAGATGAAATTTGTCTAATTTCTGATCGACATGGAGGTATTTTGGGGGCTGTGAAAAATAAGGAAA
ATGGCTGGAATGAACCGAGAGCACATCATCGATTTTGTCTCCGTCATGTAGCTAGCAATTTCAACAAAAACTATAATTCGAAGCCCTTAAAAGAGTTGGTGTATCGTGGT
GGGTCTCAACACCAAAAGCATAAATTTGAAAGATATATGAGGGAGTTACAACAATTGAATCCAGATTGTATTGATTTTTTCACTCATATAAACATTGAAAAATGGACACA
AGCACATGATCAAGGGTTTCGATATGGATGGATGACAACAAATCTAGCAGAGTGTATTAATGGTGTTTTCAAGGGAGCTAGAATGTTGTCAATTACTGCATTGGTGAAGT
TTACGTTTTTTAAGACTATTACTTATTTCCAGAAACGTAGAGATGATATCAATGAAGTAGTGGATCGGGTCTGCAATCATGTTCACTTACCATATTTGCCTTATGTTGAG
AAGTATTACGAATTGTCACGGTACCAACTTTGTTATGCATCTCGATTCTACCCTATAAAACATAAGGACTATTGGTCGGAAGTGAAGTTTGATGTTATTCATCCGAACCC
AAATTTGTTGAGGGAATATGGATGCCAAAAAACTTCTCACATTCACAATGAGATGGATTGGCGGGAGACGGATAGGCTCTACTTAGTTAGATTGACATCTTATTACTGTA
CTGGTGGAGCAATGGAGGCAGGAAACTCATTCGTTTCACATGCCTGGTGGAAAGTGTACTATTACACTACAAGATATTGCTGTGCATGTCATGATCGGTGTGCATTTAGT
GATGCGACAGATATGCCGGCCATGTCAGATGATTATCAACAGTGGTATGATTCGATTACAAGAAAGTACATCACATGTGAATCTGTTTATCACAGTTACATGGATTCTTT
TGTTAGCAACGTACAAGATTTTGCAATACAAGAAAATCTACCTGTATTAAAATCAATGTGTGATACTTGCATTAGCATAACTGAATCAAATTTAGTTTATCCAAGAGTAG
GTTCAAACTTCGAAATGAGGCTTCCTCGTCGTAGGCGCATTGAAGCACAGCCAAATGATGAAGTTCAAGAAAAAACACAAGAAGCTGAACAAGTTCAGAAACAAAATGAA
GATGTTATAGGAGATGAACAGATTATGAATACTCTTATACAATTACATTTTCTAATGATTTCTACGCCAGGAGTTCATTTAAACACTGCATCTACTTCAGCTCATCAACA
TGACCCATCAGCTGTTCAGTAA
Protein sequenceShow/hide protein sequence
MEQKPQIRADDDSKAKGERKGKKIFVYLSATVDQLPAVTVLSALSDLRTVVAPSSSSCHRCTISFAVTNERRRSLSLPPSRSGMKERGRPPSVVVAGRRKRYGMKERGRP
HFLFAVISEKRSPSLSSHWCEWHGTTTNGESAMTIVVMSLDCDEIVILELGNGSASDIEVEVDELFRTELCLNIPVNLNPPLPNEEAPSPMFIEIDFEIVDSICEEGSKL
EEGSMSLKMNVDNIRIGSKFRTKEKLQLAVKKYCIVQHYQFVVMESNQDMWYVKCKQWNDCCKWRLRARRHKSHGMFEISVLEEVHSYLYAELTQDHSQLDSNFMWNEIQ
NIVRRDPSTYVTSLQEVIKKDFGYCVNYKRMWQAKRKTLIKVFGDWEKSYEELSYWLNALVHYNADTRVLWDLIPSNKPRHLIFGRVFWSFGPAIEGFKHCRPLIQIDGT
HLYDKYKGKLLIVTTIDANGHIFPIAFAIVEEENTSGWSWFLCALRNLVTEQDEICLISDRHGGILGAVKNKENGWNEPRAHHRFCLRHVASNFNKNYNSKPLKELVYRG
GSQHQKHKFERYMRELQQLNPDCIDFFTHINIEKWTQAHDQGFRYGWMTTNLAECINGVFKGARMLSITALVKFTFFKTITYFQKRRDDINEVVDRVCNHVHLPYLPYVE
KYYELSRYQLCYASRFYPIKHKDYWSEVKFDVIHPNPNLLREYGCQKTSHIHNEMDWRETDRLYLVRLTSYYCTGGAMEAGNSFVSHAWWKVYYYTTRYCCACHDRCAFS
DATDMPAMSDDYQQWYDSITRKYITCESVYHSYMDSFVSNVQDFAIQENLPVLKSMCDTCISITESNLVYPRVGSNFEMRLPRRRRIEAQPNDEVQEKTQEAEQVQKQNE
DVIGDEQIMNTLIQLHFLMISTPGVHLNTASTSAHQHDPSAVQ