; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G011200 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G011200
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionEmbryo defective 1381 isoform 1
Genome locationCG_Chr09:11113530..11119574
RNA-Seq ExpressionClCG09G011200
SyntenyClCG09G011200
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572418.1 hypothetical protein SDJN03_29146, partial [Cucurbita argyrosperma subsp. sororia]4.4e-30388.34Show/hide
Query:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL
        MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETI+DHHP++GQSFPWASWSNCP P +S+CRIKL
Subjt:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL

Query:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA
        ESCLESMAEKGVKLGSITSHQIF TMNK HGLNTALR VL GDNASKSVVSRRASSSALWDQA+FALSARCNAAEVDG+LGL +EGRSLSIEEASYFRE+
Subjt:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA

Query:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW
        IVALRLAK++IKIQ GWRA AIADLNRTR  S SLAHSCTDWPCLLIELLSQAAEIDHFQ      PKLIINN+DVLRNAS+SD+ TSVCGSMYHDSLVW
Subjt:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW

Query:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG
        RIIALGANERCLPVILVTSDS         YYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAE    WKLIAEVLGPNPRHLFELYALKQ 
Subjt:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG

Query:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV
        NYFNKTATDHNFGTIEDIVDAYLAYLQ+            +VNPAMDRALALLQA  VDV+NGLVSKD+LRFGAPWRHPPQSDDP LSLDWAKIQLMDFV
Subjt:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV

Query:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQYFWQRLIRGRSYRHLMLEV
        HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSR RLQY WQR+IRGRSYRHLMLEV
Subjt:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQYFWQRLIRGRSYRHLMLEV

XP_022952852.1 uncharacterized protein LOC111455419 [Cucurbita moschata]1.4e-30488.24Show/hide
Query:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL
        MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETI+DHHP++GQSFPWASWSNCP P +S+CRIKL
Subjt:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL

Query:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA
        ESCLESMAEKGVKLGSITSHQIF TM K HGLNTALR VL GDN SKSVVSRRASSSALWDQA+FALSARCNAAEVDG+LGL +EGRSLSIEEASYFRE+
Subjt:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA

Query:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW
        IVALRLAK++IKIQ GWRA AIADLNRTR  S SLAHSCTDWPCLLIELLSQAAE+DHFQ      PKLIINN+DVLRNASLSDD TSVCGSMYHDSLVW
Subjt:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW

Query:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG
        RIIALGANERCLPVILVTSDS         YYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAE    WKLIAEVLGPNPRHLFELYALKQ 
Subjt:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG

Query:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV
        NYFNKTATDHNFGTIEDIVDAYLAYLQ+            +VNPAMDRALALLQA  VDV+NGLVSKD+LRFGAPWRHPPQSDDP LSLDWAKIQLMDFV
Subjt:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV

Query:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQYFWQRLIRGRSYRHLMLEVGYK
        HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSR RLQY WQR+IRGRSYRHLMLEVGYK
Subjt:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQYFWQRLIRGRSYRHLMLEVGYK

XP_022969343.1 uncharacterized protein LOC111468381 [Cucurbita maxima]1.2e-30387.9Show/hide
Query:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL
        MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETI+DHHP++GQSFPWASWSNCP P +S+CRIKL
Subjt:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL

Query:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA
        ESCLESMAEKGVKLGSITSHQIF TMNK HGLNTALR VL GDNASKS VSRRASSSALWDQA+FALSARCNAAEVDGVLGL +EGR+LSIEEASYFRE+
Subjt:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA

Query:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW
        I+ALRLAK++IKIQ GWRA AIADLNRTR  S SLAHSCTDWPCLLIELLSQAAEIDHFQ      PKLIINN+DVLRNASLSDD TSVCGSMYHDSLVW
Subjt:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW

Query:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG
        RII+LGANERCLPVILVTSDS         YYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAE    WKLIAEVLGPNPRHLFELYALKQ 
Subjt:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG

Query:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV
        NYFNKTATDHNFGTIEDIVDAYLAYLQ+            +VNPAMDRALALLQ   +DV+NGLVSKDRLRFGAPWRHPPQSDDP LSLDWAKIQLMDFV
Subjt:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV

Query:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQYFWQRLIRGRSYRHLMLEVGYK
        HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSR RLQY WQR+IRGRSYRHLML+VGYK
Subjt:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQYFWQRLIRGRSYRHLMLEVGYK

XP_023554784.1 uncharacterized protein LOC111811943 [Cucurbita pepo subsp. pepo]8.0e-30588.4Show/hide
Query:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL
        MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETI+DHHP++GQSFPWASWSNCP P +S+CRIKL
Subjt:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL

Query:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA
        ESCLESMAEKGVKLGSITSHQIF TMNK HGLNTALR VL GDNASKSVVSRRASSSALWDQA+FALSARCNAAEVDGVLGL +EGRSLSIEEASYFRE+
Subjt:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA

Query:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW
        IVALRLAK++IKIQ GWRA AIADLNRTR  S SLAHSCTDWPCLLIELLSQAAE+DHFQ      PKLIINN+DVLRNASLSDD TSVCGSMYHDSLVW
Subjt:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW

Query:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG
        RIIALGANERCLPVILVTSDS         YYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAE    WKLIAEVLGPNPRHLFELYALKQ 
Subjt:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG

Query:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV
        NYFNKTATDHNFGTIEDIVDAYLAYLQ+            +VNPAMDRALALLQA  VDV+NGLV KD+LRFGAPWRHPP+SDDP LSLDWAKIQLMDFV
Subjt:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV

Query:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQYFWQRLIRGRSYRHLMLEVGYK
        HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSR RLQY WQR+IRGRSYRHLMLEVGYK
Subjt:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQYFWQRLIRGRSYRHLMLEVGYK

XP_038887701.1 uncharacterized protein LOC120077765 [Benincasa hispida]0.0e+0089.92Show/hide
Query:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL
        MV KPWRIIP+PLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPP LSNCRIKL
Subjt:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL

Query:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA
        E+CLESMAEKGVKLGSITSHQIF TMNK HGLNTALR VLQGDNASKSV SRR+SS+ALWDQA+FALSARCNAAEVDGVLGL EEGRSLSIEEASYFR+A
Subjt:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA

Query:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW
        IVALRLAK+LIKIQQGWRANAIADL+RTRG SPSLAHSCTDWPCLLIELLSQAAEIDHFQ      PKLIINN+DVLRNASLSDD TSVCGSMYHDSLVW
Subjt:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW

Query:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG
        RIIALGANERCLPVILVTSDS         YYSYRAYMDFGFPDIF+SRETFGWTPQEAKLHMVPDYFSNAE    WKLIAEVLGPNPRHLFELYALKQG
Subjt:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG

Query:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV
        NYFNKTATDHNFGTIEDIVDAYLAYLQ+            +VNPAMDRAL+LLQAH VDVRNGLVSKD+LRFGAPWRHPPQSDDP LSLDWAKIQLMDFV
Subjt:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV

Query:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQYFWQRLIRGRSYRHLMLEVGYK
        HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQ QLSSR RLQY WQR+IRGRSYRHLMLEVGYK
Subjt:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQYFWQRLIRGRSYRHLMLEVGYK

TrEMBL top hitse value%identityAlignment
A0A0A0KHW2 Uncharacterized protein1.4e-29987.06Show/hide
Query:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL
        MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIE HHPIYGQSFPWASWSNCPPP LSNCRIKL
Subjt:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL

Query:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA
        ESCLESMAEKGVKLGSITSHQIF TMNK HGLNTALR VLQ DNASK VVSRRASSSALWDQA+FALSARCNAAE+DGVL L EEGRS+  EEASYFREA
Subjt:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA

Query:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW
         VAL+LAK+LI+IQQGWRANAIADLNRT G S SLAHSCTDWPCLLIELLSQAAEI+HFQ      PKLIINNVDVLRNASLSD  +SVCGSMYHDSLVW
Subjt:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW

Query:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG
        RIIALGANERCLPVILVTSDS         YYSYRAYMDFGFPDIFISRETFGW+PQEAKLHMVPDYFS+AE    WKLIAEVLGPNPRHLFELYALKQG
Subjt:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG

Query:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV
        NYFNK   DHNFGTIEDIVDAYLAYLQ+            +VNPAMDRALALLQAH V+VRNGLVSKDRLRFGAPWRHPPQS DP LSLDWAKIQLMDFV
Subjt:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV

Query:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQYFWQRLIRGRSYRHLMLEVGYK
        HSLVDAEFGVNYLADCSLEIFDDPS VAL EVGLLY QRDPSFMRP+SRGIQRCLVRWLVQQQ QLSS+  LQY WQR+IRGRSYRHLMLEVGYK
Subjt:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQYFWQRLIRGRSYRHLMLEVGYK

A0A1S3BM46 uncharacterized protein LOC1034913542.4e-28687.11Show/hide
Query:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL
        MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETI+DHHPIYGQSFPWASWSNC PP LSNCRIKL
Subjt:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL

Query:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA
        ESCLESMAEKGVKLGSITSHQIF TMNK HGL+TALR VLQ DN SK VVSRRASSSALWDQA+ ALSARCNAAE+DGVLGL EEGRSL IEEASYFREA
Subjt:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA

Query:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW
         VALRLAK+LI+IQQGWRANAIADLNRT G S SLAHSCTDWPCLLIELLSQAAEI+HFQ      PKLIINNVDVLRNA LSDD +SVCGSMYHDSLVW
Subjt:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW

Query:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG
        RIIALGANERCLPVILVTSDS         YYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAE    WKLIAEVLG NPRHLFELYALKQG
Subjt:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG

Query:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV
        N+FNKT  DHNFGTIEDI+DAYLAYLQ+            +VNPAMDRALALLQAHA DVRNGLVSKDRLRFGAPWRHPPQS DP LSL WAKIQLMDFV
Subjt:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV

Query:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQY
        HSLVDAEFGVNYLADCSLEIFDDPS VAL EVGLLY QRDPSFMRPIS GIQRCLVRWLVQQQ QLSS+H LQY
Subjt:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQY

A0A6J1D2X8 uncharacterized protein LOC1110168162.6e-30187.73Show/hide
Query:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL
        MVHK WRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLL DWNK PHLSGYVDFAETIEDHHP+YGQSFPWASWSNCP P LSNCRIKL
Subjt:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL

Query:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA
        ESCLESMAEKGVKLG ITSHQIFATMNK HGLNTALR VLQGDNASKSVVSRRASSSALWDQA+FALSARCNAAEVDGVLGL +EGRSLSIEEASYFREA
Subjt:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA

Query:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW
         VALRLAK++IKIQQGWRANAIADLNR RG SPSLAHSCTDWPCLLIELLSQAAEIDHFQ      PKLIINN++VLRNAS+SDD +SVCGSMYHDSLVW
Subjt:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW

Query:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG
        R+IALGANERCLPVILVTSDS         YYSYRAYMDFGFPDIFISRETFGWT QEAKLHMVPDYFSNAE    WKLIAEVLGPNPRHLFELYALKQG
Subjt:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG

Query:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV
        N++ +TATDHNFGTIEDIVDAYLAYLQ+            +VNPAMDRALALLQA AVD RNGLVSKDRLRFGAPWRHPP+S+DP LSLDWAKIQLMDFV
Subjt:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV

Query:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQYFWQRLIRGRSYRHLMLEVGYK
          LVDAEFGVNYLADCSLEIFDDPSAVAL EVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRL Y  QR+IRGRSYRHLMLEVGYK
Subjt:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQYFWQRLIRGRSYRHLMLEVGYK

A0A6J1GLI8 uncharacterized protein LOC1114554196.6e-30588.24Show/hide
Query:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL
        MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETI+DHHP++GQSFPWASWSNCP P +S+CRIKL
Subjt:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL

Query:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA
        ESCLESMAEKGVKLGSITSHQIF TM K HGLNTALR VL GDN SKSVVSRRASSSALWDQA+FALSARCNAAEVDG+LGL +EGRSLSIEEASYFRE+
Subjt:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA

Query:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW
        IVALRLAK++IKIQ GWRA AIADLNRTR  S SLAHSCTDWPCLLIELLSQAAE+DHFQ      PKLIINN+DVLRNASLSDD TSVCGSMYHDSLVW
Subjt:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW

Query:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG
        RIIALGANERCLPVILVTSDS         YYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAE    WKLIAEVLGPNPRHLFELYALKQ 
Subjt:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG

Query:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV
        NYFNKTATDHNFGTIEDIVDAYLAYLQ+            +VNPAMDRALALLQA  VDV+NGLVSKD+LRFGAPWRHPPQSDDP LSLDWAKIQLMDFV
Subjt:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV

Query:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQYFWQRLIRGRSYRHLMLEVGYK
        HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSR RLQY WQR+IRGRSYRHLMLEVGYK
Subjt:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQYFWQRLIRGRSYRHLMLEVGYK

A0A6J1I0P8 uncharacterized protein LOC1114683815.6e-30487.9Show/hide
Query:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL
        MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETI+DHHP++GQSFPWASWSNCP P +S+CRIKL
Subjt:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL

Query:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA
        ESCLESMAEKGVKLGSITSHQIF TMNK HGLNTALR VL GDNASKS VSRRASSSALWDQA+FALSARCNAAEVDGVLGL +EGR+LSIEEASYFRE+
Subjt:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA

Query:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW
        I+ALRLAK++IKIQ GWRA AIADLNRTR  S SLAHSCTDWPCLLIELLSQAAEIDHFQ      PKLIINN+DVLRNASLSDD TSVCGSMYHDSLVW
Subjt:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW

Query:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG
        RII+LGANERCLPVILVTSDS         YYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAE    WKLIAEVLGPNPRHLFELYALKQ 
Subjt:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG

Query:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV
        NYFNKTATDHNFGTIEDIVDAYLAYLQ+            +VNPAMDRALALLQ   +DV+NGLVSKDRLRFGAPWRHPPQSDDP LSLDWAKIQLMDFV
Subjt:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV

Query:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQYFWQRLIRGRSYRHLMLEVGYK
        HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSR RLQY WQR+IRGRSYRHLML+VGYK
Subjt:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQYFWQRLIRGRSYRHLMLEVGYK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G31340.1 embryo defective 13812.8e-22364.03Show/hide
Query:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL
        MV+K W+IIP+PLLETVLNNH Q HRVPQPLILHGPRGVGKTTLIL RLL DWNKGPHL+GYVDFA++I +HHP + QS+PW SW++  PP LSNC+ +L
Subjt:  MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKL

Query:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA
        E+CLESM+ K +KLGSI+S QIF TMNK +GLNTALR +LQG N +   V  + S S LW++A++ALS R NA E+DG+L L E+G SLS+EEASY+RE 
Subjt:  ESCLESMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREA

Query:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW
          ALRLAK++IK+ QGW+ANAIA LNRT G S +LA+SCTDWP L++ELLSQAAEI  FQ      PKL++NN+++L+ A  +DD T V  SMYHD+L+W
Subjt:  IVALRLAKQLIKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVW

Query:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG
        RIIALGANERCLPV+ VTSDS         YYSY+A++D+GFPDIFISRETFGW PQEAKLHMVPDYFS +E    W +IA+VLG N RHLFELYALKQ 
Subjt:  RIIALGANERCLPVILVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQG

Query:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV
        N++ ++      GT EDIVDAYLAYLQ+            +VNPAMD+AL  LQ +A DVR G +  ++LRFGA WRHPPQ++DP  + +WAKIQLMDFV
Subjt:  NYFNKTATDHNFGTIEDIVDAYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFV

Query:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQYFWQRLIRGRSYRHLMLEVGYK
         +LV+ EF VNYL D SLEIF+DPSA+AL EVG+LY QRDPSF RPIS+GI+RCLVRWL+Q++ Q+S     +Y+WQR+IRGR Y+HLML  GY+
Subjt:  HSLVDAEFGVNYLADCSLEIFDDPSAVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQYFWQRLIRGRSYRHLMLEVGYK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCACAAACCATGGAGGATAATACCAAGGCCCTTGCTGGAAACCGTCCTCAACAATCATTCACAGCACCATCGCGTTCCTCAACCTTTGATCCTTCATGGC
CCCAGAGGCGTCGGCAAAACCACTCTCATTCTCGAACGCCTCCTCGCCGACTGGAACAAAGGCCCCCACTTATCTGGATATGTGGACTTTGCTGAAACTATCGAA
GATCATCACCCAATTTACGGCCAGTCCTTTCCCTGGGCATCTTGGTCGAATTGTCCGCCACCACCACTGTCTAATTGCAGAATTAAACTGGAAAGTTGCCTCGAA
TCCATGGCTGAAAAGGGCGTCAAATTAGGCAGTATAACATCCCATCAAATATTCGCCACCATGAACAAGTCCCACGGCCTGAATACGGCCCTTCGCTGCGTACTT
CAGGGGGATAATGCTTCAAAAAGTGTTGTCTCGCGCAGAGCGTCGAGCTCGGCTCTGTGGGATCAAGCAATTTTTGCACTATCTGCTCGATGTAATGCCGCTGAG
GTTGATGGGGTGTTAGGATTGCGTGAGGAAGGGAGGAGTTTATCGATTGAAGAAGCTTCTTATTTTAGAGAGGCTATTGTGGCGCTGAGACTGGCTAAGCAGTTA
ATCAAAATTCAGCAAGGGTGGCGAGCTAATGCCATTGCTGATTTGAATCGGACGCGTGGTTCCTCACCGTCGTTGGCACATTCGTGTACTGATTGGCCTTGTTTG
TTGATAGAACTGCTGTCACAAGCTGCTGAAATTGATCATTTTCAGGTAAACATATTGAAGACACCAAAATTAATTATTAACAATGTAGATGTGTTACGAAATGCA
AGTTTATCTGATGATGGTACATCAGTATGTGGATCAATGTATCATGACAGCCTAGTGTGGAGAATAATTGCTCTTGGGGCGAATGAAAGGTGCCTTCCTGTTATA
CTTGTAACGTCTGATAGTTATTCTGTGTATGCTAATCACATAATCTACTATTCGTATCGCGCTTACATGGATTTCGGATTTCCAGATATATTTATCTCACGTGAG
ACTTTTGGATGGACTCCTCAAGAGGCAAAACTGCATATGGTTCCTGATTATTTTAGCAATGCAGAGAATGCATGTACATGGAAGCTGATTGCTGAGGTGCTTGGC
CCAAACCCTCGACACTTATTTGAGCTTTATGCTCTGAAACAAGGCAATTACTTTAACAAAACGGCAACGGATCATAATTTTGGCACAATCGAGGATATAGTAGAT
GCCTACTTGGCATACTTGCAAATAATGAATAGATATCTAATACTTTACATGCGTGCCTGCTTGGTAAATCCTGCCATGGATAGAGCACTTGCACTCCTGCAAGCA
CATGCTGTTGATGTGCGGAATGGACTGGTTTCAAAAGATAGATTACGCTTTGGTGCACCTTGGAGGCATCCTCCGCAGAGTGATGATCCATGCTTGAGTTTGGAC
TGGGCAAAGATTCAGCTGATGGATTTTGTGCATTCTCTTGTGGATGCGGAATTTGGGGTTAACTATCTTGCTGACTGTAGCCTTGAGATCTTTGACGATCCTTCT
GCTGTTGCCTTGGCTGAGGTTGGTTTGCTTTATGCTCAACGGGATCCATCCTTCATGCGGCCTATATCTAGAGGGATTCAAAGATGTCTAGTGCGATGGCTTGTC
CAACAGCAATTTCAACTTAGTTCCAGACATCGTCTTCAGTATTTCTGGCAGAGACTTATACGTGGACGTAGCTATCGCCATCTCATGCTAGAAGTGGGATATAAA
TAG
mRNA sequenceShow/hide mRNA sequence
TCTTGTACCTCCATCAGTTTTCACTCTGCAACCAGAAAACCAATCATCAAGAAGCCAACGAGCTGAAGGAGGCTTCACCCACCAAAAATGGTTCACAAACCATGG
AGGATAATACCAAGGCCCTTGCTGGAAACCGTCCTCAACAATCATTCACAGCACCATCGCGTTCCTCAACCTTTGATCCTTCATGGCCCCAGAGGCGTCGGCAAA
ACCACTCTCATTCTCGAACGCCTCCTCGCCGACTGGAACAAAGGCCCCCACTTATCTGGATATGTGGACTTTGCTGAAACTATCGAAGATCATCACCCAATTTAC
GGCCAGTCCTTTCCCTGGGCATCTTGGTCGAATTGTCCGCCACCACCACTGTCTAATTGCAGAATTAAACTGGAAAGTTGCCTCGAATCCATGGCTGAAAAGGGC
GTCAAATTAGGCAGTATAACATCCCATCAAATATTCGCCACCATGAACAAGTCCCACGGCCTGAATACGGCCCTTCGCTGCGTACTTCAGGGGGATAATGCTTCA
AAAAGTGTTGTCTCGCGCAGAGCGTCGAGCTCGGCTCTGTGGGATCAAGCAATTTTTGCACTATCTGCTCGATGTAATGCCGCTGAGGTTGATGGGGTGTTAGGA
TTGCGTGAGGAAGGGAGGAGTTTATCGATTGAAGAAGCTTCTTATTTTAGAGAGGCTATTGTGGCGCTGAGACTGGCTAAGCAGTTAATCAAAATTCAGCAAGGG
TGGCGAGCTAATGCCATTGCTGATTTGAATCGGACGCGTGGTTCCTCACCGTCGTTGGCACATTCGTGTACTGATTGGCCTTGTTTGTTGATAGAACTGCTGTCA
CAAGCTGCTGAAATTGATCATTTTCAGGTAAACATATTGAAGACACCAAAATTAATTATTAACAATGTAGATGTGTTACGAAATGCAAGTTTATCTGATGATGGT
ACATCAGTATGTGGATCAATGTATCATGACAGCCTAGTGTGGAGAATAATTGCTCTTGGGGCGAATGAAAGGTGCCTTCCTGTTATACTTGTAACGTCTGATAGT
TATTCTGTGTATGCTAATCACATAATCTACTATTCGTATCGCGCTTACATGGATTTCGGATTTCCAGATATATTTATCTCACGTGAGACTTTTGGATGGACTCCT
CAAGAGGCAAAACTGCATATGGTTCCTGATTATTTTAGCAATGCAGAGAATGCATGTACATGGAAGCTGATTGCTGAGGTGCTTGGCCCAAACCCTCGACACTTA
TTTGAGCTTTATGCTCTGAAACAAGGCAATTACTTTAACAAAACGGCAACGGATCATAATTTTGGCACAATCGAGGATATAGTAGATGCCTACTTGGCATACTTG
CAAATAATGAATAGATATCTAATACTTTACATGCGTGCCTGCTTGGTAAATCCTGCCATGGATAGAGCACTTGCACTCCTGCAAGCACATGCTGTTGATGTGCGG
AATGGACTGGTTTCAAAAGATAGATTACGCTTTGGTGCACCTTGGAGGCATCCTCCGCAGAGTGATGATCCATGCTTGAGTTTGGACTGGGCAAAGATTCAGCTG
ATGGATTTTGTGCATTCTCTTGTGGATGCGGAATTTGGGGTTAACTATCTTGCTGACTGTAGCCTTGAGATCTTTGACGATCCTTCTGCTGTTGCCTTGGCTGAG
GTTGGTTTGCTTTATGCTCAACGGGATCCATCCTTCATGCGGCCTATATCTAGAGGGATTCAAAGATGTCTAGTGCGATGGCTTGTCCAACAGCAATTTCAACTT
AGTTCCAGACATCGTCTTCAGTATTTCTGGCAGAGACTTATACGTGGACGTAGCTATCGCCATCTCATGCTAGAAGTGGGATATAAATAG
Protein sequenceShow/hide protein sequence
MVHKPWRIIPRPLLETVLNNHSQHHRVPQPLILHGPRGVGKTTLILERLLADWNKGPHLSGYVDFAETIEDHHPIYGQSFPWASWSNCPPPPLSNCRIKLESCLE
SMAEKGVKLGSITSHQIFATMNKSHGLNTALRCVLQGDNASKSVVSRRASSSALWDQAIFALSARCNAAEVDGVLGLREEGRSLSIEEASYFREAIVALRLAKQL
IKIQQGWRANAIADLNRTRGSSPSLAHSCTDWPCLLIELLSQAAEIDHFQVNILKTPKLIINNVDVLRNASLSDDGTSVCGSMYHDSLVWRIIALGANERCLPVI
LVTSDSYSVYANHIIYYSYRAYMDFGFPDIFISRETFGWTPQEAKLHMVPDYFSNAENACTWKLIAEVLGPNPRHLFELYALKQGNYFNKTATDHNFGTIEDIVD
AYLAYLQIMNRYLILYMRACLVNPAMDRALALLQAHAVDVRNGLVSKDRLRFGAPWRHPPQSDDPCLSLDWAKIQLMDFVHSLVDAEFGVNYLADCSLEIFDDPS
AVALAEVGLLYAQRDPSFMRPISRGIQRCLVRWLVQQQFQLSSRHRLQYFWQRLIRGRSYRHLMLEVGYK