; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033314 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033314
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionAnkyrin repeat family protein
Genome locationchr11:42699236..42701952
RNA-Seq ExpressionLag0033314
SyntenyLag0033314
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002110 - Ankyrin repeat
IPR020683 - Ankyrin repeat-containing domain
IPR026961 - PGG domain
IPR036770 - Ankyrin repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606412.1 NF-kappa-B inhibitor alpha, partial [Cucurbita argyrosperma subsp. sororia]9.5e-19650.44Show/hide
Query:  NNNNYNVREFLYANTKRANWGAVIQKYAEHPEA-----------------------------QGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGN
        N+ N      LY + KR  W  VI KY E+ E                                 KL   GDT LHLAV+ NQE  VE LV         
Subjt:  NNNNYNVREFLYANTKRANWGAVIQKYAEHPEA-----------------------------QGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGN

Query:  QLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAHGGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRVKNNGDTVLHCAIGNEYFD
         ++ LEI N R+NNPLHLAA MGSVRMC AIAS H GLVD RN DD+TPLFLAA YG +DAF+CLY FC   P RI+ NCRVK +GDTVLH A+ NE+FD
Subjt:  QLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAHGGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRVKNNGDTVLHCAIGNEYFD

Query:  LAFQLMHLIVDSMYWVNEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLH----------RDQEYWKRSWSERKTAIFLRVRTLV
        LAFQL+++  +++ WVNE G TP+H+LASKP SFKSGS+I+GW+++VYHC FV  L+P+SI+ L           RD++  K++  +      +  +T +
Subjt:  LAFQLMHLIVDSMYWVNEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLH----------RDQEYWKRSWSERKTAIFLRVRTLV

Query:  N------------------------------KKKSSDEANKLDKDLEK----GARISNPPTITNSPENYSTFVDLFQIILSAIITVLGHGSTTIRKIRET
        +                              K  + ++ +K + DLE+      + +  P  T+ P NY+T +D F I  SAI+ +LG GST I+KIR+ 
Subjt:  N------------------------------KKKSSDEANKLDKDLEK----GARISNPPTITNSPENYSTFVDLFQIILSAIITVLGHGSTTIRKIRET

Query:  KQKHTWSVQVMEKLIDPQFVTPDRHGDCWPDYEDYRMKLAE-----IPFTLEESEVKFSTSTSTCVD----QSKDFTESAMLLAAKNGVVEIVKELYECY
        K+KHTWSVQVMEKL+  +   PD++ +      DY+    E     +P+   ++EV+FS       +    Q  + +ESAMLLAA+NGV+EIVK  +E +
Subjt:  KQKHTWSVQVMEKLIDPQFVTPDRHGDCWPDYEDYRMKLAE-----IPFTLEESEVKFSTSTSTCVD----QSKDFTESAMLLAAKNGVVEIVKELYECY

Query:  PRTIYETRKSDEKNVVHLAAEYRQWNIYKFLLDRKYSLQPLFRAVDDNGNSALHLAAARSSPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKG
        P  I +TRK D+KNVV LAAE+RQ ++Y+FLL +   +Q LFRAVD  GNSALHLAA    PKLW+ITGAALQ+Q EV+WY +VK S+P+HFFP +N++G
Subjt:  PRTIYETRKSDEKNVVHLAAEYRQWNIYKFLLDRKYSLQPLFRAVDDNGNSALHLAAARSSPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKG

Query:  ETATRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYF
        +TA+ IF +TH +L  KG  WL  TS+SCS+VATLIATVAFATAATIPGGN++ G A L  EQ F IF  SSLIALCLS+TSVIMFL+I+T+RF+I++ F
Subjt:  ETATRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYF

Query:  TSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD
           LPWKLL+ L  LYFSIIAMLVSFCSGHYFL+ +RL N+A+LLYTLTF PVA +FGIVQLPLYFDLL+ L+K +P  S EV LCD
Subjt:  TSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD

XP_022931018.1 uncharacterized protein LOC111437341 isoform X1 [Cucurbita moschata]2.0e-19350.32Show/hide
Query:  NNNNYNVREFLYANTKRANWGAVIQKYAEHPEA-----------------------------QGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGN
        N+ N      LY + KR  W  VI KY E+ E                                 KL   GDT LHLAV+ NQE IVE LV+++      
Subjt:  NNNNYNVREFLYANTKRANWGAVIQKYAEHPEA-----------------------------QGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGN

Query:  QLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAHGGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRVKNNGDTVLHCAIGNEYFD
          + LEI N   NN LHLAA MGSVRMC AIAS H GLVD RN DD+TPLFLAA YG +DAF+CLY+FC DD  RI+ NCRVK +GDTVLH A+ +E+FD
Subjt:  QLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAHGGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRVKNNGDTVLHCAIGNEYFD

Query:  LAFQLMHLIVDSMYWVNEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTA--------------IFLRV
        LAFQL+++  +++ WVNE G TP+H+LA KP SFKSGS+I+GW+++VYHC FV  L+P+S E L ++ E  + +   ++TA               F  +
Subjt:  LAFQLMHLIVDSMYWVNEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTA--------------IFLRV

Query:  RTLV--------------------NKKKSSDEANKLDKDLEK----GARISNPPTITNSPENYSTFVDLFQIILSAIITVLGHGSTTIRKIRETKQKHTW
        + ++                     K  + ++ +K + DLE+      + +  P  T+ P NY+T +D F I  SAI+ +LG GST I+KIR+ K+KHTW
Subjt:  RTLV--------------------NKKKSSDEANKLDKDLEK----GARISNPPTITNSPENYSTFVDLFQIILSAIITVLGHGSTTIRKIRETKQKHTW

Query:  SVQVMEKLI---DPQFVTPDRHGDCWPDYEDYRMKLAEIPFTLEESEVKFSTSTSTCVD----QSKDFTESAMLLAAKNGVVEIVKELYECYPRTIYETR
        SVQVM KL+   +P   T D +      + +       +P+   ++EV+FS       +    Q  +  ESAMLLAA+NGV+EIVK   E +P  I +TR
Subjt:  SVQVMEKLI---DPQFVTPDRHGDCWPDYEDYRMKLAEIPFTLEESEVKFSTSTSTCVD----QSKDFTESAMLLAAKNGVVEIVKELYECYPRTIYETR

Query:  KSDEKNVVHLAAEYRQWNIYKFLLDRKYSLQPLFRAVDDNGNSALHLAAARSSPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFE
        K D+KNVV LAAE+RQ ++Y+ LL +   +Q LFRAVD  GNSALHLAA    PKLW+ITGAALQ+Q EV+WYK+VK S+P+HFFP +N++G+TA+ IF 
Subjt:  KSDEKNVVHLAAEYRQWNIYKFLLDRKYSLQPLFRAVDDNGNSALHLAAARSSPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFE

Query:  DTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKL
        +TH +L  KG  WL  TS+SCS+VATLIATVAFATAATIPGGN++ G A L  EQ F IF  SSLIALCLS+TSVIMFL+I+T+RF+I++ F   LPWKL
Subjt:  DTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKL

Query:  LLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD
        L+ L  LYFSIIAMLVSFCSGHYFL+ +RL N+A+LLYTLTF PVA +FGIVQLPLYFDLL+ L+K +P  S EV LCD
Subjt:  LLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD

XP_022931019.1 uncharacterized protein LOC111437341 isoform X2 [Cucurbita moschata]2.6e-19350.58Show/hide
Query:  NNNNYNVREFLYANTKRANWGAVIQKYAEHPEA-----------------------------QGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGN
        N+ N      LY + KR  W  VI KY E+ E                                 KL   GDT LHLAV+ NQE IVE LV+++      
Subjt:  NNNNYNVREFLYANTKRANWGAVIQKYAEHPEA-----------------------------QGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGN

Query:  QLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAHGGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRVKNNGDTVLHCAIGNEYFD
          + LEI N   NN LHLAA MGSVRMC AIAS H GLVD RN DD+TPLFLAA YG +DAF+CLY+FC DD  RI+ NCRVK +GDTVLH A+ +E+FD
Subjt:  QLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAHGGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRVKNNGDTVLHCAIGNEYFD

Query:  LAFQLMHLIVDSMYWVNEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTA--------------IFLRV
        LAFQL+++  +++ WVNE G TP+H+LA KP SFKSGS+I+GW+++VYHC FV  L+P+S E L ++ E  + +   ++TA               F  +
Subjt:  LAFQLMHLIVDSMYWVNEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTA--------------IFLRV

Query:  RTLV--------------------NKKKSSDEANKLDKDLEKGARISNPPTITNSPENYSTFVDLFQIILSAIITVLGHGSTTIRKIRETKQKHTWSVQV
        + ++                     K  + ++ +K + DLE+   + N  + T+ P NY+T +D F I  SAI+ +LG GST I+KIR+ K+KHTWSVQV
Subjt:  RTLV--------------------NKKKSSDEANKLDKDLEKGARISNPPTITNSPENYSTFVDLFQIILSAIITVLGHGSTTIRKIRETKQKHTWSVQV

Query:  MEKLI---DPQFVTPDRHGDCWPDYEDYRMKLAEIPFTLEESEVKFSTSTSTCVD----QSKDFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSDE
        M KL+   +P   T D +      + +       +P+   ++EV+FS       +    Q  +  ESAMLLAA+NGV+EIVK   E +P  I +TRK D+
Subjt:  MEKLI---DPQFVTPDRHGDCWPDYEDYRMKLAEIPFTLEESEVKFSTSTSTCVD----QSKDFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSDE

Query:  KNVVHLAAEYRQWNIYKFLLDRKYSLQPLFRAVDDNGNSALHLAAARSSPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFEDTHM
        KNVV LAAE+RQ ++Y+ LL +   +Q LFRAVD  GNSALHLAA    PKLW+ITGAALQ+Q EV+WYK+VK S+P+HFFP +N++G+TA+ IF +TH 
Subjt:  KNVVHLAAEYRQWNIYKFLLDRKYSLQPLFRAVDDNGNSALHLAAARSSPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFEDTHM

Query:  DLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLAL
        +L  KG  WL  TS+SCS+VATLIATVAFATAATIPGGN++ G A L  EQ F IF  SSLIALCLS+TSVIMFL+I+T+RF+I++ F   LPWKLL+ L
Subjt:  DLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLAL

Query:  SSLYFSIIAMLVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD
          LYFSIIAMLVSFCSGHYFL+ +RL N+A+LLYTLTF PVA +FGIVQLPLYFDLL+ L+K +P  S EV LCD
Subjt:  SSLYFSIIAMLVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD

XP_022995621.1 uncharacterized protein LOC111491104 isoform X2 [Cucurbita maxima]3.3e-18849.93Show/hide
Query:  VREFLYANTKRANWGAVIQKYAEHPEAQGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGNQLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAH
        +R+FLY NTKR  W  VI+KY E+PEAQ LKLT  GDTALHLAV+ N+EE+V+ LV  I ++     + L+  N+R+  PLHLAA MGS  MC+AIASAH
Subjt:  VREFLYANTKRANWGAVIQKYAEHPEAQGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGNQLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAH

Query:  GGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRVKNNGDTVLHCAIGNEYFDLAFQLMHLIVDSMYWVNEHGRTPLHILASKPASFK
          LVD RN  D+TPL+LAA  G RDAF+CLY FC ++  RIT NCR+ +NGDTVLH A+ N++FDLAFQ++HL  ++M+WV E G TPLH+LASKP +FK
Subjt:  GGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRVKNNGDTVLHCAIGNEYFDLAFQLMHLIVDSMYWVNEHGRTPLHILASKPASFK

Query:  SGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQ----------------------EYWKRSW--------------------SERKTAIFLRVRTLVNKK
        SGS IRGWRN+ Y+C  VD+LKPQ I++L RD                       +++   W                    S++ T +   +     + 
Subjt:  SGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQ----------------------EYWKRSW--------------------SERKTAIFLRVRTLVNKK

Query:  KSSDEAN-KLDKDLEKGARISNPPTITNSPENYSTFVDLFQIILSAIITVLGHGSTTIRKIRETKQKHTWSVQVMEKLIDPQFVTPDRH---GDCWPDYE
          +DE++ +LD +L K   ++   +ITN P NY+T +  FQI+ SAI+  LG GS   +KIR  K+KHTWSVQVMEKL+  ++  PD +   G    D  
Subjt:  KSSDEAN-KLDKDLEKGARISNPPTITNSPENYSTFVDLFQIILSAIITVLGHGSTTIRKIRETKQKHTWSVQVMEKLIDPQFVTPDRH---GDCWPDYE

Query:  DYRMKLAEI--PFTLEESEVKFSTSTSTC---VDQSKDFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSDEKNVVHLAAEYRQWNIYKFLLDRKYS
              AE+  P++L   EV+ S S  +     ++ K+  E+AMLLAAKNGV+EIVK ++  +P +I + RK D+KNVV LAAEY Q ++Y+FLL  K  
Subjt:  DYRMKLAEI--PFTLEESEVKFSTSTSTC---VDQSKDFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSDEKNVVHLAAEYRQWNIYKFLLDRKYS

Query:  LQPLFRAVDDNGNSALHLAAARSSPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIA
         + LFRAVDDNGNSALHLAAA S   +W+ITGAALQ+Q E++WYKFV+ES+P++FF  YN +G+ AT IF +THMDL +K  +WL  TS+SCSVV TLI 
Subjt:  LQPLFRAVDDNGNSALHLAAARSSPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIA

Query:  TVAFATAATIPGGNNDH-GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTR
        TVAF + A+IPGG N H G   LK  +AF  F + SLIALCLS+TSV MFL+ILT RF+  ++ T+ LPWKL +  SSL+ SII+MLVSFC+GHYFL+ +
Subjt:  TVAFATAATIPGGNNDH-GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTR

Query:  RLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMPT-SVEVDLCD
         + + A LLYT+  +PVA +F I +LPLY D+++ + KI+P  S  V L D
Subjt:  RLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMPT-SVEVDLCD

XP_023534309.1 uncharacterized protein LOC111795905 [Cucurbita pepo subsp. pepo]3.1e-19452.45Show/hide
Query:  NNNNYNVREFLYANTKRANWGAVIQKYAE-------------------HPEAQGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGNQLQR------
        N+ N      LYA  KR  W  VI+KY E                   +P A+  KL   GDT LHLAV+ NQE  VE LVR I E P N L        
Subjt:  NNNNYNVREFLYANTKRANWGAVIQKYAE-------------------HPEAQGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGNQLQR------

Query:  -------------LEIKNNRQNNPLHLAAAMGSVRMCHAIASAHGGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRVKNNGDTVLH
                     LE  N R NNPLHLAA MGSVRMC AI S H GLVD RN DD+TPLFLAA YG +DAF+CLY+FC DD  RI+ NCRVK +GDTVLH
Subjt:  -------------LEIKNNRQNNPLHLAAAMGSVRMCHAIASAHGGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRVKNNGDTVLH

Query:  CAIGNEYFDLAFQLMHLIVDSMYWVNEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKP---QSIETLHRDQEYWKRSWSERKTAIFLR-VR
         A+ NE+ DLAFQL+++  +++ WVNE G TP+HILASKP SFKSGS+I+GW+++VYHC FV  LK     S++   ++        + R    F R +R
Subjt:  CAIGNEYFDLAFQLMHLIVDSMYWVNEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKP---QSIETLHRDQEYWKRSWSERKTAIFLR-VR

Query:  TLV---------NK--------KKSSDEANKLDK---DLEKGARISNPPTITNSPENYSTFVDLFQIILSAIITVLGHGSTTIRKIRETKQKHTWSVQVM
         ++         NK        KK +D  N  DK   DLE+   + N  + TN P NY+T +D F I  SAI+ +LG GST I+KIR+ K+KHTWSVQVM
Subjt:  TLV---------NK--------KKSSDEANKLDK---DLEKGARISNPPTITNSPENYSTFVDLFQIILSAIITVLGHGSTTIRKIRETKQKHTWSVQVM

Query:  EKLIDPQFVTPDRHGDCWPDYEDYRMKLAE-----IPFTLEESEVKFSTSTSTCVD----QSKDFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSD
        EKL+  +   PD++ +      D + +  E     +P+   ++EV+FS       +    Q  +  ESAMLLAAKNGV+EIVK   E +P  I +T K D
Subjt:  EKLIDPQFVTPDRHGDCWPDYEDYRMKLAE-----IPFTLEESEVKFSTSTSTCVD----QSKDFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSD

Query:  EKNVVHLAAEYRQWNIYKFLLDRKYSLQPLFRAVDDNGNSALHLAAARSSPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFEDTH
        +KNVV LAAE+RQ ++Y+F L R+  +  LFRAVD  GNSALHLAA    PKLW+ITGAALQ+Q EV+WY +VK S+P+HFFP +N++G+TA+ IF +TH
Subjt:  EKNVVHLAAEYRQWNIYKFLLDRKYSLQPLFRAVDDNGNSALHLAAARSSPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFEDTH

Query:  MDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLA
         +L  KG  WL  TS+SCS+VATLIATVAFATAATIPGGN++ G A L  EQ F IF  SSLIALCLS+TSVIMFL+I+T+RF+I++ F   LPWKLL+ 
Subjt:  MDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLA

Query:  LSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD
        L  LYFSIIAMLVSFCSGHYFL+ +RL N+A+LLYTLTF PVA +FGIVQLPLYFDLL+ L+K +P  S EV LCD
Subjt:  LSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD

TrEMBL top hitse value%identityAlignment
A0A6J1ET50 uncharacterized protein LOC111437341 isoform X21.3e-19350.58Show/hide
Query:  NNNNYNVREFLYANTKRANWGAVIQKYAEHPEA-----------------------------QGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGN
        N+ N      LY + KR  W  VI KY E+ E                                 KL   GDT LHLAV+ NQE IVE LV+++      
Subjt:  NNNNYNVREFLYANTKRANWGAVIQKYAEHPEA-----------------------------QGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGN

Query:  QLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAHGGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRVKNNGDTVLHCAIGNEYFD
          + LEI N   NN LHLAA MGSVRMC AIAS H GLVD RN DD+TPLFLAA YG +DAF+CLY+FC DD  RI+ NCRVK +GDTVLH A+ +E+FD
Subjt:  QLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAHGGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRVKNNGDTVLHCAIGNEYFD

Query:  LAFQLMHLIVDSMYWVNEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTA--------------IFLRV
        LAFQL+++  +++ WVNE G TP+H+LA KP SFKSGS+I+GW+++VYHC FV  L+P+S E L ++ E  + +   ++TA               F  +
Subjt:  LAFQLMHLIVDSMYWVNEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTA--------------IFLRV

Query:  RTLV--------------------NKKKSSDEANKLDKDLEKGARISNPPTITNSPENYSTFVDLFQIILSAIITVLGHGSTTIRKIRETKQKHTWSVQV
        + ++                     K  + ++ +K + DLE+   + N  + T+ P NY+T +D F I  SAI+ +LG GST I+KIR+ K+KHTWSVQV
Subjt:  RTLV--------------------NKKKSSDEANKLDKDLEKGARISNPPTITNSPENYSTFVDLFQIILSAIITVLGHGSTTIRKIRETKQKHTWSVQV

Query:  MEKLI---DPQFVTPDRHGDCWPDYEDYRMKLAEIPFTLEESEVKFSTSTSTCVD----QSKDFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSDE
        M KL+   +P   T D +      + +       +P+   ++EV+FS       +    Q  +  ESAMLLAA+NGV+EIVK   E +P  I +TRK D+
Subjt:  MEKLI---DPQFVTPDRHGDCWPDYEDYRMKLAEIPFTLEESEVKFSTSTSTCVD----QSKDFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSDE

Query:  KNVVHLAAEYRQWNIYKFLLDRKYSLQPLFRAVDDNGNSALHLAAARSSPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFEDTHM
        KNVV LAAE+RQ ++Y+ LL +   +Q LFRAVD  GNSALHLAA    PKLW+ITGAALQ+Q EV+WYK+VK S+P+HFFP +N++G+TA+ IF +TH 
Subjt:  KNVVHLAAEYRQWNIYKFLLDRKYSLQPLFRAVDDNGNSALHLAAARSSPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFEDTHM

Query:  DLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLAL
        +L  KG  WL  TS+SCS+VATLIATVAFATAATIPGGN++ G A L  EQ F IF  SSLIALCLS+TSVIMFL+I+T+RF+I++ F   LPWKLL+ L
Subjt:  DLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLAL

Query:  SSLYFSIIAMLVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD
          LYFSIIAMLVSFCSGHYFL+ +RL N+A+LLYTLTF PVA +FGIVQLPLYFDLL+ L+K +P  S EV LCD
Subjt:  SSLYFSIIAMLVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD

A0A6J1EX64 uncharacterized protein LOC111437341 isoform X19.6e-19450.32Show/hide
Query:  NNNNYNVREFLYANTKRANWGAVIQKYAEHPEA-----------------------------QGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGN
        N+ N      LY + KR  W  VI KY E+ E                                 KL   GDT LHLAV+ NQE IVE LV+++      
Subjt:  NNNNYNVREFLYANTKRANWGAVIQKYAEHPEA-----------------------------QGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGN

Query:  QLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAHGGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRVKNNGDTVLHCAIGNEYFD
          + LEI N   NN LHLAA MGSVRMC AIAS H GLVD RN DD+TPLFLAA YG +DAF+CLY+FC DD  RI+ NCRVK +GDTVLH A+ +E+FD
Subjt:  QLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAHGGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRVKNNGDTVLHCAIGNEYFD

Query:  LAFQLMHLIVDSMYWVNEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTA--------------IFLRV
        LAFQL+++  +++ WVNE G TP+H+LA KP SFKSGS+I+GW+++VYHC FV  L+P+S E L ++ E  + +   ++TA               F  +
Subjt:  LAFQLMHLIVDSMYWVNEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTA--------------IFLRV

Query:  RTLV--------------------NKKKSSDEANKLDKDLEK----GARISNPPTITNSPENYSTFVDLFQIILSAIITVLGHGSTTIRKIRETKQKHTW
        + ++                     K  + ++ +K + DLE+      + +  P  T+ P NY+T +D F I  SAI+ +LG GST I+KIR+ K+KHTW
Subjt:  RTLV--------------------NKKKSSDEANKLDKDLEK----GARISNPPTITNSPENYSTFVDLFQIILSAIITVLGHGSTTIRKIRETKQKHTW

Query:  SVQVMEKLI---DPQFVTPDRHGDCWPDYEDYRMKLAEIPFTLEESEVKFSTSTSTCVD----QSKDFTESAMLLAAKNGVVEIVKELYECYPRTIYETR
        SVQVM KL+   +P   T D +      + +       +P+   ++EV+FS       +    Q  +  ESAMLLAA+NGV+EIVK   E +P  I +TR
Subjt:  SVQVMEKLI---DPQFVTPDRHGDCWPDYEDYRMKLAEIPFTLEESEVKFSTSTSTCVD----QSKDFTESAMLLAAKNGVVEIVKELYECYPRTIYETR

Query:  KSDEKNVVHLAAEYRQWNIYKFLLDRKYSLQPLFRAVDDNGNSALHLAAARSSPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFE
        K D+KNVV LAAE+RQ ++Y+ LL +   +Q LFRAVD  GNSALHLAA    PKLW+ITGAALQ+Q EV+WYK+VK S+P+HFFP +N++G+TA+ IF 
Subjt:  KSDEKNVVHLAAEYRQWNIYKFLLDRKYSLQPLFRAVDDNGNSALHLAAARSSPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFE

Query:  DTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKL
        +TH +L  KG  WL  TS+SCS+VATLIATVAFATAATIPGGN++ G A L  EQ F IF  SSLIALCLS+TSVIMFL+I+T+RF+I++ F   LPWKL
Subjt:  DTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKL

Query:  LLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD
        L+ L  LYFSIIAMLVSFCSGHYFL+ +RL N+A+LLYTLTF PVA +FGIVQLPLYFDLL+ L+K +P  S EV LCD
Subjt:  LLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD

A0A6J1JZF8 uncharacterized protein LOC111491104 isoform X32.3e-18749.67Show/hide
Query:  VREFLYANTKRANWGAVIQKYAEHPEAQGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGNQLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAH
        +R+FLY NTKR  W  VI+KY E+PEAQ LKLT  GDTALHLAV+ N+EE+V+ LV  I ++     + L+  N+R+  PLHLAA MGS  MC+AIASAH
Subjt:  VREFLYANTKRANWGAVIQKYAEHPEAQGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGNQLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAH

Query:  GGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRVKNNGDTVLHCAIGNEYFDLAFQLMHLIVDSMYWVNEHGRTPLHILASKPASFK
          LVD RN  D+TPL+LAA  G RDAF+CLY FC ++  RIT NCR+ +NGDTVLH A+ N++FDLAFQ++HL  ++M+WV E G TPLH+LASKP +FK
Subjt:  GGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRVKNNGDTVLHCAIGNEYFDLAFQLMHLIVDSMYWVNEHGRTPLHILASKPASFK

Query:  SGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQ----------------------EYWKRSW--------------------SERKTAIFLRVRTLVNKK
        SGS IRGWRN+ Y+C  VD+LKPQ I++L RD                       +++   W                    S++ T +   +     + 
Subjt:  SGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQ----------------------EYWKRSW--------------------SERKTAIFLRVRTLVNKK

Query:  KSSDEAN-KLDKDLEKGARISNPPTITNSPENYSTFVDLFQIILSAIITVLGHGSTTIRKIRETKQKHTWSVQVMEKLIDPQFVTPDRH---GDCWPDYE
          +DE++ +LD +L K   ++   +ITN P NY+T +  FQI+ SAI+  LG GS   +KIR  K+KHTWSVQVMEKL+  ++  PD +   G    D  
Subjt:  KSSDEAN-KLDKDLEKGARISNPPTITNSPENYSTFVDLFQIILSAIITVLGHGSTTIRKIRETKQKHTWSVQVMEKLIDPQFVTPDRH---GDCWPDYE

Query:  DYRMKLAEI--PFTLEESEVKFSTSTSTCVDQSK-----DFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSDEKNVVHLAAEYRQWNIYKFLLDRK
              AE+  P++L   EV+ S S  +   +++        E+AMLLAAKNGV+EIVK ++  +P +I + RK D+KNVV LAAEY Q ++Y+FLL  K
Subjt:  DYRMKLAEI--PFTLEESEVKFSTSTSTCVDQSK-----DFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSDEKNVVHLAAEYRQWNIYKFLLDRK

Query:  YSLQPLFRAVDDNGNSALHLAAARSSPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFEDTHMDLEQKGKDWLCVTSQSCSVVATL
           + LFRAVDDNGNSALHLAAA S   +W+ITGAALQ+Q E++WYKFV+ES+P++FF  YN +G+ AT IF +THMDL +K  +WL  TS+SCSVV TL
Subjt:  YSLQPLFRAVDDNGNSALHLAAARSSPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFEDTHMDLEQKGKDWLCVTSQSCSVVATL

Query:  IATVAFATAATIPGGNNDH-GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLL
        I TVAF + A+IPGG N H G   LK  +AF  F + SLIALCLS+TSV MFL+ILT RF+  ++ T+ LPWKL +  SSL+ SII+MLVSFC+GHYFL+
Subjt:  IATVAFATAATIPGGNNDH-GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLL

Query:  TRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMPT-SVEVDLCD
         + + + A LLYT+  +PVA +F I +LPLY D+++ + KI+P  S  V L D
Subjt:  TRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMPT-SVEVDLCD

A0A6J1K2F1 uncharacterized protein LOC111491104 isoform X21.6e-18849.93Show/hide
Query:  VREFLYANTKRANWGAVIQKYAEHPEAQGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGNQLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAH
        +R+FLY NTKR  W  VI+KY E+PEAQ LKLT  GDTALHLAV+ N+EE+V+ LV  I ++     + L+  N+R+  PLHLAA MGS  MC+AIASAH
Subjt:  VREFLYANTKRANWGAVIQKYAEHPEAQGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGNQLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAH

Query:  GGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRVKNNGDTVLHCAIGNEYFDLAFQLMHLIVDSMYWVNEHGRTPLHILASKPASFK
          LVD RN  D+TPL+LAA  G RDAF+CLY FC ++  RIT NCR+ +NGDTVLH A+ N++FDLAFQ++HL  ++M+WV E G TPLH+LASKP +FK
Subjt:  GGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRVKNNGDTVLHCAIGNEYFDLAFQLMHLIVDSMYWVNEHGRTPLHILASKPASFK

Query:  SGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQ----------------------EYWKRSW--------------------SERKTAIFLRVRTLVNKK
        SGS IRGWRN+ Y+C  VD+LKPQ I++L RD                       +++   W                    S++ T +   +     + 
Subjt:  SGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQ----------------------EYWKRSW--------------------SERKTAIFLRVRTLVNKK

Query:  KSSDEAN-KLDKDLEKGARISNPPTITNSPENYSTFVDLFQIILSAIITVLGHGSTTIRKIRETKQKHTWSVQVMEKLIDPQFVTPDRH---GDCWPDYE
          +DE++ +LD +L K   ++   +ITN P NY+T +  FQI+ SAI+  LG GS   +KIR  K+KHTWSVQVMEKL+  ++  PD +   G    D  
Subjt:  KSSDEAN-KLDKDLEKGARISNPPTITNSPENYSTFVDLFQIILSAIITVLGHGSTTIRKIRETKQKHTWSVQVMEKLIDPQFVTPDRH---GDCWPDYE

Query:  DYRMKLAEI--PFTLEESEVKFSTSTSTC---VDQSKDFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSDEKNVVHLAAEYRQWNIYKFLLDRKYS
              AE+  P++L   EV+ S S  +     ++ K+  E+AMLLAAKNGV+EIVK ++  +P +I + RK D+KNVV LAAEY Q ++Y+FLL  K  
Subjt:  DYRMKLAEI--PFTLEESEVKFSTSTSTC---VDQSKDFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSDEKNVVHLAAEYRQWNIYKFLLDRKYS

Query:  LQPLFRAVDDNGNSALHLAAARSSPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIA
         + LFRAVDDNGNSALHLAAA S   +W+ITGAALQ+Q E++WYKFV+ES+P++FF  YN +G+ AT IF +THMDL +K  +WL  TS+SCSVV TLI 
Subjt:  LQPLFRAVDDNGNSALHLAAARSSPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIA

Query:  TVAFATAATIPGGNNDH-GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTR
        TVAF + A+IPGG N H G   LK  +AF  F + SLIALCLS+TSV MFL+ILT RF+  ++ T+ LPWKL +  SSL+ SII+MLVSFC+GHYFL+ +
Subjt:  TVAFATAATIPGGNNDH-GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTR

Query:  RLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMPT-SVEVDLCD
         + + A LLYT+  +PVA +F I +LPLY D+++ + KI+P  S  V L D
Subjt:  RLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMPT-SVEVDLCD

A0A6J1K4G3 uncharacterized protein LOC111491104 isoform X12.3e-18749.67Show/hide
Query:  VREFLYANTKRANWGAVIQKYAEHPEAQGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGNQLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAH
        +R+FLY NTKR  W  VI+KY E+PEAQ LKLT  GDTALHLAV+ N+EE+V+ LV  I ++     + L+  N+R+  PLHLAA MGS  MC+AIASAH
Subjt:  VREFLYANTKRANWGAVIQKYAEHPEAQGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGNQLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAH

Query:  GGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRVKNNGDTVLHCAIGNEYFDLAFQLMHLIVDSMYWVNEHGRTPLHILASKPASFK
          LVD RN  D+TPL+LAA  G RDAF+CLY FC ++  RIT NCR+ +NGDTVLH A+ N++FDLAFQ++HL  ++M+WV E G TPLH+LASKP +FK
Subjt:  GGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRVKNNGDTVLHCAIGNEYFDLAFQLMHLIVDSMYWVNEHGRTPLHILASKPASFK

Query:  SGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQ----------------------EYWKRSW--------------------SERKTAIFLRVRTLVNKK
        SGS IRGWRN+ Y+C  VD+LKPQ I++L RD                       +++   W                    S++ T +   +     + 
Subjt:  SGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQ----------------------EYWKRSW--------------------SERKTAIFLRVRTLVNKK

Query:  KSSDEAN-KLDKDLEKGARISNPPTITNSPENYSTFVDLFQIILSAIITVLGHGSTTIRKIRETKQKHTWSVQVMEKLIDPQFVTPDRH---GDCWPDYE
          +DE++ +LD +L K   ++   +ITN P NY+T +  FQI+ SAI+  LG GS   +KIR  K+KHTWSVQVMEKL+  ++  PD +   G    D  
Subjt:  KSSDEAN-KLDKDLEKGARISNPPTITNSPENYSTFVDLFQIILSAIITVLGHGSTTIRKIRETKQKHTWSVQVMEKLIDPQFVTPDRH---GDCWPDYE

Query:  DYRMKLAEI--PFTLEESEVKFSTSTSTCVDQSK-----DFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSDEKNVVHLAAEYRQWNIYKFLLDRK
              AE+  P++L   EV+ S S  +   +++        E+AMLLAAKNGV+EIVK ++  +P +I + RK D+KNVV LAAEY Q ++Y+FLL  K
Subjt:  DYRMKLAEI--PFTLEESEVKFSTSTSTCVDQSK-----DFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSDEKNVVHLAAEYRQWNIYKFLLDRK

Query:  YSLQPLFRAVDDNGNSALHLAAARSSPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFEDTHMDLEQKGKDWLCVTSQSCSVVATL
           + LFRAVDDNGNSALHLAAA S   +W+ITGAALQ+Q E++WYKFV+ES+P++FF  YN +G+ AT IF +THMDL +K  +WL  TS+SCSVV TL
Subjt:  YSLQPLFRAVDDNGNSALHLAAARSSPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFEDTHMDLEQKGKDWLCVTSQSCSVVATL

Query:  IATVAFATAATIPGGNNDH-GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLL
        I TVAF + A+IPGG N H G   LK  +AF  F + SLIALCLS+TSV MFL+ILT RF+  ++ T+ LPWKL +  SSL+ SII+MLVSFC+GHYFL+
Subjt:  IATVAFATAATIPGGNNDH-GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLL

Query:  TRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMPT-SVEVDLCD
         + + + A LLYT+  +PVA +F I +LPLY D+++ + KI+P  S  V L D
Subjt:  TRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMPT-SVEVDLCD

SwissProt top hitse value%identityAlignment
A2CIR5 Ankyrin repeat-containing protein NPR44.7e-1225.3Show/hide
Query:  VDQSKDFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSDEK--NVVHLAAEYRQWNIYKFLLDRKYSLQPLFRAVDDNGNSALHLAAARSSPKLWQI
        V+ +KD  ++++  AA+ G VEIVK L E  P+     R++D+K    +H+A +    ++ + L+D   ++  L    D NGN+ALH+A  +   K  +I
Subjt:  VDQSKDFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSDEK--NVVHLAAEYRQWNIYKFLLDRKYSLQPLFRAVDDNGNSALHLAAARSSPKLWQI

Query:  TGAALQ--------IQLEVEWYKFVKESLPI-----HFFPLYNNKG---------------ETATRIFEDTHMDLEQKGK-----------------DWL
            L+        +  + +    + E+LP+         + +  G               +T T I +D H  LEQ  K                 + +
Subjt:  TGAALQ--------IQLEVEWYKFVKESLPI-----HFFPLYNNKG---------------ETATRIFEDTHMDLEQKGK-----------------DWL

Query:  CVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAM
           + S +VVA L ATVAFA   T+PGGN ++G+A + Q  +F IF I + IAL  S   V++ ++++    +       ++          ++ + +  
Subjt:  CVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAM

Query:  LVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMFG
         +SF +  Y +L R  Q  ALL+  +  + +A + G
Subjt:  LVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMFG

O44997 Death-associated protein kinase dapk-17.7e-0733.13Show/hide
Query:  GDTALHLAVVANQEEIVETLVRLICENPGNQLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAHGGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCG
        GDT LHLA  +    IV  LV   C  P      L ++N R+  PLH+AAA G V    A+  A+   +D    D +T L +A   G  D    L     
Subjt:  GDTALHLAVVANQEEIVETLVRLICENPGNQLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAHGGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCG

Query:  DDPVRITINCRVK---NNGDTVLHCAIGNEYFDLAFQLMH--LIVDSMYWVNEHGRTPLHILA
             IT  C +    ++GDT LH A  +        L H  + VDS   VN + +T LH+ A
Subjt:  DDPVRITINCRVK---NNGDTVLHCAIGNEYFDLAFQLMH--LIVDSMYWVNEHGRTPLHILA

P25963 NF-kappa-B inhibitor alpha2.7e-0729.81Show/hide
Query:  KLTPEGDTALHLAVVANQEEIVETLVRLICENPGNQLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAHGGLVDHRNIDDQTPLFLAALYGKRDAFYCL
        +LT +GD+ LHLA++  ++ +   ++R +       L  L  +NN Q  PLHLA       +  A+  A G   + R+    TPL LA   G   +   L
Subjt:  KLTPEGDTALHLAVVANQEEIVETLVRLICENPGNQLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAHGGLVDHRNIDDQTPLFLAALYGKRDAFYCL

Query:  YSFCGDDPVRITINCRVKNNGDTVLHCAIGNEYFDLAFQLMHLIVDSMYWVNEHGRTPLHI
           C   P   +I      NG T LH A  + Y  +   L+ L  D       +GRT LH+
Subjt:  YSFCGDDPVRITINCRVKNNGDTVLHCAIGNEYFDLAFQLMHLIVDSMYWVNEHGRTPLHI

Q91974 NF-kappa-B inhibitor alpha1.2e-0728.57Show/hide
Query:  PEAQGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGNQLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAHGGLVDHRNIDDQTPLFLAALYGKR
        P A   +LT +GDT LHLA++  ++ +   ++R            L  +NN    PLHLA       +   +  A G  +D R+    TPL +A   G  
Subjt:  PEAQGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGNQLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIASAHGGLVDHRNIDDQTPLFLAALYGKR

Query:  DAFYCLYSFCGDDPVRITINCRVKN-NGDTVLHCAIGNEYFDLAFQLMHLIVDSMYWVNEHGRTPLHI
         +   L   C   P  +    +  N NG T LH A    Y  +   L+ L  D       +GRT LH+
Subjt:  DAFYCLYSFCGDDPVRITINCRVKN-NGDTVLHCAIGNEYFDLAFQLMHLIVDSMYWVNEHGRTPLHI

Q9C7A2 Ankyrin repeat-containing protein ITN11.6e-0725.59Show/hide
Query:  SKDFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSDEK--NVVHLAAEYRQWNIYKFLLDRKYS--LQPLFRAVDDNGNSALHLAAARSSPKLWQI-
        S+   ++A+ LAA+ G VE++K L    P+     R+ D+K    +H+A + +   + K LLD   +  +QP     D + N+ALH+A  +   ++ ++ 
Subjt:  SKDFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSDEK--NVVHLAAEYRQWNIYKFLLDRKYS--LQPLFRAVDDNGNSALHLAAARSSPKLWQI-

Query:  -----TGA---------ALQIQLEV---EWYKFVKESL----PIHFFPLYNNKGE---TATRIFEDTHMDLEQK-----------------GKDWLCVTS
             T A         AL I   +   E   ++KE L     +    L   + E   T T+I  D H+ LEQ                   ++ +   +
Subjt:  -----TGA---------ALQIQLEV---EWYKFVKESL----PIHFFPLYNNKGE---TATRIFEDTHMDLEQK-----------------GKDWLCVTS

Query:  QSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSF
         S +VVA L ATVAFA   T+PGG+N+ G A +    +F IF I + +AL  S   V++ ++++    +  +    ++    L+ L+S+  S     V+F
Subjt:  QSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSF

Query:  CSGHYFLLTRRLQNQALLLYTLTFLPVAFMFGIVQLPLYF
         +  Y ++ R+ +  A L   +T +    M G++    Y+
Subjt:  CSGHYFLLTRRLQNQALLLYTLTFLPVAFMFGIVQLPLYF

Arabidopsis top hitse value%identityAlignment
AT3G18670.1 Ankyrin repeat family protein1.6e-3625.14Show/hide
Query:  NNNYNVREFLYANTKRANWGAVIQKYAEHPEAQGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGNQLQRLEIKNNRQNNPLHLAAAMGSVRMCHA
        N+ Y V   L+ N       A       +PEA    LT  GDT +H AV++   +IVE ++R I +      Q L+IKN+     L  AA  G VR+   
Subjt:  NNNYNVREFLYANTKRANWGAVIQKYAEHPEAQGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGNQLQRLEIKNNRQNNPLHLAAAMGSVRMCHA

Query:  IASAHGGLVDHRNIDDQTPLFLAALYGKRDAFYCLYS---FCGDDPVRITINCRVKNNGDTVLHCAIGNEYFDLAFQLMHLIVDSMYWVNEHGRTPLHIL
        + +   GLV  RN  +  P+ +A+LYG +     LYS       DP   +   + KN    V +C                IVD +Y             
Subjt:  IASAHGGLVDHRNIDDQTPLFLAALYGKRDAFYCLYS---FCGDDPVRITINCRVKNNGDTVLHCAIGNEYFDLAFQLMHLIVDSMYWVNEHGRTPLHIL

Query:  ASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAIFLRVRTLVNKKKSSDEANKLDKDLEKGARISNPPTITNSPENYST
                              CI +D ++        RD        S+  TAI    +T                          P    + P     
Subjt:  ASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAIFLRVRTLVNKKKSSDEANKLDKDLEKGARISNPPTITNSPENYST

Query:  FVDLFQIILSAIITVLGHGSTTIRKIRETKQKHTWSVQVMEKLIDPQFVTPDRHGDCWPDYEDYRMKLAEIPFTLEESEVKFSTSTSTCVDQSKDFTESA
                              IR++ + K  H  + ++++ +                          EIP        KF  +      Q       A
Subjt:  FVDLFQIILSAIITVLGHGSTTIRKIRETKQKHTWSVQVMEKLIDPQFVTPDRHGDCWPDYEDYRMKLAEIPFTLEESEVKFSTSTSTCVDQSKDFTESA

Query:  MLLAAKNGVVEIVKELYECYPRTIYETRKSDEKNVVHLAAEYRQWNIYKFLLDRKYSLQPLFRAVDDNGNSALHLAAARS-SPKLWQITGAALQIQLEVE
        +  A +NG+VE ++E+   YP  ++ ++ S   N+   A   RQ  I+  + +       L    D   N+ LH AA R+ + +L  I GAALQ+Q E++
Subjt:  MLLAAKNGVVEIVKELYECYPRTIYETRKSDEKNVVHLAAEYRQWNIYKFLLDRKYSLQPLFRAVDDNGNSALHLAAARS-SPKLWQITGAALQIQLEVE

Query:  WYKFVKESL-PIHFFPLYNNKGETATRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCL
        W+K V++ + P H   +   + +T   +F D H DL ++G+ W+  T+ SC+VVA LI T+ F++A T+PGG    G+     +  F IF+IS  I+L  
Subjt:  WYKFVKESL-PIHFFPLYNNKGETATRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCL

Query:  STTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLR
        S  S++MFL IL +R+   E F   LP KL++ L +L+ S+  M+V+F      L+  ++   +     L  +P+  MF ++Q P+  ++ R
Subjt:  STTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLR

AT3G54070.1 Ankyrin repeat family protein3.9e-3033.67Show/hide
Query:  AAKNGVVEIVKELYECYPRTIYETRKSDEKNVVHLAAEYRQWNIYKFLLDRKYSLQPLFRAVDD--NGNSALHLAAARSSPKLWQI-TGAALQIQLEVEW
        AA+ G VEI+  L   +   ++    ++ + + H+AA YR  NI+  + +    ++ L  +  +  + ++ LHL A        Q+ +GAAL +Q E+ W
Subjt:  AAKNGVVEIVKELYECYPRTIYETRKSDEKNVVHLAAEYRQWNIYKFLLDRKYSLQPLFRAVDD--NGNSALHLAAARSSPKLWQI-TGAALQIQLEVEW

Query:  YKFVKESLPIHFFPLYNNKGETATRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDH-------GLATLKQEQAFSIFVISSL
        +K VKE +P  +    N KGE A  IF + H +L ++G+ W+  T+ +C + ATLIATV FA A TIPGGN+D        G    ++   F IF +S  
Subjt:  YKFVKESLPIHFFPLYNNKGETATRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDH-------GLATLKQEQAFSIFVISSL

Query:  IALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLR
        +AL  S  S+++FLSI T+R+   E F   LP KL+  LS+L+ SII+M+++F      +   +     +L+  L  L  A  F  +   L+F+ LR
Subjt:  IALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLR

AT5G04700.1 Ankyrin repeat family protein2.4e-3533.63Show/hide
Query:  EDYRMKLAEIPFTLEESEVKFSTSTSTC---VDQSKDFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSDEKNVVHLAAEYRQWNIYKFLL---DRK
        E YR+K+      L+  ++    S  T    + +  +  + A+L A + G V+ + E+       ++ TR S    +  LA E+RQ  ++  L    DRK
Subjt:  EDYRMKLAEIPFTLEESEVKFSTSTSTC---VDQSKDFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSDEKNVVHLAAEYRQWNIYKFLL---DRK

Query:  YSLQPLFRAVDDNGNSALHLAAARSSP-KLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFEDTHMDLEQKGKDWLCVTSQSCSVVAT
        Y    L    D +GN  LHLA   S P KL  + GA LQ+Q E++W+K V+   P       N + +T   IF   H  L Q+ + W+  T+ SCS+VA 
Subjt:  YSLQPLFRAVDDNGNSALHLAAARSSP-KLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFEDTHMDLEQKGKDWLCVTSQSCSVVAT

Query:  LIATVAFATAATIPGGNNDH--GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYF
        LI TV FA   T+PGG +D+  G     +++ F IF++S LI+   S TSV++FL ILTAR+   + F   LP K++  LS L+ SI AML++F S  + 
Subjt:  LIATVAFATAATIPGGNNDH--GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYF

Query:  LLTRRLQNQALLLYTLTF--LPVAFMFGIVQLPLYFDLL
        ++ +  + + ++  T+ F  LP A +F ++Q PL  +++
Subjt:  LLTRRLQNQALLLYTLTF--LPVAFMFGIVQLPLYFDLL

AT5G04730.1 Ankyrin-repeat containing protein1.6e-3132.46Show/hide
Query:  KDFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSDEKNVVHLAAEYRQ---WNIYKFLLDRKYSLQPLFRAVDDNGNSALHLAAARSSP-KLWQITG
        K+    A+L AAK+G  +   E+ +C  + ++    +  +N+  LA E+++   +N+   L DRK +   L R+ D   N+ LH+A   S+P +L +I+G
Subjt:  KDFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSDEKNVVHLAAEYRQ---WNIYKFLLDRKYSLQPLFRAVDDNGNSALHLAAARSSP-KLWQITG

Query:  AALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGG-NNDHGLATLKQEQAFSIF
        AAL++Q E +W+K V+  +        N   +T  +IFE  H  L ++G++W+  T+ +CS VA LIATV F    T+PGG +   G   +  +  F  F
Subjt:  AALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGG-NNDHGLATLKQEQAFSIF

Query:  VISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDL
        + +  +A   S  SV++FLSILT+R+   ++  S LP K++L  S L+ SI +MLV+F +     L+  ++++  L+Y L   P+A    ++ L L + L
Subjt:  VISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMFGIVQLPLYFDL

Query:  LRVLL
        L+ ++
Subjt:  LRVLL

AT5G35810.1 Ankyrin repeat family protein6.9e-3535.14Show/hide
Query:  VDQSKDFTESAMLL---AAKNGVVEIVKELYECYPRTIYETRKSDEKNVVHLAAEYRQWNIYK--FLLDRKYSLQPLFRAVDDNGNSALHLAAARSSP-K
        V++   F  S+ +L   AA++G +E++  L   YP  I+ T     +++ H+AA  R   I+   + L     L  +++  + N N  LHL A    P +
Subjt:  VDQSKDFTESAMLL---AAKNGVVEIVKELYECYPRTIYETRKSDEKNVVHLAAEYRQWNIYK--FLLDRKYSLQPLFRAVDDNGNSALHLAAARSSP-K

Query:  LWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDH------GLA
        L  ++GAALQ+Q E+ WYK VKE +P  +    N K E A  +F   H +L ++G+ W+  T+ +C +V+TLIATV FA A T+PGGN+        G  
Subjt:  LWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDH------GLA

Query:  TLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMF
        T ++E  F +F+IS  +AL  S TS+++FLSILT+R+     F + LP KL+L L +L+ SII+M+++F +    L+  +    +L+L        A  F
Subjt:  TLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALLLYTLTFLPVAFMF

Query:  GIVQLPLYFDLLR
         ++   L+FD LR
Subjt:  GIVQLPLYFDLLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCAACGTCGTCCAACAATAACAACTACAATGTACGAGAGTTTCTGTATGCAAACACCAAGAGAGCAAACTGGGGTGCAGTGATTCAAAAGTACGCCGAACACCC
AGAAGCTCAGGGGTTGAAGCTGACTCCAGAGGGCGACACGGCGTTGCATTTGGCTGTGGTTGCCAACCAAGAAGAAATAGTGGAAACGCTTGTGAGACTCATTTGCGAAA
ACCCAGGAAACCAGCTGCAACGTCTTGAGATTAAGAACAACAGACAAAACAACCCTCTCCACCTTGCCGCAGCGATGGGAAGCGTGCGAATGTGCCATGCCATTGCTTCA
GCCCATGGGGGCTTAGTGGACCACAGAAACATAGACGACCAAACGCCACTCTTCCTGGCGGCGTTGTACGGCAAAAGGGACGCCTTTTATTGCCTTTACTCCTTCTGCGG
AGACGACCCAGTTCGAATCACCATCAACTGCAGAGTCAAGAACAATGGCGACACCGTGCTACATTGTGCCATCGGAAACGAGTACTTTGATTTGGCGTTTCAATTAATGC
ACTTGATCGTTGACAGTATGTATTGGGTCAATGAGCACGGCAGAACCCCTCTCCATATTCTAGCTAGCAAGCCAGCTTCCTTCAAAAGTGGATCCAACATCAGGGGATGG
CGGAACCTCGTCTATCACTGCATATTTGTGGATAAACTGAAGCCTCAATCCATTGAAACCCTTCACCGTGACCAAGAGTACTGGAAGAGAAGTTGGTCCGAAAGAAAGAC
AGCTATATTTTTACGAGTCCGCACTTTGGTCAACAAGAAGAAAAGTAGTGATGAAGCCAACAAGCTGGATAAAGATTTGGAGAAAGGTGCGAGAATATCAAATCCACCGA
CGATTACAAATTCCCCGGAAAACTATTCTACCTTCGTCGACCTTTTTCAAATAATTTTATCGGCCATCATAACCGTTCTTGGGCACGGATCTACGACGATCAGAAAGATT
CGAGAGACGAAACAAAAACACACTTGGTCAGTTCAAGTGATGGAGAAACTTATTGATCCACAATTTGTTACACCGGACAGACATGGCGACTGTTGGCCCGATTATGAAGA
TTATAGGATGAAATTAGCAGAAATTCCATTCACGTTGGAAGAATCCGAAGTGAAGTTCAGTACAAGTACAAGTACTTGTGTGGATCAGTCGAAGGATTTCACAGAGTCGG
CGATGTTATTGGCAGCGAAGAACGGCGTGGTTGAGATAGTGAAGGAACTTTACGAATGTTATCCAAGGACAATCTACGAGACTAGGAAATCAGATGAGAAGAATGTAGTG
CATTTGGCTGCAGAGTACAGGCAGTGGAACATCTACAAGTTTTTACTTGACAGAAAATATTCCTTACAGCCCCTGTTTCGAGCCGTTGATGATAACGGCAACAGCGCCTT
ACATCTGGCGGCTGCCAGATCAAGTCCTAAGCTTTGGCAAATCACTGGAGCTGCACTCCAGATCCAATTGGAAGTTGAGTGGTATAAGTTCGTCAAGGAGTCCTTGCCAA
TCCATTTTTTCCCTCTTTATAATAACAAAGGAGAAACTGCAACTAGAATCTTCGAAGATACCCACATGGATCTCGAGCAAAAGGGCAAGGATTGGCTTTGCGTCACCTCA
CAGTCATGCTCTGTGGTCGCCACTCTGATTGCAACCGTGGCTTTCGCAACTGCAGCCACCATCCCAGGCGGCAACAACGACCACGGTCTCGCAACACTTAAACAAGAGCA
AGCCTTTTCTATCTTCGTCATCTCTTCCCTCATTGCTCTATGCCTCTCCACAACCTCTGTCATCATGTTTCTCTCCATCTTGACCGCCAGATTCGAAATCAGAGAGTACT
TCACATCAATGTTGCCCTGGAAACTGCTCCTCGCCTTATCCTCTCTTTACTTTTCCATCATCGCCATGTTGGTTTCCTTTTGCAGCGGCCACTACTTTCTCCTCACTCGC
CGCCTCCAAAATCAAGCCCTTCTGCTCTACACACTTACTTTTCTCCCTGTCGCATTCATGTTCGGTATTGTGCAGCTTCCTCTCTACTTCGATCTCCTAAGGGTTCTTCT
CAAAATCATGCCTACCAGCGTCGAAGTCGACCTATGCGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATCCAACGTCGTCCAACAATAACAACTACAATGTACGAGAGTTTCTGTATGCAAACACCAAGAGAGCAAACTGGGGTGCAGTGATTCAAAAGTACGCCGAACACCC
AGAAGCTCAGGGGTTGAAGCTGACTCCAGAGGGCGACACGGCGTTGCATTTGGCTGTGGTTGCCAACCAAGAAGAAATAGTGGAAACGCTTGTGAGACTCATTTGCGAAA
ACCCAGGAAACCAGCTGCAACGTCTTGAGATTAAGAACAACAGACAAAACAACCCTCTCCACCTTGCCGCAGCGATGGGAAGCGTGCGAATGTGCCATGCCATTGCTTCA
GCCCATGGGGGCTTAGTGGACCACAGAAACATAGACGACCAAACGCCACTCTTCCTGGCGGCGTTGTACGGCAAAAGGGACGCCTTTTATTGCCTTTACTCCTTCTGCGG
AGACGACCCAGTTCGAATCACCATCAACTGCAGAGTCAAGAACAATGGCGACACCGTGCTACATTGTGCCATCGGAAACGAGTACTTTGATTTGGCGTTTCAATTAATGC
ACTTGATCGTTGACAGTATGTATTGGGTCAATGAGCACGGCAGAACCCCTCTCCATATTCTAGCTAGCAAGCCAGCTTCCTTCAAAAGTGGATCCAACATCAGGGGATGG
CGGAACCTCGTCTATCACTGCATATTTGTGGATAAACTGAAGCCTCAATCCATTGAAACCCTTCACCGTGACCAAGAGTACTGGAAGAGAAGTTGGTCCGAAAGAAAGAC
AGCTATATTTTTACGAGTCCGCACTTTGGTCAACAAGAAGAAAAGTAGTGATGAAGCCAACAAGCTGGATAAAGATTTGGAGAAAGGTGCGAGAATATCAAATCCACCGA
CGATTACAAATTCCCCGGAAAACTATTCTACCTTCGTCGACCTTTTTCAAATAATTTTATCGGCCATCATAACCGTTCTTGGGCACGGATCTACGACGATCAGAAAGATT
CGAGAGACGAAACAAAAACACACTTGGTCAGTTCAAGTGATGGAGAAACTTATTGATCCACAATTTGTTACACCGGACAGACATGGCGACTGTTGGCCCGATTATGAAGA
TTATAGGATGAAATTAGCAGAAATTCCATTCACGTTGGAAGAATCCGAAGTGAAGTTCAGTACAAGTACAAGTACTTGTGTGGATCAGTCGAAGGATTTCACAGAGTCGG
CGATGTTATTGGCAGCGAAGAACGGCGTGGTTGAGATAGTGAAGGAACTTTACGAATGTTATCCAAGGACAATCTACGAGACTAGGAAATCAGATGAGAAGAATGTAGTG
CATTTGGCTGCAGAGTACAGGCAGTGGAACATCTACAAGTTTTTACTTGACAGAAAATATTCCTTACAGCCCCTGTTTCGAGCCGTTGATGATAACGGCAACAGCGCCTT
ACATCTGGCGGCTGCCAGATCAAGTCCTAAGCTTTGGCAAATCACTGGAGCTGCACTCCAGATCCAATTGGAAGTTGAGTGGTATAAGTTCGTCAAGGAGTCCTTGCCAA
TCCATTTTTTCCCTCTTTATAATAACAAAGGAGAAACTGCAACTAGAATCTTCGAAGATACCCACATGGATCTCGAGCAAAAGGGCAAGGATTGGCTTTGCGTCACCTCA
CAGTCATGCTCTGTGGTCGCCACTCTGATTGCAACCGTGGCTTTCGCAACTGCAGCCACCATCCCAGGCGGCAACAACGACCACGGTCTCGCAACACTTAAACAAGAGCA
AGCCTTTTCTATCTTCGTCATCTCTTCCCTCATTGCTCTATGCCTCTCCACAACCTCTGTCATCATGTTTCTCTCCATCTTGACCGCCAGATTCGAAATCAGAGAGTACT
TCACATCAATGTTGCCCTGGAAACTGCTCCTCGCCTTATCCTCTCTTTACTTTTCCATCATCGCCATGTTGGTTTCCTTTTGCAGCGGCCACTACTTTCTCCTCACTCGC
CGCCTCCAAAATCAAGCCCTTCTGCTCTACACACTTACTTTTCTCCCTGTCGCATTCATGTTCGGTATTGTGCAGCTTCCTCTCTACTTCGATCTCCTAAGGGTTCTTCT
CAAAATCATGCCTACCAGCGTCGAAGTCGACCTATGCGATTAG
Protein sequenceShow/hide protein sequence
MNPTSSNNNNYNVREFLYANTKRANWGAVIQKYAEHPEAQGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGNQLQRLEIKNNRQNNPLHLAAAMGSVRMCHAIAS
AHGGLVDHRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRVKNNGDTVLHCAIGNEYFDLAFQLMHLIVDSMYWVNEHGRTPLHILASKPASFKSGSNIRGW
RNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAIFLRVRTLVNKKKSSDEANKLDKDLEKGARISNPPTITNSPENYSTFVDLFQIILSAIITVLGHGSTTIRKI
RETKQKHTWSVQVMEKLIDPQFVTPDRHGDCWPDYEDYRMKLAEIPFTLEESEVKFSTSTSTCVDQSKDFTESAMLLAAKNGVVEIVKELYECYPRTIYETRKSDEKNVV
HLAAEYRQWNIYKFLLDRKYSLQPLFRAVDDNGNSALHLAAARSSPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNNKGETATRIFEDTHMDLEQKGKDWLCVTS
QSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTR
RLQNQALLLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMPTSVEVDLCD