; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005521 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005521
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionMyb_DNA-bind_3 domain-containing protein
Genome locationscaffold7:27153613..27160476
RNA-Seq ExpressionSpg005521
SyntenySpg005521
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PON69808.1 Myb/SANT-like domain containing protein [Parasponia andersonii]1.3e-5641.09Show/hide
Query:  MDDSEQSKAGSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGYLIQIEKWMTEKIPKCDLKAQPHIESKVKLLKKQYNAISEMLGPNCSG-FGW
        MD  + S +   +K  W+  ED KLVECLL+++N G WKA+NGTFK GYL Q+EK + EKIP+C LKAQPHI+S++K+LKKQ++AISEMLG +C G F W
Subjt:  MDDSEQSKAGSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGYLIQIEKWMTEKIPKCDLKAQPHIESKVKLLKKQYNAISEMLGPNCSG-FGW

Query:  NDVDKCIEAEKHIFDEWVK-----------PFPFLDQLAIVFGKDRANGLGAEGPADMFEAVEREMGNNEFWGGDYMPD---------------PPIVGG
        N  DKCI A+K++++EWVK            FP+ D+L +VFGKDRA G GA G +DM E   +E+ N++        D                P+   
Subjt:  NDVDKCIEAEKHIFDEWVK-----------PFPFLDQLAIVFGKDRANGLGAEGPADMFEAVEREMGNNEFWGGDYMPD---------------PPIVGG

Query:  DEVDGPTSMSETPINTQSTTHTSNRSSKKRARSVDPLIAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVR
        D++DG  S+S    +TQS    S + S KR+RS D LI  +           + + E I +IA  ++  A+  +    RR  +  E++KV+GL+  QRVR
Subjt:  DEVDGPTSMSETPINTQSTTHTSNRSSKKRARSVDPLIAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVR

Query:  AGRIITKDQSQIDYFFNLPADERYEFLMEVL
         G ++ ++Q+  DYFF L  + + EFL ++L
Subjt:  AGRIITKDQSQIDYFFNLPADERYEFLMEVL

PON98555.1 Myb/SANT-like domain containing protein [Trema orientale]9.8e-6040.88Show/hide
Query:  MDDSEQSKAGSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGYLIQIEKWMTEKIPKCDLKAQPHIESKVKLLKKQYNAISEMLGPNCSGFGWN
        MD  + S +  ++K  W+  ED KLVECLL+++N G WKA+NGTFK GYL Q+EK + EKIP+C LKAQPHI+S++K+LKKQ++AISEMLG    GF WN
Subjt:  MDDSEQSKAGSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGYLIQIEKWMTEKIPKCDLKAQPHIESKVKLLKKQYNAISEMLGPNCSGFGWN

Query:  DVDKCIEAEKHIFDEWVK-----------PFPFLDQLAIVFGKDRANGLGAEGPADMFEAVEREMGNNEFWGGDYMPDP---------------------
          DKCI A+K+++DEWVK            FP+ D+L +VFGKDRA G GA G +DM E   +E+ N++        DP                     
Subjt:  DVDKCIEAEKHIFDEWVK-----------PFPFLDQLAIVFGKDRANGLGAEGPADMFEAVEREMGNNEFWGGDYMPDP---------------------

Query:  ----PIVGGDEVDGPTSMSETPINTQSTTHTSNRSSKKRARSVDPLIAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVD
            P+   D++DG  S+S    +TQS    S + SKKR+RS D LI  +           + + E I +IA  ++  A+  +    RR  +  E++KV+
Subjt:  ----PIVGGDEVDGPTSMSETPINTQSTTHTSNRSSKKRARSVDPLIAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVD

Query:  GLSVRQRVRAGRIITKDQSQIDYFFNLPADERYEFLMEVL
        GL+  QRVR GR++ ++Q+  DYFF L  + + EFL ++L
Subjt:  GLSVRQRVRAGRIITKDQSQIDYFFNLPADERYEFLMEVL

XP_024022021.1 uncharacterized protein LOC112091787 [Morus notabilis]2.0e-6545.39Show/hide
Query:  RKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGYLIQIEKWMTEKIPKCDLKAQPHIESKVKLLKKQYNAISEMLGPNCSGFGWNDVDKCIEAEKHI
        RK  W+  ED KLVECLL+++N G WKADNGTFKPGYL Q+EK M EKIP+C LKAQPHI+S+VK+LKKQY+AISEMLGP  SGFGWND DKC+  EK +
Subjt:  RKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGYLIQIEKWMTEKIPKCDLKAQPHIESKVKLLKKQYNAISEMLGPNCSGFGWNDVDKCIEAEKHI

Query:  FDEWV-----------KPFPFLDQLAIVFGKDRANGLGAEGPADMFEAVEREMGNNEFWGGDYMPDPPIVGGDEVDGPTSMSETPINTQSTTHTSNRSSK
        FDEWV           KPFP+ D+L +VFG DRANG GA G  DM + +++E  N+     DY    P++  DE     S+    + T ST     R  +
Subjt:  FDEWV-----------KPFPFLDQLAIVFGKDRANGLGAEGPADMFEAVEREMGNNEFWGGDYMPDPPIVGGDEVDGPTSMSETPINTQSTTHTSNRSSK

Query:  KRARSVDPLIAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRIITKDQSQIDYFFNLPADERYEFLM
        KR++  D L+ A+  + +  S   + A ENI  +A  ++  A+  +    RR  +  E++KV+GL+  QRVR G+++ ++    +YFF L  + + +FL+
Subjt:  KRARSVDPLIAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRIITKDQSQIDYFFNLPADERYEFLM

Query:  EVLQ
         +L+
Subjt:  EVLQ

XP_030483301.1 uncharacterized protein LOC115699898 [Cannabis sativa]3.2e-6341.46Show/hide
Query:  MDDSEQSKAGSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGYLIQIEKWMTEKIPKCDLKAQPHIESKVKLLKKQYNAISEMLGPNCSGFGWN
        M+ + QS     RK  W+  +D KLVECL+++ N G WKADNGTFKPGYL Q+EK M ++IP   +KAQPHI+S++K+LK+QY AIS+MLGP+ SGFGWN
Subjt:  MDDSEQSKAGSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGYLIQIEKWMTEKIPKCDLKAQPHIESKVKLLKKQYNAISEMLGPNCSGFGWN

Query:  DVDKCIEAEKHIFDEWV-----------KPFPFLDQLAIVFGKDRANGLGAEGPADMFEAVEREMGNNEFWGGDYMPDPPIVGGDEVDGPTSMSETPINT
        +  KC+ A+K +FDEWV           KPFP+ D+LAIV+GKDRA G GA G ++  + +  E+ N   W  D+ P  P+   DE++   SM+ +  ++
Subjt:  DVDKCIEAEKHIFDEWV-----------KPFPFLDQLAIVFGKDRANGLGAEGPADMFEAVEREMGNNEFWGGDYMPDPPIVGGDEVDGPTSMSETPINT

Query:  QSTTHTSNRSSKKRARSVDPLIAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRIITKDQSQIDYFF
        Q+T     R +K+++ + DPL+  ++      S+  ++A+++I+++A       + E+    RR  L  EI+KVDGL+  QR++ G+++  +Q  IDYFF
Subjt:  QSTTHTSNRSSKKRARSVDPLIAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRIITKDQSQIDYFF

Query:  NLPADERYEFLMEVLQ
         L  + + +FL+ +L+
Subjt:  NLPADERYEFLMEVLQ

XP_030495170.1 uncharacterized protein LOC115710958 [Cannabis sativa]8.8e-6140.51Show/hide
Query:  MDDSEQSKAGSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGYLIQIEKWMTEKIPKCDLKAQPHIESKVKLLKKQYNAISEMLGPNCSGFGWN
        M+ + QS     +K  W+  ED KLVECL+E+ N G WK DNGTFKPGYL Q+EK M ++IP   +KAQPHI+S++K+LK+QY AIS+MLGP+ SGFGW+
Subjt:  MDDSEQSKAGSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGYLIQIEKWMTEKIPKCDLKAQPHIESKVKLLKKQYNAISEMLGPNCSGFGWN

Query:  DVDKCIEAEKHIFDEWV-----------KPFPFLDQLAIVFGKDRANGLGAEGPADMFEAVEREMGNNEFWGGDYMPDPPIVGGDEVDGPTSMSETPINT
        +  KC+ A+K +FDEWV           KPFP+ ++LAIV+GKDRA G G  G ++  + +  E+ N   W  DY P   +   DE++   SM+    ++
Subjt:  DVDKCIEAEKHIFDEWV-----------KPFPFLDQLAIVFGKDRANGLGAEGPADMFEAVEREMGNNEFWGGDYMPDPPIVGGDEVDGPTSMSETPINT

Query:  QSTTHTSNRSSKKRARSVDPLIAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRIITKDQSQIDYFF
        Q+      R +K+++ S DPL+  ++      S+  ++A+++I+++A       + E+    RR +L  EI+KVDGL+  QR++ G+++  +Q  IDYFF
Subjt:  QSTTHTSNRSSKKRARSVDPLIAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRIITKDQSQIDYFF

Query:  NLPADERYEFLMEVLQ
         L  + + +FL+ +L+
Subjt:  NLPADERYEFLMEVLQ

TrEMBL top hitse value%identityAlignment
A0A2P5D960 Myb/SANT-like domain containing protein6.4e-5741.09Show/hide
Query:  MDDSEQSKAGSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGYLIQIEKWMTEKIPKCDLKAQPHIESKVKLLKKQYNAISEMLGPNCSG-FGW
        MD  + S +   +K  W+  ED KLVECLL+++N G WKA+NGTFK GYL Q+EK + EKIP+C LKAQPHI+S++K+LKKQ++AISEMLG +C G F W
Subjt:  MDDSEQSKAGSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGYLIQIEKWMTEKIPKCDLKAQPHIESKVKLLKKQYNAISEMLGPNCSG-FGW

Query:  NDVDKCIEAEKHIFDEWVK-----------PFPFLDQLAIVFGKDRANGLGAEGPADMFEAVEREMGNNEFWGGDYMPD---------------PPIVGG
        N  DKCI A+K++++EWVK            FP+ D+L +VFGKDRA G GA G +DM E   +E+ N++        D                P+   
Subjt:  NDVDKCIEAEKHIFDEWVK-----------PFPFLDQLAIVFGKDRANGLGAEGPADMFEAVEREMGNNEFWGGDYMPD---------------PPIVGG

Query:  DEVDGPTSMSETPINTQSTTHTSNRSSKKRARSVDPLIAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVR
        D++DG  S+S    +TQS    S + S KR+RS D LI  +           + + E I +IA  ++  A+  +    RR  +  E++KV+GL+  QRVR
Subjt:  DEVDGPTSMSETPINTQSTTHTSNRSSKKRARSVDPLIAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVR

Query:  AGRIITKDQSQIDYFFNLPADERYEFLMEVL
         G ++ ++Q+  DYFF L  + + EFL ++L
Subjt:  AGRIITKDQSQIDYFFNLPADERYEFLMEVL

A0A2P5FL77 Myb/SANT-like domain containing protein4.7e-6040.88Show/hide
Query:  MDDSEQSKAGSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGYLIQIEKWMTEKIPKCDLKAQPHIESKVKLLKKQYNAISEMLGPNCSGFGWN
        MD  + S +  ++K  W+  ED KLVECLL+++N G WKA+NGTFK GYL Q+EK + EKIP+C LKAQPHI+S++K+LKKQ++AISEMLG    GF WN
Subjt:  MDDSEQSKAGSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGYLIQIEKWMTEKIPKCDLKAQPHIESKVKLLKKQYNAISEMLGPNCSGFGWN

Query:  DVDKCIEAEKHIFDEWVK-----------PFPFLDQLAIVFGKDRANGLGAEGPADMFEAVEREMGNNEFWGGDYMPDP---------------------
          DKCI A+K+++DEWVK            FP+ D+L +VFGKDRA G GA G +DM E   +E+ N++        DP                     
Subjt:  DVDKCIEAEKHIFDEWVK-----------PFPFLDQLAIVFGKDRANGLGAEGPADMFEAVEREMGNNEFWGGDYMPDP---------------------

Query:  ----PIVGGDEVDGPTSMSETPINTQSTTHTSNRSSKKRARSVDPLIAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVD
            P+   D++DG  S+S    +TQS    S + SKKR+RS D LI  +           + + E I +IA  ++  A+  +    RR  +  E++KV+
Subjt:  ----PIVGGDEVDGPTSMSETPINTQSTTHTSNRSSKKRARSVDPLIAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVD

Query:  GLSVRQRVRAGRIITKDQSQIDYFFNLPADERYEFLMEVL
        GL+  QRVR GR++ ++Q+  DYFF L  + + EFL ++L
Subjt:  GLSVRQRVRAGRIITKDQSQIDYFFNLPADERYEFLMEVL

A0A6V7P4Z9 Myb_DNA-bind_3 domain-containing protein1.9e-5640.76Show/hide
Query:  DMDDSEQSKA-GSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGYLIQIEKWMTEKIPKCDLKAQPHIESKVKLLKKQYNAISEMLGPNCSGFG
        ++   + SKA     K  W+K EDEKL+ECLL+LSN G W+ADNGTF+ GYL Q+E+WM EKIP C+LK  PHIES+ KL K+QYNAI EMLGP+ SGFG
Subjt:  DMDDSEQSKA-GSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGYLIQIEKWMTEKIPKCDLKAQPHIESKVKLLKKQYNAISEMLGPNCSGFG

Query:  WNDVDKCIEAEKHIFDEWV-----------KPFPFLDQLAIVFGKDRANGLGAEGPADMFEAVEREMGNNEFWGGDYMPDPPIVGGDEVDGPTSMSETPI
        W+D +KC+E +K+++D WV           K FP+ ++L+IVFGKDRA G+ AE PAD    +E E  N         PD  +   DE  G         
Subjt:  WNDVDKCIEAEKHIFDEWV-----------KPFPFLDQLAIVFGKDRANGLGAEGPADMFEAVEREMGNNEFWGGDYMPDPPIVGGDEVDGPTSMSETPI

Query:  NTQSTTHTSNRSSKKRARSVDPLI-AAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRIITKDQSQID
         T S+   S ++ K+R+ + D  I  + N     + S   NA+E+I ++A  ++ +A    T   R+  L  E+ KV+GL  + R+ A  I+  D +++ 
Subjt:  NTQSTTHTSNRSSKKRARSVDPLI-AAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRIITKDQSQID

Query:  YFFNLPADERYEFL
         F+ +PA+ R +++
Subjt:  YFFNLPADERYEFL

A0A6V7QY21 Myb_DNA-bind_3 domain-containing protein1.9e-5640.95Show/hide
Query:  DMDDSEQSKA-GSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGYLIQIEKWMTEKIPKCDLKAQPHIESKVKLLKKQYNAISEMLGPNCSGFG
        ++   + SKA     K  W+K EDEKL+ECLL+LSN G W+ADNGTF+ GYL Q+E+WM EKIP C+LK  PHIES+ KL K+QYNAI EMLGP+ SGFG
Subjt:  DMDDSEQSKA-GSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGYLIQIEKWMTEKIPKCDLKAQPHIESKVKLLKKQYNAISEMLGPNCSGFG

Query:  WNDVDKCIEAEKHIFDEWVKP----------FPFLDQLAIVFGKDRANGLGAEGPADMFEAVEREMGNNEFWGGDYMPDPPIVGGDEVD--GPTSMSETP
        W+D +KC+E +K+++D WVK           FP+ ++L+IVFGKDRA G+ AE PAD    +E E  N         PD  +   DE    GPT++S   
Subjt:  WNDVDKCIEAEKHIFDEWVKP----------FPFLDQLAIVFGKDRANGLGAEGPADMFEAVEREMGNNEFWGGDYMPDPPIVGGDEVD--GPTSMSETP

Query:  INTQSTTHTSNRSSKKRARSVDPLI-AAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRIITKDQSQI
                 S ++ K+R+ + D  I  + N     + S   NA+E+I ++A  ++ +A    T   R+  L  E+ KV+GL  + R+ A  I+  D +++
Subjt:  INTQSTTHTSNRSSKKRARSVDPLI-AAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRIITKDQSQI

Query:  DYFFNLPADERYEFL
          F+ +PA+ R +++
Subjt:  DYFFNLPADERYEFL

A0A803QNC5 Uncharacterized protein3.2e-6441.3Show/hide
Query:  LILPPDMDDSEQSKAGSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGYLIQIEKWMTEKIPKCDLKAQPHIESKVKLLKKQYNAISEMLGPNC
        +IL   M+ + QS     RK  W+  +D KLVECL+++ N G WKADNGTFKPGYL Q+EK M ++IP   +KAQPHI+S++K+LK+QY AIS+MLGP+ 
Subjt:  LILPPDMDDSEQSKAGSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGYLIQIEKWMTEKIPKCDLKAQPHIESKVKLLKKQYNAISEMLGPNC

Query:  SGFGWNDVDKCIEAEKHIFDEWV-----------KPFPFLDQLAIVFGKDRANGLGAEGPADMFEAVEREMGNNEFWGGDYMPDPPIVGGDEVDGPTSMS
        SGFGWN+  KC+ A+K +FDEWV           KPFP+ D+LAIV+GKDRA G GA G ++  + +  E+ N   W  D+ P  P+   DE++   SM+
Subjt:  SGFGWNDVDKCIEAEKHIFDEWV-----------KPFPFLDQLAIVFGKDRANGLGAEGPADMFEAVEREMGNNEFWGGDYMPDPPIVGGDEVDGPTSMS

Query:  ETPINTQSTTHTSNRSSKKRARSVDPLIAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRIITKDQS
         +  ++Q+T     R +K+++ + DPL+  ++      S+  ++A+++I+++A       + E+    RR  L  EI+KVDGL+  QR++ G+++  +Q 
Subjt:  ETPINTQSTTHTSNRSSKKRARSVDPLIAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRIITKDQS

Query:  QIDYFFNLPADERYEFLMEVLQ
         IDYFF L  + + +FL+ +L+
Subjt:  QIDYFFNLPADERYEFLMEVLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAACCACCCATGACAGCACGATTTCAGTGGATAGAGTAATGCTACTCTATTGCCTTATGAAGGGGTTGGAGATCAACGTGGGGAGCATTATCAGGGATGAGATTTT
AGCCTGTGGGAGAAAGCGAGCAGGCAAGCTTTTCTTTGGATCACTCATCACCCAACTTTGCCAAAGGAACAGTATCCAGAGGAAAGACAAAGCCTCTACATCTCAGGCTA
CTCCACCTACAGGGCCGAACGTAGCTTTTCCATCCCAACACACTCCTTTCACAGGGCCATCACCATCATCGGAGGCCCTAGCCATTGCCTACCGTCAGATAGATCAACTG
AGGGACAACCTGAGAACGTATTGGAAAAACAGAGGAAAAGTTGGACTTCCCCAGAAATGCGACCGCATTTCTGGGAAGGCAAAAGTGAAATACGACCGCATTTCTGGAAA
ACCGAGACCGTTCCGAGTCGTCCGTGGCACCTATTTCGCAGCTCCTCGACCATTTTCTACACTATATAAGCTCGAGTTTCGAAGTCAAATCCAAGCAGAACTCACTTGGA
AGAAAAGGGTGGTGACAGAACCCTGGCTCGACGCCAATGAGTACGGATGCCCGAGGCAAAGTGGAGGGTTTCTTATACCCATTCAAAAGGGTGGGTTTTCTCTAGGGTTT
TCTCTCATTCTCCCTCCAGATATGGATGATAGTGAACAATCTAAAGCTGGTTCAAGTAGGAAACGCATGTGGAGTAAGGCAGAAGATGAGAAACTAGTGGAATGTTTACT
AGAGTTATCTAATATTGGCACATGGAAGGCTGATAATGGGACTTTCAAACCAGGATATCTCATTCAAATAGAAAAGTGGATGACTGAAAAAATTCCTAAATGCGATCTTA
AGGCTCAACCACACATAGAATCCAAAGTTAAATTGTTGAAGAAGCAGTATAATGCAATATCGGAAATGTTAGGCCCAAATTGTAGCGGCTTTGGATGGAATGATGTGGAC
AAGTGCATTGAAGCAGAGAAACATATATTTGATGAATGGGTAAAGCCTTTCCCATTTCTTGATCAACTAGCCATTGTTTTTGGAAAAGATAGGGCCAATGGACTTGGTGC
AGAAGGCCCAGCTGATATGTTTGAAGCAGTGGAACGAGAAATGGGCAACAATGAATTTTGGGGAGGAGATTATATGCCTGACCCACCAATAGTTGGAGGGGACGAAGTGG
ATGGACCAACATCAATGAGTGAGACACCAATAAATACACAATCTACTACGCATACATCCAATAGGTCCAGTAAGAAAAGAGCAAGGAGTGTGGACCCATTAATAGCAGCA
GTGAATGGGCTTGAGAATGTTATGAGTAGTCATCTTTCAAATGCTAATGAAAATATTCAAGAGATTGCATTGTTTTATCGTCAAGTAGCTGAACGAGAATCTACAAGAGA
GGAACGTCGAAATTCGTTAGTAAGTGAAATTAGAAAGGTAGATGGATTGAGTGTTCGACAGAGAGTTCGGGCTGGTAGGATTATCACCAAGGATCAATCCCAAATTGATT
ACTTCTTTAATCTTCCTGCTGATGAAAGATATGAATTTCTGATGGAGGTTCTGCAAGAGAATGTTGATCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCAACCACCCATGACAGCACGATTTCAGTGGATAGAGTAATGCTACTCTATTGCCTTATGAAGGGGTTGGAGATCAACGTGGGGAGCATTATCAGGGATGAGATTTT
AGCCTGTGGGAGAAAGCGAGCAGGCAAGCTTTTCTTTGGATCACTCATCACCCAACTTTGCCAAAGGAACAGTATCCAGAGGAAAGACAAAGCCTCTACATCTCAGGCTA
CTCCACCTACAGGGCCGAACGTAGCTTTTCCATCCCAACACACTCCTTTCACAGGGCCATCACCATCATCGGAGGCCCTAGCCATTGCCTACCGTCAGATAGATCAACTG
AGGGACAACCTGAGAACGTATTGGAAAAACAGAGGAAAAGTTGGACTTCCCCAGAAATGCGACCGCATTTCTGGGAAGGCAAAAGTGAAATACGACCGCATTTCTGGAAA
ACCGAGACCGTTCCGAGTCGTCCGTGGCACCTATTTCGCAGCTCCTCGACCATTTTCTACACTATATAAGCTCGAGTTTCGAAGTCAAATCCAAGCAGAACTCACTTGGA
AGAAAAGGGTGGTGACAGAACCCTGGCTCGACGCCAATGAGTACGGATGCCCGAGGCAAAGTGGAGGGTTTCTTATACCCATTCAAAAGGGTGGGTTTTCTCTAGGGTTT
TCTCTCATTCTCCCTCCAGATATGGATGATAGTGAACAATCTAAAGCTGGTTCAAGTAGGAAACGCATGTGGAGTAAGGCAGAAGATGAGAAACTAGTGGAATGTTTACT
AGAGTTATCTAATATTGGCACATGGAAGGCTGATAATGGGACTTTCAAACCAGGATATCTCATTCAAATAGAAAAGTGGATGACTGAAAAAATTCCTAAATGCGATCTTA
AGGCTCAACCACACATAGAATCCAAAGTTAAATTGTTGAAGAAGCAGTATAATGCAATATCGGAAATGTTAGGCCCAAATTGTAGCGGCTTTGGATGGAATGATGTGGAC
AAGTGCATTGAAGCAGAGAAACATATATTTGATGAATGGGTAAAGCCTTTCCCATTTCTTGATCAACTAGCCATTGTTTTTGGAAAAGATAGGGCCAATGGACTTGGTGC
AGAAGGCCCAGCTGATATGTTTGAAGCAGTGGAACGAGAAATGGGCAACAATGAATTTTGGGGAGGAGATTATATGCCTGACCCACCAATAGTTGGAGGGGACGAAGTGG
ATGGACCAACATCAATGAGTGAGACACCAATAAATACACAATCTACTACGCATACATCCAATAGGTCCAGTAAGAAAAGAGCAAGGAGTGTGGACCCATTAATAGCAGCA
GTGAATGGGCTTGAGAATGTTATGAGTAGTCATCTTTCAAATGCTAATGAAAATATTCAAGAGATTGCATTGTTTTATCGTCAAGTAGCTGAACGAGAATCTACAAGAGA
GGAACGTCGAAATTCGTTAGTAAGTGAAATTAGAAAGGTAGATGGATTGAGTGTTCGACAGAGAGTTCGGGCTGGTAGGATTATCACCAAGGATCAATCCCAAATTGATT
ACTTCTTTAATCTTCCTGCTGATGAAAGATATGAATTTCTGATGGAGGTTCTGCAAGAGAATGTTGATCCTTAG
Protein sequenceShow/hide protein sequence
MPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRNSIQRKDKASTSQATPPTGPNVAFPSQHTPFTGPSPSSEALAIAYRQIDQL
RDNLRTYWKNRGKVGLPQKCDRISGKAKVKYDRISGKPRPFRVVRGTYFAAPRPFSTLYKLEFRSQIQAELTWKKRVVTEPWLDANEYGCPRQSGGFLIPIQKGGFSLGF
SLILPPDMDDSEQSKAGSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGYLIQIEKWMTEKIPKCDLKAQPHIESKVKLLKKQYNAISEMLGPNCSGFGWNDVD
KCIEAEKHIFDEWVKPFPFLDQLAIVFGKDRANGLGAEGPADMFEAVEREMGNNEFWGGDYMPDPPIVGGDEVDGPTSMSETPINTQSTTHTSNRSSKKRARSVDPLIAA
VNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRIITKDQSQIDYFFNLPADERYEFLMEVLQENVDP