; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022335 (gene) of Snake gourd v1 genome

Gene IDTan0022335
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBromodomain-containing protein 4-like protein isoform X3
Genome locationLG08:51708553..51711695
RNA-Seq ExpressionTan0022335
SyntenyTan0022335
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_020086045.1 uncharacterized protein LOC109708647 isoform X1 [Ananas comosus]1.2e-4135.57Show/hide
Query:  TESSPLEENSINLEEANVDTEPNLAIDGDVDVENSFGACKT-RGQTRMSDVWNLKEGQKIVVEFNNLNQPIKVGSNVLGNLLGTIGRTCNLCPLNYDDWR
        ++S P  E   +  ++ +  + N+  D  V++ +  G  +T RG TR  DVW+L  G+KIVV  N   QP+K    +LG  LGT+ R  NLCP++Y  W+
Subjt:  TESSPLEENSINLEEANVDTEPNLAIDGDVDVENSFGACKT-RGQTRMSDVWNLKEGQKIVVEFNNLNQPIKVGSNVLGNLLGTIGRTCNLCPLNYDDWR

Query:  KMPNIYKDEIWEII--------KKTHRKWCLPSLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQWVDIKR----------SARNTEIRKKQKF
        KMPN +K  I  +         K+    W L S+SRKW GYK  L+ +Y     TE +     P+ I  +QW+D+ R          S      R   K 
Subjt:  KMPNIYKDEIWEII--------KKTHRKWCLPSLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQWVDIKR----------SARNTEIRKKQKF

Query:  THSCGRKSLARKTYEMKEELGRDPTRVEVFLAAHKCKDGSYIDEESAKIAGKFQEYMNHNADSSE----VMPSEMEVFEQVMGNEHSGRVRGMGLGPT
         H+ G  S ARK  E ++E  R+P+R++ F + HK KDG+YI++ S ++  K    +   + SSE    +   E EVF +++G E  GRVRG GLGP+
Subjt:  THSCGRKSLARKTYEMKEELGRDPTRVEVFLAAHKCKDGSYIDEESAKIAGKFQEYMNHNADSSE----VMPSEMEVFEQVMGNEHSGRVRGMGLGPT

XP_020086046.1 uncharacterized protein LOC109708647 isoform X2 [Ananas comosus]1.2e-4135.57Show/hide
Query:  TESSPLEENSINLEEANVDTEPNLAIDGDVDVENSFGACKT-RGQTRMSDVWNLKEGQKIVVEFNNLNQPIKVGSNVLGNLLGTIGRTCNLCPLNYDDWR
        ++S P  E   +  ++ +  + N+  D  V++ +  G  +T RG TR  DVW+L  G+KIVV  N   QP+K    +LG  LGT+ R  NLCP++Y  W+
Subjt:  TESSPLEENSINLEEANVDTEPNLAIDGDVDVENSFGACKT-RGQTRMSDVWNLKEGQKIVVEFNNLNQPIKVGSNVLGNLLGTIGRTCNLCPLNYDDWR

Query:  KMPNIYKDEIWEII--------KKTHRKWCLPSLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQWVDIKR----------SARNTEIRKKQKF
        KMPN +K  I  +         K+    W L S+SRKW GYK  L+ +Y     TE +     P+ I  +QW+D+ R          S      R   K 
Subjt:  KMPNIYKDEIWEII--------KKTHRKWCLPSLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQWVDIKR----------SARNTEIRKKQKF

Query:  THSCGRKSLARKTYEMKEELGRDPTRVEVFLAAHKCKDGSYIDEESAKIAGKFQEYMNHNADSSE----VMPSEMEVFEQVMGNEHSGRVRGMGLGPT
         H+ G  S ARK  E ++E  R+P+R++ F + HK KDG+YI++ S ++  K    +   + SSE    +   E EVF +++G E  GRVRG GLGP+
Subjt:  THSCGRKSLARKTYEMKEELGRDPTRVEVFLAAHKCKDGSYIDEESAKIAGKFQEYMNHNADSSE----VMPSEMEVFEQVMGNEHSGRVRGMGLGPT

XP_020086047.1 uncharacterized protein LOC109708647 isoform X3 [Ananas comosus]1.2e-4135.57Show/hide
Query:  TESSPLEENSINLEEANVDTEPNLAIDGDVDVENSFGACKT-RGQTRMSDVWNLKEGQKIVVEFNNLNQPIKVGSNVLGNLLGTIGRTCNLCPLNYDDWR
        ++S P  E   +  ++ +  + N+  D  V++ +  G  +T RG TR  DVW+L  G+KIVV  N   QP+K    +LG  LGT+ R  NLCP++Y  W+
Subjt:  TESSPLEENSINLEEANVDTEPNLAIDGDVDVENSFGACKT-RGQTRMSDVWNLKEGQKIVVEFNNLNQPIKVGSNVLGNLLGTIGRTCNLCPLNYDDWR

Query:  KMPNIYKDEIWEII--------KKTHRKWCLPSLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQWVDIKR----------SARNTEIRKKQKF
        KMPN +K  I  +         K+    W L S+SRKW GYK  L+ +Y     TE +     P+ I  +QW+D+ R          S      R   K 
Subjt:  KMPNIYKDEIWEII--------KKTHRKWCLPSLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQWVDIKR----------SARNTEIRKKQKF

Query:  THSCGRKSLARKTYEMKEELGRDPTRVEVFLAAHKCKDGSYIDEESAKIAGKFQEYMNHNADSSE----VMPSEMEVFEQVMGNEHSGRVRGMGLGPT
         H+ G  S ARK  E ++E  R+P+R++ F + HK KDG+YI++ S ++  K    +   + SSE    +   E EVF +++G E  GRVRG GLGP+
Subjt:  THSCGRKSLARKTYEMKEELGRDPTRVEVFLAAHKCKDGSYIDEESAKIAGKFQEYMNHNADSSE----VMPSEMEVFEQVMGNEHSGRVRGMGLGPT

XP_020088912.1 uncharacterized protein LOC109710609 [Ananas comosus]3.6e-4134.52Show/hide
Query:  STRSRRHVVVTESSPLEENSINLEEANVDTEPNLAIDGDVDVENSFGACKTRGQTRMSDVWNLKEGQKIVVEFNNLNQPIKVGSNVLGNLLGTIGRTCNL
        S+++  +V  + +S  + N   L +A+ D    L  + +   E      + RG T ++++WNL + ++I V FN   QPI     VL + LG + R  NL
Subjt:  STRSRRHVVVTESSPLEENSINLEEANVDTEPNLAIDGDVDVENSFGACKTRGQTRMSDVWNLKEGQKIVVEFNNLNQPIKVGSNVLGNLLGTIGRTCNL

Query:  CPLNYDDWRKMPNIYKDEIWEIIKKTH------RKWCLPSLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQWVDI----------KRSARNTE
         PL+  DWR  P   K  ++++++          KW L SL +KW  YKC L+G Y  KY   D+ L+HKP+ +PR+QW  +          KRS +N E
Subjt:  CPLNYDDWRKMPNIYKDEIWEIIKKTH------RKWCLPSLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQWVDI----------KRSARNTE

Query:  IRKKQKFTHSCGRKSLARKTYEMKEELGRDPTRVEVFLAAHK-CKDGSYIDEESAKIAGKFQEYMNHNADSSEV----MPSEMEVFEQVMGNEHSGRVRG
         R KQK  H+ G KS AR   E K + G +P+R  +F+  HK  KDG  +DEESA+     ++ M    +S E     +  E ++F QV+G E  G+VRG
Subjt:  IRKKQKFTHSCGRKSLARKTYEMKEELGRDPTRVEVFLAAHK-CKDGSYIDEESAKIAGKFQEYMNHNADSSEV----MPSEMEVFEQVMGNEHSGRVRG

Query:  MGLGPTQLKFLKLNIVNLHALRSI--LEQVWKTIKL
        +GLGPT       +  N  + R    +EQ+ K IK+
Subjt:  MGLGPTQLKFLKLNIVNLHALRSI--LEQVWKTIKL

XP_038699673.1 uncharacterized protein LOC119996968 [Tripterygium wilfordii]1.2e-4134.73Show/hide
Query:  MPSTRSRRHVVV----------------TESSPLEENSINLEEANVDTEPNLAIDGDVDVENSFGACKTRGQTRMSDVWNLKEGQKIVVEFNNLNQPIKV
        M  TR RR V +                   + +EEN  +   +  +    + +DG VD  ++ G  +TRG+TR++DVW L   Q+I+V FN+L QPIK 
Subjt:  MPSTRSRRHVVV----------------TESSPLEENSINLEEANVDTEPNLAIDGDVDVENSFGACKTRGQTRMSDVWNLKEGQKIVVEFNNLNQPIKV

Query:  GSNVLGNLLGTIGRTCNLCPLNYDDWRKMPNIYKDEIWEIIKKTHRK---------WCLPSLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQW
           +LG  L +I +   LC L  DDWRK     K EI E++K    K         W L +++R+W  YKC+L+      ++  +E ++++P ++   QW
Subjt:  GSNVLGNLLGTIGRTCNLCPLNYDDWRKMPNIYKDEIWEIIKKTHRK---------WCLPSLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQW

Query:  VDIKRSARNTEIRKKQKFTHSCGRKSLARKTYEMKEELGRDPTRVEVFLAAHKCKDGSYIDEESAKIAGKFQEYMNHNADSSEVMPSEMEVFEQVMGNEH
           KRS  NT  RKKQK  H+ G KS ARK  EM+E+ G +PTR ++FL  HK K G  +  E    + K           +  + SE + +  + G EH
Subjt:  VDIKRSARNTEIRKKQKFTHSCGRKSLARKTYEMKEELGRDPTRVEVFLAAHKCKDGSYIDEESAKIAGKFQEYMNHNADSSEVMPSEMEVFEQVMGNEH

Query:  SGRVRGMGLGP
        +GRVRG+G GP
Subjt:  SGRVRGMGLGP

TrEMBL top hitse value%identityAlignment
A0A445ARQ5 Uncharacterized protein8.7e-4135.57Show/hide
Query:  ESSPLEENSINLEEANVDTEPNLAIDGDVDVENSFGACKTRGQTRMSDVWNLKEGQKIVVEFNNLNQPIKVGSNVLGNLLGTIGRTCNLCPLNYDDWRKM
        E  P  E  +N E+  +   PN++   + + +    A + RG T +  +WN+  G+ I V+FNN NQ I+     L + LG + RT ++ PLN DDWR  
Subjt:  ESSPLEENSINLEEANVDTEPNLAIDGDVDVENSFGACKTRGQTRMSDVWNLKEGQKIVVEFNNLNQPIKVGSNVLGNLLGTIGRTCNLCPLNYDDWRKM

Query:  PNIYKDEIWEIIKKTHRKWCLP---------SLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQWVDI----------KRSARNTEIRKKQKFT
            K+++ +I++K   K+ +P         S+ +KW  YKC+L+G Y  +Y T+D  LK++P+ IPR+QW+ +          KR+  N   R KQK  
Subjt:  PNIYKDEIWEIIKKTHRKWCLP---------SLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQWVDI----------KRSARNTEIRKKQKFT

Query:  HSCGRKSLARKTYEMKEELGRDPTRVEVFLAAH-KCKDGSYIDEESAKIAGKFQEYMNH----NADSSEVMPSEMEVFEQVMGNEHSGRVRGMGLGPT
        H+ G KS+A    E+ ++ G +P+R E+FL  H + KDG  +DEESAK     +E +N+    N +S   +  E +++ QV+G++ SG VRG+GLGPT
Subjt:  HSCGRKSLARKTYEMKEELGRDPTRVEVFLAAH-KCKDGSYIDEESAKIAGKFQEYMNH----NADSSEVMPSEMEVFEQVMGNEHSGRVRGMGLGPT

A0A6P5EQR3 uncharacterized protein LOC109708647 isoform X26.0e-4235.57Show/hide
Query:  TESSPLEENSINLEEANVDTEPNLAIDGDVDVENSFGACKT-RGQTRMSDVWNLKEGQKIVVEFNNLNQPIKVGSNVLGNLLGTIGRTCNLCPLNYDDWR
        ++S P  E   +  ++ +  + N+  D  V++ +  G  +T RG TR  DVW+L  G+KIVV  N   QP+K    +LG  LGT+ R  NLCP++Y  W+
Subjt:  TESSPLEENSINLEEANVDTEPNLAIDGDVDVENSFGACKT-RGQTRMSDVWNLKEGQKIVVEFNNLNQPIKVGSNVLGNLLGTIGRTCNLCPLNYDDWR

Query:  KMPNIYKDEIWEII--------KKTHRKWCLPSLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQWVDIKR----------SARNTEIRKKQKF
        KMPN +K  I  +         K+    W L S+SRKW GYK  L+ +Y     TE +     P+ I  +QW+D+ R          S      R   K 
Subjt:  KMPNIYKDEIWEII--------KKTHRKWCLPSLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQWVDIKR----------SARNTEIRKKQKF

Query:  THSCGRKSLARKTYEMKEELGRDPTRVEVFLAAHKCKDGSYIDEESAKIAGKFQEYMNHNADSSE----VMPSEMEVFEQVMGNEHSGRVRGMGLGPT
         H+ G  S ARK  E ++E  R+P+R++ F + HK KDG+YI++ S ++  K    +   + SSE    +   E EVF +++G E  GRVRG GLGP+
Subjt:  THSCGRKSLARKTYEMKEELGRDPTRVEVFLAAHKCKDGSYIDEESAKIAGKFQEYMNHNADSSE----VMPSEMEVFEQVMGNEHSGRVRGMGLGPT

A0A6P5ERA3 uncharacterized protein LOC109708647 isoform X36.0e-4235.57Show/hide
Query:  TESSPLEENSINLEEANVDTEPNLAIDGDVDVENSFGACKT-RGQTRMSDVWNLKEGQKIVVEFNNLNQPIKVGSNVLGNLLGTIGRTCNLCPLNYDDWR
        ++S P  E   +  ++ +  + N+  D  V++ +  G  +T RG TR  DVW+L  G+KIVV  N   QP+K    +LG  LGT+ R  NLCP++Y  W+
Subjt:  TESSPLEENSINLEEANVDTEPNLAIDGDVDVENSFGACKT-RGQTRMSDVWNLKEGQKIVVEFNNLNQPIKVGSNVLGNLLGTIGRTCNLCPLNYDDWR

Query:  KMPNIYKDEIWEII--------KKTHRKWCLPSLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQWVDIKR----------SARNTEIRKKQKF
        KMPN +K  I  +         K+    W L S+SRKW GYK  L+ +Y     TE +     P+ I  +QW+D+ R          S      R   K 
Subjt:  KMPNIYKDEIWEII--------KKTHRKWCLPSLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQWVDIKR----------SARNTEIRKKQKF

Query:  THSCGRKSLARKTYEMKEELGRDPTRVEVFLAAHKCKDGSYIDEESAKIAGKFQEYMNHNADSSE----VMPSEMEVFEQVMGNEHSGRVRGMGLGPT
         H+ G  S ARK  E ++E  R+P+R++ F + HK KDG+YI++ S ++  K    +   + SSE    +   E EVF +++G E  GRVRG GLGP+
Subjt:  THSCGRKSLARKTYEMKEELGRDPTRVEVFLAAHKCKDGSYIDEESAKIAGKFQEYMNHNADSSE----VMPSEMEVFEQVMGNEHSGRVRGMGLGPT

A0A6P5EY17 uncharacterized protein LOC109708647 isoform X16.0e-4235.57Show/hide
Query:  TESSPLEENSINLEEANVDTEPNLAIDGDVDVENSFGACKT-RGQTRMSDVWNLKEGQKIVVEFNNLNQPIKVGSNVLGNLLGTIGRTCNLCPLNYDDWR
        ++S P  E   +  ++ +  + N+  D  V++ +  G  +T RG TR  DVW+L  G+KIVV  N   QP+K    +LG  LGT+ R  NLCP++Y  W+
Subjt:  TESSPLEENSINLEEANVDTEPNLAIDGDVDVENSFGACKT-RGQTRMSDVWNLKEGQKIVVEFNNLNQPIKVGSNVLGNLLGTIGRTCNLCPLNYDDWR

Query:  KMPNIYKDEIWEII--------KKTHRKWCLPSLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQWVDIKR----------SARNTEIRKKQKF
        KMPN +K  I  +         K+    W L S+SRKW GYK  L+ +Y     TE +     P+ I  +QW+D+ R          S      R   K 
Subjt:  KMPNIYKDEIWEII--------KKTHRKWCLPSLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQWVDIKR----------SARNTEIRKKQKF

Query:  THSCGRKSLARKTYEMKEELGRDPTRVEVFLAAHKCKDGSYIDEESAKIAGKFQEYMNHNADSSE----VMPSEMEVFEQVMGNEHSGRVRGMGLGPT
         H+ G  S ARK  E ++E  R+P+R++ F + HK KDG+YI++ S ++  K    +   + SSE    +   E EVF +++G E  GRVRG GLGP+
Subjt:  THSCGRKSLARKTYEMKEELGRDPTRVEVFLAAHKCKDGSYIDEESAKIAGKFQEYMNHNADSSE----VMPSEMEVFEQVMGNEHSGRVRGMGLGPT

A0A6P5EZG9 uncharacterized protein LOC1097106091.7e-4134.52Show/hide
Query:  STRSRRHVVVTESSPLEENSINLEEANVDTEPNLAIDGDVDVENSFGACKTRGQTRMSDVWNLKEGQKIVVEFNNLNQPIKVGSNVLGNLLGTIGRTCNL
        S+++  +V  + +S  + N   L +A+ D    L  + +   E      + RG T ++++WNL + ++I V FN   QPI     VL + LG + R  NL
Subjt:  STRSRRHVVVTESSPLEENSINLEEANVDTEPNLAIDGDVDVENSFGACKTRGQTRMSDVWNLKEGQKIVVEFNNLNQPIKVGSNVLGNLLGTIGRTCNL

Query:  CPLNYDDWRKMPNIYKDEIWEIIKKTH------RKWCLPSLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQWVDI----------KRSARNTE
         PL+  DWR  P   K  ++++++          KW L SL +KW  YKC L+G Y  KY   D+ L+HKP+ +PR+QW  +          KRS +N E
Subjt:  CPLNYDDWRKMPNIYKDEIWEIIKKTH------RKWCLPSLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQWVDI----------KRSARNTE

Query:  IRKKQKFTHSCGRKSLARKTYEMKEELGRDPTRVEVFLAAHK-CKDGSYIDEESAKIAGKFQEYMNHNADSSEV----MPSEMEVFEQVMGNEHSGRVRG
         R KQK  H+ G KS AR   E K + G +P+R  +F+  HK  KDG  +DEESA+     ++ M    +S E     +  E ++F QV+G E  G+VRG
Subjt:  IRKKQKFTHSCGRKSLARKTYEMKEELGRDPTRVEVFLAAHK-CKDGSYIDEESAKIAGKFQEYMNHNADSSEV----MPSEMEVFEQVMGNEHSGRVRG

Query:  MGLGPTQLKFLKLNIVNLHALRSI--LEQVWKTIKL
        +GLGPT       +  N  + R    +EQ+ K IK+
Subjt:  MGLGPTQLKFLKLNIVNLHALRSI--LEQVWKTIKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G40087.1 Plant transposase (Ptta/En/Spm family)1.2e-2628.25Show/hide
Query:  IDGDVDVENSFGACKTRGQTRMSDVWNLKEGQKIVVEFNN-LNQPIKVGSNVLGNLLGTIGRTCNLCPLNYDDWRKMPNIYKDEIWEIIKKTHR------
        ID D ++E      K        DVW + +  +++V F+   NQPI     +LG+ L  +     L P+NY DWR +    KD  W +I+   R      
Subjt:  IDGDVDVENSFGACKTRGQTRMSDVWNLKEGQKIVVEFNN-LNQPIKVGSNVLGNLLGTIGRTCNLCPLNYDDWRKMPNIYKDEIWEIIKKTHR------

Query:  --KWCLPSLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQWVDI----------KRSARNTEIRKKQKFTHSCGRKSLARKTYEMKEELGRDPT
           + + +L  K    K      Y  K N  +E L+++P+ +P +QW  +          K   RNT+ +K     H CGRKS +RK  E+K + G+ P 
Subjt:  --KWCLPSLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQWVDI----------KRSARNTEIRKKQKFTHSCGRKSLARKTYEMKEELGRDPT

Query:  RVEVFLAAHKCKDGSYIDEESAKIAGKFQEYMNHNADSSEVMPSEM-EVFEQVMGNEHSGRVRGMGLGPTQLKFLK-LNIVNLHALRSILEQVWKTIKLR
        R E F+ + K  DGS++ +E+   A      +N N   +    + + + + QV G E  GRVR +G GPT  + ++  N      + +   ++   +K +
Subjt:  RVEVFLAAHKCKDGSYIDEESAKIAGKFQEYMNHNADSSEVMPSEM-EVFEQVMGNEHSGRVRGMGLGPTQLKFLK-LNIVNLHALRSILEQVWKTIKLR

Query:  RKILNDQI
         K L DQ+
Subjt:  RKILNDQI

AT3G30200.1 Plant transposase (Ptta/En/Spm family)2.0e-2130Show/hide
Query:  LLGTIGRTCNLCPLNYDDWRKMPNIYKDEIWEIIKKTHRKWCLPSLSRKWG----GYKC---NLRGRYASKYNTEDECLKHKPDNIPREQWVDI------
        ++G +     L P+NY DWR +    KD  W +I+   R +  P + + +     G +C    LR     K N  +E L+++P+ +P +QW  +      
Subjt:  LLGTIGRTCNLCPLNYDDWRKMPNIYKDEIWEIIKKTHRKWCLPSLSRKWG----GYKC---NLRGRYASKYNTEDECLKHKPDNIPREQWVDI------

Query:  ----KRSARNTEIRKKQKFTHSCGRKSLARKTYEMKEELGRDPTRVEVFLAAHKCKDGSYIDEESAKIAGKFQEYMNHNADSSEVMPSEM-EVFEQVMGN
            K   RNT+ +K     H CGRKS +RK  E+K + G+ P RVE F+ + K  DGS++ +E+   A      +N N   +    + + + + QV G 
Subjt:  ----KRSARNTEIRKKQKFTHSCGRKSLARKTYEMKEELGRDPTRVEVFLAAHKCKDGSYIDEESAKIAGKFQEYMNHNADSSEVMPSEM-EVFEQVMGN

Query:  EHSGRVRGMGLGPTQLKFLK
        E  GRV  +G GPT  + ++
Subjt:  EHSGRVRGMGLGPTQLKFLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTCAACTAGAAGTCGTCGCCATGTGGTGGTCACTGAGTCTAGTCCTTTAGAAGAAAATTCTATCAATTTGGAGGAAGCTAATGTTGATACAGAACCTAATTTAGC
TATTGATGGCGATGTTGACGTTGAAAATTCATTTGGGGCATGTAAGACACGTGGTCAAACACGCATGTCTGATGTTTGGAATTTAAAAGAAGGTCAGAAGATAGTGGTGG
AGTTCAACAATTTAAACCAACCTATAAAAGTTGGTTCAAATGTTTTGGGTAATCTCTTAGGGACTATTGGAAGAACTTGTAACCTCTGTCCCCTAAATTATGATGATTGG
AGAAAGATGCCCAATATATATAAAGATGAGATTTGGGAGATCATTAAGAAAACACATAGAAAATGGTGTTTGCCCTCATTATCTAGAAAGTGGGGTGGCTATAAATGTAA
TTTACGAGGACGATATGCTTCAAAGTATAATACTGAAGATGAATGCTTGAAACATAAACCTGATAATATACCTAGAGAACAATGGGTTGACATAAAACGTAGTGCAAGAA
ATACAGAGATACGTAAGAAACAAAAATTTACTCATTCATGCGGAAGAAAAAGCCTAGCACGAAAAACATATGAGATGAAAGAAGAATTGGGAAGGGACCCCACCAGAGTA
GAAGTCTTTTTAGCCGCTCATAAATGTAAGGATGGAAGTTACATTGATGAGGAATCTGCTAAAATTGCTGGAAAATTTCAAGAATACATGAATCACAATGCAGATAGCTC
AGAAGTTATGCCTTCTGAGATGGAGGTGTTTGAACAAGTCATGGGTAACGAACATTCAGGACGTGTTCGTGGAATGGGATTAGGACCTACCCAACTGAAGTTTTTAAAAC
TAAATATCGTCAATTTGCATGCATTGAGATCAATTCTGGAACAAGTATGGAAGACCATAAAACTTAGAAGGAAGATCTTAAATGACCAAATTAGAGTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTTCAACTAGAAGTCGTCGCCATGTGGTGGTCACTGAGTCTAGTCCTTTAGAAGAAAATTCTATCAATTTGGAGGAAGCTAATGTTGATACAGAACCTAATTTAGC
TATTGATGGCGATGTTGACGTTGAAAATTCATTTGGGGCATGTAAGACACGTGGTCAAACACGCATGTCTGATGTTTGGAATTTAAAAGAAGGTCAGAAGATAGTGGTGG
AGTTCAACAATTTAAACCAACCTATAAAAGTTGGTTCAAATGTTTTGGGTAATCTCTTAGGGACTATTGGAAGAACTTGTAACCTCTGTCCCCTAAATTATGATGATTGG
AGAAAGATGCCCAATATATATAAAGATGAGATTTGGGAGATCATTAAGAAAACACATAGAAAATGGTGTTTGCCCTCATTATCTAGAAAGTGGGGTGGCTATAAATGTAA
TTTACGAGGACGATATGCTTCAAAGTATAATACTGAAGATGAATGCTTGAAACATAAACCTGATAATATACCTAGAGAACAATGGGTTGACATAAAACGTAGTGCAAGAA
ATACAGAGATACGTAAGAAACAAAAATTTACTCATTCATGCGGAAGAAAAAGCCTAGCACGAAAAACATATGAGATGAAAGAAGAATTGGGAAGGGACCCCACCAGAGTA
GAAGTCTTTTTAGCCGCTCATAAATGTAAGGATGGAAGTTACATTGATGAGGAATCTGCTAAAATTGCTGGAAAATTTCAAGAATACATGAATCACAATGCAGATAGCTC
AGAAGTTATGCCTTCTGAGATGGAGGTGTTTGAACAAGTCATGGGTAACGAACATTCAGGACGTGTTCGTGGAATGGGATTAGGACCTACCCAACTGAAGTTTTTAAAAC
TAAATATCGTCAATTTGCATGCATTGAGATCAATTCTGGAACAAGTATGGAAGACCATAAAACTTAGAAGGAAGATCTTAAATGACCAAATTAGAGTCTGA
Protein sequenceShow/hide protein sequence
MPSTRSRRHVVVTESSPLEENSINLEEANVDTEPNLAIDGDVDVENSFGACKTRGQTRMSDVWNLKEGQKIVVEFNNLNQPIKVGSNVLGNLLGTIGRTCNLCPLNYDDW
RKMPNIYKDEIWEIIKKTHRKWCLPSLSRKWGGYKCNLRGRYASKYNTEDECLKHKPDNIPREQWVDIKRSARNTEIRKKQKFTHSCGRKSLARKTYEMKEELGRDPTRV
EVFLAAHKCKDGSYIDEESAKIAGKFQEYMNHNADSSEVMPSEMEVFEQVMGNEHSGRVRGMGLGPTQLKFLKLNIVNLHALRSILEQVWKTIKLRRKILNDQIRV