; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg008205 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg008205
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionATP-dependent helicase ATRX
Genome locationscaffold2:19191047..19194525
RNA-Seq ExpressionSpg008205
SyntenySpg008205
Gene Ontology termsGO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0140658 - ATP-dependent chromatin remodeler activity (molecular function)
InterPro domainsIPR000330 - SNF2, N-terminal
IPR025558 - Domain of unknown function DUF4283
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR038718 - SNF2-like, N-terminal domain superfamily
IPR044574 - ATPase ARIP4-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7020999.1 Protein CHROMATIN REMODELING 20 [Cucurbita argyrosperma subsp. argyrosperma]1.1e-4861.63Show/hide
Query:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----
        + +SGIRF+WENIIQS+RKVKSGDK LGCILAHTMGLGKTFQVIAF YTAMRNVDLGLRT + VTPVNVLHNW QEFF+W+PSELKPLRVFMLED     
Subjt:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----

Query:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR
                                                     DGPDI VCDEAHMIKNTKAD+TQALK+
Subjt:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR

XP_022149859.1 uncharacterized protein LOC111018186 [Momordica charantia]6.6e-5750.49Show/hide
Query:  VINPFLPDKALLKCPSRQLANLLAKNKGWVSFGPIILKVEKWDKKLHSRISCIPCYGGWVKLRNLPLHLWNLKTFKAIGDSLGGFIEYDEVNSLLIECVE
        +INPF  DKAL+KCPS+ LA LL  NKGWV+FGP+ +K+E W+  LH R    P YG WVK+RN+PLHLW+L TFKAIG++LGGFI+YD+ NS  IEC +
Subjt:  VINPFLPDKALLKCPSRQLANLLAKNKGWVSFGPIILKVEKWDKKLHSRISCIPCYGGWVKLRNLPLHLWNLKTFKAIGDSLGGFIEYDEVNSLLIECVE

Query:  VKLKIKDNYCGFIPAEVRIKDGDDHFNVQIVTFQDGNLLIDKVAGLHSSFSPTAAQGFHRGPLDPIFCTMDRWRIEDSMDYPVVKTLHSE----EDLRTG
        V +K+K NYCGFIPAE+   DG   F  ++V+F+D   L  K  G+H  FS  AA+ FH+G  +    ++DRWR+E+  +YP V   +      +  R G
Subjt:  VKLKIKDNYCGFIPAEVRIKDGDDHFNVQIVTFQDGNLLIDKVAGLHSSFSPTAAQGFHRGPLDPIFCTMDRWRIEDSMDYPVVKTLHSE----EDLRTG

Query:  GGQL
        G +L
Subjt:  GGQL

XP_022937760.1 protein CHROMATIN REMODELING 20 isoform X3 [Cucurbita moschata]1.1e-4861.63Show/hide
Query:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----
        + +SGIRF+WENIIQS+RKVKSGDK LGCILAHTMGLGKTFQVIAF YTAMRNVDLGLRT + VTPVNVLHNW QEFF+W+PSELKPLRVFMLED     
Subjt:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----

Query:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR
                                                     DGPDI VCDEAHMIKNTKAD+TQALK+
Subjt:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR

XP_022965424.1 protein CHROMATIN REMODELING 20 isoform X1 [Cucurbita maxima]1.1e-4861.63Show/hide
Query:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----
        + +SGIRF+WENIIQS+RKVKSGDK LGCILAHTMGLGKTFQVIAF YTAMRNVDLGLRT + VTPVNVLHNW QEFF+W+PSELKPLRVFMLED     
Subjt:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----

Query:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR
                                                     DGPDI VCDEAHMIKNTKAD+TQALK+
Subjt:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR

XP_022965425.1 protein CHROMATIN REMODELING 20 isoform X2 [Cucurbita maxima]1.1e-4861.63Show/hide
Query:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----
        + +SGIRF+WENIIQS+RKVKSGDK LGCILAHTMGLGKTFQVIAF YTAMRNVDLGLRT + VTPVNVLHNW QEFF+W+PSELKPLRVFMLED     
Subjt:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----

Query:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR
                                                     DGPDI VCDEAHMIKNTKAD+TQALK+
Subjt:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR

TrEMBL top hitse value%identityAlignment
A0A6J1D6X4 uncharacterized protein LOC1110181863.2e-5750.49Show/hide
Query:  VINPFLPDKALLKCPSRQLANLLAKNKGWVSFGPIILKVEKWDKKLHSRISCIPCYGGWVKLRNLPLHLWNLKTFKAIGDSLGGFIEYDEVNSLLIECVE
        +INPF  DKAL+KCPS+ LA LL  NKGWV+FGP+ +K+E W+  LH R    P YG WVK+RN+PLHLW+L TFKAIG++LGGFI+YD+ NS  IEC +
Subjt:  VINPFLPDKALLKCPSRQLANLLAKNKGWVSFGPIILKVEKWDKKLHSRISCIPCYGGWVKLRNLPLHLWNLKTFKAIGDSLGGFIEYDEVNSLLIECVE

Query:  VKLKIKDNYCGFIPAEVRIKDGDDHFNVQIVTFQDGNLLIDKVAGLHSSFSPTAAQGFHRGPLDPIFCTMDRWRIEDSMDYPVVKTLHSE----EDLRTG
        V +K+K NYCGFIPAE+   DG   F  ++V+F+D   L  K  G+H  FS  AA+ FH+G  +    ++DRWR+E+  +YP V   +      +  R G
Subjt:  VKLKIKDNYCGFIPAEVRIKDGDDHFNVQIVTFQDGNLLIDKVAGLHSSFSPTAAQGFHRGPLDPIFCTMDRWRIEDSMDYPVVKTLHSE----EDLRTG

Query:  GGQL
        G +L
Subjt:  GGQL

A0A6J1FGU8 ATP-dependent helicase ATRX5.4e-4961.63Show/hide
Query:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----
        + +SGIRF+WENIIQS+RKVKSGDK LGCILAHTMGLGKTFQVIAF YTAMRNVDLGLRT + VTPVNVLHNW QEFF+W+PSELKPLRVFMLED     
Subjt:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----

Query:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR
                                                     DGPDI VCDEAHMIKNTKAD+TQALK+
Subjt:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR

A0A6J1HLN1 ATP-dependent helicase ATRX5.4e-4961.63Show/hide
Query:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----
        + +SGIRF+WENIIQS+RKVKSGDK LGCILAHTMGLGKTFQVIAF YTAMRNVDLGLRT + VTPVNVLHNW QEFF+W+PSELKPLRVFMLED     
Subjt:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----

Query:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR
                                                     DGPDI VCDEAHMIKNTKAD+TQALK+
Subjt:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR

A0A6J1HNN1 ATP-dependent helicase ATRX5.4e-4961.63Show/hide
Query:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----
        + +SGIRF+WENIIQS+RKVKSGDK LGCILAHTMGLGKTFQVIAF YTAMRNVDLGLRT + VTPVNVLHNW QEFF+W+PSELKPLRVFMLED     
Subjt:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----

Query:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR
                                                     DGPDI VCDEAHMIKNTKAD+TQALK+
Subjt:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR

A0A6J1HQY8 ATP-dependent helicase ATRX5.4e-4961.63Show/hide
Query:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----
        + +SGIRF+WENIIQS+RKVKSGDK LGCILAHTMGLGKTFQVIAF YTAMRNVDLGLRT + VTPVNVLHNW QEFF+W+PSELKPLRVFMLED     
Subjt:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----

Query:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR
                                                     DGPDI VCDEAHMIKNTKAD+TQALK+
Subjt:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR

SwissProt top hitse value%identityAlignment
F4HW51 Protein CHROMATIN REMODELING 201.2e-4556.98Show/hide
Query:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----
        + ++GIRF+WENIIQSI +VKSGDK LGCILAHTMGLGKTFQVIAF YTAMR VDLGL+T L VTPVNVLHNW  EF KW PSE+KPLR+FML D     
Subjt:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----

Query:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR
                                                     DGPDI VCDEAH+IKNTKAD TQALK+
Subjt:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR

P46100 Transcriptional regulator ATRX1.3e-1530.46Show/hide
Query:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKW-------------------R
        + + G++F+W+   +S++K K    S GCILAH MGLGKT QV++F +T +    L   T L V P+N   NW  EF KW                   R
Subjt:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKW-------------------R

Query:  PSE-----------------------------------LKPLRVFMLEDDGPDIPVCDEAHMIKNTKADLTQAL
        P E                                   LK +    L D GPD  VCDE H++KN  + +++A+
Subjt:  PSE-----------------------------------LKPLRVFMLEDDGPDIPVCDEAHMIKNTKADLTQAL

Q61687 Transcriptional regulator ATRX2.2e-1530.46Show/hide
Query:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKW-------------------R
        + + G++F+W+   +S+ K K    S GCILAH MGLGKT QV++F +T +    L   T L V P+N   NW  EF KW                   R
Subjt:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKW-------------------R

Query:  PSE-----------------------------------LKPLRVFMLEDDGPDIPVCDEAHMIKNTKADLTQAL
        P E                                   LK +    L D GPD  VCDE H++KN  + +++A+
Subjt:  PSE-----------------------------------LKPLRVFMLEDDGPDIPVCDEAHMIKNTKADLTQAL

Q7YQM3 Transcriptional regulator ATRX1.3e-1530.46Show/hide
Query:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKW-------------------R
        + + G++F+W+   +S++K K    S GCILAH MGLGKT QV++F +T +    L   T L V P+N   NW  EF KW                   R
Subjt:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKW-------------------R

Query:  PSE-----------------------------------LKPLRVFMLEDDGPDIPVCDEAHMIKNTKADLTQAL
        P E                                   LK +    L D GPD  VCDE H++KN  + +++A+
Subjt:  PSE-----------------------------------LKPLRVFMLEDDGPDIPVCDEAHMIKNTKADLTQAL

Q7YQM4 Transcriptional regulator ATRX1.3e-1530.46Show/hide
Query:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKW-------------------R
        + + G++F+W+   +S++K K    S GCILAH MGLGKT QV++F +T +    L   T L V P+N   NW  EF KW                   R
Subjt:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKW-------------------R

Query:  PSE-----------------------------------LKPLRVFMLEDDGPDIPVCDEAHMIKNTKADLTQAL
        P E                                   LK +    L D GPD  VCDE H++KN  + +++A+
Subjt:  PSE-----------------------------------LKPLRVFMLEDDGPDIPVCDEAHMIKNTKADLTQAL

Arabidopsis top hitse value%identityAlignment
AT1G08600.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein3.1e-4964.9Show/hide
Query:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----
        + ++GIRF+WENIIQSI +VKSGDK LGCILAHTMGLGKTFQVIAF YTAMR VDLGL+T L VTPVNVLHNW  EF KW PSE+KPLR+FML D     
Subjt:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----

Query:  ------------------------DGPDIPVCDEAHMIKNTKADLTQALKR
                                DGPDI VCDEAH+IKNTKAD TQALK+
Subjt:  ------------------------DGPDIPVCDEAHMIKNTKADLTQALKR

AT1G08600.2 P-loop containing nucleoside triphosphate hydrolases superfamily protein8.6e-4756.98Show/hide
Query:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----
        + ++GIRF+WENIIQSI +VKSGDK LGCILAHTMGLGKTFQVIAF YTAMR VDLGL+T L VTPVNVLHNW  EF KW PSE+KPLR+FML D     
Subjt:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----

Query:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR
                                                     DGPDI VCDEAH+IKNTKAD TQALK+
Subjt:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR

AT1G08600.3 P-loop containing nucleoside triphosphate hydrolases superfamily protein8.6e-4756.98Show/hide
Query:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----
        + ++GIRF+WENIIQSI +VKSGDK LGCILAHTMGLGKTFQVIAF YTAMR VDLGL+T L VTPVNVLHNW  EF KW PSE+KPLR+FML D     
Subjt:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----

Query:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR
                                                     DGPDI VCDEAH+IKNTKAD TQALK+
Subjt:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR

AT1G08600.4 P-loop containing nucleoside triphosphate hydrolases superfamily protein8.6e-4756.98Show/hide
Query:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----
        + ++GIRF+WENIIQSI +VKSGDK LGCILAHTMGLGKTFQVIAF YTAMR VDLGL+T L VTPVNVLHNW  EF KW PSE+KPLR+FML D     
Subjt:  YSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSELKPLRVFMLED-----

Query:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR
                                                     DGPDI VCDEAH+IKNTKAD TQALK+
Subjt:  ---------------------------------------------DGPDIPVCDEAHMIKNTKADLTQALKR

AT3G42670.1 chromatin remodeling 382.7e-0832.67Show/hide
Query:  WDIISRFKERMQGKDSWDYSISGIRFLWENIIQSIRKV---KSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFK
        W +I + K ++       +      FLW+N+  S+       S DK  GC+++HT G GKTF +IAF  + ++ +  G R  L + P   L+ W +EF K
Subjt:  WDIISRFKERMQGKDSWDYSISGIRFLWENIIQSIRKV---KSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFK

Query:  W
        W
Subjt:  W


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGACCAAGAGAGTCTCGACCAACGTGGGAGGAGATTCATCTTCGGTTGGAAAAGAGGCAGAGCCTACCACCGCCCTCTCACCACGAGCAACAACCGTACGCTTGCT
GTCGGTTGAACAAGATGAGAAAATCTTGAAGAATGATGTGGGAGCTGAATTCAAAACAAGACAGTGGCAAGAAAAGCTAGGAATGGAACAAAGAGTGGTCCAAGAAGTAG
ATTTGCCTCCAAGAATGGTCCAAGAAGTAGATTCTCCTCCAAGAATCCATCAAGAACGTCGACCAATGCTCCAAGAATTCAACAATAATCCCTTATATAGTGGGAAGATG
GAGATTGAAACTTTCTTGGAATGGGACATAATTTCAAGATTTAAGGAGAGAATGCAAGGGAAAGATTCATGGGACTATTCGATTTCTGGAATAAGATTTCTGTGGGAAAA
TATCATTCAATCAATAAGGAAAGTGAAGTCTGGTGATAAAAGTCTTGGATGCATTCTTGCTCACACAATGGGTCTTGGTAAAACTTTTCAGGTTATAGCATTCTTCTACA
CTGCCATGAGAAATGTGGACTTGGGTTTACGAACAGAACTTAGAGTCACCCCTGTTAATGTGTTGCATAATTGGCCGCAAGAATTCTTCAAGTGGAGGCCTTCAGAATTA
AAGCCCCTTCGTGTTTTCATGCTGGAAGACGATGGGCCCGATATTCCTGTATGTGATGAGGCCCACATGATTAAGAACACCAAGGCTGATTTAACCCAGGCATTGAAAAG
AGATGGAAACTTTGGAAGAAGGGTGAGGGAGAGCCGGGTGTTTAGGGATCTGGAGGACTTTCCAGAGGGGAGGGTGGGGTTTGTCTTGGGTTTTCGTCGGAGTTTTCATG
GAGAAGAAGTAGTTATCGTCTGGATTATTGACTCGATCGAGGATCTGCTCCACGCCCCAGCTACTCACAAGTTCTTCAGAAAGGTCGACTGCAAAAACGGTTTCATTTGG
ATCCAAAAAATCTCCAACAAACGAGGGAGTTTCTTGGAGATTACAAAGGTCAATTCCTCGGGCGGTAAACATAACCTGGTTGTCCCAGCTGGGGATGATTACAAGGGGTG
GAGAAACTTCTCTGATTTACTAAAAAAATTTCTCAACGGTGGGGATGAGCTTCTGGAAGAGAAAAAAGATATAGAACAAGGGGATAGGTTAATCAGAAAAGGGAAATCGT
TTGTTGATATTGAACTCCTTGTTATTAACCCCTTTCTTCCGGACAAGGCTCTTCTTAAATGTCCTTCCCGTCAGCTAGCTAATTTGCTCGCTAAAAATAAAGGGTGGGTG
AGTTTTGGTCCGATTATATTGAAGGTGGAGAAATGGGACAAAAAACTTCACTCTAGAATTTCCTGTATACCCTGCTATGGTGGTTGGGTCAAGCTTCGAAACCTCCCTCT
GCATTTATGGAACCTTAAGACATTTAAAGCTATCGGGGACAGTCTTGGAGGGTTCATTGAATATGATGAGGTTAATTCCTTGCTCATTGAATGCGTGGAGGTTAAATTGA
AAATCAAGGATAATTATTGTGGGTTCATTCCAGCCGAGGTGAGAATAAAGGACGGGGACGACCACTTCAATGTGCAAATCGTCACCTTCCAAGACGGGAACTTGTTGATA
GACAAAGTGGCTGGGCTTCACAGCAGCTTCTCACCAACGGCAGCCCAGGGATTCCACAGAGGGCCTCTCGATCCCATCTTTTGCACAATGGATAGATGGAGGATTGAGGA
CAGTATGGATTACCCAGTGGTCAAAACTCTGCATTCGGAAGAAGACCTCAGGACAGGTGGAGGACAGCTGGTGGGCAATATTACAGATTTTGAAATTTCCCGCTCAAAAA
ACAAAGGAAAAGCCCTAGTCGAAGCTGGGTCCTCAGAACCCTTTGATGCGATGGCCCGAGTGGAAGCCCATTTTCAAGTTGCCCAGTCAGAATTAGAAGAAGGAGATTGG
GTCGTAAAGCGGCCCAAAAAGAAAAATAAAAAAGGGGTTTCCTTTGCCAAAGAGACCCAGATTTCCGTTTTTAAAAAGGGGGAGGTCCACTCCACTAGTACATCGAAAAG
CCCACAAAGAAAGGATGGATTGGCCGAAGTCTGGAATGGCGAACAAGGTATTGAATCCGACCTCTCCTTCTCGAGCCCAGTGTCACAATCGCGGTCTTTAGGGTCTCGTA
AAGCGACTGTGCGGCACTTACTTTCGCTCACAAGTAAGTCAGCCAACTCGAAAACGGATCGCCGGTGGCACAACGGGCGAATATCGCAACCTCTACCCCCGTTCTTGGAA
AACGGTTTTAGAAAACGGGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGACCAAGAGAGTCTCGACCAACGTGGGAGGAGATTCATCTTCGGTTGGAAAAGAGGCAGAGCCTACCACCGCCCTCTCACCACGAGCAACAACCGTACGCTTGCT
GTCGGTTGAACAAGATGAGAAAATCTTGAAGAATGATGTGGGAGCTGAATTCAAAACAAGACAGTGGCAAGAAAAGCTAGGAATGGAACAAAGAGTGGTCCAAGAAGTAG
ATTTGCCTCCAAGAATGGTCCAAGAAGTAGATTCTCCTCCAAGAATCCATCAAGAACGTCGACCAATGCTCCAAGAATTCAACAATAATCCCTTATATAGTGGGAAGATG
GAGATTGAAACTTTCTTGGAATGGGACATAATTTCAAGATTTAAGGAGAGAATGCAAGGGAAAGATTCATGGGACTATTCGATTTCTGGAATAAGATTTCTGTGGGAAAA
TATCATTCAATCAATAAGGAAAGTGAAGTCTGGTGATAAAAGTCTTGGATGCATTCTTGCTCACACAATGGGTCTTGGTAAAACTTTTCAGGTTATAGCATTCTTCTACA
CTGCCATGAGAAATGTGGACTTGGGTTTACGAACAGAACTTAGAGTCACCCCTGTTAATGTGTTGCATAATTGGCCGCAAGAATTCTTCAAGTGGAGGCCTTCAGAATTA
AAGCCCCTTCGTGTTTTCATGCTGGAAGACGATGGGCCCGATATTCCTGTATGTGATGAGGCCCACATGATTAAGAACACCAAGGCTGATTTAACCCAGGCATTGAAAAG
AGATGGAAACTTTGGAAGAAGGGTGAGGGAGAGCCGGGTGTTTAGGGATCTGGAGGACTTTCCAGAGGGGAGGGTGGGGTTTGTCTTGGGTTTTCGTCGGAGTTTTCATG
GAGAAGAAGTAGTTATCGTCTGGATTATTGACTCGATCGAGGATCTGCTCCACGCCCCAGCTACTCACAAGTTCTTCAGAAAGGTCGACTGCAAAAACGGTTTCATTTGG
ATCCAAAAAATCTCCAACAAACGAGGGAGTTTCTTGGAGATTACAAAGGTCAATTCCTCGGGCGGTAAACATAACCTGGTTGTCCCAGCTGGGGATGATTACAAGGGGTG
GAGAAACTTCTCTGATTTACTAAAAAAATTTCTCAACGGTGGGGATGAGCTTCTGGAAGAGAAAAAAGATATAGAACAAGGGGATAGGTTAATCAGAAAAGGGAAATCGT
TTGTTGATATTGAACTCCTTGTTATTAACCCCTTTCTTCCGGACAAGGCTCTTCTTAAATGTCCTTCCCGTCAGCTAGCTAATTTGCTCGCTAAAAATAAAGGGTGGGTG
AGTTTTGGTCCGATTATATTGAAGGTGGAGAAATGGGACAAAAAACTTCACTCTAGAATTTCCTGTATACCCTGCTATGGTGGTTGGGTCAAGCTTCGAAACCTCCCTCT
GCATTTATGGAACCTTAAGACATTTAAAGCTATCGGGGACAGTCTTGGAGGGTTCATTGAATATGATGAGGTTAATTCCTTGCTCATTGAATGCGTGGAGGTTAAATTGA
AAATCAAGGATAATTATTGTGGGTTCATTCCAGCCGAGGTGAGAATAAAGGACGGGGACGACCACTTCAATGTGCAAATCGTCACCTTCCAAGACGGGAACTTGTTGATA
GACAAAGTGGCTGGGCTTCACAGCAGCTTCTCACCAACGGCAGCCCAGGGATTCCACAGAGGGCCTCTCGATCCCATCTTTTGCACAATGGATAGATGGAGGATTGAGGA
CAGTATGGATTACCCAGTGGTCAAAACTCTGCATTCGGAAGAAGACCTCAGGACAGGTGGAGGACAGCTGGTGGGCAATATTACAGATTTTGAAATTTCCCGCTCAAAAA
ACAAAGGAAAAGCCCTAGTCGAAGCTGGGTCCTCAGAACCCTTTGATGCGATGGCCCGAGTGGAAGCCCATTTTCAAGTTGCCCAGTCAGAATTAGAAGAAGGAGATTGG
GTCGTAAAGCGGCCCAAAAAGAAAAATAAAAAAGGGGTTTCCTTTGCCAAAGAGACCCAGATTTCCGTTTTTAAAAAGGGGGAGGTCCACTCCACTAGTACATCGAAAAG
CCCACAAAGAAAGGATGGATTGGCCGAAGTCTGGAATGGCGAACAAGGTATTGAATCCGACCTCTCCTTCTCGAGCCCAGTGTCACAATCGCGGTCTTTAGGGTCTCGTA
AAGCGACTGTGCGGCACTTACTTTCGCTCACAAGTAAGTCAGCCAACTCGAAAACGGATCGCCGGTGGCACAACGGGCGAATATCGCAACCTCTACCCCCGTTCTTGGAA
AACGGTTTTAGAAAACGGGGTTAG
Protein sequenceShow/hide protein sequence
MMTKRVSTNVGGDSSSVGKEAEPTTALSPRATTVRLLSVEQDEKILKNDVGAEFKTRQWQEKLGMEQRVVQEVDLPPRMVQEVDSPPRIHQERRPMLQEFNNNPLYSGKM
EIETFLEWDIISRFKERMQGKDSWDYSISGIRFLWENIIQSIRKVKSGDKSLGCILAHTMGLGKTFQVIAFFYTAMRNVDLGLRTELRVTPVNVLHNWPQEFFKWRPSEL
KPLRVFMLEDDGPDIPVCDEAHMIKNTKADLTQALKRDGNFGRRVRESRVFRDLEDFPEGRVGFVLGFRRSFHGEEVVIVWIIDSIEDLLHAPATHKFFRKVDCKNGFIW
IQKISNKRGSFLEITKVNSSGGKHNLVVPAGDDYKGWRNFSDLLKKFLNGGDELLEEKKDIEQGDRLIRKGKSFVDIELLVINPFLPDKALLKCPSRQLANLLAKNKGWV
SFGPIILKVEKWDKKLHSRISCIPCYGGWVKLRNLPLHLWNLKTFKAIGDSLGGFIEYDEVNSLLIECVEVKLKIKDNYCGFIPAEVRIKDGDDHFNVQIVTFQDGNLLI
DKVAGLHSSFSPTAAQGFHRGPLDPIFCTMDRWRIEDSMDYPVVKTLHSEEDLRTGGGQLVGNITDFEISRSKNKGKALVEAGSSEPFDAMARVEAHFQVAQSELEEGDW
VVKRPKKKNKKGVSFAKETQISVFKKGEVHSTSTSKSPQRKDGLAEVWNGEQGIESDLSFSSPVSQSRSLGSRKATVRHLLSLTSKSANSKTDRRWHNGRISQPLPPFLE
NGFRKRG