; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001381 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001381
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDNA polymerase epsilon subunit C
Genome locationChr09:16590888..16593678
RNA-Seq ExpressionHG10001381
SyntenyHG10001381
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR003958 - Transcription factor CBF/NF-Y/archaeal histone domain
IPR009072 - Histone-fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010622.1 Chromatin accessibility complex protein 1, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-7574.79Show/hide
Query:  MASSKKSSTEAKSKEAGTSKPAPKTKTH-NNARKSDKDDSKKKKKKKKKISTNNGSIKDNETLAISAPASASEEDDADEDRAETTSKKPNTSKSKKFKRN
        MASSKKSS+EAKS+EAGTSKPAPK KTH NNAR  DK  SKKKK       T+NGS+K+NE L ISAP       DA+EDRAETTSK PNTSKSKK KRN
Subjt:  MASSKKSSTEAKSKEAGTSKPAPKTKTH-NNARKSDKDDSKKKKKKKKKISTNNGSIKDNETLAISAPASASEEDDADEDRAETTSKKPNTSKSKKFKRN

Query:  HVKKEENGYAAE-----------EEEVEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVA
        H KKE+NGY  E           EEE EEKI KFPM+RIKKIMRDENSDLRINQEALFLVNKA+EMFLVQFCKDAYACCAQDRKKSLAYKHL        
Subjt:  HVKKEENGYAAE-----------EEEVEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVA

Query:  ASVACKRKRYDFLSDFVPEKLKFEDALKERSMAESGKN
        +SV CKRKRYDFLSDFVPEKLKFEDALKERSMAESGK+
Subjt:  ASVACKRKRYDFLSDFVPEKLKFEDALKERSMAESGKN

XP_004149539.2 chromatin accessibility complex protein 1 isoform X1 [Cucumis sativus]4.5e-8382.68Show/hide
Query:  MASSKKSSTEAKSKEAGTSKPAPKTKTHNNARKSDKDDSKKKKKKKK---KISTNNGSIKDNETLAISAP-ASASEE-DDADEDRAETTSKKPNTSKSKK
        MASSKKSSTEAKSKE GTSKP+PK KTHNNAR SDKD SKKKKKKKK    ISTNNGS+KDN  LA+SAP ASASE+ D ADED+AETTS+K NTSKSKK
Subjt:  MASSKKSSTEAKSKEAGTSKPAPKTKTHNNARKSDKDDSKKKKKKKK---KISTNNGSIKDNETLAISAP-ASASEE-DDADEDRAETTSKKPNTSKSKK

Query:  FKRNHVKKEENGYAAEEEEVEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVAASVACKR
         KRNH K+E+N YAA EEE EEKI KFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHL        +SV  KR
Subjt:  FKRNHVKKEENGYAAEEEEVEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVAASVACKR

Query:  KRYDFLSDFVPEKLKFEDALKERSMAESGKN
        KRYDFLSDFVPEKLKFEDALKERSMAESGK+
Subjt:  KRYDFLSDFVPEKLKFEDALKERSMAESGKN

XP_008463933.1 PREDICTED: DNA polymerase epsilon subunit C [Cucumis melo]1.2e-8383.04Show/hide
Query:  MASSKKSSTEAKSKEAGTSKPAPKTKTHNNARKSDKDDSKKKKKKKK-KISTNNGSIKDNETLAISAP-ASASEEDD-ADEDRAETTSKKPNTSKSKKFK
        MASSKKSSTEAK KEAGTSKPAPK KTHNNAR SDK+ SKKKKKKKK  ISTNNGS+KDNE LA+S P  SASE+DD ADED+AETTS+K  TSKSKK K
Subjt:  MASSKKSSTEAKSKEAGTSKPAPKTKTHNNARKSDKDDSKKKKKKKK-KISTNNGSIKDNETLAISAP-ASASEEDD-ADEDRAETTSKKPNTSKSKKFK

Query:  RNHVKKEENGYA-AEEEEVEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVAASVACKRK
        RNH K+E+NGYA  EEEE EEKI KFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHL        +SV  KRK
Subjt:  RNHVKKEENGYA-AEEEEVEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVAASVACKRK

Query:  RYDFLSDFVPEKLKFEDALKERSMAESGKN
        RYDFLSDFVPEKLKFEDALKERSMAESGK+
Subjt:  RYDFLSDFVPEKLKFEDALKERSMAESGKN

XP_022140741.1 DNA polymerase epsilon subunit C [Momordica charantia]1.4e-7677.22Show/hide
Query:  MASSKKSSTEAKSKEAGTSKPA------PKTKTHNNARKSDKDDSKKKKKKKK---KISTNNGSIKDNETLAISAPASASEEDDADEDRAETTSKKPNTS
        MASSKKSSTEAKSKEAGTSK A       KTKT +NA KSDKD SKK   KKK   K   NNGS ++N+   ISAP SAS EDDADEDR ETTS+K N+S
Subjt:  MASSKKSSTEAKSKEAGTSKPA------PKTKTHNNARKSDKDDSKKKKKKKK---KISTNNGSIKDNETLAISAPASASEEDDADEDRAETTSKKPNTS

Query:  KSKKFKRNHVKKEENGYAAEEEE---VEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVA
        KSKK KRNH KKEENGY  EEEE   VEEKI KFPMHRIKKI RDENSDLRINQEALFLVNKA+EMFLVQFCKDAYACCAQDRKKSLAYKHL        
Subjt:  KSKKFKRNHVKKEENGYAAEEEE---VEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVA

Query:  ASVACKRKRYDFLSDFVPEKLKFEDALKERSMAESGK
        +SV CKRKRYDFLSDFVPEKLKFEDALKERSMAESGK
Subjt:  ASVACKRKRYDFLSDFVPEKLKFEDALKERSMAESGK

XP_038902902.1 DNA polymerase epsilon subunit C [Benincasa hispida]4.5e-9187.77Show/hide
Query:  MASSKKSSTEAKSKEAGTSKPAPKTKTHNNARKSDKDDSKKKKKKKKKISTNNGSIKDNETLAISAP--ASASEEDDADEDRAETTSKKPNTSKSKKFKR
        MASSKKSSTEAKSKEAGTSKP PKTKTHNNARKSDKDDS KKKKKKKKIST NGS+KDNE LA S P  ASASEE DADEDRAETTS KPNTSKSKK +R
Subjt:  MASSKKSSTEAKSKEAGTSKPAPKTKTHNNARKSDKDDSKKKKKKKKKISTNNGSIKDNETLAISAP--ASASEEDDADEDRAETTSKKPNTSKSKKFKR

Query:  NHVKKEENGYAA-EEEEVEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVAASVACKRKR
        NHVKKEENGYAA EEEE EEKI KFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHL        +SV CKRKR
Subjt:  NHVKKEENGYAA-EEEEVEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVAASVACKRKR

Query:  YDFLSDFVPEKLKFEDALKERSMAESGKN
        YDFLSDFVPEKLKFEDALKER+MAESGK+
Subjt:  YDFLSDFVPEKLKFEDALKERSMAESGKN

TrEMBL top hitse value%identityAlignment
A0A0A0KCQ2 CBFD_NFYB_HMF domain-containing protein2.2e-8382.68Show/hide
Query:  MASSKKSSTEAKSKEAGTSKPAPKTKTHNNARKSDKDDSKKKKKKKK---KISTNNGSIKDNETLAISAP-ASASEE-DDADEDRAETTSKKPNTSKSKK
        MASSKKSSTEAKSKE GTSKP+PK KTHNNAR SDKD SKKKKKKKK    ISTNNGS+KDN  LA+SAP ASASE+ D ADED+AETTS+K NTSKSKK
Subjt:  MASSKKSSTEAKSKEAGTSKPAPKTKTHNNARKSDKDDSKKKKKKKK---KISTNNGSIKDNETLAISAP-ASASEE-DDADEDRAETTSKKPNTSKSKK

Query:  FKRNHVKKEENGYAAEEEEVEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVAASVACKR
         KRNH K+E+N YAA EEE EEKI KFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHL        +SV  KR
Subjt:  FKRNHVKKEENGYAAEEEEVEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVAASVACKR

Query:  KRYDFLSDFVPEKLKFEDALKERSMAESGKN
        KRYDFLSDFVPEKLKFEDALKERSMAESGK+
Subjt:  KRYDFLSDFVPEKLKFEDALKERSMAESGKN

A0A1S3CKU7 DNA polymerase epsilon subunit C5.7e-8483.04Show/hide
Query:  MASSKKSSTEAKSKEAGTSKPAPKTKTHNNARKSDKDDSKKKKKKKK-KISTNNGSIKDNETLAISAP-ASASEEDD-ADEDRAETTSKKPNTSKSKKFK
        MASSKKSSTEAK KEAGTSKPAPK KTHNNAR SDK+ SKKKKKKKK  ISTNNGS+KDNE LA+S P  SASE+DD ADED+AETTS+K  TSKSKK K
Subjt:  MASSKKSSTEAKSKEAGTSKPAPKTKTHNNARKSDKDDSKKKKKKKK-KISTNNGSIKDNETLAISAP-ASASEEDD-ADEDRAETTSKKPNTSKSKKFK

Query:  RNHVKKEENGYA-AEEEEVEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVAASVACKRK
        RNH K+E+NGYA  EEEE EEKI KFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHL        +SV  KRK
Subjt:  RNHVKKEENGYA-AEEEEVEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVAASVACKRK

Query:  RYDFLSDFVPEKLKFEDALKERSMAESGKN
        RYDFLSDFVPEKLKFEDALKERSMAESGK+
Subjt:  RYDFLSDFVPEKLKFEDALKERSMAESGKN

A0A6J1CGZ0 DNA polymerase epsilon subunit C6.8e-7777.22Show/hide
Query:  MASSKKSSTEAKSKEAGTSKPA------PKTKTHNNARKSDKDDSKKKKKKKK---KISTNNGSIKDNETLAISAPASASEEDDADEDRAETTSKKPNTS
        MASSKKSSTEAKSKEAGTSK A       KTKT +NA KSDKD SKK   KKK   K   NNGS ++N+   ISAP SAS EDDADEDR ETTS+K N+S
Subjt:  MASSKKSSTEAKSKEAGTSKPA------PKTKTHNNARKSDKDDSKKKKKKKK---KISTNNGSIKDNETLAISAPASASEEDDADEDRAETTSKKPNTS

Query:  KSKKFKRNHVKKEENGYAAEEEE---VEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVA
        KSKK KRNH KKEENGY  EEEE   VEEKI KFPMHRIKKI RDENSDLRINQEALFLVNKA+EMFLVQFCKDAYACCAQDRKKSLAYKHL        
Subjt:  KSKKFKRNHVKKEENGYAAEEEE---VEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVA

Query:  ASVACKRKRYDFLSDFVPEKLKFEDALKERSMAESGK
        +SV CKRKRYDFLSDFVPEKLKFEDALKERSMAESGK
Subjt:  ASVACKRKRYDFLSDFVPEKLKFEDALKERSMAESGK

A0A6J1FVY8 DNA polymerase epsilon subunit C-like isoform X12.2e-7576.62Show/hide
Query:  MASSKKSSTEAKSKEAGTSKPAPKTKTH-NNARKSDKDDSKKKKKKKKKISTNNGSIKDNETLAISAPASASEEDDADEDRAETTSKKPNTSKSKKFKRN
        MASSKKSS+EAKSKEAGTSKPAP  KTH NNAR  DK  SKKKK       T+NGS+K+NE L ISAP       DA+EDRAETTSK PNTSKSKK KRN
Subjt:  MASSKKSSTEAKSKEAGTSKPAPKTKTH-NNARKSDKDDSKKKKKKKKKISTNNGSIKDNETLAISAPASASEEDDADEDRAETTSKKPNTSKSKKFKRN

Query:  HVKKEENGYAAE----EEEVEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVAASVACKR
          KKE+NGY  E    EEE EEKI KFPM+RIKKIMRDENSDLRINQEALFLVNKA+EMFLVQFCKDAYACCAQDRKKSLAYKHL        +SV CKR
Subjt:  HVKKEENGYAAE----EEEVEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVAASVACKR

Query:  KRYDFLSDFVPEKLKFEDALKERSMAESGKN
        KRYDFLSDFVPEKLKFEDALKERSMAESGK+
Subjt:  KRYDFLSDFVPEKLKFEDALKERSMAESGKN

A0A6J1JF10 DNA polymerase epsilon subunit C-like isoform X11.3e-7574.79Show/hide
Query:  MASSKKSSTEAKSKEAGTSKPAPKTKTH-------NNARKSDKDDSKKKK-KKKKKISTNNGSIKDNETLAISAPASASEEDDADEDRAETTSKKPNTSK
        MASSKKSS+EAKSKEAGTSKPAPK KTH       NNARK DK  SKKKK        T+NGS+K+NE L ISAP       DA+EDR ETTSK PNTSK
Subjt:  MASSKKSSTEAKSKEAGTSKPAPKTKTH-------NNARKSDKDDSKKKK-KKKKKISTNNGSIKDNETLAISAPASASEEDDADEDRAETTSKKPNTSK

Query:  SKKFKRNHVKKEENGYAAE--------EEEVEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFF
        SKK KRNH KKEENGYA E        EEE EEKI KFPM+RIKKIMRDENSDLRINQEALFLVNKA+EMFLVQFCKDAYACCAQDRKKSLAYKHL    
Subjt:  SKKFKRNHVKKEENGYAAE--------EEEVEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFF

Query:  FVVAASVACKRKRYDFLSDFVPEKLKFEDALKERSMAESGKN
            +SV CKRKRYDFLSDFVPEKLKFEDALKERSMAESGK+
Subjt:  FVVAASVACKRKRYDFLSDFVPEKLKFEDALKERSMAESGKN

SwissProt top hitse value%identityAlignment
A6QQ14 DNA polymerase epsilon subunit 43.3e-0430.16Show/hide
Query:  KICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHL
        ++ + P+ R+K +++ +       QEA+F++ +A+E+F+    KDAY C  Q ++K+L  + L
Subjt:  KICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHL

Q9CQ36 DNA polymerase epsilon subunit 43.3e-0430.16Show/hide
Query:  KICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHL
        ++ + P+ R+K +++ +       QEA+F++ +A+E+F+    KDAY C  Q ++K+L  + L
Subjt:  KICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHL

Q9JKP8 Chromatin accessibility complex protein 16.7e-0531.82Show/hide
Query:  EEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVAASVACKRKRYDFLSDFVPEKL---KFE
        ++++   P+ RI+ IM+       INQEAL L  KA+E+F+      +Y   +   KK+L Y  L        AS A   +   FL+D +P+K+   K+ 
Subjt:  EEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVAASVACKRKRYDFLSDFVPEKL---KFE

Query:  DALKERSMAE
          LKE+   E
Subjt:  DALKERSMAE

Q9NR33 DNA polymerase epsilon subunit 43.3e-0430.16Show/hide
Query:  KICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHL
        ++ + P+ R+K +++ +       QEA+F++ +A+E+F+    KDAY C  Q ++K+L  + L
Subjt:  KICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHL

Q9NRG0 Chromatin accessibility complex protein 11.0e-0531.58Show/hide
Query:  EEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVAASVACKRKRYDFLSDFVPEKL---KFE
        E+++   P+ RI+ IM+       INQEAL L  KA+E+F+      +Y   +   KK L Y  L        A+ A + + + FL+D +P+K+   K+ 
Subjt:  EEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVAASVACKRKRYDFLSDFVPEKL---KFE

Query:  DALKERSMAESGKN
          LKE    E  +N
Subjt:  DALKERSMAESGKN

Arabidopsis top hitse value%identityAlignment
AT1G07980.1 nuclear factor Y, subunit C104.3e-1533.19Show/hide
Query:  MASSKKSSTEAKSKEAGTSKPAPKTKTHNNARKSDKDDSKKKKKKKKKISTNNGSIKDNETLAISAPASASEEDDADEDRAETTSKKPNTSKSKKFKRNH
        M SSKK   +    +   +K + ++K  + +R     +     KKK +I       + +E+ +  +   A   D+A +     + +    S     K + 
Subjt:  MASSKKSSTEAKSKEAGTSKPAPKTKTHNNARKSDKDDSKKKKKKKKKISTNNGSIKDNETLAISAPASASEEDDADEDRAETTSKKPNTSKSKKFKRNH

Query:  VKKEENGYAAEEEEVEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVAASVACKRKRYDF
         ++E++G A      E+   KFPM+RI++IMR +NS  +I Q+A+FLVNKA+EMF+ +F ++AY    +D+KK + YKHL        +SV    +RY+F
Subjt:  VKKEENGYAAEEEEVEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVAASVACKRKRYDF

Query:  LSDFVPEKLKFEDALK--ERSMAESG
        L+D VPEKLK E AL+  ER M ++G
Subjt:  LSDFVPEKLKFEDALK--ERSMAESG

AT5G43250.1 nuclear factor Y, subunit C137.6e-0430.93Show/hide
Query:  EEEVEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVAASVACKRKR--YDFLSDFVP
        EEE      +FP+ R+KKIM+ +    +IN EAL ++  ++E+FL    + +    A+ ++K++   HL          +A KR +   DFL D +P
Subjt:  EEEVEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVAASVACKRKR--YDFLSDFVP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCATCCAAAAAATCAAGTACTGAAGCCAAGAGCAAAGAAGCGGGAACCTCAAAGCCAGCCCCCAAAACCAAGACTCACAACAATGCCAGAAAATCCGATAAGGA
TGATTCAAAGAAGAAGAAGAAGAAGAAGAAGAAGATTAGCACCAACAATGGCTCCATCAAGGACAACGAAACCTTAGCTATTTCTGCTCCAGCCTCAGCCTCTGAGGAGG
ACGATGCCGATGAAGACAGAGCCGAAACCACTAGTAAAAAACCTAACACCTCGAAATCGAAGAAATTTAAACGGAACCATGTCAAGAAAGAAGAGAATGGTTATGCTGCC
GAAGAAGAAGAAGTCGAGGAGAAGATTTGTAAATTCCCTATGCATCGGATCAAGAAAATCATGAGGGACGAAAATTCTGATTTGCGCATCAATCAGGAAGCTTTGTTTCT
CGTCAACAAAGCTTCGGAGATGTTTCTCGTACAATTTTGCAAAGATGCATATGCGTGCTGTGCTCAGGATCGCAAGAAGTCTCTTGCTTACAAGCATCTATTAAGATTCT
TCTTTGTTGTTGCAGCATCAGTAGCTTGCAAAAGGAAGAGATACGACTTTCTTTCAGATTTTGTTCCAGAGAAATTAAAATTCGAAGACGCGTTAAAGGAAAGAAGCATG
GCAGAATCAGGGAAAAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCATCCAAAAAATCAAGTACTGAAGCCAAGAGCAAAGAAGCGGGAACCTCAAAGCCAGCCCCCAAAACCAAGACTCACAACAATGCCAGAAAATCCGATAAGGA
TGATTCAAAGAAGAAGAAGAAGAAGAAGAAGAAGATTAGCACCAACAATGGCTCCATCAAGGACAACGAAACCTTAGCTATTTCTGCTCCAGCCTCAGCCTCTGAGGAGG
ACGATGCCGATGAAGACAGAGCCGAAACCACTAGTAAAAAACCTAACACCTCGAAATCGAAGAAATTTAAACGGAACCATGTCAAGAAAGAAGAGAATGGTTATGCTGCC
GAAGAAGAAGAAGTCGAGGAGAAGATTTGTAAATTCCCTATGCATCGGATCAAGAAAATCATGAGGGACGAAAATTCTGATTTGCGCATCAATCAGGAAGCTTTGTTTCT
CGTCAACAAAGCTTCGGAGATGTTTCTCGTACAATTTTGCAAAGATGCATATGCGTGCTGTGCTCAGGATCGCAAGAAGTCTCTTGCTTACAAGCATCTATTAAGATTCT
TCTTTGTTGTTGCAGCATCAGTAGCTTGCAAAAGGAAGAGATACGACTTTCTTTCAGATTTTGTTCCAGAGAAATTAAAATTCGAAGACGCGTTAAAGGAAAGAAGCATG
GCAGAATCAGGGAAAAACTAG
Protein sequenceShow/hide protein sequence
MASSKKSSTEAKSKEAGTSKPAPKTKTHNNARKSDKDDSKKKKKKKKKISTNNGSIKDNETLAISAPASASEEDDADEDRAETTSKKPNTSKSKKFKRNHVKKEENGYAA
EEEEVEEKICKFPMHRIKKIMRDENSDLRINQEALFLVNKASEMFLVQFCKDAYACCAQDRKKSLAYKHLLRFFFVVAASVACKRKRYDFLSDFVPEKLKFEDALKERSM
AESGKN