; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg03067 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg03067
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionTranscription initiation factor IIE subunit alpha-like
Genome locationCarg_Chr09:3302995..3303723
RNA-Seq ExpressionCarg03067
SyntenyCarg03067
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591755.1 hypothetical protein SDJN03_14101, partial [Cucurbita argyrosperma subsp. sororia]5.6e-13299.17Show/hide
Query:  MPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKD
        MPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKD
Subjt:  MPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKD

Query:  IMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGVQGSDGSRTAFSKWFMVLQGSGFRRDRNGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWV
        IMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGV+GSDGSRTAFSKWFMVLQGSG RRDRNGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWV
Subjt:  IMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGVQGSDGSRTAFSKWFMVLQGSGFRRDRNGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWV

Query:  EEGCSEEEEETEVKVKKSLKWLMEEENRESRDLVTRSQSWKV
        EEGCSEEEEETEVKVKKSLKWLMEEENRESRDLVTRSQSWKV
Subjt:  EEGCSEEEEETEVKVKKSLKWLMEEENRESRDLVTRSQSWKV

KAG7024639.1 hypothetical protein SDJN02_13457, partial [Cucurbita argyrosperma subsp. argyrosperma]3.9e-133100Show/hide
Query:  MPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKD
        MPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKD
Subjt:  MPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKD

Query:  IMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGVQGSDGSRTAFSKWFMVLQGSGFRRDRNGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWV
        IMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGVQGSDGSRTAFSKWFMVLQGSGFRRDRNGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWV
Subjt:  IMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGVQGSDGSRTAFSKWFMVLQGSGFRRDRNGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWV

Query:  EEGCSEEEEETEVKVKKSLKWLMEEENRESRDLVTRSQSWKV
        EEGCSEEEEETEVKVKKSLKWLMEEENRESRDLVTRSQSWKV
Subjt:  EEGCSEEEEETEVKVKKSLKWLMEEENRESRDLVTRSQSWKV

XP_022935869.1 uncharacterized protein LOC111442647 [Cucurbita moschata]2.4e-13098.35Show/hide
Query:  MPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKD
        MPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKD
Subjt:  MPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKD

Query:  IMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGVQGSDGSRTAFSKWFMVLQGSGFRRDRNGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWV
        IMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGV+GSDGSRTAFSKWFMVLQGSG RRD NGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWV
Subjt:  IMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGVQGSDGSRTAFSKWFMVLQGSGFRRDRNGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWV

Query:  EEGCSEEEEETEVKVKKSLKWLMEEENRESRDLVTRSQSWKV
        EEGCSEE EETEVKVKKSLKWLMEEENRESRDLVTRSQSWKV
Subjt:  EEGCSEEEEETEVKVKKSLKWLMEEENRESRDLVTRSQSWKV

XP_023535963.1 uncharacterized protein LOC111797241 isoform X1 [Cucurbita pepo subsp. pepo]5.8e-12997.11Show/hide
Query:  MPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKD
        MPNPLCSPARASDSNKLRRYHRRR+SAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKD
Subjt:  MPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKD

Query:  IMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGVQGSDGSRTAFSKWFMVLQGSGFRRDRNGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWV
        IMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEE+V V+GSDGSRTAFSKWFMVLQGSG RRD NGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWV
Subjt:  IMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGVQGSDGSRTAFSKWFMVLQGSGFRRDRNGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWV

Query:  EEGCSEEEEETEVKVKKSLKWLMEEENRESRDLVTRSQSWKV
        EEGCSEEEEETEVKVKKSLKWLMEEENRESR LVTRSQSWKV
Subjt:  EEGCSEEEEETEVKVKKSLKWLMEEENRESRDLVTRSQSWKV

XP_023535965.1 uncharacterized protein LOC111797241 isoform X3 [Cucurbita pepo subsp. pepo]2.6e-12997.52Show/hide
Query:  MPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKD
        MPNPLCSPARASDSNKLRRYHRRR+SAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKD
Subjt:  MPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKD

Query:  IMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGVQGSDGSRTAFSKWFMVLQGSGFRRDRNGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWV
        IMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEE+V VQGSDGSRTAFSKWFMVLQGSG RRD NGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWV
Subjt:  IMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGVQGSDGSRTAFSKWFMVLQGSGFRRDRNGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWV

Query:  EEGCSEEEEETEVKVKKSLKWLMEEENRESRDLVTRSQSWKV
        EEGCSEEEEETEVKVKKSLKWLMEEENRESR LVTRSQSWKV
Subjt:  EEGCSEEEEETEVKVKKSLKWLMEEENRESRDLVTRSQSWKV

TrEMBL top hitse value%identityAlignment
A0A0A0L1Z4 Uncharacterized protein8.9e-9166.43Show/hide
Query:  MPNPLCSPARASDSNKL----RRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG
        MPNPLCSPAR SDS+K     RRYHRRRKSAESPVVWAKAKTM GSE+SEPSSPKVTCAGQIK+RPK+ KSW+SVMEEIERIHNRR+LRRRRFNW+ES G
Subjt:  MPNPLCSPARASDSNKL----RRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG

Query:  FKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEE-----------VGVQGSDGSRTAFSKWFMVLQGSG---FRRDRNGLCTVDDASIGPPMAPP
        FKKDIMQFLTCLR++RFDF CF AFPE +FT+E+EEEEE           VG++ ++ SRTAFSKWFMVLQ +G    +RD N  C  DD SI   MAPP
Subjt:  FKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEE-----------VGVQGSDGSRTAFSKWFMVLQGSG---FRRDRNGLCTVDDASIGPPMAPP

Query:  RNALLLMRCRSAPAKSWVEEGCSEEEEETE-------VKVKKSLKWLMEEENRE----------------SRDLVTRSQSWKV
        RNALLLMRC+SAPA+ W+EE   EE++E E       VKVKKSLKWLMEEENRE                +    TRSQSWKV
Subjt:  RNALLLMRCRSAPAKSWVEEGCSEEEEETE-------VKVKKSLKWLMEEENRE----------------SRDLVTRSQSWKV

A0A1S3B949 uncharacterized protein LOC1034875512.8e-8967.64Show/hide
Query:  MPNPLCSPARASDSNKL----RRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG
        MPNPLCSPAR SDS+K     RR+HRRRKSAESPVVWAKAKTM GSE+SEPSSPKVTCAGQIK+RPK+ KSW+SVMEEIERIHNRR+LRRRRF WVES G
Subjt:  MPNPLCSPARASDSNKL----RRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG

Query:  FKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEE---------VGVQGSDGSRTAFSKWFMVLQGSG---FRRDRNGLCTVDDASIGPPMAPPRN
        FKKDIMQFLTCLR++RFDF CF AFPE +FT+E+EEEEE         VG++ ++ SRTAFSKWFMVLQ +G    +RD   LC  DD SI   MAPP N
Subjt:  FKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEE---------VGVQGSDGSRTAFSKWFMVLQGSG---FRRDRNGLCTVDDASIGPPMAPPRN

Query:  ALLLMRCRSAPAKSWVEEGCSEEEEETE-VKVKKSLKWLMEEENRE----------------SRDLVTRSQSWKV
        ALLLMRCRSAPA+ W+EE   E ++E E VKVKKSLKWLMEEENRE                +    TRSQSWKV
Subjt:  ALLLMRCRSAPAKSWVEEGCSEEEEETE-VKVKKSLKWLMEEENRE----------------SRDLVTRSQSWKV

A0A5D3D503 Transcription initiation factor IIE subunit alpha-like2.8e-8967.64Show/hide
Query:  MPNPLCSPARASDSNKL----RRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG
        MPNPLCSPAR SDS+K     RR+HRRRKSAESPVVWAKAKTM GSE+SEPSSPKVTCAGQIK+RPK+ KSW+SVMEEIERIHNRR+LRRRRF WVES G
Subjt:  MPNPLCSPARASDSNKL----RRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG

Query:  FKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEE---------VGVQGSDGSRTAFSKWFMVLQGSG---FRRDRNGLCTVDDASIGPPMAPPRN
        FKKDIMQFLTCLR++RFDF CF AFPE +FT+E+EEEEE         VG++ ++ SRTAFSKWFMVLQ +G    +RD   LC  DD SI   MAPP N
Subjt:  FKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEE---------VGVQGSDGSRTAFSKWFMVLQGSG---FRRDRNGLCTVDDASIGPPMAPPRN

Query:  ALLLMRCRSAPAKSWVEEGCSEEEEETE-VKVKKSLKWLMEEENRE----------------SRDLVTRSQSWKV
        ALLLMRCRSAPA+ W+EE   E ++E E VKVKKSLKWLMEEENRE                +    TRSQSWKV
Subjt:  ALLLMRCRSAPAKSWVEEGCSEEEEETE-VKVKKSLKWLMEEENRE----------------SRDLVTRSQSWKV

A0A6J1FBW7 uncharacterized protein LOC1114426471.1e-13098.35Show/hide
Query:  MPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKD
        MPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKD
Subjt:  MPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKD

Query:  IMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGVQGSDGSRTAFSKWFMVLQGSGFRRDRNGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWV
        IMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGV+GSDGSRTAFSKWFMVLQGSG RRD NGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWV
Subjt:  IMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGVQGSDGSRTAFSKWFMVLQGSGFRRDRNGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWV

Query:  EEGCSEEEEETEVKVKKSLKWLMEEENRESRDLVTRSQSWKV
        EEGCSEE EETEVKVKKSLKWLMEEENRESRDLVTRSQSWKV
Subjt:  EEGCSEEEEETEVKVKKSLKWLMEEENRESRDLVTRSQSWKV

A0A6J1IQQ3 uncharacterized protein LOC1114773339.1e-12895.87Show/hide
Query:  MPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKD
        MPNPLCSP RASDSNKLRRYHRRRKSAESPVVWAKAKT+GGSEVSEPSSPKVTCAGQIKMR KSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKD
Subjt:  MPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKD

Query:  IMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGVQGSDGSRTAFSKWFMVLQGSGFRRDRNGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWV
        IMQFLTCLRS+RFDFGCFGAFPEAEFTSEDEEEEEVGV+GSDGSRTAFSKWFMVLQGSG RRD NGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWV
Subjt:  IMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGVQGSDGSRTAFSKWFMVLQGSGFRRDRNGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWV

Query:  EEGCSEEEEETEVKVKKSLKWLMEEENRESRDLVTRSQSWKV
        EE CSEEEE+TEVKVKKSLKWLMEEENRESRDLVTRS+SWKV
Subjt:  EEGCSEEEEETEVKVKKSLKWLMEEENRESRDLVTRSQSWKV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G22230.1 unknown protein3.4e-3440.94Show/hide
Query:  LCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSE--VSEPSSPKVTCAGQIKMRPKSR----KSWESVMEEIERIHNRRELRRRRFNWVESLGFK
        + SP+ + +  +   +HRR  S  S       +  GG    V EP+SPKVTCAGQIK+R   R    K+W+S+M EIE+IH  R     +F      G K
Subjt:  LCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSE--VSEPSSPKVTCAGQIKMRPKSR----KSWESVMEEIERIHNRRELRRRRFNWVESLGFK

Query:  KDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGVQ------GSDGSRTAFSKWFMVLQGSGFRRDRNGLCTVDDASIGPPM---APPRNALLLM
        +D+M FLTCLR   FDF CFGAFP  +  S+DEEE+E   +        + S T FSKW MVL      +  N  C     ++   +    PP NALLLM
Subjt:  KDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGVQ------GSDGSRTAFSKWFMVLQGSGFRRDRNGLCTVDDASIGPPM---APPRNALLLM

Query:  RCRSAPAKSWVEE---------------GCSEEEEETEVKVKKSLKWLMEEENR
        RCRSAP K+W EE               G  EEEE+  V  KK L+ LMEEE +
Subjt:  RCRSAPAKSWVEE---------------GCSEEEEETEVKVKKSLKWLMEEENR

AT1G78110.1 unknown protein4.0e-5146.04Show/hide
Query:  PNPLCSPARASDSNKLRRYHRRRKSAE----------SPVVWAK---AKTMGGSEVSEPSSPKVTCAGQIKMRPKS----RKSWESVMEEIERIHNRREL
        P P+CSP+R SDS+  RR H RR+ ++          SPV+WAK   +K MGG E++EP+SPKVTCAGQIK+RP       K+W+SVMEEIERIH+ R  
Subjt:  PNPLCSPARASDSNKLRRYHRRRKSAE----------SPVVWAK---AKTMGGSEVSEPSSPKVTCAGQIKMRPKS----RKSWESVMEEIERIHNRREL

Query:  RRRRFNWVESLGFKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEE---------VGVQGSDGSRTAFSKWFMVLQGSGFRRD---RNGLC----
         +         G KKD+M FLTCLR+++FDF CFG F  A+ TS+D+EEE+         V  +  + S+T FSKWFMVLQ     +D    N  C    
Subjt:  RRRRFNWVESLGFKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEE---------VGVQGSDGSRTAFSKWFMVLQGSGFRRD---RNGLC----

Query:  TVDDASIGPPMAPPRNALLLMRCRSAPAKSWVEEGC----------------SEEEEETEVKV-KKSLKWLMEEENRE
         ++D    P + PP NALLLMRCRSAPAKSW+EE                    E++ET +K  KK L+ LMEEE  E
Subjt:  TVDDASIGPPMAPPRNALLLMRCRSAPAKSWVEEGC----------------SEEEEETEVKV-KKSLKWLMEEENRE

AT2G37100.1 protamine P1 family protein7.8e-0725.35Show/hide
Query:  PLCSPARASDSNKLRRY------HRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGF
        P+ SP R  +   L R+       R R  +  P+ + +      +E  EP+SPKVTC GQ+++    +   E+          RR+   RR  WV++   
Subjt:  PLCSPARASDSNKLRRY------HRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGF

Query:  KKDIMQFL--TCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGVQGSDGSRTAFSKWFMVLQGSGFRRDRNGLCTVDDASIGPPMAPPRNALLLMRCRSAP
               +  TC   +   +  + +F  A F+ + E+        S  S   F +  +  +     R          +       PPRNA LL RCRSAP
Subjt:  KKDIMQFL--TCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGVQGSDGSRTAFSKWFMVLQGSGFRRDRNGLCTVDDASIGPPMAPPRNALLLMRCRSAP

Query:  AKS-WVEEGCSEEEEET
         +S        E++EET
Subjt:  AKS-WVEEGCSEEEEET


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAACCCACTCTGTAGTCCTGCCAGAGCGTCTGATTCGAACAAGCTCCGCCGTTATCACCGGCGGAGGAAGTCGGCGGAGAGTCCGGTGGTTTGGGCCAAAGCGAA
GACGATGGGGGGGTCTGAGGTGTCGGAACCGTCGTCGCCGAAAGTGACCTGTGCAGGGCAGATTAAGATGAGGCCGAAGAGCAGGAAGAGCTGGGAATCGGTGATGGAGG
AGATAGAGAGAATTCATAATAGGAGGGAATTACGGCGGAGGAGGTTCAATTGGGTCGAATCTTTAGGGTTCAAGAAGGATATAATGCAATTCTTAACGTGTTTACGGAGC
TTACGGTTTGATTTTGGGTGTTTCGGAGCTTTCCCTGAAGCAGAGTTCACCTCTGAAGACGAGGAAGAAGAGGAAGTGGGTGTCCAGGGGAGCGATGGCTCTAGAACGGC
GTTTTCTAAATGGTTCATGGTTTTACAGGGAAGTGGGTTCCGGAGAGACCGCAACGGTCTCTGTACAGTTGATGATGCATCGATTGGGCCGCCGATGGCGCCGCCCAGAA
ACGCGCTTTTGCTTATGCGCTGCAGGTCTGCTCCGGCGAAGAGTTGGGTGGAGGAAGGATGTTCAGAGGAGGAAGAAGAAACAGAGGTGAAGGTGAAGAAGAGCTTGAAA
TGGCTAATGGAGGAAGAGAACAGAGAGAGCAGGGATTTGGTTACGAGGAGCCAGAGTTGGAAGGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAAACCCACTCTGTAGTCCTGCCAGAGCGTCTGATTCGAACAAGCTCCGCCGTTATCACCGGCGGAGGAAGTCGGCGGAGAGTCCGGTGGTTTGGGCCAAAGCGAA
GACGATGGGGGGGTCTGAGGTGTCGGAACCGTCGTCGCCGAAAGTGACCTGTGCAGGGCAGATTAAGATGAGGCCGAAGAGCAGGAAGAGCTGGGAATCGGTGATGGAGG
AGATAGAGAGAATTCATAATAGGAGGGAATTACGGCGGAGGAGGTTCAATTGGGTCGAATCTTTAGGGTTCAAGAAGGATATAATGCAATTCTTAACGTGTTTACGGAGC
TTACGGTTTGATTTTGGGTGTTTCGGAGCTTTCCCTGAAGCAGAGTTCACCTCTGAAGACGAGGAAGAAGAGGAAGTGGGTGTCCAGGGGAGCGATGGCTCTAGAACGGC
GTTTTCTAAATGGTTCATGGTTTTACAGGGAAGTGGGTTCCGGAGAGACCGCAACGGTCTCTGTACAGTTGATGATGCATCGATTGGGCCGCCGATGGCGCCGCCCAGAA
ACGCGCTTTTGCTTATGCGCTGCAGGTCTGCTCCGGCGAAGAGTTGGGTGGAGGAAGGATGTTCAGAGGAGGAAGAAGAAACAGAGGTGAAGGTGAAGAAGAGCTTGAAA
TGGCTAATGGAGGAAGAGAACAGAGAGAGCAGGGATTTGGTTACGAGGAGCCAGAGTTGGAAGGTTTGA
Protein sequenceShow/hide protein sequence
MPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKDIMQFLTCLRS
LRFDFGCFGAFPEAEFTSEDEEEEEVGVQGSDGSRTAFSKWFMVLQGSGFRRDRNGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWVEEGCSEEEEETEVKVKKSLK
WLMEEENRESRDLVTRSQSWKV