; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC07G132000 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC07G132000
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionB box-type domain-containing protein
Genome locationCmU531Chr07:7350588..7353396
RNA-Seq ExpressionCmUC07G132000
SyntenyCmUC07G132000
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR006734 - PLATZ transcription factor


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599925.1 hypothetical protein SDJN03_05158, partial [Cucurbita argyrosperma subsp. sororia]7.5e-8575.85Show/hide
Query:  MEPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCTIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT
        ME  W G+LLNTKF+ SCDLHPNLR N+K+ FCIDCSVSFCKNCT+HDLHRQV+IWKY YHEVVRVQD EK+F CSEIH YK NG+++VHLNSR QSVD 
Subjt:  MEPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCTIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT

Query:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELESTISIAESTEETKTSPSSSQPRKRGRKAI
        K  KAKSG  CE+CGR+VQ P+RFCSIACKVSVNSK KDQS+ T+++ SPD  NLSFK KTS ETN SELESTISIAES EETK SPSSSQPRKR RK  
Subjt:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELESTISIAESTEETKTSPSSSQPRKRGRKAI

Query:  PHRAPFF
        PHR+P F
Subjt:  PHRAPFF

KAG7030608.1 hypothetical protein SDJN02_04645 [Cucurbita argyrosperma subsp. argyrosperma]7.5e-8575.85Show/hide
Query:  MEPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCTIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT
        ME  W G+LLNTKF+ SCDLHPNLR N+K+ FCIDCSVSFCKNCT+HDLHRQV+IWKY YHEVVRVQD EK+F CSEIH YK NG+++VHLNSR QSVD 
Subjt:  MEPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCTIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT

Query:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELESTISIAESTEETKTSPSSSQPRKRGRKAI
        K  KAKSG  CE+CGR+VQ P+RFCSIACKVSVNSK KDQS+ T+++ SPD  NLSFK KTS ETN SELESTISIAES EETK SPSSSQPRKR RK  
Subjt:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELESTISIAESTEETKTSPSSSQPRKRGRKAI

Query:  PHRAPFF
        PHR+P F
Subjt:  PHRAPFF

XP_022146741.1 uncharacterized protein LOC111015876 isoform X1 [Momordica charantia]2.0e-8575.85Show/hide
Query:  MEPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCTIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT
        ME  W G+LLNTKF+TSCDLHP LR N+K+ FCIDCS+SFCKNC +HDLHRQVNIWKY YH+VVRVQDMEK F CSEIH YK NG+V+VHLNSR QSVDT
Subjt:  MEPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCTIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT

Query:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNS-KPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELESTISIAESTEETKTSPSSSQPRKRGRKA
        KS KAKS   CE+CGRHVQDP+RFCS+ACKVSVNS K K QS+GT+++PS D  NLS+K +TSPETN SELESTISIAES EE K SPSSS+PRKR RK 
Subjt:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNS-KPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELESTISIAESTEETKTSPSSSQPRKRGRKA

Query:  IPHRAPF
        IPHR+PF
Subjt:  IPHRAPF

XP_031741091.1 uncharacterized protein LOC101214401 isoform X1 [Cucumis sativus]5.9e-9080.77Show/hide
Query:  MEPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCTIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT
        ME +W G+LLNTKFYTSCDLHPNL  NKKS FCIDCSVSFCKNCTIHDLHRQVNIWKY Y EVVRVQDMEK+FCCSEIHPYK NGK+AVH+NS  QSVDT
Subjt:  MEPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCTIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT

Query:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFK-GKTSPETNVSELESTISIAESTEETKTSPSSSQPRKRGRKA
        KS K KS NPCEECG+H+ DPHRFCSIACKV VNSK KD SVGTV++ S D  NLSFK  K SPETN SELESTISIAES EETKTS SS QPRKR  K+
Subjt:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFK-GKTSPETNVSELESTISIAESTEETKTSPSSSQPRKRGRKA

Query:  IPHRAPFF
        IPHRAPFF
Subjt:  IPHRAPFF

XP_038893018.1 uncharacterized protein LOC120081909 [Benincasa hispida]5.6e-9683.57Show/hide
Query:  MEPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCTIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT
        ME +W  +LLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCT+HDLHRQV IWKY YHEVVRVQDMEK FCCSEIHPYK NGKVAVHLN+RSQS+DT
Subjt:  MEPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCTIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT

Query:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELESTISIAESTEETKT-SPSSSQPRKRGRKA
        KS K KSG+PCE+CGRH+ DPHRFCSIACKVSVNSK KDQS+GTV++PS +  NLS K K SPETN SELESTISIAESTE TKT SPSSSQPRKR RK 
Subjt:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELESTISIAESTEETKT-SPSSSQPRKRGRKA

Query:  IPHRAPF
        IPHRAPF
Subjt:  IPHRAPF

TrEMBL top hitse value%identityAlignment
A0A0A0KL89 Uncharacterized protein2.9e-9080.77Show/hide
Query:  MEPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCTIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT
        ME +W G+LLNTKFYTSCDLHPNL  NKKS FCIDCSVSFCKNCTIHDLHRQVNIWKY Y EVVRVQDMEK+FCCSEIHPYK NGK+AVH+NS  QSVDT
Subjt:  MEPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCTIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT

Query:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFK-GKTSPETNVSELESTISIAESTEETKTSPSSSQPRKRGRKA
        KS K KS NPCEECG+H+ DPHRFCSIACKV VNSK KD SVGTV++ S D  NLSFK  K SPETN SELESTISIAES EETKTS SS QPRKR  K+
Subjt:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFK-GKTSPETNVSELESTISIAESTEETKTSPSSSQPRKRGRKA

Query:  IPHRAPFF
        IPHRAPFF
Subjt:  IPHRAPFF

A0A6J1D096 uncharacterized protein LOC111015876 isoform X19.6e-8675.85Show/hide
Query:  MEPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCTIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT
        ME  W G+LLNTKF+TSCDLHP LR N+K+ FCIDCS+SFCKNC +HDLHRQVNIWKY YH+VVRVQDMEK F CSEIH YK NG+V+VHLNSR QSVDT
Subjt:  MEPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCTIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT

Query:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNS-KPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELESTISIAESTEETKTSPSSSQPRKRGRKA
        KS KAKS   CE+CGRHVQDP+RFCS+ACKVSVNS K K QS+GT+++PS D  NLS+K +TSPETN SELESTISIAES EE K SPSSS+PRKR RK 
Subjt:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNS-KPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELESTISIAESTEETKTSPSSSQPRKRGRKA

Query:  IPHRAPF
        IPHR+PF
Subjt:  IPHRAPF

A0A6J1D0E9 uncharacterized protein LOC111015876 isoform X21.3e-6664.73Show/hide
Query:  MEPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCTIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT
        ME  W G+LLNTKF+TSCDLHP LR N+K+ FCIDCS+SFCKNC +HDLHRQ                            YK NG+V+VHLNSR QSVDT
Subjt:  MEPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCTIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT

Query:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNS-KPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELESTISIAESTEETKTSPSSSQPRKRGRKA
        KS KAKS   CE+CGRHVQDP+RFCS+ACKVSVNS K K QS+GT+++PS D  NLS+K +TSPETN SELESTISIAES EE K SPSSS+PRKR RK 
Subjt:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNS-KPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELESTISIAESTEETKTSPSSSQPRKRGRKA

Query:  IPHRAPF
        IPHR+PF
Subjt:  IPHRAPF

A0A6J1FR06 uncharacterized protein LOC1114476623.6e-8575.85Show/hide
Query:  MEPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCTIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT
        ME  W G+LLNTKF+ SCDLHPNLR N+K+ FCIDCSVSFCKNCT+HDLHRQV+IWKY YHEVVRVQD EK+F CSEIH YK NG+++VHLNSR QSVD 
Subjt:  MEPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCTIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT

Query:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELESTISIAESTEETKTSPSSSQPRKRGRKAI
        K  KAKSG  CE+CGR+VQ P+RFCSIACKVSVNSK KDQS+ T+++ SPD  NLSFK KTS ETN SELESTISIAES EETK SPSSSQPRKR RK  
Subjt:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELESTISIAESTEETKTSPSSSQPRKRGRKAI

Query:  PHRAPFF
        PHR+P F
Subjt:  PHRAPFF

A0A6J1K8W8 uncharacterized protein LOC1114917311.4e-8475.36Show/hide
Query:  MEPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCTIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT
        ME  W G+LLNTKF+ SCDLHPNLR N+K+ FCIDCSVSFCKNCT+HDLHRQV+IWKY YHEVVRVQD EK+F CSEIH YK NG+++VHLNSR QSVD 
Subjt:  MEPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCTIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT

Query:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELESTISIAESTEETKTSPSSSQPRKRGRKAI
        K  KAKSG  CE+CGR+VQ P+RFCSIACKVSVNSK KDQS+ T+++ SPD  NLSFK K S ETN SELESTISIAES EETK SPSSSQPRKR RK  
Subjt:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELESTISIAESTEETKTSPSSSQPRKRGRKAI

Query:  PHRAPFF
        PHR+P F
Subjt:  PHRAPFF

SwissProt top hitse value%identityAlignment
Q1G3Q4 Protein RGF1 INDUCIBLE TRANSCRIPTION FACTOR 16.1e-2134.35Show/hide
Query:  EPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCT-IHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT
        +P+W  +L   KF+  C  H   + N+++  C+DC  S C +C   H  HR + + +Y YH+VVR++D++K   CS +  Y  N    V +  R Q+   
Subjt:  EPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCT-IHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT

Query:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKV
        K     +GN C  C R +Q+P+  CS+ CKV
Subjt:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKV

Arabidopsis top hitse value%identityAlignment
AT1G31040.1 PLATZ transcription factor family protein1.1e-2230.04Show/hide
Query:  EPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNC-TIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT
        +P+W   L+   F++SC +H   R ++K+ FC+ C +S C +C   H  H  + + +Y YH+VVR+ D+EK   CS + PY  NG   + LN R QS   
Subjt:  EPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNC-TIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT

Query:  KSLKAK-SGNPCEECGRHVQDPHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFKG-------KTSPETNVSELESTISIAESTEETKTSPSSSQP
           +AK S N C  C R +Q+P  FCS++CKV   S   D  + ++L    D  + +F+G       +    + + + E  + I++ +E+   S    + 
Subjt:  KSLKAK-SGNPCEECGRHVQDPHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFKG-------KTSPETNVSELESTISIAESTEETKTSPSSSQP

Query:  RKR-------------------GRKAIPHRAPF
        + +                    RK  PHRAPF
Subjt:  RKR-------------------GRKAIPHRAPF

AT1G32700.1 PLATZ transcription factor family protein2.8e-2131.73Show/hide
Query:  PSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVS-FCKNC-TIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT
        P W   LL  KF+  C LH +   ++ + +C+DC+    C  C + H  H  + I + +YH+V+RV +++K    + +  Y  N    V LN R Q    
Subjt:  PSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVS-FCKNC-TIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT

Query:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELESTISIAESTEETKTSPSSSQPRK--RGRK
        K +     N CE C R + D  RFCS+ CK+S  SK K +             NLS    +   T++  L+    I  ++    T P S+  R+  + RK
Subjt:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELESTISIAESTEETKTSPSSSQPRK--RGRK

Query:  AIPHRAPF
         IPHRAPF
Subjt:  AIPHRAPF

AT2G01818.1 PLATZ transcription factor family protein1.2e-3239.45Show/hide
Query:  EPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNC-----TIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQ
        E  W  +LLN++F+  C  H  LR N+K+ FCIDC+V  C++C       H LHR++ I KY Y +V+R+ +++ +F CSEI  YK NG+ A+HLNSR Q
Subjt:  EPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNC-----TIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQ

Query:  SVDTK-SLKAKSGNPCEECGRHVQD-PHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELES-TISIAESTEETKT--SPSSSQ
        + D + S KAK+G  C  C R++QD P+ FCSI+CK+S  SK         L  S     +  K  ++ E ++ E +S T S+ + +E+++   S  S +
Subjt:  SVDTK-SLKAKSGNPCEECGRHVQD-PHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELES-TISIAESTEETKT--SPSSSQ

Query:  PRKR--GRKAIPHRAPFF
        P  R   RK I  R+PF+
Subjt:  PRKR--GRKAIPHRAPFF

AT2G12646.1 PLATZ transcription factor family protein4.3e-2234.35Show/hide
Query:  EPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCT-IHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT
        +P+W  +L   KF+  C  H   + N+++  C+DC  S C +C   H  HR + + +Y YH+VVR++D++K   CS +  Y  N    V +  R Q+   
Subjt:  EPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCT-IHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDT

Query:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKV
        K     +GN C  C R +Q+P+  CS+ CKV
Subjt:  KSLKAKSGNPCEECGRHVQDPHRFCSIACKV

AT3G60670.1 PLATZ transcription factor family protein1.6e-2134.07Show/hide
Query:  PSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNC-TIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDTK
        P+W   LL  KF+ +C  H + + N+K+  CIDC ++ C +C + H  HR + I +Y Y +V+RV+D  K   CS I PY  N    V +N R QS   +
Subjt:  PSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNC-TIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDTK

Query:  SLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELESTISIAESTEET
             SGN C  C R +Q P+ FC ++CK+S +   + + +   L     C  L    + +  T  S LE T S   S+E +
Subjt:  SLKAKSGNPCEECGRHVQDPHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELESTISIAESTEET


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACCTAGCTGGTTTGGTTCGCTTTTGAATACGAAATTCTACACTTCTTGCGATCTACATCCTAATCTCCGGGGAAATAAGAAAAGTGGATTCTGTATTGATTGTAG
TGTTAGCTTCTGCAAGAATTGTACAATCCATGATCTTCATCGGCAGGTTAACATCTGGAAATATGCCTATCATGAAGTTGTGCGCGTTCAGGACATGGAGAAACACTTTT
GCTGTTCAGAGATTCATCCATATAAAGCCAATGGTAAAGTAGCTGTTCATCTAAACTCCCGTAGTCAATCCGTCGACACCAAATCACTGAAGGCGAAGTCCGGTAATCCT
TGTGAAGAATGTGGTAGACATGTACAAGATCCTCATCGCTTCTGTTCAATTGCTTGCAAGGTTTCAGTGAACTCAAAGCCCAAGGACCAGAGTGTTGGAACTGTCTTAAC
TCCGAGCCCGGATTGCCGGAACTTATCATTCAAGGGAAAAACCAGCCCAGAAACAAATGTAAGCGAATTGGAATCAACCATATCAATTGCAGAGTCCACAGAAGAGACTA
AAACTAGCCCTTCATCTTCACAACCAAGAAAACGCGGAAGGAAAGCCATCCCTCACAGAGCTCCATTTTTCTGA
mRNA sequenceShow/hide mRNA sequence
GAAAAATCCTAAAGATTTCTGTTTCCATTTTCACTTTAATCTTCTTTTTTCTTCTTCGTTCTTCCTCCATTTGCAGAGCTTAATCCAATTTATTCGCCGCTTCTTCTGCC
TTGCTCTTCAATCGGAAGAATCTGTTGTAATTATGCGAGTTGTTTAGTGAAGAAAATGGAACCTAGCTGGTTTGGTTCGCTTTTGAATACGAAATTCTACACTTCTTGCG
ATCTACATCCTAATCTCCGGGGAAATAAGAAAAGTGGATTCTGTATTGATTGTAGTGTTAGCTTCTGCAAGAATTGTACAATCCATGATCTTCATCGGCAGGTTAACATC
TGGAAATATGCCTATCATGAAGTTGTGCGCGTTCAGGACATGGAGAAACACTTTTGCTGTTCAGAGATTCATCCATATAAAGCCAATGGTAAAGTAGCTGTTCATCTAAA
CTCCCGTAGTCAATCCGTCGACACCAAATCACTGAAGGCGAAGTCCGGTAATCCTTGTGAAGAATGTGGTAGACATGTACAAGATCCTCATCGCTTCTGTTCAATTGCTT
GCAAGGTTTCAGTGAACTCAAAGCCCAAGGACCAGAGTGTTGGAACTGTCTTAACTCCGAGCCCGGATTGCCGGAACTTATCATTCAAGGGAAAAACCAGCCCAGAAACA
AATGTAAGCGAATTGGAATCAACCATATCAATTGCAGAGTCCACAGAAGAGACTAAAACTAGCCCTTCATCTTCACAACCAAGAAAACGCGGAAGGAAAGCCATCCCTCA
CAGAGCTCCATTTTTCTGATAAGATAAGATGCAACATTTGCCATCTTCTTTTTAACTTCCGCTGTGGCTCTCAAATGGTATGCTGCTTAATTTTTATTTACCTTGACTCA
TAATGCATAAAATGTCCAGTAGTGAAATGAAATCGTCAATATAAAAGTCAAATTTTTAGATCAACTCACTTCTGCAATCTCTCTACTGCATGATTTTTAGAATATCTTTT
ACAGATTTGACGAGATTAGTTGGTAGAGAAAGCTTACTTAAACAGATAAGGAAAATACGTGAGTGCATCTTAGGCTAGGCACATCTTTATGCATTCTACTCAATTGTCCG
GCCATAGTTTACCACTTCTTTATGTATAATATATATTGCGGGTGCATCTCATTATTAAAAAAAAAAAAAAAAAGAAATGTATTTGTGATTACATTTTCGGTCTGAAGTGG
TCAAATATGGATATTCTTTGATCACTGCAATTAGCATTTGGGTATGGGTCTTATTATTGACGAGAATGTTTGTGGGGATGTGTTCACATGAAAGTTTGCTTTCCCACTTT
CTCTTTTCTTTTTTTCTTTTTCTCTTTTTGTCTTATGGTGGCCATAGATTTAGAGCTTCTTTTTGTCACAATCTCCGTGGGAATGGTGATTTGCCATTTATTTTTTCTTT
TTTTAATATTTTCTTGGGGGGTTTCATCTCAATCTTTGGTAGTTGTTAAGGAGGGGCCCTCTATCTCTCACCTTTATACATCATGAAAACAGCCTTTTTCTTTGTGTTTT
CCCCCATGAAATATGGCAGACAGAAATCCCACAACTGAAATAGGAAATAAAGCTTAAAGGAAACATAGTCGGCACAATTTTTTCTTTCTTTCTTTCTTTCTTTCTTTCTT
TCTTTCTTTTTTATCAACCACAAACAACAAACGGGTGAGGAATGCGAACATCTGACCAAGGTCAAGCTTGCTTCCTTTTTCATAGGTGGGAGGTTCAACTTTTTACATTT
TGGCTGAAAGCATATGTTTCAACCAATTGAGTTATCTTCGATCTTTCAACCAATTGACTTATTACTACAATTTTGCCCTTAGGACATTTGAATTTATTTGAAGAAATTGC
TAAATTTTCTTTTTTAATTCAACTTTATGGTTATAACATGTAATCCCAGTAGTGATTGGATTTAATTATTACGGTTTATGTTACAGATGTGATAATTAGAATTTTGGGGT
GTGTTTAAAACCCCATAGGTGAGACGAGGTGAG
Protein sequenceShow/hide protein sequence
MEPSWFGSLLNTKFYTSCDLHPNLRGNKKSGFCIDCSVSFCKNCTIHDLHRQVNIWKYAYHEVVRVQDMEKHFCCSEIHPYKANGKVAVHLNSRSQSVDTKSLKAKSGNP
CEECGRHVQDPHRFCSIACKVSVNSKPKDQSVGTVLTPSPDCRNLSFKGKTSPETNVSELESTISIAESTEETKTSPSSSQPRKRGRKAIPHRAPFF