; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016618 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016618
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionC2H2-type domain-containing protein
Genome locationtig00152977:875382..892143
RNA-Seq ExpressionSgr016618
SyntenySgr016618
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR013087 - Zinc finger C2H2-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7022689.1 hypothetical protein SDJN02_16423, partial [Cucurbita argyrosperma subsp. argyrosperma]6.0e-12983.7Show/hide
Query:  FKSARGGAMTSTMGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQGSKDSATASTPKQDWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSE
        F SA GG++TSTMGRRS+WVALLV+SLL  +RPQ VSST PLQSL QGSKDSAT S PKQDW++A EVHCSRERSRAAWKIIEEYLMPFVDKKK+KIS+E
Subjt:  FKSARGGAMTSTMGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQGSKDSATASTPKQDWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSE

Query:  CRLHPDNDMFRDQEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQ-KSKCNPAAVARNKHMCEALAD
        CRLHPDNDMFRDQEQHK+ LDFN+WKCGYCRKRFYEEKYLDQHFDNRHY+LLNVSR+KC+AD+CGAL CDHV+D+VSQ KSKCNPAA ARNKHMCEALAD
Subjt:  CRLHPDNDMFRDQEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQ-KSKCNPAAVARNKHMCEALAD

Query:  SCFPVAEGALASQLHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKR
        SCFP+ +GA AS LHEFFLRQFCDAHTCSGKPKPFSRGR+VRTSW YVVIS++T+L V+FFYVF+YLYKR
Subjt:  SCFPVAEGALASQLHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKR

XP_022154197.1 uncharacterized protein LOC111021515 [Momordica charantia]6.6e-12886.97Show/hide
Query:  MTSTMGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQGSKDSATASTPKQDWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDND
        MT+TMGRRS+WVALLV  LLL MRP  VSSTLPLQ L QGSKDSA  STPKQD++NAHEVHCSRERSR AWKIIEEYLMPFV+K+KYKIS++CRLHPDND
Subjt:  MTSTMGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQGSKDSATASTPKQDWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDND

Query:  MFRDQEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQKSKCNPAAVARNKHMCEALADSCFPVAEGA
        MFRDQEQHKS LDFNEWKCGYCRKRFYEEKYLDQHFDNRHY+LLN SR KCLADVCGALHCD V++TVSQKSKCNPAA AR KHMCE LADSCFPVAEGA
Subjt:  MFRDQEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQKSKCNPAAVARNKHMCEALADSCFPVAEGA

Query:  LASQLHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKR
         AS LHEFFLRQFCDAHTCSGKPKPFS+G+EVRTSWFY+VISI+TVLFVLFFYVFIYLYKR
Subjt:  LASQLHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKR

XP_022928324.1 uncharacterized protein LOC111435184 [Cucurbita moschata]3.3e-12785.5Show/hide
Query:  MTSTMGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQGSKDSATASTPKQDWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDND
        MTSTMGRRS+WVALLV+SLL  +RPQ VSSTLPLQSL QGSKDSAT S PKQDW++A EVHCSRERSRAAWKIIEEYLMPFVDKKK+KIS+ECRLHPDND
Subjt:  MTSTMGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQGSKDSATASTPKQDWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDND

Query:  MFRDQEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQ-KSKCNPAAVARNKHMCEALADSCFPVAEG
        MFRDQEQHK+ LDFN+WKCGYCRKRFYEEKYLDQHFDNRHY+LLNVSR+KC+AD+CGALHCDHV+D+VSQ KSKCNPAA ARNKHMCEALADSCFP+ +G
Subjt:  MFRDQEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQ-KSKCNPAAVARNKHMCEALADSCFPVAEG

Query:  ALASQLHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKR
         +AS LHEFFLRQFCDAHTCSGKPKPFSRGR+VRTSW YVVIS++T+L VLFFYVF+YLYKR
Subjt:  ALASQLHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKR

XP_023532028.1 uncharacterized protein LOC111794116 [Cucurbita pepo subsp. pepo]7.3e-12786.05Show/hide
Query:  MGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQGSKDSATASTPKQDWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDNDMFRD
        MGRRS+WVALLV+SLL  +RPQ VSSTLPLQSL QGSKDSAT S PKQDW++A EVHCSRERSRAAWKIIEEYLMPFVDKKK+KIS+ECRLHPDNDMFRD
Subjt:  MGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQGSKDSATASTPKQDWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDNDMFRD

Query:  QEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQ-KSKCNPAAVARNKHMCEALADSCFPVAEGALAS
        QEQHK+ LDFN+WKCGYCRKRFYEEKYLDQHFDNRHY+LLNVSR+KC+AD+CGALHCDHV+D+VSQ KSKCNPAA ARNKHMCEALADSCFP+ EGA AS
Subjt:  QEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQ-KSKCNPAAVARNKHMCEALADSCFPVAEGALAS

Query:  QLHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKR
         LHEFFLRQFCDAHTCSGKPKPFSRGR+VRTSW YVVIS++T+L VLFFYVF+YLYKR
Subjt:  QLHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKR

XP_038891552.1 uncharacterized protein LOC120080934 [Benincasa hispida]3.9e-12887.55Show/hide
Query:  MGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQGSKDSATASTPKQDWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDNDMFRD
        MGRRS+WVALLV  LLL +RP +VSS LPLQ+L QG KDSA+ STPKQDW+NAHEVHCSRERSRAAWKIIEEYLMPFV+KKKYKISS+CRLHPDNDMFRD
Subjt:  MGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQGSKDSATASTPKQDWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDNDMFRD

Query:  QEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQKSKCNPAAVARNKHMCEALADSCFPVAEGALASQ
        QEQHKS LDFN+WKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRS+CLAD+CGALHCDHVIDTVSQKSKCNPAA ARNKHMCE LADSCFPV EGA AS 
Subjt:  QEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQKSKCNPAAVARNKHMCEALADSCFPVAEGALASQ

Query:  LHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKR
        LHEFFLRQFCDAHTCSGKPKPFSRGR+VR + FY+VIS++TVLFVLFFYVFIYLYKR
Subjt:  LHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKR

TrEMBL top hitse value%identityAlignment
A0A5A7URM7 Zinc finger family protein5.1e-12685.33Show/hide
Query:  MGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQG--SKDSATASTPKQDWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDNDMF
        MGRRS+ VALLV SLLL +RP +VSSTLPLQSL QG  SKDSA+ STPKQDW+NAHEVHCSRERSRAAWK+IEEYLMPFV+KKKY+IS++CRLHPDNDMF
Subjt:  MGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQG--SKDSATASTPKQDWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDNDMF

Query:  RDQEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQKSKCNPAAVARNKHMCEALADSCFPVAEGALA
        RDQEQHKS LDFN+WKCGYCRKRFYEEKY+DQHFDNRHYNLLNVSR++CLAD+CGALHCDHVID VSQKSKCNPAA ARNKHMCE LADSCFPV EGALA
Subjt:  RDQEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQKSKCNPAAVARNKHMCEALADSCFPVAEGALA

Query:  SQLHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKR
        S LHEFFLRQFCDAHTCSGKPKPFSRGR+VRTS FY+VIS++T+LFV+FFYVF YLY R
Subjt:  SQLHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKR

A0A6J1DL16 uncharacterized protein LOC1110215153.2e-12886.97Show/hide
Query:  MTSTMGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQGSKDSATASTPKQDWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDND
        MT+TMGRRS+WVALLV  LLL MRP  VSSTLPLQ L QGSKDSA  STPKQD++NAHEVHCSRERSR AWKIIEEYLMPFV+K+KYKIS++CRLHPDND
Subjt:  MTSTMGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQGSKDSATASTPKQDWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDND

Query:  MFRDQEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQKSKCNPAAVARNKHMCEALADSCFPVAEGA
        MFRDQEQHKS LDFNEWKCGYCRKRFYEEKYLDQHFDNRHY+LLN SR KCLADVCGALHCD V++TVSQKSKCNPAA AR KHMCE LADSCFPVAEGA
Subjt:  MFRDQEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQKSKCNPAAVARNKHMCEALADSCFPVAEGA

Query:  LASQLHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKR
         AS LHEFFLRQFCDAHTCSGKPKPFS+G+EVRTSWFY+VISI+TVLFVLFFYVFIYLYKR
Subjt:  LASQLHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKR

A0A6J1EKH9 uncharacterized protein LOC1114351841.6e-12785.5Show/hide
Query:  MTSTMGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQGSKDSATASTPKQDWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDND
        MTSTMGRRS+WVALLV+SLL  +RPQ VSSTLPLQSL QGSKDSAT S PKQDW++A EVHCSRERSRAAWKIIEEYLMPFVDKKK+KIS+ECRLHPDND
Subjt:  MTSTMGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQGSKDSATASTPKQDWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDND

Query:  MFRDQEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQ-KSKCNPAAVARNKHMCEALADSCFPVAEG
        MFRDQEQHK+ LDFN+WKCGYCRKRFYEEKYLDQHFDNRHY+LLNVSR+KC+AD+CGALHCDHV+D+VSQ KSKCNPAA ARNKHMCEALADSCFP+ +G
Subjt:  MFRDQEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQ-KSKCNPAAVARNKHMCEALADSCFPVAEG

Query:  ALASQLHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKR
         +AS LHEFFLRQFCDAHTCSGKPKPFSRGR+VRTSW YVVIS++T+L VLFFYVF+YLYKR
Subjt:  ALASQLHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKR

A0A6J1F459 uncharacterized protein LOC1114419963.0e-12685.99Show/hide
Query:  MGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQGSKDSATASTPKQDWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDNDMFRD
        MGRRS+WVALLVLSLLL  RP  VSSTLPLQSL QGSKDSAT STPKQD +NAHEVHCSRERSRAAWKIIEEYLMPF+DKK+YK+S++CRLHPDNDMFRD
Subjt:  MGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQGSKDSATASTPKQDWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDNDMFRD

Query:  QEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQKSKCNPAAVARNKHMCEALADSCFPVAEGALASQ
        QEQHKS LDFN+WKCGYC+K+FYEEKYLDQHF NRHY+LLNVSRSKCLAD CGALHCD VIDT+SQKSKCNPAA ARNKHMCE LADSCFP+ +GA AS 
Subjt:  QEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQKSKCNPAAVARNKHMCEALADSCFPVAEGALASQ

Query:  LHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKR
        LHEFFLRQFCDAHTCSGKPKPFSRGR+VRTS FY+VIS++TV+FVLFFYVFIYLYKR
Subjt:  LHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKR

A0A6J1J0Y1 uncharacterized protein LOC1114824593.9e-12686.38Show/hide
Query:  MGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQGSKDSATASTPKQDWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDNDMFRD
        MGRRS+WVALLVLSLLL  RP  VSSTLPLQSL QGSKDSAT STPKQD +NAHEVHCSRERSRAAWKIIEEYLMPFVDKK+YKIS++CRLHPDNDMFRD
Subjt:  MGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQGSKDSATASTPKQDWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDNDMFRD

Query:  QEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQKSKCNPAAVARNKHMCEALADSCFPVAEGALASQ
        QEQHKS LDFN+WKCGYC+K+FYEEKYLDQHF NRHY+LLNVSRSKCLAD CGALHCD VID++SQKSKCNPAA ARNKHMCE LADSCFP+ +GA AS 
Subjt:  QEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQKSKCNPAAVARNKHMCEALADSCFPVAEGALASQ

Query:  LHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKR
        LHEFFLRQFCDAHTCSGKPKPFSRGR+VRTS FY+VIS++TV+FVLFFYVFIYLYKR
Subjt:  LHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G40710.1 zinc finger (C2H2 type) family protein2.1e-7962.62Show/hide
Query:  HEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDNDMFRDQEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCG
        HE+HCSRERSR AWKII+EYLMP+V+K++Y++ S CR+H DND++R+QE+HK R D NEW+CG+C+K FYEEKYLD+HFD+RHYNLLN S  KCL+D+CG
Subjt:  HEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDNDMFRDQEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCG

Query:  ALHCDHVIDTVSQKSKCNPAAVARNKHMCEALADSCFPVAEGALASQLHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIY
        ALHCD V+DT   KSKCNPAA A+N+H+CE+LA+SCFPV +G+ A++LH+FFLRQFCDAHTCSG  KP S+ +  + S  Y++ SI+ ++ +L +Y F+Y
Subjt:  ALHCDHVIDTVSQKSKCNPAAVARNKHMCEALADSCFPVAEGALASQLHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIY

Query:  LYKRYL
        L++R L
Subjt:  LYKRYL

AT5G63280.1 C2H2-like zinc finger protein2.3e-7851.61Show/hide
Query:  MGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQGSKDSATASTPKQ--DWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDNDMF
        MGR  +W +L V+ LLL +        L  QS+ QG ++S +     +  + SNA E+HCSRERSRAAW+II++YL PFV++++Y+I   CRLHPDND++
Subjt:  MGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQGSKDSATASTPKQ--DWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDNDMF

Query:  RDQEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQKSKCNPAAVARNKHMCEALADSCFPVAEGALA
        RDQE HK  +D  EWKCGYC+K F +EK+LD+HF  RHYNLLN + +KCLAD+CGALHCD V+ +   KSKCNP AVA+N+H+CE++A+SCFPV++G  A
Subjt:  RDQEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSKCLADVCGALHCDHVIDTVSQKSKCNPAAVARNKHMCEALADSCFPVAEGALA

Query:  SQLHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKRYLSSAYLHLYQFMRVSTLTR
        S+LHE FLRQFCDAHTC+G  KPF RG + ++  FY+ ISI+T++ +  FY+ ++L++R   S    L + ++    T+
Subjt:  SQLHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYKRYLSSAYLHLYQFMRVSTLTR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGTTGGTTTCAAAAGTTACAGTTAGGGTTTGACGAAGTTTTCGGTGGTGAAGAAGAAGAAGGTGCTCCGCCTTGGTTTTTCTCTCCGGTCTGTGCTCTTCCTTG
GAATCTCGAAAAAATCTGTACTCTTTCGGGGCTGCTCATATTGGTCTGTACTCTTTCTGGCGCTGTATCGTTGGGAAGGAGATTTCTCAACAGTTGCTACGAACTTCTTC
AGATGTCTAATATACTCGGGATAGAGCTCAAGATTACAGTGATTAACGCGCAGGTGAATGCTCAGCTCGATGAAGAGCTCATACATCTGACCCAGATCGGTGGCGTTGCC
GTGCGAGTAGAGCAGAGTCGAAGTCGCCATGGGATGCCGGACATAGATTGCGACGATATCGGTGCCCCGGCGAGTGGGCAGCTTGAGAACTTCAACATTTTCGCGGTGAG
GGAAGGGACTGAGAAGCAAAAGACCCGTAAGTTCATCGGTGATGAGCTTATACGAAGAGAATCTGGATTTGCCTCACGGATTTCTTTGCTTGGTTCTGAAGGTTCAGGGG
TTACAGAATACATTGAGCGCATCGATGAACGGAACAAAGTGTTACATCTTGTACACAGACTAATTAATGATAATGCAAGGCATGCATTTCTTGCGTTCCGGGCTACCTTT
GTGATCACTTTCAAGATCCCATGGAAAAAGGAACTGCAGTTAAAATTCAGAAACTCCAAATGCAATCCTCTAGGGGATTCTTTTTCTTTTTCATTTTTTTTTTTTTTTTT
TTTGTGGGGAGCGGCCGTGGCGGCTGTGGCTAGAAGAGAGACAAAAGGTCGACGTCCAAAAGAGAACCCACAAATTCCGAGAATCATCTTGCTGAACGTAATCCGGCTCG
AATGGACCCAACCCAGTCGAGGAGATTTGCGGCAAGCAAAGCCCAAATGGAAAGCTAGGACTGATTTCTTGATGGAGGTGGCAAAGAGAGGGATGAAGGTTGAGAGAGGA
AGTTATCTGTGCAGAAGATGGGAGAAGAACCAACAGCGCAGAAACCCTAATGCAGACGTGAAGGACAGAGAGAAGAGTGAAAATGAATTTGGAAAATACAAAATTGCCCC
AGATCTGTCCGAAATTAAGAAAATGGCCCTCGCAGTCTTCGAAGTTTTACTTGTGTTCCTCCGCGAGGCGCCTGTGTTCAAGTCCGCTCGTGGAGGTGCCATGACATCGA
CGATGGGAAGGCGATCCATGTGGGTTGCTCTTCTCGTCCTCTCTCTTCTGCTTTGTATGCGACCGCAGAGCGTTTCTTCCACTCTTCCTCTCCAATCATTGAAACAGGGT
TCTAAAGATTCTGCCACTGCAAGTACTCCGAAGCAGGATTGGAGCAATGCTCATGAGGTACATTGCTCAAGAGAAAGGAGTAGGGCGGCATGGAAAATAATCGAGGAGTA
TCTAATGCCCTTTGTGGATAAAAAGAAATACAAGATTTCTAGTGAGTGTAGGCTTCACCCTGATAACGACATGTTCAGAGATCAAGAACAGCACAAGAGCCGACTAGATT
TTAATGAATGGAAATGTGGATATTGTAGAAAAAGGTTCTACGAAGAGAAGTATCTTGATCAGCATTTTGACAATAGGCACTATAATTTGCTCAATGTGAGTCGAAGCAAG
TGCTTGGCAGATGTTTGTGGTGCATTGCATTGTGACCATGTGATAGATACAGTATCACAGAAAAGTAAATGCAATCCTGCTGCTGTTGCAAGAAATAAACATATGTGTGA
GGCTCTAGCTGACAGTTGTTTTCCTGTTGCTGAGGGTGCCTTGGCCAGCCAACTTCACGAATTCTTCTTGCGCCAATTCTGTGACGCACACACTTGTAGCGGGAAGCCAA
AACCTTTCTCCAGAGGTCGGGAGGTTAGGACGAGTTGGTTCTATGTTGTTATTTCGATAATGACAGTCCTATTTGTGCTGTTTTTCTATGTCTTCATTTATTTGTATAAG
AGGTATCTTTCTTCTGCCTACCTTCATTTGTATCAGTTCATGAGAGTTTCAACTTTGACTCGGACAATCATGTTCATCTTACAGGGGAATGAGAACAAGACCGCAGGGGC
TGAAGCGCCTCTCACAAAGCGGAAGAAAGAAAAAACCGTCATAGATGGAACCTTTTGCCCTTTCAATAACAATATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGTTGGTTTCAAAAGTTACAGTTAGGGTTTGACGAAGTTTTCGGTGGTGAAGAAGAAGAAGGTGCTCCGCCTTGGTTTTTCTCTCCGGTCTGTGCTCTTCCTTG
GAATCTCGAAAAAATCTGTACTCTTTCGGGGCTGCTCATATTGGTCTGTACTCTTTCTGGCGCTGTATCGTTGGGAAGGAGATTTCTCAACAGTTGCTACGAACTTCTTC
AGATGTCTAATATACTCGGGATAGAGCTCAAGATTACAGTGATTAACGCGCAGGTGAATGCTCAGCTCGATGAAGAGCTCATACATCTGACCCAGATCGGTGGCGTTGCC
GTGCGAGTAGAGCAGAGTCGAAGTCGCCATGGGATGCCGGACATAGATTGCGACGATATCGGTGCCCCGGCGAGTGGGCAGCTTGAGAACTTCAACATTTTCGCGGTGAG
GGAAGGGACTGAGAAGCAAAAGACCCGTAAGTTCATCGGTGATGAGCTTATACGAAGAGAATCTGGATTTGCCTCACGGATTTCTTTGCTTGGTTCTGAAGGTTCAGGGG
TTACAGAATACATTGAGCGCATCGATGAACGGAACAAAGTGTTACATCTTGTACACAGACTAATTAATGATAATGCAAGGCATGCATTTCTTGCGTTCCGGGCTACCTTT
GTGATCACTTTCAAGATCCCATGGAAAAAGGAACTGCAGTTAAAATTCAGAAACTCCAAATGCAATCCTCTAGGGGATTCTTTTTCTTTTTCATTTTTTTTTTTTTTTTT
TTTGTGGGGAGCGGCCGTGGCGGCTGTGGCTAGAAGAGAGACAAAAGGTCGACGTCCAAAAGAGAACCCACAAATTCCGAGAATCATCTTGCTGAACGTAATCCGGCTCG
AATGGACCCAACCCAGTCGAGGAGATTTGCGGCAAGCAAAGCCCAAATGGAAAGCTAGGACTGATTTCTTGATGGAGGTGGCAAAGAGAGGGATGAAGGTTGAGAGAGGA
AGTTATCTGTGCAGAAGATGGGAGAAGAACCAACAGCGCAGAAACCCTAATGCAGACGTGAAGGACAGAGAGAAGAGTGAAAATGAATTTGGAAAATACAAAATTGCCCC
AGATCTGTCCGAAATTAAGAAAATGGCCCTCGCAGTCTTCGAAGTTTTACTTGTGTTCCTCCGCGAGGCGCCTGTGTTCAAGTCCGCTCGTGGAGGTGCCATGACATCGA
CGATGGGAAGGCGATCCATGTGGGTTGCTCTTCTCGTCCTCTCTCTTCTGCTTTGTATGCGACCGCAGAGCGTTTCTTCCACTCTTCCTCTCCAATCATTGAAACAGGGT
TCTAAAGATTCTGCCACTGCAAGTACTCCGAAGCAGGATTGGAGCAATGCTCATGAGGTACATTGCTCAAGAGAAAGGAGTAGGGCGGCATGGAAAATAATCGAGGAGTA
TCTAATGCCCTTTGTGGATAAAAAGAAATACAAGATTTCTAGTGAGTGTAGGCTTCACCCTGATAACGACATGTTCAGAGATCAAGAACAGCACAAGAGCCGACTAGATT
TTAATGAATGGAAATGTGGATATTGTAGAAAAAGGTTCTACGAAGAGAAGTATCTTGATCAGCATTTTGACAATAGGCACTATAATTTGCTCAATGTGAGTCGAAGCAAG
TGCTTGGCAGATGTTTGTGGTGCATTGCATTGTGACCATGTGATAGATACAGTATCACAGAAAAGTAAATGCAATCCTGCTGCTGTTGCAAGAAATAAACATATGTGTGA
GGCTCTAGCTGACAGTTGTTTTCCTGTTGCTGAGGGTGCCTTGGCCAGCCAACTTCACGAATTCTTCTTGCGCCAATTCTGTGACGCACACACTTGTAGCGGGAAGCCAA
AACCTTTCTCCAGAGGTCGGGAGGTTAGGACGAGTTGGTTCTATGTTGTTATTTCGATAATGACAGTCCTATTTGTGCTGTTTTTCTATGTCTTCATTTATTTGTATAAG
AGGTATCTTTCTTCTGCCTACCTTCATTTGTATCAGTTCATGAGAGTTTCAACTTTGACTCGGACAATCATGTTCATCTTACAGGGGAATGAGAACAAGACCGCAGGGGC
TGAAGCGCCTCTCACAAAGCGGAAGAAAGAAAAAACCGTCATAGATGGAACCTTTTGCCCTTTCAATAACAATATGTAG
Protein sequenceShow/hide protein sequence
MEGWFQKLQLGFDEVFGGEEEEGAPPWFFSPVCALPWNLEKICTLSGLLILVCTLSGAVSLGRRFLNSCYELLQMSNILGIELKITVINAQVNAQLDEELIHLTQIGGVA
VRVEQSRSRHGMPDIDCDDIGAPASGQLENFNIFAVREGTEKQKTRKFIGDELIRRESGFASRISLLGSEGSGVTEYIERIDERNKVLHLVHRLINDNARHAFLAFRATF
VITFKIPWKKELQLKFRNSKCNPLGDSFSFSFFFFFFLWGAAVAAVARRETKGRRPKENPQIPRIILLNVIRLEWTQPSRGDLRQAKPKWKARTDFLMEVAKRGMKVERG
SYLCRRWEKNQQRRNPNADVKDREKSENEFGKYKIAPDLSEIKKMALAVFEVLLVFLREAPVFKSARGGAMTSTMGRRSMWVALLVLSLLLCMRPQSVSSTLPLQSLKQG
SKDSATASTPKQDWSNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISSECRLHPDNDMFRDQEQHKSRLDFNEWKCGYCRKRFYEEKYLDQHFDNRHYNLLNVSRSK
CLADVCGALHCDHVIDTVSQKSKCNPAAVARNKHMCEALADSCFPVAEGALASQLHEFFLRQFCDAHTCSGKPKPFSRGREVRTSWFYVVISIMTVLFVLFFYVFIYLYK
RYLSSAYLHLYQFMRVSTLTRTIMFILQGNENKTAGAEAPLTKRKKEKTVIDGTFCPFNNNM