; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027525 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027525
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionC2H2-type domain-containing protein
Genome locationchr8:1720814..1725030
RNA-Seq ExpressionLag0027525
SyntenyLag0027525
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR013087 - Zinc finger C2H2-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7022689.1 hypothetical protein SDJN02_16423, partial [Cucurbita argyrosperma subsp. argyrosperma]9.2e-13687.23Show/hide
Query:  ESLAFNSACGGSIASTMGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQGSKDSATESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYK
        + LAFNSA GGSI STMGRRSIWVALLV+SLL SL+PQRVSST PLQSL QGSKDSATES PKQDWN+A EVHCSRERSRAAWKIIEEYLMPFVDKKK+K
Subjt:  ESLAFNSACGGSIASTMGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQGSKDSATESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYK

Query:  ISTACRLHPDNDMFRDQEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQ-KSKCNPAAAARNKHMCE
        IST CRLHPDNDMFRDQEQHK+HLDFNDWKCGYCRKRFYEEKYLDQHFDNRHY LLNVSR+KC+AD+CGAL CDHV+D++SQ KSKCNPAA ARNKHMCE
Subjt:  ISTACRLHPDNDMFRDQEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQ-KSKCNPAAAARNKHMCE

Query:  ALADSCFPVNEGALASHLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKR
        ALADSCFP+N+GA ASHLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTS  Y+VISVLT+L V+FFYVF+YLYKR
Subjt:  ALADSCFPVNEGALASHLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKR

XP_022928324.1 uncharacterized protein LOC111435184 [Cucurbita moschata]1.8e-13188.17Show/hide
Query:  IASTMGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQGSKDSATESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDND
        + STMGRRSIWVALLV+SLL SL+PQRVSSTLPLQSL QGSKDSATES PKQDWN+A EVHCSRERSRAAWKIIEEYLMPFVDKKK+KIST CRLHPDND
Subjt:  IASTMGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQGSKDSATESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDND

Query:  MFRDQEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQ-KSKCNPAAAARNKHMCEALADSCFPVNEG
        MFRDQEQHK+HLDFNDWKCGYCRKRFYEEKYLDQHFDNRHY LLNVSR+KC+AD+CGALHCDHV+D++SQ KSKCNPAA ARNKHMCEALADSCFP+N+G
Subjt:  MFRDQEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQ-KSKCNPAAAARNKHMCEALADSCFPVNEG

Query:  ALASHLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKR
         +ASHLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTS  Y+VISVLT+L VLFFYVF+YLYKR
Subjt:  ALASHLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKR

XP_023526204.1 uncharacterized protein LOC111789755 [Cucurbita pepo subsp. pepo]1.1e-13190.66Show/hide
Query:  MGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQGSKDSATESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDNDMFRD
        MGRRS+WVALLVLSLLL  +P  VSSTLPLQSL QGSKDSATESTPKQD NNAHEVHCSRERSRAAWKIIEEYLMPFVDKK+YKIST CRLHPDNDMFRD
Subjt:  MGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQGSKDSATESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDNDMFRD

Query:  QEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQKSKCNPAAAARNKHMCEALADSCFPVNEGALASH
        QEQHKSHLDFNDWKCGYC+K+FYEEKYLDQHF NRHY LLNVSRSKCLAD CGALHCD VIDTISQKSKCNPAAAARNKHMCE LADSCFP+++GA ASH
Subjt:  QEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQKSKCNPAAAARNKHMCEALADSCFPVNEGALASH

Query:  LHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKR
        LHEFFLRQFCDAHTCSGKPKPFSRGRQVRTS FY+VISVLTV+FVLFFYVFIYLYKR
Subjt:  LHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKR

XP_023532028.1 uncharacterized protein LOC111794116 [Cucurbita pepo subsp. pepo]1.2e-13289.92Show/hide
Query:  MGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQGSKDSATESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDNDMFRD
        MGRRSIWVALLV+SLL SL+PQRVSSTLPLQSL QGSKDSATES PKQDWN+A EVHCSRERSRAAWKIIEEYLMPFVDKKK+KIST CRLHPDNDMFRD
Subjt:  MGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQGSKDSATESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDNDMFRD

Query:  QEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQ-KSKCNPAAAARNKHMCEALADSCFPVNEGALAS
        QEQHK+HLDFNDWKCGYCRKRFYEEKYLDQHFDNRHY LLNVSR+KC+AD+CGALHCDHV+D++SQ KSKCNPAAAARNKHMCEALADSCFP+NEGA AS
Subjt:  QEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQ-KSKCNPAAAARNKHMCEALADSCFPVNEGALAS

Query:  HLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKR
        HLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTS  Y+VISVLT+L VLFFYVF+YLYKR
Subjt:  HLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKR

XP_038891552.1 uncharacterized protein LOC120080934 [Benincasa hispida]3.3e-13390.66Show/hide
Query:  MGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQGSKDSATESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDNDMFRD
        MGRRS+WVALLV  LLLSL+P  VSS LPLQ+L QG KDSA+ESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFV+KKKYKIS+ CRLHPDNDMFRD
Subjt:  MGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQGSKDSATESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDNDMFRD

Query:  QEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQKSKCNPAAAARNKHMCEALADSCFPVNEGALASH
        QEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHY LLNVSRS+CLAD+CGALHCDHVIDT+SQKSKCNPAAAARNKHMCE LADSCFPV+EGA ASH
Subjt:  QEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQKSKCNPAAAARNKHMCEALADSCFPVNEGALASH

Query:  LHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKR
        LHEFFLRQFCDAHTCSGKPKPFSRGRQVR + FYIVISVLTVLFVLFFYVFIYLYKR
Subjt:  LHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKR

TrEMBL top hitse value%identityAlignment
A0A0A0L8D2 C2H2-type domain-containing protein5.6e-13188.8Show/hide
Query:  MGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQG--SKDSATESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDNDMF
        MGRRSI VALL+ SLLLSL+P  VSSTLPLQSL QG  SKDSA+ESTPKQDWNNAHEVHCSRERSRAAWK+IEEYLMPFV+KKKYKIST CRLHPDNDMF
Subjt:  MGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQG--SKDSATESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDNDMF

Query:  RDQEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQKSKCNPAAAARNKHMCEALADSCFPVNEGALA
        RDQEQHKSHLDFNDWKCGYCRKRFYEEKY+DQHFDNRHY LLNVSR++CLAD+CGALHCDHVID +SQKSKCNPAAAARNKHMCE LADSCFPV+EGALA
Subjt:  RDQEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQKSKCNPAAAARNKHMCEALADSCFPVNEGALA

Query:  SHLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKR
        SHLHEFFL QFCDAHTCSGKPKPFSRGRQVR S FYIVISVLT+LFV+FFYVF YLY R
Subjt:  SHLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKR

A0A5A7URM7 Zinc finger family protein4.3e-13189.19Show/hide
Query:  MGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQG--SKDSATESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDNDMF
        MGRRSI VALLV SLLLSL+P  VSSTLPLQSL QG  SKDSA+ STPKQDWNNAHEVHCSRERSRAAWK+IEEYLMPFV+KKKY+IST CRLHPDNDMF
Subjt:  MGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQG--SKDSATESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDNDMF

Query:  RDQEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQKSKCNPAAAARNKHMCEALADSCFPVNEGALA
        RDQEQHKSHLDFNDWKCGYCRKRFYEEKY+DQHFDNRHY LLNVSR++CLAD+CGALHCDHVID +SQKSKCNPAAAARNKHMCE LADSCFPV+EGALA
Subjt:  RDQEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQKSKCNPAAAARNKHMCEALADSCFPVNEGALA

Query:  SHLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKR
        SHLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTS FYIVISVLT+LFV+FFYVF YLY R
Subjt:  SHLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKR

A0A6J1EKH9 uncharacterized protein LOC1114351848.7e-13288.17Show/hide
Query:  IASTMGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQGSKDSATESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDND
        + STMGRRSIWVALLV+SLL SL+PQRVSSTLPLQSL QGSKDSATES PKQDWN+A EVHCSRERSRAAWKIIEEYLMPFVDKKK+KIST CRLHPDND
Subjt:  IASTMGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQGSKDSATESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDND

Query:  MFRDQEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQ-KSKCNPAAAARNKHMCEALADSCFPVNEG
        MFRDQEQHK+HLDFNDWKCGYCRKRFYEEKYLDQHFDNRHY LLNVSR+KC+AD+CGALHCDHV+D++SQ KSKCNPAA ARNKHMCEALADSCFP+N+G
Subjt:  MFRDQEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQ-KSKCNPAAAARNKHMCEALADSCFPVNEG

Query:  ALASHLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKR
         +ASHLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTS  Y+VISVLT+L VLFFYVF+YLYKR
Subjt:  ALASHLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKR

A0A6J1F459 uncharacterized protein LOC1114419961.1e-13190.27Show/hide
Query:  MGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQGSKDSATESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDNDMFRD
        MGRRS+WVALLVLSLLL  +P  VSSTLPLQSL QGSKDSATESTPKQD NNAHEVHCSRERSRAAWKIIEEYLMPF+DKK+YK+ST CRLHPDNDMFRD
Subjt:  MGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQGSKDSATESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDNDMFRD

Query:  QEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQKSKCNPAAAARNKHMCEALADSCFPVNEGALASH
        QEQHKSHLDFNDWKCGYC+K+FYEEKYLDQHF NRHY LLNVSRSKCLAD CGALHCD VIDTISQKSKCNPAAAARNKHMCE LADSCFP+++GA ASH
Subjt:  QEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQKSKCNPAAAARNKHMCEALADSCFPVNEGALASH

Query:  LHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKR
        LHEFFLRQFCDAHTCSGKPKPFSRGRQVRTS FYIVISVLTV+FVLFFYVFIYLYKR
Subjt:  LHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKR

A0A6J1J0Y1 uncharacterized protein LOC1114824591.5e-13190.66Show/hide
Query:  MGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQGSKDSATESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDNDMFRD
        MGRRS+WVALLVLSLLL  +P  VSSTLPLQSL QGSKDSATESTPKQD NNAHEVHCSRERSRAAWKIIEEYLMPFVDKK+YKIST CRLHPDNDMFRD
Subjt:  MGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQGSKDSATESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDNDMFRD

Query:  QEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQKSKCNPAAAARNKHMCEALADSCFPVNEGALASH
        QEQHKSHLDFNDWKCGYC+K+FYEEKYLDQHF NRHY LLNVSRSKCLAD CGALHCD VID+ISQKSKCNPAAAARNKHMCE LADSCFP+++GA ASH
Subjt:  QEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQKSKCNPAAAARNKHMCEALADSCFPVNEGALASH

Query:  LHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKR
        LHEFFLRQFCDAHTCSGKPKPFSRGRQVRTS FYIVISVLTV+FVLFFYVFIYLYKR
Subjt:  LHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G40710.1 zinc finger (C2H2 type) family protein9.9e-8061.65Show/hide
Query:  HEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDNDMFRDQEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCG
        HE+HCSRERSR AWKII+EYLMP+V+K++Y++ + CR+H DND++R+QE+HK   D N+W+CG+C+K FYEEKYLD+HFD+RHY LLN S  KCL+D+CG
Subjt:  HEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDNDMFRDQEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCG

Query:  ALHCDHVIDTISQKSKCNPAAAARNKHMCEALADSCFPVNEGALASHLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIY
        ALHCD V+DT   KSKCNPAAAA+N+H+CE+LA+SCFPVN+G+ A+ LH+FFLRQFCDAHTCSG  KP S+ +  + S  YI+ S++ ++ +L +Y F+Y
Subjt:  ALHCDHVIDTISQKSKCNPAAAARNKHMCEALADSCFPVNEGALASHLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIY

Query:  LYKRYL
        L++R L
Subjt:  LYKRYL

AT5G63280.1 C2H2-like zinc finger protein1.4e-7852.54Show/hide
Query:  MGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQGSKDSATEST----PKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDND
        MGR  +W +L V+ LLL L        L  QS+ QG ++S  EST     + + +NA E+HCSRERSRAAW+II++YL PFV++++Y+I   CRLHPDND
Subjt:  MGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQGSKDSATEST----PKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKISTACRLHPDND

Query:  MFRDQEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQKSKCNPAAAARNKHMCEALADSCFPVNEGA
        ++RDQE HK H+D  +WKCGYC+K F +EK+LD+HF  RHY LLN + +KCLAD+CGALHCD V+ +   KSKCNP A A+N+H+CE++A+SCFPV++G 
Subjt:  MFRDQEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQKSKCNPAAAARNKHMCEALADSCFPVNEGA

Query:  LASHLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKRYLSFGLPSFVSVHKS
         AS LHE FLRQFCDAHTC+G  KPF RG + ++  FY+ IS+LT++ +  FY+ ++L++R    G      + KS
Subjt:  LASHLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKRYLSFGLPSFVSVHKS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGGTCGGATATAGCAGCTTCCCGGTTCTCTCTCCTTCTGCTGGTTGTCCTCTTGGGTTAGACCACCACCGATGTGCGATCCGAACTGGCAAGCATTCTTCCCTACA
GACGGCGCCAACTGTTGATGCCAAAATTTGGCCAAGCAATCCACCTGCCCTTCACGGTTTGGAGGCGTTCTGGGACGAAGCAGGCTGGACCGAGGCAGCTGGAAGCGGTA
GGGGCCGAGCGGAGGCAAACGGACTCCTCCCGGTCCTATCTCCTCCCGGTTGTCCTGTCAGCTCCTCATCTTTTCAAAATTTCAAGGTGGCCCTCACAGCTTCGCAGGTT
TTCTCGAGCTTGTTCCGCGACCGCGAGTCGCTTGCGTTCAATTCCGCCTGTGGAGGTTCCATAGCATCAACGATGGGAAGGCGATCCATATGGGTTGCTCTTCTCGTCCT
CTCTCTTCTTCTTTCGTTGCAACCGCAGAGAGTTTCTTCAACTCTTCCTCTTCAATCATTGATTCAGGGTTCTAAAGATTCTGCCACTGAAAGTACTCCGAAGCAGGATT
GGAACAATGCTCATGAGGTACATTGCTCGAGAGAAAGGAGTAGAGCAGCATGGAAAATAATTGAGGAGTATCTAATGCCCTTTGTGGATAAAAAGAAATACAAGATTTCG
ACAGCGTGTAGGCTTCACCCTGATAATGACATGTTCAGAGATCAAGAACAGCACAAGAGCCATCTAGATTTTAACGACTGGAAGTGTGGATATTGTAGAAAAAGATTCTA
TGAAGAGAAGTATCTCGATCAGCACTTTGACAATAGACACTATACTTTACTCAATGTGAGTCGAAGCAAGTGCTTGGCAGATGTTTGTGGTGCATTGCACTGTGATCATG
TGATAGATACAATATCACAGAAAAGTAAATGCAATCCTGCTGCTGCGGCAAGAAATAAACACATGTGTGAGGCTCTTGCTGACAGTTGTTTTCCTGTTAATGAGGGTGCT
TTGGCCAGCCATCTTCACGAATTCTTCTTGCGTCAGTTTTGTGATGCACACACTTGTAGCGGAAAGCCCAAACCTTTCTCCAGAGGTCGACAGGTTAGGACGAGTGGGTT
CTACATTGTTATTTCAGTTTTGACAGTCCTTTTCGTGCTGTTTTTCTATGTCTTCATTTACTTGTATAAGAGGTATCTTTCCTTTGGCCTACCATCATTTGTATCGGTTC
ATAAGAGTTTCAACTTTAACTTGACAATCGTGTTCGTCTTAAAGGGGAATGAGAACGAGACCGCAAGTGTTGAAGCGCCTCTCCGAAAGTGGAAGAAAGAAAAAACCGTC
ATAGATAGAACCTGTTGCCCTTTCAATGATAATATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTGGTCGGATATAGCAGCTTCCCGGTTCTCTCTCCTTCTGCTGGTTGTCCTCTTGGGTTAGACCACCACCGATGTGCGATCCGAACTGGCAAGCATTCTTCCCTACA
GACGGCGCCAACTGTTGATGCCAAAATTTGGCCAAGCAATCCACCTGCCCTTCACGGTTTGGAGGCGTTCTGGGACGAAGCAGGCTGGACCGAGGCAGCTGGAAGCGGTA
GGGGCCGAGCGGAGGCAAACGGACTCCTCCCGGTCCTATCTCCTCCCGGTTGTCCTGTCAGCTCCTCATCTTTTCAAAATTTCAAGGTGGCCCTCACAGCTTCGCAGGTT
TTCTCGAGCTTGTTCCGCGACCGCGAGTCGCTTGCGTTCAATTCCGCCTGTGGAGGTTCCATAGCATCAACGATGGGAAGGCGATCCATATGGGTTGCTCTTCTCGTCCT
CTCTCTTCTTCTTTCGTTGCAACCGCAGAGAGTTTCTTCAACTCTTCCTCTTCAATCATTGATTCAGGGTTCTAAAGATTCTGCCACTGAAAGTACTCCGAAGCAGGATT
GGAACAATGCTCATGAGGTACATTGCTCGAGAGAAAGGAGTAGAGCAGCATGGAAAATAATTGAGGAGTATCTAATGCCCTTTGTGGATAAAAAGAAATACAAGATTTCG
ACAGCGTGTAGGCTTCACCCTGATAATGACATGTTCAGAGATCAAGAACAGCACAAGAGCCATCTAGATTTTAACGACTGGAAGTGTGGATATTGTAGAAAAAGATTCTA
TGAAGAGAAGTATCTCGATCAGCACTTTGACAATAGACACTATACTTTACTCAATGTGAGTCGAAGCAAGTGCTTGGCAGATGTTTGTGGTGCATTGCACTGTGATCATG
TGATAGATACAATATCACAGAAAAGTAAATGCAATCCTGCTGCTGCGGCAAGAAATAAACACATGTGTGAGGCTCTTGCTGACAGTTGTTTTCCTGTTAATGAGGGTGCT
TTGGCCAGCCATCTTCACGAATTCTTCTTGCGTCAGTTTTGTGATGCACACACTTGTAGCGGAAAGCCCAAACCTTTCTCCAGAGGTCGACAGGTTAGGACGAGTGGGTT
CTACATTGTTATTTCAGTTTTGACAGTCCTTTTCGTGCTGTTTTTCTATGTCTTCATTTACTTGTATAAGAGGTATCTTTCCTTTGGCCTACCATCATTTGTATCGGTTC
ATAAGAGTTTCAACTTTAACTTGACAATCGTGTTCGTCTTAAAGGGGAATGAGAACGAGACCGCAAGTGTTGAAGCGCCTCTCCGAAAGTGGAAGAAAGAAAAAACCGTC
ATAGATAGAACCTGTTGCCCTTTCAATGATAATATGTAG
Protein sequenceShow/hide protein sequence
MLVGYSSFPVLSPSAGCPLGLDHHRCAIRTGKHSSLQTAPTVDAKIWPSNPPALHGLEAFWDEAGWTEAAGSGRGRAEANGLLPVLSPPGCPVSSSSFQNFKVALTASQV
FSSLFRDRESLAFNSACGGSIASTMGRRSIWVALLVLSLLLSLQPQRVSSTLPLQSLIQGSKDSATESTPKQDWNNAHEVHCSRERSRAAWKIIEEYLMPFVDKKKYKIS
TACRLHPDNDMFRDQEQHKSHLDFNDWKCGYCRKRFYEEKYLDQHFDNRHYTLLNVSRSKCLADVCGALHCDHVIDTISQKSKCNPAAAARNKHMCEALADSCFPVNEGA
LASHLHEFFLRQFCDAHTCSGKPKPFSRGRQVRTSGFYIVISVLTVLFVLFFYVFIYLYKRYLSFGLPSFVSVHKSFNFNLTIVFVLKGNENETASVEAPLRKWKKEKTV
IDRTCCPFNDNM