; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg13397 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg13397
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionSWR1-complex protein 4-like
Genome locationCarg_Chr08:2541646..2550645
RNA-Seq ExpressionCarg13397
SyntenyCarg13397
Gene Ontology termsGO:0000122 - negative regulation of transcription by RNA polymerase II (biological process)
GO:0006281 - DNA repair (biological process)
GO:0043486 - histone exchange (biological process)
GO:0043967 - histone H4 acetylation (biological process)
GO:0043968 - histone H2A acetylation (biological process)
GO:0000812 - Swr1 complex (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0035267 - NuA4 histone acetyltransferase complex (cellular component)
GO:0003714 - transcription corepressor activity (molecular function)
InterPro domainsIPR008468 - DNA methyltransferase 1-associated 1
IPR027109 - SWR1-complex protein 4/DNA methyltransferase 1-associated protein 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593255.1 SWR1-complex protein 4, partial [Cucurbita argyrosperma subsp. sororia]3.9e-131100Show/hide
Query:  MPRESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAA
        MPRESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAA
Subjt:  MPRESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAA

Query:  APSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRE
        APSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRE
Subjt:  APSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRE

Query:  GPYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL
        GPYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL
Subjt:  GPYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL

KAG7025607.1 SWR1-complex protein 4 [Cucurbita argyrosperma subsp. argyrosperma]3.9e-131100Show/hide
Query:  MPRESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAA
        MPRESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAA
Subjt:  MPRESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAA

Query:  APSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRE
        APSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRE
Subjt:  APSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRE

Query:  GPYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL
        GPYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL
Subjt:  GPYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL

XP_022959680.1 SWR1-complex protein 4-like [Cucurbita moschata]1.5e-12798.08Show/hide
Query:  PRESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAAA
        PRESSGN LAKDPY+VSQEIER+RALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDS+PSVSNVQPP PAAA
Subjt:  PRESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAAA

Query:  PSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFREG
        PSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFREG
Subjt:  PSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFREG

Query:  PYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL
        PYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL
Subjt:  PYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL

XP_023004539.1 SWR1-complex protein 4-like [Cucurbita maxima]7.4e-13099.23Show/hide
Query:  MPRESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAA
        MPRESSGNPLAKDPYNVSQE ERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVT NAVPGATERAVVPGDSVPSVSNVQPPTPAA
Subjt:  MPRESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAA

Query:  APSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRE
        APSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRE
Subjt:  APSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRE

Query:  GPYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL
        GPYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL
Subjt:  GPYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL

XP_023515250.1 SWR1-complex protein 4-like [Cucurbita pepo subsp. pepo]1.2e-12798.08Show/hide
Query:  MPRESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAA
        MPRESSGN LAKDPY+VSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDS+PSVSNVQPP PAA
Subjt:  MPRESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAA

Query:  APSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRE
        APSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRE
Subjt:  APSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRE

Query:  GPYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL
        GPYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPS PAQTKRPRKQKGSDL
Subjt:  GPYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL

TrEMBL top hitse value%identityAlignment
A0A0A0K9K0 SANT domain-containing protein7.3e-11589.96Show/hide
Query:  RESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAAAP
        RESSGN  AKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITE+R+ ERV E+SELPVTSNAVP  TER VVPGD+VPS+SNVQPP PAA P
Subjt:  RESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAAAP

Query:  STLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFREGP
        ST+VADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTK+VCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+ P
Subjt:  STLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFREGP

Query:  YNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL
        Y EAPGTPKDR+FI DS+S GGER GKRDQKRKATGRLSEAPS PAQ+KRPRKQKGSDL
Subjt:  YNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL

A0A1S3BSL6 SWR1-complex protein 43.3e-11590.35Show/hide
Query:  RESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAAAP
        RESSGN  AKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESR+ ERV E+SELPVTSNAVP  TER VVPGD+VPS+SNVQPP PAA P
Subjt:  RESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAAAP

Query:  STLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFREGP
        ST+VADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTK+VCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+ P
Subjt:  STLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFREGP

Query:  YNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL
        Y EAPGTPKDR+FI DS+S GGER GKRDQKRKATGRLSEAPS PAQ+KRPRKQKGSDL
Subjt:  YNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL

A0A6J1DID3 SWR1-complex protein 46.8e-11389.58Show/hide
Query:  RESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAAAP
        RE SGN L KDPYNVS EIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRR ERV E+SELPV SN VP   ERAVVPGDSVPS+SNVQP  PAAAP
Subjt:  RESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAAAP

Query:  STLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFREGP
        STL ADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTK+VCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+ P
Subjt:  STLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFREGP

Query:  YNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL
        YN+APGTPKDRSFIPDS++ GGER  KRDQKRKATGRLSEAPS PAQ+KRPRKQKGSDL
Subjt:  YNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL

A0A6J1H708 SWR1-complex protein 4-like7.5e-12898.08Show/hide
Query:  PRESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAAA
        PRESSGN LAKDPY+VSQEIER+RALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDS+PSVSNVQPP PAAA
Subjt:  PRESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAAA

Query:  PSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFREG
        PSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFREG
Subjt:  PSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFREG

Query:  PYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL
        PYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL
Subjt:  PYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL

A0A6J1KWJ6 SWR1-complex protein 4-like3.6e-13099.23Show/hide
Query:  MPRESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAA
        MPRESSGNPLAKDPYNVSQE ERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVT NAVPGATERAVVPGDSVPSVSNVQPPTPAA
Subjt:  MPRESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAA

Query:  APSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRE
        APSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRE
Subjt:  APSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRE

Query:  GPYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL
        GPYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL
Subjt:  GPYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL

SwissProt top hitse value%identityAlignment
Q8VZL6 SWR1-complex protein 41.0e-7364.75Show/hide
Query:  PRESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAAA
        P + + +PL K+PY+++++ ERKRALSMVLSQ++ QE+KDAE+LAEAK+ITE R   R  E+ ++    NA     +  VVPG SV   SN Q P  A A
Subjt:  PRESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAAA

Query:  PSTL-VADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRE
        PSTL +AD ASTLASLRML VYLRTY LEQMVQAASS+ GLRTIKRVEQTLQDL VNLKP+VPTK+VC EHLELRKEILTLLNLQKQLQ KE+EGSS RE
Subjt:  PSTL-VADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRE

Query:  GPYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL
        G Y   P TPKDR F PD  S G ER  K++QKRK  GR ++ PSP    KRPRK K SDL
Subjt:  GPYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL

Arabidopsis top hitse value%identityAlignment
AT2G47210.1 myb-like transcription factor family protein7.3e-7564.75Show/hide
Query:  PRESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAAA
        P + + +PL K+PY+++++ ERKRALSMVLSQ++ QE+KDAE+LAEAK+ITE R   R  E+ ++    NA     +  VVPG SV   SN Q P  A A
Subjt:  PRESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAAA

Query:  PSTL-VADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRE
        PSTL +AD ASTLASLRML VYLRTY LEQMVQAASS+ GLRTIKRVEQTLQDL VNLKP+VPTK+VC EHLELRKEILTLLNLQKQLQ KE+EGSS RE
Subjt:  PSTL-VADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRE

Query:  GPYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL
        G Y   P TPKDR F PD  S G ER  K++QKRK  GR ++ PSP    KRPRK K SDL
Subjt:  GPYNEAPGTPKDRSFIPDSMSSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCGGGAAAGTTCAGGGAATCCTCTCGCCAAGGATCCTTACAATGTCTCACAAGAGATTGAGCGCAAACGGGCATTGTCCATGGTTCTCTCCCAAACAAAACAGCA
AGAACGAAAAGATGCAGAGGTTCTAGCTGAAGCAAAAAAGATAACTGAATCTCGCAGAGATGAAAGAGTGCCTGAAAAATCTGAGCTGCCTGTCACTTCAAATGCTGTCC
CTGGAGCTACTGAAAGGGCTGTCGTTCCTGGAGATTCTGTACCATCTGTATCCAATGTGCAGCCCCCTACTCCAGCAGCTGCACCTTCAACTTTAGTGGCGGATAATGCT
TCTACTCTTGCTTCTCTTCGCATGCTTCCTGTATACTTGAGAACGTATGCACTTGAGCAAATGGTACAAGCTGCAAGTTCATCTGCAGGGCTTCGGACTATCAAGCGAGT
TGAACAAACATTACAAGATCTTTCGGTTAATCTAAAGCCGAGGGTTCCAACAAAATCAGTGTGTGCAGAACATCTTGAATTGAGGAAAGAAATATTGACTCTACTGAATC
TTCAAAAGCAGTTGCAAAATAAGGAGGCAGAAGGTTCATCTTTCCGTGAGGGTCCCTACAACGAGGCACCTGGCACACCCAAGGATCGTAGTTTTATTCCTGATTCTATG
AGTTCTGGAGGGGAAAGGTCTGGCAAACGGGATCAGAAACGTAAGGCAACTGGGAGATTATCTGAAGCTCCATCACCACCAGCTCAAACTAAAAGGCCAAGAAAACAGAA
AGGATCTGATCTGTGA
mRNA sequenceShow/hide mRNA sequence
TTTGGCTGTTCGTCACCTACACGTCAATTAAGAAATTCCGATAATAACAACTCTGCTCCACTGTGCTTTCGCGCTCGAATGATGCTCAAAAAATGAAACAAGAACGATTT
TCTTCCAATTGGTCGTATCGAAATCCTGTTCGTATGGTGTCCGTTTGATGAAATTAGGATCCAGAGTAACAAATTCGACTCTGTGAACTCGAGTTATCCGGCATTGCGAA
GTAGAAAGAAGCCCTAGAATTCAAGGGAAGTCTGATATGGATGCCAAGGATATCTTGGGCTTGCCCAAAAATACGCTTCATATACCCCACGAGAAGAAATCTAGGGCTCA
GAAAGATGCCCAGAGAAAGCGAGATGGAATTTCCCGCGAGGTTTATGCGCTTACTGGTGGTCTGGCTCCTATTATGCCGGCAATCGATACATCTGAGCTGAAGAAACAAC
CTCCATCAGATGAAAAGGTTACGTTCGGCACGAAATAGAGGCTGTTGCTTCATGCTAGAGGAGTTCTATGGCAGGCCTGTTTCATAGGTTGTTTGCCTCGAGAGTAGGAG
AATTTTTAGAGGTGTCGAGCGATCTTTCGTTGAGATTACGTGGCAGTGGCTTCCTTTTGCAAATTCTGCTAGAAAAGATAACTTGCAGCTTTACCATTGGGTTAGAGTTG
TAAATGGCATTCCACCAACAGGTGATTATTCCTTTGCAAAGTACAACAAGTCTGTTGAAATTGTCAAATACACGGATGAGGATTACGAAAAGTATTTGAATGAACCTTCA
TGGACAAAGGAGGAGACGGATCAATTATTTGACTTGTGTGAACGGTTTGATCTTCGCTTCGTTGTGATCGCTGACAGGTTTCCATCAACAAGGACAGTGGAGGAACTAAA
GGAGCGATATTATCGTGCATCAAGAGCAATTTTGGCAGCTAGAGAACCAATGCCTCGGGAAAGTTCAGGGAATCCTCTCGCCAAGGATCCTTACAATGTCTCACAAGAGA
TTGAGCGCAAACGGGCATTGTCCATGGTTCTCTCCCAAACAAAACAGCAAGAACGAAAAGATGCAGAGGTTCTAGCTGAAGCAAAAAAGATAACTGAATCTCGCAGAGAT
GAAAGAGTGCCTGAAAAATCTGAGCTGCCTGTCACTTCAAATGCTGTCCCTGGAGCTACTGAAAGGGCTGTCGTTCCTGGAGATTCTGTACCATCTGTATCCAATGTGCA
GCCCCCTACTCCAGCAGCTGCACCTTCAACTTTAGTGGCGGATAATGCTTCTACTCTTGCTTCTCTTCGCATGCTTCCTGTATACTTGAGAACGTATGCACTTGAGCAAA
TGGTACAAGCTGCAAGTTCATCTGCAGGGCTTCGGACTATCAAGCGAGTTGAACAAACATTACAAGATCTTTCGGTTAATCTAAAGCCGAGGGTTCCAACAAAATCAGTG
TGTGCAGAACATCTTGAATTGAGGAAAGAAATATTGACTCTACTGAATCTTCAAAAGCAGTTGCAAAATAAGGAGGCAGAAGGTTCATCTTTCCGTGAGGGTCCCTACAA
CGAGGCACCTGGCACACCCAAGGATCGTAGTTTTATTCCTGATTCTATGAGTTCTGGAGGGGAAAGGTCTGGCAAACGGGATCAGAAACGTAAGGCAACTGGGAGATTAT
CTGAAGCTCCATCACCACCAGCTCAAACTAAAAGGCCAAGAAAACAGAAAGGATCTGATCTGTGATTCTCCAGGCGTGTAAGAAATTTATGGTTGTATCTCTAGATTGTA
TGCGCCTAACCACCCTTTTTTAATAGAAAGAAAAATGAAAAGAACACAAGTCCATGAAGTGGAACCTCCTTTTGTCCTTGTTTCTATTGAATCATTGATGGTAGGTTTAA
CATATGTAAATCTATGATGGCTGCATATGGTTTCATCATCTTACGATATCAATTACAATTGTATGCATCTAAAAGGCTCAAGCTCGAGCAGGTCACTTACGCTGGAAGGC
ACATCTTCATCACAAAGCCAAAGCTCTGGTGTTGTAGGTCTATTAGCTAGACTTGTCTTTCTTCTGCTGGCACAGTTAGCAGGATAGCTCCTCTAGTATGATGGTGCATT
TTCTGTATATCTATCCAGAAATGAGGAAAAATTACTTTCTTGTATATTACTTTCATTTTTATGGTGCATTTTTGTGTTTTTTACAATCACTTCCTTCACATAAGCACATT
GAGATCTCTACATTCCTATTCTGAAAAGATGAGCACCAGCAACATTTCATAAATGCAGCAGCTATACATAATTCCCCTGCTTCAATAACTTAGGAAAAGGAGAGTTTGTA
TTATCTGGAGTTCTTTCTTGAATAGCTGCTTTAATTCCTGTGCCGATTAGATTCAAGGCCCTCATACTTCAAGAGTCAGTCAGCCACTGATGATCAATATTACTTCCACG
CTCTTGCTGTAGTTCCTTATGGCGAGCCTCAACATGCCAAGGTGACTATTTATTGCCTCTGTTTATGCAACATTCAGCATTAATACTAAATGCCCGACTCATTGCAGATA
ACTCACCTTGAATGTCGTCTCATTTGGTTCTTCATCATATGCTTGGCGTATTTCCTTCACGAAATCTCCGAACCTTTCCTTCCATCCCCGTTCAATTTCCATCTTCAGGG
TGCTTTTTGAACTGGGGCAGCTATCGCTTGCGCCTGAAGATCGGTTCAGACTTGGTTAAGTCTTTAGTTACGAAAAGGTTATCGTTGTTTTCACTTTCAACAACTTACTT
GTTCGAAGGTGAGAATAAGTGTCATATTAATCGCATTACGAGTCTAGAAAGAGTGTAGTTGGGACATAAACTGGATTAACTCGGATTAACTTCTATACTATAAGTTCAAT
TTATCTCAAGTTAGTA
Protein sequenceShow/hide protein sequence
MPRESSGNPLAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRDERVPEKSELPVTSNAVPGATERAVVPGDSVPSVSNVQPPTPAAAPSTLVADNA
STLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKSVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFREGPYNEAPGTPKDRSFIPDSM
SSGGERSGKRDQKRKATGRLSEAPSPPAQTKRPRKQKGSDL