; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh09G006290 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh09G006290
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionProtein of unknown function (DUF1677)
Genome locationCma_Chr09:2961800..2965542
RNA-Seq ExpressionCmaCh09G006290
SyntenyCmaCh09G006290
Gene Ontology termsNA
InterPro domainsIPR012876 - Protein of unknown function DUF1677, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591728.1 hypothetical protein SDJN03_14074, partial [Cucurbita argyrosperma subsp. sororia]4.3e-19784.88Show/hide
Query:  NEQQVFLRRAVSDLSEEIERHKLSINEEVEEIKMSRI-----------EEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERAL
        NEQQVFLRRAVSDLS+EIERHKLSINEEVEE+KMSRI           EEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERAL
Subjt:  NEQQVFLRRAVSDLSEEIERHKLSINEEVEEIKMSRI-----------EEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERAL

Query:  KFPSISIGQAVEFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRSVSCDPKIQLKPFPSQAIMGDPSNRSMEENGPTDEF
        KFP ISIGQAVEFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRS    P         +    D  N + EENGPTD F
Subjt:  KFPSISIGQAVEFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRSVSCDPKIQLKPFPSQAIMGDPSNRSMEENGPTDEF

Query:  NKAQEKNGPTDEFNKAQEENGPTDGFNKAQEENGPTDGFDKAQEENEPTDEFNKAQEENGPTDGFNKAQEDNGPTDGFDKAQEENGPTDEFNKAQEENGP
        N+AQE+NGPTD FNKAQEENGPTDGFNKAQEE GPTDGF+KAQEEN PTD FN+AQEENGPTDGFNKAQE+NGPTDGF+KAQEENGPTD FN+AQEENGP
Subjt:  NKAQEKNGPTDEFNKAQEENGPTDGFNKAQEENGPTDGFDKAQEENEPTDEFNKAQEENGPTDGFNKAQEDNGPTDGFDKAQEENGPTDEFNKAQEENGP

Query:  TDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDGFNNAQE
        TDGF++AQEENGPTD FNKAQEENGPTDGF+KAQEENGPTD FN+AQEENGPTDGF++AQEENGPTD FNKAQEENGPTDGF+KAQEE GPTDGFN AQE
Subjt:  TDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDGFNNAQE

Query:  KNGPTDGFDKAQEENEPTDEFNKAQEENGPTDGFNKAQEENGP
        +NGPTDGF++AQEEN PTD FNKAQEENGPTDGFNKAQEENGP
Subjt:  KNGPTDGFDKAQEENEPTDEFNKAQEENGPTDGFNKAQEENGP

XP_022975749.1 RNA polymerase-associated protein LEO1-like [Cucurbita maxima]2.0e-138100Show/hide
Query:  MGDPSNRSMEENGPTDEFNKAQEKNGPTDEFNKAQEENGPTDGFNKAQEENGPTDGFDKAQEENEPTDEFNKAQEENGPTDGFNKAQEDNGPTDGFDKAQ
        MGDPSNRSMEENGPTDEFNKAQEKNGPTDEFNKAQEENGPTDGFNKAQEENGPTDGFDKAQEENEPTDEFNKAQEENGPTDGFNKAQEDNGPTDGFDKAQ
Subjt:  MGDPSNRSMEENGPTDEFNKAQEKNGPTDEFNKAQEENGPTDGFNKAQEENGPTDGFDKAQEENEPTDEFNKAQEENGPTDGFNKAQEDNGPTDGFDKAQ

Query:  EENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGF
        EENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGF
Subjt:  EENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGF

Query:  DKAQEENGPTDGFNNAQEKNGPTDGFDKAQEENEPTDEFNKAQEENGPTDGFNKAQEENGPK
        DKAQEENGPTDGFNNAQEKNGPTDGFDKAQEENEPTDEFNKAQEENGPTDGFNKAQEENGPK
Subjt:  DKAQEENGPTDGFNNAQEKNGPTDGFDKAQEENEPTDEFNKAQEENGPTDGFNKAQEENGPK

XP_022976231.1 uncharacterized protein LOC111476691 [Cucurbita maxima]1.6e-82100Show/hide
Query:  NEQQVFLRRAVSDLSEEIERHKLSINEEVEEIKMSRIEEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPSISIGQAV
        NEQQVFLRRAVSDLSEEIERHKLSINEEVEEIKMSRIEEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPSISIGQAV
Subjt:  NEQQVFLRRAVSDLSEEIERHKLSINEEVEEIKMSRIEEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPSISIGQAV

Query:  EFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRSVSCDPKIQ
        EFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRSVSCDPKIQ
Subjt:  EFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRSVSCDPKIQ

XP_023534820.1 uncharacterized protein LOC111796448 [Cucurbita pepo subsp. pepo]2.1e-7996.34Show/hide
Query:  NEQQVFLRRAVSDLSEEIERHKLSINEEVEEIKMSRI--EEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPSISIGQ
        NEQQVFLRRAVSDLS+EIERHKLSINEEVEE+KMSRI  EEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFP ISIGQ
Subjt:  NEQQVFLRRAVSDLSEEIERHKLSINEEVEEIKMSRI--EEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPSISIGQ

Query:  AVEFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRSVSCDPKIQ
        AVEFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRSVSCDPKI+
Subjt:  AVEFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRSVSCDPKIQ

XP_023536145.1 uncharacterized protein LOC111797391 [Cucurbita pepo subsp. pepo]1.8e-9986.61Show/hide
Query:  MGDPSNRSMEENGPTDEFNKAQEKNGPTDEFNKAQEENGPTDGFNKAQEENGPTDGFDKAQEENEPTDEFNKAQEENGPTDGFNKAQEDNGPTDGFDKAQ
        MGD SNRSMEENGPTD FNKAQE+NGPTD FNKAQEENGPTDGFNKAQEENGPTDGF+KAQEEN PTD FNKAQEENGPTDGFNKAQE+NGPTDGF+KAQ
Subjt:  MGDPSNRSMEENGPTDEFNKAQEKNGPTDEFNKAQEENGPTDGFNKAQEENGPTDGFDKAQEENEPTDEFNKAQEENGPTDGFNKAQEDNGPTDGFDKAQ

Query:  EENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGF
        E+NGPTD FNKAQE+NGPTDGF+KAQE+NGPTD FNKAQE+NGPTDGF+KAQE+NGPTD FNKAQE+NGPTDGF+KAQE+NGPTD FNKAQE+NGPTDGF
Subjt:  EENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGF

Query:  DKAQEENGPTDGFNNAQEKNGPTD
        +KAQE+NGPTDGFN AQE NGP++
Subjt:  DKAQEENGPTDGFNNAQEKNGPTD

TrEMBL top hitse value%identityAlignment
A0A1S3CRY2 uncharacterized protein LOC1035040071.5e-4662.57Show/hide
Query:  EQQVFLRRAV-SDLSEEIERHKLSI----NEEVEEI--KMSRI--------EEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSG-KWVCGLCSEAVK
        ++ V +R+AV SDLS+EIE++K+ I    N EVEE+   M+R         EEEEE  +E+A+C CCGLKEECTK YILEVQ  FSG KWVCGLCSEAVK
Subjt:  EQQVFLRRAV-SDLSEEIERHKLSI----NEEVEEI--KMSRI--------EEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSG-KWVCGLCSEAVK

Query:  ERALKFPSISIGQAVEFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRSVSCDPKIQLK
        ER LKFP+ +I +A+E HREFCD+FN TTRLNPKLSLTTSMR+IARKSFE RT++     +  N+LSRSVSCDP+I L+
Subjt:  ERALKFPSISIGQAVEFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRSVSCDPKIQLK

A0A6J1C661 uncharacterized protein LOC1110077388.5e-5067.47Show/hide
Query:  NLNEQQVFLRRAVSDLSEEIERHKLSINEEVEEIKMSRIEEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPSISIGQ
        N N+Q+V +R+AVSDLS+EIER  L    EVEE+  SR+EEEEEV   Q EC CCGLKEECT  YILE+Q  F+GKWVCGLCSEAVKER L+FP+  I +
Subjt:  NLNEQQVFLRRAVSDLSEEIERHKLSINEEVEEIKMSRIEEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPSISIGQ

Query:  AVEFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKR-THDSTDFGSSSNRLSRSVSCDPKIQL
        A+ FHREF  AFN TTRLNPKLSLTTSMR+IARKSF+KR +   +DFGSS+ +LSRS+SCDP+I+L
Subjt:  AVEFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKR-THDSTDFGSSSNRLSRSVSCDPKIQL

A0A6J1FAH1 uncharacterized protein LOC1114435616.7e-7996.32Show/hide
Query:  NEQQVFLRRAVSDLSEEIERHKLSINEEVEEIKMSRI-EEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPSISIGQA
        NEQQVFL RAVSDLS+EIERHKLSINEEVEE+KMSRI EEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFP ISIGQA
Subjt:  NEQQVFLRRAVSDLSEEIERHKLSINEEVEEIKMSRI-EEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPSISIGQA

Query:  VEFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRSVSCDPKIQ
        VEFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRSVSCDPKI+
Subjt:  VEFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRSVSCDPKIQ

A0A6J1IGD0 uncharacterized protein LOC1114766917.6e-83100Show/hide
Query:  NEQQVFLRRAVSDLSEEIERHKLSINEEVEEIKMSRIEEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPSISIGQAV
        NEQQVFLRRAVSDLSEEIERHKLSINEEVEEIKMSRIEEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPSISIGQAV
Subjt:  NEQQVFLRRAVSDLSEEIERHKLSINEEVEEIKMSRIEEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPSISIGQAV

Query:  EFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRSVSCDPKIQ
        EFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRSVSCDPKIQ
Subjt:  EFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRSVSCDPKIQ

A0A6J1ILH4 RNA polymerase-associated protein LEO1-like9.8e-139100Show/hide
Query:  MGDPSNRSMEENGPTDEFNKAQEKNGPTDEFNKAQEENGPTDGFNKAQEENGPTDGFDKAQEENEPTDEFNKAQEENGPTDGFNKAQEDNGPTDGFDKAQ
        MGDPSNRSMEENGPTDEFNKAQEKNGPTDEFNKAQEENGPTDGFNKAQEENGPTDGFDKAQEENEPTDEFNKAQEENGPTDGFNKAQEDNGPTDGFDKAQ
Subjt:  MGDPSNRSMEENGPTDEFNKAQEKNGPTDEFNKAQEENGPTDGFNKAQEENGPTDGFDKAQEENEPTDEFNKAQEENGPTDGFNKAQEDNGPTDGFDKAQ

Query:  EENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGF
        EENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGF
Subjt:  EENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGF

Query:  DKAQEENGPTDGFNNAQEKNGPTDGFDKAQEENEPTDEFNKAQEENGPTDGFNKAQEENGPK
        DKAQEENGPTDGFNNAQEKNGPTDGFDKAQEENEPTDEFNKAQEENGPTDGFNKAQEENGPK
Subjt:  DKAQEENGPTDGFNNAQEKNGPTDGFDKAQEENEPTDEFNKAQEENGPTDGFNKAQEENGPK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G72510.1 Protein of unknown function (DUF1677)8.2e-1334.97Show/hide
Query:  VEEIKMSRIEEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPS-ISIGQAVEFHREFCDAFNATT-RLNPKLSLTTSM
        V  + +++I  +E    +   C CCGL EECT+ YI  V+  + GKW+CGLCSEAVK   ++    ++  +A+  H   C+ F +++   NP   L ++M
Subjt:  VEEIKMSRIEEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPS-ISIGQAVEFHREFCDAFNATT-RLNPKLSLTTSM

Query:  RRIARKSFE----------KRTHDSTDFGSS--SNRLSRSVSC
        R+I RKS +            + D  D      SN LSRS SC
Subjt:  RRIARKSFE----------KRTHDSTDFGSS--SNRLSRSVSC

AT1G79770.1 Protein of unknown function (DUF1677)6.1e-1636.55Show/hide
Query:  KLSINEEVEEIKMSRIEEEEE-----VVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPSISIGQAVEFHREFCDAFNATTRL
        K S++  + +I +    EEE+     V +E+A+C CCG++EECT  YI  V+  F GKW+CGLCSEAVKE   K     +  A++ H   C  FN   R 
Subjt:  KLSINEEVEEIKMSRIEEEEE-----VVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPSISIGQAVEFHREFCDAFNATTRL

Query:  NPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRSVSCDPKI
         P L    +MR + R+S   ++            +SR+ SC P I
Subjt:  NPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRSVSCDPKI

AT2G25780.1 Protein of unknown function (DUF1677)1.8e-3145.57Show/hide
Query:  LRRAVSDLSEEIERHKLSINEEVEEIKMSRIEEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPSISIG--QAVEFHR
        LR+A SD+S E +R+ +S N+E         E+E EV + Q +C CCG++EECT  YI +V+  +SG WVCGLC E V ER  K P I+ G  +A ++H+
Subjt:  LRRAVSDLSEEIERHKLSINEEVEEIKMSRIEEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPSISIG--QAVEFHR

Query:  EFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRSVSCDPKIQ
          CDAFN+TTR+NPKL  T SMR IA++S + R       GS   +++R++SCDP+++
Subjt:  EFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRSVSCDPKIQ

AT3G22540.1 Protein of unknown function (DUF1677)1.6e-1639.62Show/hide
Query:  IEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPSISIGQAVEFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTD-
        IE   C CCGL E+CT+ YI EV++ F  KW+CGLCSEAV++   +    ++ +AV+ H  FC  F    + NP + +   MR++ R+   + T+ ST  
Subjt:  IEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPSISIGQAVEFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTD-

Query:  -FGSSS
         FG S+
Subjt:  -FGSSS

AT4G14819.1 Protein of unknown function (DUF1677)2.3e-1542.86Show/hide
Query:  IEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPSISIGQAVEFHREFCDAFNATTRLNPKLSLTTSMRR-IARKSFEKRTHDSTD
        IE   C CCGL E+CT+ YI +V+  F+GKW+CGLCSEAV +   +  S ++ +AV  H  FC  FNA    NP   +   MR+ + R+S E     S  
Subjt:  IEQAECRCCGLKEECTKPYILEVQTFFSGKWVCGLCSEAVKERALKFPSISIGQAVEFHREFCDAFNATTRLNPKLSLTTSMRR-IARKSFEKRTHDSTD

Query:  FGSSS
        FG S+
Subjt:  FGSSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCCATATGGCATCTCCATCAATATCCAAATTGCAACCCCAAATTCATTGCGTAATCCGATGGAGGCTAGTGGTGCCAATTTGAACGAGCAGCAGGTATTT
CTTCGTAGAGCGGTATCGGATTTAAGCGAGGAGATCGAACGGCACAAACTGAGTATCAACGAAGAAGTAGAAGAGATCAAGATGAGCAGAATTGAAGAAGAAGAA
GAAGTAGTGATAGAACAGGCTGAATGCAGGTGCTGTGGGCTCAAAGAGGAGTGCACCAAACCCTACATTTTAGAGGTCCAAACCTTCTTCTCAGGGAAATGGGTT
TGTGGGCTCTGCTCTGAAGCTGTTAAAGAGAGAGCCTTGAAGTTTCCATCCATAAGCATTGGGCAAGCTGTGGAGTTTCACAGGGAGTTTTGTGATGCATTCAAC
GCCACCACAAGGCTTAACCCTAAACTCTCCTTGACCACTTCCATGAGAAGGATTGCTAGAAAAAGCTTTGAGAAACGAACCCATGATTCCACTGATTTTGGCTCT
TCTTCCAATAGGCTCTCTCGAAGTGTTAGCTGTGATCCTAAGATTCAATTGAAACCCTTCCCTTCCCAAGCAATCATGGGTGATCCTTCCAATAGGAGTATGGAA
GAGAATGGGCCTACTGATGAATTTAATAAGGCCCAGGAAAAGAATGGGCCTACTGATGAATTTAATAAGGCCCAGGAAGAGAATGGGCCTACTGATGGATTTAAT
AAAGCCCAGGAAGAGAATGGGCCTACTGATGGATTTGATAAAGCCCAGGAAGAGAATGAGCCTACTGATGAATTTAATAAGGCCCAGGAAGAGAATGGGCCTACT
GATGGGTTTAATAAAGCCCAGGAAGATAATGGGCCTACTGATGGATTTGATAAAGCCCAGGAAGAGAATGGGCCTACTGATGAATTTAATAAGGCCCAGGAAGAG
AATGGGCCTACTGATGGATTTGATAAAGCCCAGGAAGAGAATGGGCCTACTGATGAATTTAATAAGGCCCAGGAAGAGAATGGGCCTACTGATGGATTTGATAAA
GCCCAGGAAGAGAATGGGCCTACTGATGAATTTAATAAGGCCCAGGAAGAGAATGGGCCTACTGATGGATTTGATAAAGCCCAGGAAGAGAATGGGCCTACTGAT
GAATTTAATAAGGCCCAGGAAGAGAATGGGCCTACTGATGGATTTGATAAAGCCCAGGAAGAGAATGGGCCTACTGATGGATTTAATAACGCCCAGGAAAAGAAT
GGGCCTACTGATGGATTTGATAAAGCCCAGGAAGAGAATGAGCCTACTGATGAATTTAATAAGGCCCAGGAAGAGAATGGGCCTACTGATGGGTTTAATAAAGCC
CAGGAAGAGAACGGGCCTAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCCCATATGGCATCTCCATCAATATCCAAATTGCAACCCCAAATTCATTGCGTAATCCGATGGAGGCTAGTGGTGCCAATTTGAACGAGCAGCAGGTATTT
CTTCGTAGAGCGGTATCGGATTTAAGCGAGGAGATCGAACGGCACAAACTGAGTATCAACGAAGAAGTAGAAGAGATCAAGATGAGCAGAATTGAAGAAGAAGAA
GAAGTAGTGATAGAACAGGCTGAATGCAGGTGCTGTGGGCTCAAAGAGGAGTGCACCAAACCCTACATTTTAGAGGTCCAAACCTTCTTCTCAGGGAAATGGGTT
TGTGGGCTCTGCTCTGAAGCTGTTAAAGAGAGAGCCTTGAAGTTTCCATCCATAAGCATTGGGCAAGCTGTGGAGTTTCACAGGGAGTTTTGTGATGCATTCAAC
GCCACCACAAGGCTTAACCCTAAACTCTCCTTGACCACTTCCATGAGAAGGATTGCTAGAAAAAGCTTTGAGAAACGAACCCATGATTCCACTGATTTTGGCTCT
TCTTCCAATAGGCTCTCTCGAAGTGTTAGCTGTGATCCTAAGATTCAATTGAAACCCTTCCCTTCCCAAGCAATCATGGGTGATCCTTCCAATAGGAGTATGGAA
GAGAATGGGCCTACTGATGAATTTAATAAGGCCCAGGAAAAGAATGGGCCTACTGATGAATTTAATAAGGCCCAGGAAGAGAATGGGCCTACTGATGGATTTAAT
AAAGCCCAGGAAGAGAATGGGCCTACTGATGGATTTGATAAAGCCCAGGAAGAGAATGAGCCTACTGATGAATTTAATAAGGCCCAGGAAGAGAATGGGCCTACT
GATGGGTTTAATAAAGCCCAGGAAGATAATGGGCCTACTGATGGATTTGATAAAGCCCAGGAAGAGAATGGGCCTACTGATGAATTTAATAAGGCCCAGGAAGAG
AATGGGCCTACTGATGGATTTGATAAAGCCCAGGAAGAGAATGGGCCTACTGATGAATTTAATAAGGCCCAGGAAGAGAATGGGCCTACTGATGGATTTGATAAA
GCCCAGGAAGAGAATGGGCCTACTGATGAATTTAATAAGGCCCAGGAAGAGAATGGGCCTACTGATGGATTTGATAAAGCCCAGGAAGAGAATGGGCCTACTGAT
GAATTTAATAAGGCCCAGGAAGAGAATGGGCCTACTGATGGATTTGATAAAGCCCAGGAAGAGAATGGGCCTACTGATGGATTTAATAACGCCCAGGAAAAGAAT
GGGCCTACTGATGGATTTGATAAAGCCCAGGAAGAGAATGAGCCTACTGATGAATTTAATAAGGCCCAGGAAGAGAATGGGCCTACTGATGGGTTTAATAAAGCC
CAGGAAGAGAACGGGCCTAAGTGA
Protein sequenceShow/hide protein sequence
MVPYGISINIQIATPNSLRNPMEASGANLNEQQVFLRRAVSDLSEEIERHKLSINEEVEEIKMSRIEEEEEVVIEQAECRCCGLKEECTKPYILEVQTFFSGKWV
CGLCSEAVKERALKFPSISIGQAVEFHREFCDAFNATTRLNPKLSLTTSMRRIARKSFEKRTHDSTDFGSSSNRLSRSVSCDPKIQLKPFPSQAIMGDPSNRSME
ENGPTDEFNKAQEKNGPTDEFNKAQEENGPTDGFNKAQEENGPTDGFDKAQEENEPTDEFNKAQEENGPTDGFNKAQEDNGPTDGFDKAQEENGPTDEFNKAQEE
NGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDEFNKAQEENGPTDGFDKAQEENGPTDGFNNAQEKN
GPTDGFDKAQEENEPTDEFNKAQEENGPTDGFNKAQEENGPK