; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh06G006070 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh06G006070
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein CYCLOPS
Genome locationCmo_Chr06:2988239..2996174
RNA-Seq ExpressionCmoCh06G006070
SyntenyCmoCh06G006070
Gene Ontology termsGO:0036377 - arbuscular mycorrhizal association (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR040036 - Protein CYCLOPS


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596693.1 Protein CYCLOPS, partial [Cucurbita argyrosperma subsp. sororia]1.3e-27799.81Show/hide
Query:  MLKFSINNSATKPQSKDDVKNPNQKPHNSFVDRTEILAIGPFHVAITIWLYLNVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQS
        MLKFSINNSATKPQSKDDVKNPNQKPHNSFVDRTEILAIGPFHVAITIWLYLNVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQS
Subjt:  MLKFSINNSATKPQSKDDVKNPNQKPHNSFVDRTEILAIGPFHVAITIWLYLNVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQS

Query:  ASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQ
        ASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQ
Subjt:  ASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQ

Query:  PVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEVSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSA
        PVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEVSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSA
Subjt:  PVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEVSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSA

Query:  ELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTHSNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRR
        ELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTHSNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRR
Subjt:  ELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTHSNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRR

Query:  SVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLL
        SVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLL
Subjt:  SVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLL

Query:  EEIEKILSETGRI
        EEIEKILSETGR+
Subjt:  EEIEKILSETGRI

XP_022961436.1 uncharacterized protein LOC111461994 isoform X1 [Cucurbita moschata]2.6e-24699.78Show/hide
Query:  NVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRR
        NVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRR
Subjt:  NVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRR

Query:  YAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE
        YAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE
Subjt:  YAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE

Query:  VSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTH
        VSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTH
Subjt:  VSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTH

Query:  SNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLK
        SNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLK
Subjt:  SNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLK

Query:  RCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRI
        RCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGR+
Subjt:  RCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRI

XP_022961508.1 uncharacterized protein LOC111461994 isoform X2 [Cucurbita moschata]2.5e-24499.57Show/hide
Query:  NVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRR
        NVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRR
Subjt:  NVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRR

Query:  YAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE
        YAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE
Subjt:  YAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE

Query:  VSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTH
        VSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTH
Subjt:  VSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTH

Query:  SNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLK
        SNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRK RSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLK
Subjt:  SNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLK

Query:  RCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRI
        RCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGR+
Subjt:  RCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRI

XP_023540450.1 uncharacterized protein LOC111800819 isoform X1 [Cucurbita pepo subsp. pepo]4.8e-24097.84Show/hide
Query:  NVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRR
        NVNNSSGLAYRTRQ SRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGE QGSNLYLAKAWFHSSQPMTRSRSSELRRR
Subjt:  NVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRR

Query:  YAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE
        YAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQ+FIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVD+ISSVVNMLKGTLERKKLNNQIEKVAPE
Subjt:  YAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE

Query:  VSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTH
        VSSNARFGNLSFDRSSDSYIHQ+PNSLEEFSTVQVVDNRILQTVEGSAEL FDAFVYPVNPIQSGRVSQEP QSESSAAAAVVSSGFEACDGLSNSTLTH
Subjt:  VSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTH

Query:  SNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAAS-VDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVL
        SNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGS TSAAS VDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVL
Subjt:  SNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAAS-VDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVL

Query:  KRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRI
        KRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLA+EKECLLEEIEKILSETGRI
Subjt:  KRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRI

XP_023540452.1 uncharacterized protein LOC111800819 isoform X2 [Cucurbita pepo subsp. pepo]2.0e-24198.05Show/hide
Query:  NVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRR
        NVNNSSGLAYRTRQ SRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGE QGSNLYLAKAWFHSSQPMTRSRSSELRRR
Subjt:  NVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRR

Query:  YAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE
        YAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQ+FIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVD+ISSVVNMLKGTLERKKLNNQIEKVAPE
Subjt:  YAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE

Query:  VSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTH
        VSSNARFGNLSFDRSSDSYIHQ+PNSLEEFSTVQVVDNRILQTVEGSAEL FDAFVYPVNPIQSGRVSQEP QSESSAAAAVVSSGFEACDGLSNSTLTH
Subjt:  VSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTH

Query:  SNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLK
        SNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGS TSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLK
Subjt:  SNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLK

Query:  RCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRI
        RCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLA+EKECLLEEIEKILSETGRI
Subjt:  RCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRI

TrEMBL top hitse value%identityAlignment
A0A6J1E5R2 uncharacterized protein LOC111430970 isoform X51.9e-19477.67Show/hide
Query:  DVKNPNQKPHNSFVDRTEIL---AIGPFHVAITIWLYLNVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQT
        DVKNPN+KP NS VDRT  L   AIG FHVA+TIWLYLNVNNSSG AYR RQ SRRI+SE V+LSSPQYV KHQKRISNDIL PQSASVADDV+G NQQ 
Subjt:  DVKNPNQKPHNSFVDRTEIL---AIGPFHVAITIWLYLNVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQT

Query:  FRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASM
        FR++V+GE QGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGI+S++DLSGH   AMKLD+  SQ F D STCE+PNQPV F S SNSSAS 
Subjt:  FRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASM

Query:  FNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEVSSNARF------GNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAF-
        FNA  MYDVDKISSVVNMLK T+ERKKLNNQI+K A E SSNA F      G+ SF+RSS++Y+HQ PN L E S VQV DNRIL+TV  S ++ F AF 
Subjt:  FNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEVSSNARF------GNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAF-

Query:  --VYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTHSNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRY
          V PVNP+QSGRVSQEP QSESSAAAAVVSSGFEA DG SNS  THSN  SSRKQ+      ENG SRSK       DFRERIIDNLKDDRKR S++R 
Subjt:  --VYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTHSNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRY

Query:  GSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEK
        GSVTSA SVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLKRCD LEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLA+EKE LLEEIE+
Subjt:  GSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEK

Query:  ILSETG
        ILSETG
Subjt:  ILSETG

A0A6J1E8X4 uncharacterized protein LOC111430970 isoform X46.7e-19577.36Show/hide
Query:  DVKNPNQKPHNSFVDRT-----EILAIGPFHVAITIWLYLNVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQ
        DVKNPN+KP NS VDRT     E +AIG FHVA+TIWLYLNVNNSSG AYR RQ SRRI+SE V+LSSPQYV KHQKRISNDIL PQSASVADDV+G NQ
Subjt:  DVKNPNQKPHNSFVDRT-----EILAIGPFHVAITIWLYLNVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQ

Query:  QTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSA
        Q FR++V+GE QGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGI+S++DLSGH   AMKLD+  SQ F D STCE+PNQPV F S SNSSA
Subjt:  QTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSA

Query:  SMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEVSSNARF------GNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDA
        S FNA  MYDVDKISSVVNMLK T+ERKKLNNQI+K A E SSNA F      G+ SF+RSS++Y+HQ PN L E S VQV DNRIL+TV  S ++ F A
Subjt:  SMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEVSSNARF------GNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDA

Query:  F---VYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTHSNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVI
        F   V PVNP+QSGRVSQEP QSESSAAAAVVSSGFEA DG SNS  THSN  SSRKQ+      ENG SRSK       DFRERIIDNLKDDRKR S++
Subjt:  F---VYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTHSNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVI

Query:  RYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEI
        R GSVTSA SVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLKRCD LEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLA+EKE LLEEI
Subjt:  RYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEI

Query:  EKILSETG
        E+ILSETG
Subjt:  EKILSETG

A0A6J1HAD0 uncharacterized protein LOC111461994 isoform X11.3e-24699.78Show/hide
Query:  NVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRR
        NVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRR
Subjt:  NVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRR

Query:  YAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE
        YAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE
Subjt:  YAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE

Query:  VSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTH
        VSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTH
Subjt:  VSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTH

Query:  SNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLK
        SNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLK
Subjt:  SNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLK

Query:  RCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRI
        RCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGR+
Subjt:  RCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRI

A0A6J1HC08 uncharacterized protein LOC111461994 isoform X21.2e-24499.57Show/hide
Query:  NVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRR
        NVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRR
Subjt:  NVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRR

Query:  YAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE
        YAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE
Subjt:  YAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE

Query:  VSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTH
        VSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTH
Subjt:  VSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTH

Query:  SNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLK
        SNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRK RSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLK
Subjt:  SNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLK

Query:  RCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRI
        RCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGR+
Subjt:  RCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRI

A0A6J1L1E0 uncharacterized protein LOC111498273 isoform X19.5e-23495.66Show/hide
Query:  NVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRR
        NVNNSSGLAYRTRQ SRRIASE+VELSSPQYVAKHQKRISNDIL+PQSASVADDVAG NQQTFRDVV+GE QGSNLYLAKAWFHSSQPMTRSRSSELRRR
Subjt:  NVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRR

Query:  YAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE
        YAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQ+FIDPSTCEIPNQPV FVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE
Subjt:  YAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE

Query:  VSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTH
         SSNA FGNLSFDRSSDSY+HQ+PNSLEEFSTVQVVDNRILQTVEGSAEL FDAFVYPVNPIQSGRVSQEP QSESSAAAAVVSSGFEAC GLSNSTLTH
Subjt:  VSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTH

Query:  SNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLK
        SNGASSRKQVAGNQS+ENGSSRSKVSSSGTADFRERIIDNLKDDRK RSVIR GSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLK
Subjt:  SNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLK

Query:  RCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRI
        RCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLA+EKECLLEEIEKILSETGR+
Subjt:  RCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRI

SwissProt top hitse value%identityAlignment
A7TUE1 Protein CYCLOPS1.0e-10754.21Show/hide
Query:  NSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAA
        NSS +   +R  S+RI++E   +S+ Q++    +  +ND    Q+  +A+DV+       RD VD E Q SNL+LAKAWF + Q MTRSRSSELRRRY  
Subjt:  NSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAA

Query:  MQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEVSS
        MQ++Q+  G++SM  +  H  N +K ++ N   F   S CE+P+Q   F+S SNSS+S FN   + DVDK+SS V+MLKGTL+RKKL  Q+EK A E   
Subjt:  MQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEVSS

Query:  NARFG-----NLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTL
        N  F        S     +S+  Q   +++   T QV D  ++QT+EG+     D F    N IQ    S EP QSESSAAA V+SSG +AC+G SNS  
Subjt:  NARFG-----NLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTL

Query:  THSNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSV
        T   G SS KQV         S+++KV        RE+I+DNLKDDRKR+S+ RYGSVTSA S  K D TKKRRVERSRKMAEAKERN+TPTIPSDMQ++
Subjt:  THSNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSV

Query:  LKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRI
        LKRC+NLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLA+EKE LLEEIE+ILSETG+I
Subjt:  LKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRI

A9XMT3 Protein CYCLOPS1.1e-10953.66Show/hide
Query:  NSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAA
        NSS + + +R  S+RI++E V  S+   V       +ND    Q+  +A+DV+G      RD VD E Q SNL+LAKAWF S Q MTRSRSSELRRRY+ 
Subjt:  NSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAA

Query:  MQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEVSS
        MQ+  ++ GIES+     HGA A K ++ N   +   S CE+P+Q   F+S SNS +S FN P   D+DK+SS V+MLKGTL+R++L++Q+EK A E   
Subjt:  MQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEVSS

Query:  NARF------GNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNST
        N  F          FD+  +++ +Q P +++  S  +V D+ +LQT+EGS     D F   +N I  G  S EP QSESS AA V+SSG + C+G  NS 
Subjt:  NARF------GNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNST

Query:  LTHSNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQS
         T     SS KQV  ++S EN  +R K        FRE+I+DNLKDD+KR+S+ RYGS+TSA S DKGD TKKRRVERSRKMAEAKERN TP++PSDMQ+
Subjt:  LTHSNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQS

Query:  VLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRI
        VLKRC+NLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNE+LA+EKE LLEEIE+ILSET ++
Subjt:  VLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRI

A9XMT4 Protein CYCLOPS9.2e-10953.02Show/hide
Query:  NSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAA
        NS+ +   +R  S+RI++E V +S+ Q+V    +  +ND    Q++ + +DV+G      R+ VD E Q  NL+LAKAWF + Q MTRSRSSELRRRY  
Subjt:  NSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAA

Query:  MQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEVSS
        MQ+ Q+  G++SM     H AN +K ++ +   F   S CEIP+Q   F+S SNSS+S FN   + DVDK+SS V+MLKGTL+RK+L  Q+EK A E   
Subjt:  MQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEVSS

Query:  NARFG------NLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNST
        N  FG         F+   +++ HQ   +++   T QV D  +++T+EG+A    + F    + I  G  S EP QSESSAAA V+SSG +AC+G SNS+
Subjt:  NARFG------NLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNST

Query:  LTHSNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQS
         T  +  SS KQV      E+  +R+K         RE+I+DNLKDDRKR+ + RYGSVTSA S DK D TKKRRVERSRKMAEAKERN+TPTIPSDMQ+
Subjt:  LTHSNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQS

Query:  VLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRI
        V+KRC+NLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNE+LA+EKE LLEEIE++LSETG+I
Subjt:  VLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRI

A9XMT5 Protein CYCLOPS2.7e-8448.03Show/hide
Query:  SGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASV-ADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAM
        S +  R RQ S R++SE+    + Q+    Q+    D L PQ+ +V ++     NQQ  ++  +   Q S+L LAKAWFHS+QPMTRSRSSELR+RYAAM
Subjt:  SGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASV-ADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAM

Query:  QSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEVSSN
        QSN      E++       AN ++ D+TN+           P+Q   FVS S+SS S  + P++   D I+SVV+MLK TLERKKL++       + SS 
Subjt:  QSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEVSSN

Query:  ARFGNLSFDRSSDSYIHQVPNSLEEF---STVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTHS
          FG       S  +   +    + F   +T Q+ D+ +L  VE   E     FV P N +  G  S+EP QS SS A    S+GFE CD L       +
Subjt:  ARFGNLSFDRSSDSYIHQVPNSLEEF---STVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSAAAAVVSSGFEACDGLSNSTLTHS

Query:  NGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERII-DNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLK
           S+R   A      NG+  +   S G  DFRERI+ +NLKDDRK+ S+ R GS+ S+   DKGDPTKKRRVERSRKMAEAKER+ TP IPSD+Q VLK
Subjt:  NGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERII-DNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMTPTIPSDMQSVLK

Query:  RCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSET
        RC+ LEKEVRSLKLNLSFMNRKDSEQTKQIE+LQKQNEDL  EKE LLEEIE+I+S+T
Subjt:  RCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSET

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAAATTTTCTATAAACAATTCAGCAACAAAACCTCAATCGAAGGATGATGTGAAGAATCCGAATCAGAAACCGCATAATTCGTTCGTTGATCGAACAGAGATACT
TGCTATTGGTCCTTTTCACGTCGCCATCACGATCTGGCTCTATCTCAATGTTAACAACTCATCAGGCCTGGCTTATCGCACTCGACAGACATCAAGAAGAATTGCCTCGG
AGTCTGTTGAACTGTCTAGTCCTCAGTATGTGGCAAAACATCAAAAGAGAATAAGCAATGATATATTGCATCCACAAAGTGCCTCCGTGGCTGATGATGTCGCAGGAGCT
AATCAACAAACTTTCAGGGATGTTGTTGATGGTGAAGGGCAAGGTAGTAACCTTTATCTTGCGAAGGCATGGTTCCACAGTTCTCAACCTATGACAAGAAGTCGATCATC
TGAGCTAAGGAGGAGGTATGCTGCAATGCAAAGCAATCAAAGCTCATTCGGTATAGAGTCCATGCATGACTTGTCAGGGCATGGAGCCAACGCGATGAAACTAGATATCA
CAAATTCACAGAATTTCATTGACCCTTCTACTTGTGAGATTCCAAACCAGCCTGTTCAATTTGTATCCACATCCAATTCATCGGCATCAATGTTCAATGCACCAAACATG
TATGATGTAGATAAAATTTCTTCTGTTGTAAACATGCTAAAGGGCACATTAGAACGGAAGAAACTAAATAACCAGATTGAAAAAGTGGCACCGGAGGTTAGTTCAAATGC
ACGTTTTGGCAACCTTAGTTTCGATCGAAGCAGTGATAGTTATATACACCAAGTACCGAACAGTCTCGAGGAATTTTCTACTGTTCAAGTCGTGGATAATAGAATTTTAC
AAACGGTTGAGGGATCAGCAGAGCTCAGTTTCGATGCTTTCGTATATCCTGTAAATCCCATTCAGTCAGGTAGAGTTTCTCAAGAACCTCCTCAAAGTGAATCTTCTGCT
GCTGCAGCAGTAGTTTCATCCGGTTTTGAGGCGTGTGATGGTCTTAGCAACTCAACTCTAACTCATAGCAACGGTGCAAGCTCAAGGAAACAAGTTGCAGGCAATCAGAG
TTTAGAAAATGGATCATCACGATCTAAAGTTTCATCCAGTGGGACAGCAGACTTCAGAGAAAGAATAATAGACAACTTAAAAGATGATAGAAAGAGGAGGAGTGTAATTC
GTTACGGGTCCGTAACATCAGCTGCTTCAGTGGACAAAGGAGATCCCACAAAGAAACGCCGGGTGGAACGATCACGAAAAATGGCAGAGGCGAAGGAGAGGAATATGACA
CCAACTATTCCATCAGATATGCAATCAGTTCTGAAGCGGTGTGACAATCTTGAGAAGGAAGTGCGGTCACTAAAACTCAATTTGTCCTTTATGAATAGGAAGGATTCTGA
GCAGACTAAGCAGATAGAGGATCTTCAGAAGCAGAATGAGGACTTAGCAAATGAAAAAGAATGCCTACTTGAAGAGATTGAGAAGATTCTCTCAGAAACTGGAAGAATTA
ACCAGGGACAGCCAAATTTAATAGATTCTAACAAACCAGTTATCTGGTACCGATCCACTGCAAAAAAAAAGGCTACATTTACTTTCTATTGTTTCAAATGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGAAATTTTCTATAAACAATTCAGCAACAAAACCTCAATCGAAGGATGATGTGAAGAATCCGAATCAGAAACCGCATAATTCGTTCGTTGATCGAACAGAGATACT
TGCTATTGGTCCTTTTCACGTCGCCATCACGATCTGGCTCTATCTCAATGTTAACAACTCATCAGGCCTGGCTTATCGCACTCGACAGACATCAAGAAGAATTGCCTCGG
AGTCTGTTGAACTGTCTAGTCCTCAGTATGTGGCAAAACATCAAAAGAGAATAAGCAATGATATATTGCATCCACAAAGTGCCTCCGTGGCTGATGATGTCGCAGGAGCT
AATCAACAAACTTTCAGGGATGTTGTTGATGGTGAAGGGCAAGGTAGTAACCTTTATCTTGCGAAGGCATGGTTCCACAGTTCTCAACCTATGACAAGAAGTCGATCATC
TGAGCTAAGGAGGAGGTATGCTGCAATGCAAAGCAATCAAAGCTCATTCGGTATAGAGTCCATGCATGACTTGTCAGGGCATGGAGCCAACGCGATGAAACTAGATATCA
CAAATTCACAGAATTTCATTGACCCTTCTACTTGTGAGATTCCAAACCAGCCTGTTCAATTTGTATCCACATCCAATTCATCGGCATCAATGTTCAATGCACCAAACATG
TATGATGTAGATAAAATTTCTTCTGTTGTAAACATGCTAAAGGGCACATTAGAACGGAAGAAACTAAATAACCAGATTGAAAAAGTGGCACCGGAGGTTAGTTCAAATGC
ACGTTTTGGCAACCTTAGTTTCGATCGAAGCAGTGATAGTTATATACACCAAGTACCGAACAGTCTCGAGGAATTTTCTACTGTTCAAGTCGTGGATAATAGAATTTTAC
AAACGGTTGAGGGATCAGCAGAGCTCAGTTTCGATGCTTTCGTATATCCTGTAAATCCCATTCAGTCAGGTAGAGTTTCTCAAGAACCTCCTCAAAGTGAATCTTCTGCT
GCTGCAGCAGTAGTTTCATCCGGTTTTGAGGCGTGTGATGGTCTTAGCAACTCAACTCTAACTCATAGCAACGGTGCAAGCTCAAGGAAACAAGTTGCAGGCAATCAGAG
TTTAGAAAATGGATCATCACGATCTAAAGTTTCATCCAGTGGGACAGCAGACTTCAGAGAAAGAATAATAGACAACTTAAAAGATGATAGAAAGAGGAGGAGTGTAATTC
GTTACGGGTCCGTAACATCAGCTGCTTCAGTGGACAAAGGAGATCCCACAAAGAAACGCCGGGTGGAACGATCACGAAAAATGGCAGAGGCGAAGGAGAGGAATATGACA
CCAACTATTCCATCAGATATGCAATCAGTTCTGAAGCGGTGTGACAATCTTGAGAAGGAAGTGCGGTCACTAAAACTCAATTTGTCCTTTATGAATAGGAAGGATTCTGA
GCAGACTAAGCAGATAGAGGATCTTCAGAAGCAGAATGAGGACTTAGCAAATGAAAAAGAATGCCTACTTGAAGAGATTGAGAAGATTCTCTCAGAAACTGGAAGAATTA
ACCAGGGACAGCCAAATTTAATAGATTCTAACAAACCAGTTATCTGGTACCGATCCACTGCAAAAAAAAAGGCTACATTTACTTTCTATTGTTTCAAATGTTGA
Protein sequenceShow/hide protein sequence
MLKFSINNSATKPQSKDDVKNPNQKPHNSFVDRTEILAIGPFHVAITIWLYLNVNNSSGLAYRTRQTSRRIASESVELSSPQYVAKHQKRISNDILHPQSASVADDVAGA
NQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNM
YDVDKISSVVNMLKGTLERKKLNNQIEKVAPEVSSNARFGNLSFDRSSDSYIHQVPNSLEEFSTVQVVDNRILQTVEGSAELSFDAFVYPVNPIQSGRVSQEPPQSESSA
AAAVVSSGFEACDGLSNSTLTHSNGASSRKQVAGNQSLENGSSRSKVSSSGTADFRERIIDNLKDDRKRRSVIRYGSVTSAASVDKGDPTKKRRVERSRKMAEAKERNMT
PTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRINQGQPNLIDSNKPVIWYRSTAKKKATFTFYCFKC