; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg16287 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg16287
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionProtein CYCLOPS
Genome locationCarg_Chr06:3003021..3012260
RNA-Seq ExpressionCarg16287
SyntenyCarg16287
Gene Ontology termsGO:0036377 - arbuscular mycorrhizal association (biological process)
GO:0005634 - nucleus (cellular component)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR040036 - Protein CYCLOPS


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596693.1 Protein CYCLOPS, partial [Cucurbita argyrosperma subsp. sororia]1.9e-22171.75Show/hide
Query:  MLQLINCLHSRNEDEDFFHPATPNLRSRRLMLKFSINNSATKPQSKDDVKNPNQKPHNSFVDRTGRLVSGPQKAFSSSVRNIMKAPFQRLHRDTCYWSFS
        MLQLINCLHSRNEDEDFFHPATPNLRSRRLMLKFSINNSATKPQSKDDVKNPNQKPHNSFVDRT  L  GP                             
Subjt:  MLQLINCLHSRNEDEDFFHPATPNLRSRRLMLKFSINNSATKPQSKDDVKNPNQKPHNSFVDRTGRLVSGPQKAFSSSVRNIMKAPFQRLHRDTCYWSFS

Query:  RRHHDLALSPTLDTHFLITFVVSSKVLTGLAGCDIAWTNLGIFLSRNDPNIGASIIIKMEGRGVSELYRNASEELFLKSWVENAIGIIASESVELSSSQY
                      H  IT                    + ++L+ N+ +            G++   R  S               IASESVELSS QY
Subjt:  RRHHDLALSPTLDTHFLITFVVSSKVLTGLAGCDIAWTNLGIFLSRNDPNIGASIIIKMEGRGVSELYRNASEELFLKSWVENAIGIIASESVELSSSQY

Query:  VAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDI
        VAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDI
Subjt:  VAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDI

Query:  TNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARFGNLSFDRSSDSYIHQIPNSLEEFS
        TNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE SSNARFGNLSFDRSSDSYIHQ+PNSLEEFS
Subjt:  TNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARFGNLSFDRSSDSYIHQIPNSLEEFS

Query:  TVQVVDNRILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAAA-ISSGFEACDGLSNSTLTHSNGASSRTQVTGNQSLKNGSSRPKEEECNS-
        TVQVVDNRILQ  EGSAEL FDAFVYPVNPIQSGRVSQEP QSESSAAAA +SSGFEACDGLSNSTLTHSNGASSR QV GNQSL+NGSSR K     + 
Subjt:  TVQVVDNRILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAAA-ISSGFEACDGLSNSTLTHSNGASSRTQVTGNQSLKNGSSRPKEEECNS-

Query:  -------------------LRVRNISCF------------------RMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIE
                           +R  +++                    +MAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIE
Subjt:  -------------------LRVRNISCF------------------RMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIE

Query:  DLQKQNEDLANEKECLLEEIEKILSETGRM
        DLQKQNEDLANEKECLLEEIEKILSETGRM
Subjt:  DLQKQNEDLANEKECLLEEIEKILSETGRM

XP_022961436.1 uncharacterized protein LOC111461994 isoform X1 [Cucurbita moschata]2.2e-20177.44Show/hide
Query:  MEGRGVSELYRNASEELFLKSWVENAIGI---------------------------------------------------IASESVELSSSQYVAKHQKR
        MEGRGVSELYRNASEELFLKSWVENAIG+                                                   IASESVELSS QYVAKHQKR
Subjt:  MEGRGVSELYRNASEELFLKSWVENAIGI---------------------------------------------------IASESVELSSSQYVAKHQKR

Query:  ISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFI
        ISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFI
Subjt:  ISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFI

Query:  DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARFGNLSFDRSSDSYIHQIPNSLEEFSTVQVVDN
        DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE SSNARFGNLSFDRSSDSYIHQ+PNSLEEFSTVQVVDN
Subjt:  DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARFGNLSFDRSSDSYIHQIPNSLEEFSTVQVVDN

Query:  RILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAAA-ISSGFEACDGLSNSTLTHSNGASSRTQVTGNQSLKNGSSRPKEEECNS--------
        RILQ  EGSAEL FDAFVYPVNPIQSGRVSQEP QSESSAAAA +SSGFEACDGLSNSTLTHSNGASSR QV GNQSL+NGSSR K     +        
Subjt:  RILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAAA-ISSGFEACDGLSNSTLTHSNGASSRTQVTGNQSLKNGSSRPKEEECNS--------

Query:  ------------LRVRNISCF------------------RMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNE
                    +R  +++                    +MAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNE
Subjt:  ------------LRVRNISCF------------------RMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNE

Query:  DLANEKECLLEEIEKILSETGRM
        DLANEKECLLEEIEKILSETGRM
Subjt:  DLANEKECLLEEIEKILSETGRM

XP_022961508.1 uncharacterized protein LOC111461994 isoform X2 [Cucurbita moschata]7.5e-20277.78Show/hide
Query:  MEGRGVSELYRNASEELFLKSWVENAIGI---------------------------------------------------IASESVELSSSQYVAKHQKR
        MEGRGVSELYRNASEELFLKSWVENAIG+                                                   IASESVELSS QYVAKHQKR
Subjt:  MEGRGVSELYRNASEELFLKSWVENAIGI---------------------------------------------------IASESVELSSSQYVAKHQKR

Query:  ISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFI
        ISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFI
Subjt:  ISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFI

Query:  DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARFGNLSFDRSSDSYIHQIPNSLEEFSTVQVVDN
        DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE SSNARFGNLSFDRSSDSYIHQ+PNSLEEFSTVQVVDN
Subjt:  DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARFGNLSFDRSSDSYIHQIPNSLEEFSTVQVVDN

Query:  RILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAAA-ISSGFEACDGLSNSTLTHSNGASSRTQVTGNQSLKNGSSRPKEEECNSL-------
        RILQ  EGSAEL FDAFVYPVNPIQSGRVSQEP QSESSAAAA +SSGFEACDGLSNSTLTHSNGASSR QV GNQSL+NGSSR K     +        
Subjt:  RILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAAA-ISSGFEACDGLSNSTLTHSNGASSRTQVTGNQSLKNGSSRPKEEECNSL-------

Query:  ------RVRNISCF------------------------RMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNED
              R R++  +                        +MAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNED
Subjt:  ------RVRNISCF------------------------RMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNED

Query:  LANEKECLLEEIEKILSETGRM
        LANEKECLLEEIEKILSETGRM
Subjt:  LANEKECLLEEIEKILSETGRM

XP_023540450.1 uncharacterized protein LOC111800819 isoform X1 [Cucurbita pepo subsp. pepo]3.0e-19876.72Show/hide
Query:  MEGRGVSELYRNASEELFLKSWVENAIGI---------------------------------------------------IASESVELSSSQYVAKHQKR
        MEGR VSE+YRNASEELFLKSWVEN+IG+                                                   IASESVELSS QYVAKHQKR
Subjt:  MEGRGVSELYRNASEELFLKSWVENAIGI---------------------------------------------------IASESVELSSSQYVAKHQKR

Query:  ISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFI
        ISNDILHPQSASVADDVAGANQQTFRDVVDGE QGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQ+FI
Subjt:  ISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFI

Query:  DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARFGNLSFDRSSDSYIHQIPNSLEEFSTVQVVDN
        DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVD+ISSVVNMLKGTLERKKLNNQIEKVAPE SSNARFGNLSFDRSSDSYIHQIPNSLEEFSTVQVVDN
Subjt:  DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARFGNLSFDRSSDSYIHQIPNSLEEFSTVQVVDN

Query:  RILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAAA-ISSGFEACDGLSNSTLTHSNGASSRTQVTGNQSLKNGSSRPKEEECNS--LRVRNI
        RILQ  EGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAAA +SSGFEACDGLSNSTLTHSNGASSR QV GNQSL+NGSSR K     +   R R I
Subjt:  RILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAAA-ISSGFEACDGLSNSTLTHSNGASSRTQVTGNQSLKNGSSRPKEEECNS--LRVRNI

Query:  SCF-------------------------------------RMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQN
                                                +MAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQN
Subjt:  SCF-------------------------------------RMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQN

Query:  EDLANEKECLLEEIEKILSETGRM
        EDLA+EKECLLEEIEKILSETGR+
Subjt:  EDLANEKECLLEEIEKILSETGRM

XP_023540452.1 uncharacterized protein LOC111800819 isoform X2 [Cucurbita pepo subsp. pepo]2.3e-19876.86Show/hide
Query:  MEGRGVSELYRNASEELFLKSWVENAIGI---------------------------------------------------IASESVELSSSQYVAKHQKR
        MEGR VSE+YRNASEELFLKSWVEN+IG+                                                   IASESVELSS QYVAKHQKR
Subjt:  MEGRGVSELYRNASEELFLKSWVENAIGI---------------------------------------------------IASESVELSSSQYVAKHQKR

Query:  ISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFI
        ISNDILHPQSASVADDVAGANQQTFRDVVDGE QGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQ+FI
Subjt:  ISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFI

Query:  DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARFGNLSFDRSSDSYIHQIPNSLEEFSTVQVVDN
        DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVD+ISSVVNMLKGTLERKKLNNQIEKVAPE SSNARFGNLSFDRSSDSYIHQIPNSLEEFSTVQVVDN
Subjt:  DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARFGNLSFDRSSDSYIHQIPNSLEEFSTVQVVDN

Query:  RILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAAA-ISSGFEACDGLSNSTLTHSNGASSRTQVTGNQSLKNGSSRPKEEECNS--LRVRNI
        RILQ  EGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAAA +SSGFEACDGLSNSTLTHSNGASSR QV GNQSL+NGSSR K     +   R R I
Subjt:  RILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAAA-ISSGFEACDGLSNSTLTHSNGASSRTQVTGNQSLKNGSSRPKEEECNS--LRVRNI

Query:  SCF------------------------------------RMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNE
                                               +MAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNE
Subjt:  SCF------------------------------------RMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNE

Query:  DLANEKECLLEEIEKILSETGRM
        DLA+EKECLLEEIEKILSETGR+
Subjt:  DLANEKECLLEEIEKILSETGRM

TrEMBL top hitse value%identityAlignment
A0A0A0L144 Uncharacterized protein8.5e-16757.01Show/hide
Query:  TKPQSKDDVKNPNQKPHNSFVDRTGRLVSGPQKAFSSSVR--NIMKAPFQRLHRDTCYWSFSRRHHDLALSPTLDTHFLITFVVSSKVLTGLAGCDIAWT
        TKPQSK+D++N  +KP NS V+R  RL+SG + AF  + R  +I     + L +D C+                     + +V+ +K        D+   
Subjt:  TKPQSKDDVKNPNQKPHNSFVDRTGRLVSGPQKAFSSSVR--NIMKAPFQRLHRDTCYWSFSRRHHDLALSPTLDTHFLITFVVSSKVLTGLAGCDIAWT

Query:  NLGIFLS-RNDPNIGASIIIKMEGRGVSELYRNASEELFLKSWVENAIGI--------------------------------------------------
        N  + L  R   N+G  II+ MEGRGVSELYRNASEELFLKSWVEN+IG+                                                  
Subjt:  NLGIFLS-RNDPNIGASIIIKMEGRGVSELYRNASEELFLKSWVENAIGI--------------------------------------------------

Query:  -IASESVELSSSQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMH
         I+SE V+LSS QYV KHQKRISN     +  S  DD+ G +QQ FR+V + E QGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQS FG+ES++
Subjt:  -IASESVELSSSQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMH

Query:  DLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARF------GNLS
        DLSGHG N+MKLDI NSQ+F D STCE+PNQP  FVS SNSS+S+FN PNMYDVDKISSVVNMLKGTLERKKLNNQI+K APEDSSNA F      GN S
Subjt:  DLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARF------GNLS

Query:  FDRSSDSYIHQIPNSLEEFSTVQVVDNRILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAAAI-SSGFEACDGLSNSTLTHSNGASSRTQVT
        F+R SD+YIH IPNS  +FS VQV D+RIL+  E SA+LGF+AFV PVNPIQSGRVSQEPSQSESSAAAAI SSGFEACDG SNS  THSNG SSR QV 
Subjt:  FDRSSDSYIHQIPNSLEEFSTVQVVDNRILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAAAI-SSGFEACDGLSNSTLTHSNGASSRTQVT

Query:  GNQSLKNGS-----------------------------SRPKEEECNSLRVRNISCFR-MAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFM
        G++ L+N S                             S    ++ +  + R +   R MAEAKERNMTPTIPSD+QSV+KRC+ LEKEVRSLKLNLSFM
Subjt:  GNQSLKNGS-----------------------------SRPKEEECNSLRVRNISCFR-MAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFM

Query:  NRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRM
        NRKDSEQTKQIEDLQKQN+DLA+EKE LLEEIE+I+SETGRM
Subjt:  NRKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRM

A0A6J1E8X0 uncharacterized protein LOC111430970 isoform X12.0e-16358.57Show/hide
Query:  DVKNPNQKPHNSFVDRTGRLVSGPQKAFSSSVRNIMKAPFQRLHRDTCYWSFSRRHHDLALSPTLDTHFLITFVVSSKVLTG---LAGCDIAWTNLGIFL
        DVKNPN+KP NS VDRTGRL+SG +    S +++ +                   +H+ A+S         ++++   V  G   +A     + NLG+FL
Subjt:  DVKNPNQKPHNSFVDRTGRLVSGPQKAFSSSVRNIMKAPFQRLHRDTCYWSFSRRHHDLALSPTLDTHFLITFVVSSKVLTG---LAGCDIAWTNLGIFL

Query:  SRNDPNIGASIIIKMEGRGVSELYRNASEELFLKSWVENAIGI---------------------------------------------------IASESV
        SR DP     IIIKMEGRGVSELYRNASEELFLKSW EN+IG+                                                   I+SE V
Subjt:  SRNDPNIGASIIIKMEGRGVSELYRNASEELFLKSWVENAIGI---------------------------------------------------IASESV

Query:  ELSSSQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGA
        +LSS QYV KHQKRISNDIL PQSASVADDV+G NQQ FR++V+GE QGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGI+S++DLSGH  
Subjt:  ELSSSQYVAKHQKRISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGA

Query:  NAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARF------GNLSFDRSSDS
         AMKLD+  SQ F D STCE+PNQPV F S SNSSAS FNA  MYDVDKISSVVNMLK T+ERKKLNNQI+K A EDSSNA F      G+ SF+RSS++
Subjt:  NAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARF------GNLSFDRSSDS

Query:  YIHQIPNSLEEFSTVQVVDNRILQMFEGSAELGFDAF---VYPVNPIQSGRVSQEPSQSESSAAAA-ISSGFEACDGLSNSTLTHSNGASSRTQVTGNQS
        Y+HQ PN L E S VQV DNRIL+    S ++ F AF   V PVNP+QSGRVSQEPSQSESSAAAA +SSGFEA DG SNS  THSN  SSR Q+     
Subjt:  YIHQIPNSLEEFSTVQVVDNRILQMFEGSAELGFDAF---VYPVNPIQSGRVSQEPSQSESSAAAA-ISSGFEACDGLSNSTLTHSNGASSRTQVTGNQS

Query:  LKNGSSR----PKEEECNSLRVRNISCF--------------------RMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQ
         K+   R     K++      VRN S                      +MAEAKERNMTPTIPSDMQSVLKRCD LEKEVRSLKLNLSFMNRKDSEQTKQ
Subjt:  LKNGSSR----PKEEECNSLRVRNISCF--------------------RMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQ

Query:  IEDLQKQNEDLANEKECLLEEIEKILSETG
        IEDLQKQNEDLA+EKE LLEEIE+ILSETG
Subjt:  IEDLQKQNEDLANEKECLLEEIEKILSETG

A0A6J1HAD0 uncharacterized protein LOC111461994 isoform X11.1e-20177.44Show/hide
Query:  MEGRGVSELYRNASEELFLKSWVENAIGI---------------------------------------------------IASESVELSSSQYVAKHQKR
        MEGRGVSELYRNASEELFLKSWVENAIG+                                                   IASESVELSS QYVAKHQKR
Subjt:  MEGRGVSELYRNASEELFLKSWVENAIGI---------------------------------------------------IASESVELSSSQYVAKHQKR

Query:  ISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFI
        ISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFI
Subjt:  ISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFI

Query:  DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARFGNLSFDRSSDSYIHQIPNSLEEFSTVQVVDN
        DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE SSNARFGNLSFDRSSDSYIHQ+PNSLEEFSTVQVVDN
Subjt:  DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARFGNLSFDRSSDSYIHQIPNSLEEFSTVQVVDN

Query:  RILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAAA-ISSGFEACDGLSNSTLTHSNGASSRTQVTGNQSLKNGSSRPKEEECNS--------
        RILQ  EGSAEL FDAFVYPVNPIQSGRVSQEP QSESSAAAA +SSGFEACDGLSNSTLTHSNGASSR QV GNQSL+NGSSR K     +        
Subjt:  RILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAAA-ISSGFEACDGLSNSTLTHSNGASSRTQVTGNQSLKNGSSRPKEEECNS--------

Query:  ------------LRVRNISCF------------------RMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNE
                    +R  +++                    +MAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNE
Subjt:  ------------LRVRNISCF------------------RMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNE

Query:  DLANEKECLLEEIEKILSETGRM
        DLANEKECLLEEIEKILSETGRM
Subjt:  DLANEKECLLEEIEKILSETGRM

A0A6J1HC08 uncharacterized protein LOC111461994 isoform X23.6e-20277.78Show/hide
Query:  MEGRGVSELYRNASEELFLKSWVENAIGI---------------------------------------------------IASESVELSSSQYVAKHQKR
        MEGRGVSELYRNASEELFLKSWVENAIG+                                                   IASESVELSS QYVAKHQKR
Subjt:  MEGRGVSELYRNASEELFLKSWVENAIGI---------------------------------------------------IASESVELSSSQYVAKHQKR

Query:  ISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFI
        ISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFI
Subjt:  ISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFI

Query:  DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARFGNLSFDRSSDSYIHQIPNSLEEFSTVQVVDN
        DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPE SSNARFGNLSFDRSSDSYIHQ+PNSLEEFSTVQVVDN
Subjt:  DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARFGNLSFDRSSDSYIHQIPNSLEEFSTVQVVDN

Query:  RILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAAA-ISSGFEACDGLSNSTLTHSNGASSRTQVTGNQSLKNGSSRPKEEECNSL-------
        RILQ  EGSAEL FDAFVYPVNPIQSGRVSQEP QSESSAAAA +SSGFEACDGLSNSTLTHSNGASSR QV GNQSL+NGSSR K     +        
Subjt:  RILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAAA-ISSGFEACDGLSNSTLTHSNGASSRTQVTGNQSLKNGSSRPKEEECNSL-------

Query:  ------RVRNISCF------------------------RMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNED
              R R++  +                        +MAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNED
Subjt:  ------RVRNISCF------------------------RMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNED

Query:  LANEKECLLEEIEKILSETGRM
        LANEKECLLEEIEKILSETGRM
Subjt:  LANEKECLLEEIEKILSETGRM

A0A6J1L1E0 uncharacterized protein LOC111498273 isoform X13.0e-19675.86Show/hide
Query:  MEGRGVSELYRNASEELFLKSWVENAIGI---------------------------------------------------IASESVELSSSQYVAKHQKR
        MEGR VSELYRNASEELFLKSWVEN+IG+                                                   IASE+VELSS QYVAKHQKR
Subjt:  MEGRGVSELYRNASEELFLKSWVENAIGI---------------------------------------------------IASESVELSSSQYVAKHQKR

Query:  ISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFI
        ISNDIL+PQSASVADDVAG NQQTFRDVV+GE QGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQ+FI
Subjt:  ISNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFI

Query:  DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARFGNLSFDRSSDSYIHQIPNSLEEFSTVQVVDN
        DPSTCEIPNQPV FVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNA FGNLSFDRSSDSY+HQIPNSLEEFSTVQVVDN
Subjt:  DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARFGNLSFDRSSDSYIHQIPNSLEEFSTVQVVDN

Query:  RILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAAA-ISSGFEACDGLSNSTLTHSNGASSRTQVTGNQSLKNGSSRPKEEECNSL-------
        RILQ  EGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAAA +SSGFEAC GLSNSTLTHSNGASSR QV GNQS++NGSSR K     +        
Subjt:  RILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAAA-ISSGFEACDGLSNSTLTHSNGASSRTQVTGNQSLKNGSSRPKEEECNSL-------

Query:  ------RVRNI------------------------SCFRMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNED
              R R++                           +MAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNED
Subjt:  ------RVRNI------------------------SCFRMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNED

Query:  LANEKECLLEEIEKILSETGRM
        LA+EKECLLEEIEKILSETGRM
Subjt:  LANEKECLLEEIEKILSETGRM

SwissProt top hitse value%identityAlignment
A7TUE1 Protein CYCLOPS1.2e-8844.94Show/hide
Query:  MEGRGVSELYRNASEELFLKSWVENAIGI--------------------------------------------------IASESVELSSSQYVAKHQKRI
        MEGRG S LY+N+SEELFLK+ +E+ IG+                                                  I++E   +S+ Q++    +  
Subjt:  MEGRGVSELYRNASEELFLKSWVENAIGI--------------------------------------------------IASESVELSSSQYVAKHQKRI

Query:  SNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFID
        +ND    Q+  +A+DV+       RD VD E Q SNL+LAKAWF + Q MTRSRSSELRRRY  MQ++Q+  G++SM  +  H  N +K ++ N   F  
Subjt:  SNDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFID

Query:  PSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARF------GNLSFDRSSDSYIHQIPNSLEEFSTV
         S CE+P+Q   F+S SNSS+S FN   + DVDK+SS V+MLKGTL+RKKL  Q+EK A ED  N  F         +F+        ++ N   +F T 
Subjt:  PSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARF------GNLSFDRSSDSYIHQIPNSLEEFSTV

Query:  QVVDNRILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAA-AISSGFEACDGLSNSTLTHSN------GASSRTQVTG-----NQSLKNGSSR
        QV D  ++Q  EG+     D F    N IQ    S EPSQSESSAAA  ISSG +AC+G SNS  T  +      G S++ +V G       +LK+   R
Subjt:  QVVDNRILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAA-AISSGFEACDGLSNSTLTHSN------GASSRTQVTG-----NQSLKNGSSR

Query:  PKEEECNSL------------RVRNISCFR-MAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECL
           E   S+            + R +   R MAEAKERN+TPTIPSDMQ++LKRC+NLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLA+EKE L
Subjt:  PKEEECNSL------------RVRNISCFR-MAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECL

Query:  LEEIEKILSETGRM
        LEEIE+ILSETG++
Subjt:  LEEIEKILSETGRM

A9XMT3 Protein CYCLOPS8.9e-8945.17Show/hide
Query:  MEGRGVSELYRNASEELFLKSWVENAIGI----------------IASESVEL-------------SSSQYVAKHQKRISNDILHP--------------
        MEGRG S LYRN+SEELFLK+ +E+ IG+                  ++S EL             SS  + ++  KRIS ++++               
Subjt:  MEGRGVSELYRNASEELFLKSWVENAIGI----------------IASESVEL-------------SSSQYVAKHQKRISNDILHP--------------

Query:  ------QSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDP
              Q+  +A+DV+G      RD VD E Q SNL+LAKAWF S Q MTRSRSSELRRRY+ MQ+  ++ GIES+     HGA A K ++ N   +   
Subjt:  ------QSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDP

Query:  STCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARF------GNLSFDRSSDSYIHQIPNSLEEFSTVQ
        S CE+P+Q   F+S SNS +S FN P   D+DK+SS V+MLKGTL+R++L++Q+EK A ED  N  F          FD+  +++ +Q P +++  S  +
Subjt:  STCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARF------GNLSFDRSSDSYIHQIPNSLEEFSTVQ

Query:  VVDNRILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESS-AAAAISSGFEACDG--LSNSTLTHSN----GASSRTQVTGNQ----------SLKN
        V D+ +LQ  EGS     D F   +N I  G  S EPSQSESS AA  ISSG + C+G   SN TL  S+    G S  ++ T N+          +LK+
Subjt:  VVDNRILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESS-AAAAISSGFEACDG--LSNSTLTHSN----GASSRTQVTGNQ----------SLKN

Query:  GSSRPKEEECNSL------------RVRNISCFR-MAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANE
           R   E   S+            + R +   R MAEAKERN TP++PSDMQ+VLKRC+NLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNE+LA+E
Subjt:  GSSRPKEEECNSL------------RVRNISCFR-MAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANE

Query:  KECLLEEIEKILSETGRM
        KE LLEEIE+ILSET +M
Subjt:  KECLLEEIEKILSETGRM

A9XMT4 Protein CYCLOPS1.4e-8944.25Show/hide
Query:  MEGRGVSELYRNASEELFLKSWVENAIGI-------------------------------------------------IASESVELSSSQYVAKHQKRIS
        MEGRG S LY+N+SEELFLK+ +E+ IG+                                                 I++E V +S+ Q+V    +  +
Subjt:  MEGRGVSELYRNASEELFLKSWVENAIGI-------------------------------------------------IASESVELSSSQYVAKHQKRIS

Query:  NDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDP
        ND    Q++ + +DV+G      R+ VD E Q  NL+LAKAWF + Q MTRSRSSELRRRY  MQ+ Q+  G++SM     H AN +K ++ +   F   
Subjt:  NDILHPQSASVADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDP

Query:  STCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARFG------NLSFDRSSDSYIHQIPNSLEEFSTVQ
        S CEIP+Q   F+S SNSS+S FN   + DVDK+SS V+MLKGTL+RK+L  Q+EK A ED  N  FG         F+   +++ HQ   +++   T Q
Subjt:  STCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARFG------NLSFDRSSDSYIHQIPNSLEEFSTVQ

Query:  VVDNRILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAA-AISSGFEACDGLSNS--TLTHSN----GASSRTQVTG-----NQSLKNGSSRP
        V D  +++  EG+A    + F    + I  G  S EPSQSESSAAA  ISSG +AC+G SNS  TL  S+    G S++ +  G       +LK+   R 
Subjt:  VVDNRILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAAA-AISSGFEACDGLSNS--TLTHSN----GASSRTQVTG-----NQSLKNGSSRP

Query:  K------------EEECNSLRVRNISCFR-MAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLL
        +            +++ ++ + R +   R MAEAKERN+TPTIPSDMQ+V+KRC+NLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNE+LA+EKE LL
Subjt:  K------------EEECNSLRVRNISCFR-MAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLANEKECLL

Query:  EEIEKILSETGRM
        EEIE++LSETG++
Subjt:  EEIEKILSETGRM

A9XMT5 Protein CYCLOPS3.9e-6840.12Show/hide
Query:  MEGRGVSELYRNASEELFLKSWVENAIGIIAS------------------ESVELSSS--------------------------------QYVAKHQKRI
        MEGRG+SEL+RN SE++FLK+ +EN++G+ A+                  +S EL +S                                Q     Q+  
Subjt:  MEGRGVSELYRNASEELFLKSWVENAIGIIAS------------------ESVELSSS--------------------------------QYVAKHQKRI

Query:  SNDILHPQSASV-ADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFI
          D L PQ+ +V ++     NQQ  ++  +   Q S+L LAKAWFHS+QPMTRSRSSELR+RYAAMQSN      E++       AN ++ D+TN+    
Subjt:  SNDILHPQSASV-ADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFI

Query:  DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARFGNLSFDRSSDSYIHQIPNSLEEF---STVQV
               P+Q   FVS S+SS S  + P++   D I+SVV+MLK TLERKKL++       + SS   FG       S  +   I    + F   +T Q+
Subjt:  DPSTCEIPNQPVQFVSTSNSSASMFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARFGNLSFDRSSDSYIHQIPNSLEEF---STVQV

Query:  VDNRILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAA-AAISSGFEACDGL---------SNSTLTH-----------SNGASSRTQV-----
         D+ +L   E   E G   FV P N +  G  S+EPSQS SS A  A S+GFE CD L           ST T+           S G   R ++     
Subjt:  VDNRILQMFEGSAELGFDAFVYPVNPIQSGRVSQEPSQSESSAA-AAISSGFEACDGL---------SNSTLTH-----------SNGASSRTQV-----

Query:  -----TGNQSLKNGSSRPKEEECNSLRVRNISCFR-MAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLAN
              G+ +     S  + ++ +  + R +   R MAEAKER+ TP IPSD+Q VLKRC+ LEKEVRSLKLNLSFMNRKDSEQTKQIE+LQKQNEDL  
Subjt:  -----TGNQSLKNGSSRPKEEECNSLRVRNISCFR-MAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMNRKDSEQTKQIEDLQKQNEDLAN

Query:  EKECLLEEIEKILSET
        EKE LLEEIE+I+S+T
Subjt:  EKECLLEEIEKILSET

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCAACTCATCAACTGTTTACACAGCCGAAACGAAGACGAAGATTTCTTTCATCCAGCAACTCCCAACTTGAGGAGCAGAAGGCTGATGTTGAAATTTTCTATAAA
CAATTCAGCAACAAAACCTCAATCGAAGGATGATGTGAAGAATCCGAATCAGAAACCGCATAATTCGTTCGTTGATCGAACAGGTCGGCTTGTTTCTGGCCCTCAAAAGG
CCTTCTCCAGCAGCGTACGCAATATCATGAAAGCGCCATTCCAACGGCTTCATAGAGATACTTGCTATTGGTCCTTTTCACGTCGCCATCACGATCTGGCTCTATCTCCA
ACCCTAGATACACACTTTTTGATTACGTTTGTTGTTAGTTCTAAAGTTCTTACCGGCCTAGCTGGTTGTGATATTGCTTGGACGAATTTAGGCATCTTTCTGTCTCGAAA
TGATCCAAACATTGGAGCAAGTATCATCATAAAGATGGAAGGAAGGGGCGTTTCAGAACTATATAGAAACGCAAGTGAGGAGCTATTTCTTAAATCTTGGGTCGAAAATG
CAATCGGAATAATTGCCTCGGAGTCTGTTGAACTGTCTAGTTCTCAGTATGTGGCAAAACATCAAAAGAGAATAAGCAATGATATATTGCATCCACAAAGTGCCTCCGTG
GCTGATGATGTCGCAGGAGCTAATCAACAAACTTTCAGGGATGTTGTTGATGGTGAAGGGCAAGGTAGTAACCTTTATCTTGCGAAGGCATGGTTCCACAGTTCTCAACC
TATGACAAGAAGTCGATCATCTGAGCTAAGGAGGAGGTATGCTGCAATGCAAAGCAATCAAAGCTCATTCGGTATAGAGTCCATGCATGACTTGTCAGGGCATGGAGCCA
ACGCGATGAAACTAGATATCACAAATTCACAGAATTTCATTGACCCTTCTACTTGTGAGATTCCAAACCAGCCTGTTCAATTTGTATCCACATCCAATTCATCGGCATCA
ATGTTCAATGCACCAAACATGTATGATGTAGATAAAATTTCTTCTGTTGTAAACATGCTAAAGGGCACATTAGAACGGAAGAAACTAAATAACCAGATTGAAAAAGTGGC
ACCGGAGGATAGTTCAAATGCACGTTTTGGCAACCTTAGTTTCGATCGAAGCAGTGACAGTTATATACATCAAATACCGAACAGTCTCGAGGAATTTTCTACTGTTCAAG
TTGTGGATAATAGAATTTTACAAATGTTTGAGGGATCAGCAGAGCTCGGTTTCGATGCTTTCGTATATCCTGTAAATCCCATTCAGTCAGGTAGAGTTTCTCAAGAACCT
TCTCAAAGTGAATCTTCTGCTGCTGCAGCAATTTCATCCGGTTTTGAGGCGTGTGATGGTCTTAGCAACTCAACTCTAACTCATAGCAACGGTGCAAGCTCAAGGACACA
AGTTACAGGCAATCAGAGCTTAAAAAATGGATCATCACGACCTAAAGAGGAGGAGTGTAATTCGTTACGGGTCCGTAACATCAGCTGCTTCAGAATGGCAGAGGCGAAGG
AGAGGAATATGACACCAACTATTCCATCAGATATGCAATCAGTTCTGAAGCGGTGTGACAATCTTGAGAAGGAAGTGCGGTCACTAAAACTCAATTTGTCCTTTATGAAT
AGGAAGGATTCTGAGCAGACTAAGCAGATAGAGGATCTTCAGAAGCAGAATGAGGACTTAGCAAATGAAAAAGAATGCCTACTTGAAGAGATTGAGAAGATTCTCTCAGA
AACTGGAAGAATGTTCTTGAGCTTGACAGCTGGAAAAGCCATCATAGTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTCAACTCATCAACTGTTTACACAGCCGAAACGAAGACGAAGATTTCTTTCATCCAGCAACTCCCAACTTGAGGAGCAGAAGGCTGATGTTGAAATTTTCTATAAA
CAATTCAGCAACAAAACCTCAATCGAAGGATGATGTGAAGAATCCGAATCAGAAACCGCATAATTCGTTCGTTGATCGAACAGGTCGGCTTGTTTCTGGCCCTCAAAAGG
CCTTCTCCAGCAGCGTACGCAATATCATGAAAGCGCCATTCCAACGGCTTCATAGAGATACTTGCTATTGGTCCTTTTCACGTCGCCATCACGATCTGGCTCTATCTCCA
ACCCTAGATACACACTTTTTGATTACGTTTGTTGTTAGTTCTAAAGTTCTTACCGGCCTAGCTGGTTGTGATATTGCTTGGACGAATTTAGGCATCTTTCTGTCTCGAAA
TGATCCAAACATTGGAGCAAGTATCATCATAAAGATGGAAGGAAGGGGCGTTTCAGAACTATATAGAAACGCAAGTGAGGAGCTATTTCTTAAATCTTGGGTCGAAAATG
CAATCGGAATAATTGCCTCGGAGTCTGTTGAACTGTCTAGTTCTCAGTATGTGGCAAAACATCAAAAGAGAATAAGCAATGATATATTGCATCCACAAAGTGCCTCCGTG
GCTGATGATGTCGCAGGAGCTAATCAACAAACTTTCAGGGATGTTGTTGATGGTGAAGGGCAAGGTAGTAACCTTTATCTTGCGAAGGCATGGTTCCACAGTTCTCAACC
TATGACAAGAAGTCGATCATCTGAGCTAAGGAGGAGGTATGCTGCAATGCAAAGCAATCAAAGCTCATTCGGTATAGAGTCCATGCATGACTTGTCAGGGCATGGAGCCA
ACGCGATGAAACTAGATATCACAAATTCACAGAATTTCATTGACCCTTCTACTTGTGAGATTCCAAACCAGCCTGTTCAATTTGTATCCACATCCAATTCATCGGCATCA
ATGTTCAATGCACCAAACATGTATGATGTAGATAAAATTTCTTCTGTTGTAAACATGCTAAAGGGCACATTAGAACGGAAGAAACTAAATAACCAGATTGAAAAAGTGGC
ACCGGAGGATAGTTCAAATGCACGTTTTGGCAACCTTAGTTTCGATCGAAGCAGTGACAGTTATATACATCAAATACCGAACAGTCTCGAGGAATTTTCTACTGTTCAAG
TTGTGGATAATAGAATTTTACAAATGTTTGAGGGATCAGCAGAGCTCGGTTTCGATGCTTTCGTATATCCTGTAAATCCCATTCAGTCAGGTAGAGTTTCTCAAGAACCT
TCTCAAAGTGAATCTTCTGCTGCTGCAGCAATTTCATCCGGTTTTGAGGCGTGTGATGGTCTTAGCAACTCAACTCTAACTCATAGCAACGGTGCAAGCTCAAGGACACA
AGTTACAGGCAATCAGAGCTTAAAAAATGGATCATCACGACCTAAAGAGGAGGAGTGTAATTCGTTACGGGTCCGTAACATCAGCTGCTTCAGAATGGCAGAGGCGAAGG
AGAGGAATATGACACCAACTATTCCATCAGATATGCAATCAGTTCTGAAGCGGTGTGACAATCTTGAGAAGGAAGTGCGGTCACTAAAACTCAATTTGTCCTTTATGAAT
AGGAAGGATTCTGAGCAGACTAAGCAGATAGAGGATCTTCAGAAGCAGAATGAGGACTTAGCAAATGAAAAAGAATGCCTACTTGAAGAGATTGAGAAGATTCTCTCAGA
AACTGGAAGAATGTTCTTGAGCTTGACAGCTGGAAAAGCCATCATAGTCTAA
Protein sequenceShow/hide protein sequence
MLQLINCLHSRNEDEDFFHPATPNLRSRRLMLKFSINNSATKPQSKDDVKNPNQKPHNSFVDRTGRLVSGPQKAFSSSVRNIMKAPFQRLHRDTCYWSFSRRHHDLALSP
TLDTHFLITFVVSSKVLTGLAGCDIAWTNLGIFLSRNDPNIGASIIIKMEGRGVSELYRNASEELFLKSWVENAIGIIASESVELSSSQYVAKHQKRISNDILHPQSASV
ADDVAGANQQTFRDVVDGEGQGSNLYLAKAWFHSSQPMTRSRSSELRRRYAAMQSNQSSFGIESMHDLSGHGANAMKLDITNSQNFIDPSTCEIPNQPVQFVSTSNSSAS
MFNAPNMYDVDKISSVVNMLKGTLERKKLNNQIEKVAPEDSSNARFGNLSFDRSSDSYIHQIPNSLEEFSTVQVVDNRILQMFEGSAELGFDAFVYPVNPIQSGRVSQEP
SQSESSAAAAISSGFEACDGLSNSTLTHSNGASSRTQVTGNQSLKNGSSRPKEEECNSLRVRNISCFRMAEAKERNMTPTIPSDMQSVLKRCDNLEKEVRSLKLNLSFMN
RKDSEQTKQIEDLQKQNEDLANEKECLLEEIEKILSETGRMFLSLTAGKAIIV