; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014907 (gene) of Snake gourd v1 genome

Gene IDTan0014907
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionpre-mRNA-splicing factor CWC22 homolog isoform X2
Genome locationLG01:108670296..108673379
RNA-Seq ExpressionTan0014907
SyntenyTan0014907
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7024815.1 hypothetical protein SDJN02_13634, partial [Cucurbita argyrosperma subsp. argyrosperma]6.4e-24179.74Show/hide
Query:  MGK-FSSRNKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDY
        MGK  SSR KERSK SSSQRSRRKS+SSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDD+KV RSRSKTR   KN+K SKKR+KKQS+D QSR+ 
Subjt:  MGK-FSSRNKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDY

Query:  SPHPRKRKHSKRDNRCEVKKTNKKKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGSDHQNEV
        SPHPRKRKHSKR +R E KKTNKKK RRD SV AT SDSL  STCGDGS+TS++SEIDRRRGR  KRK NMVKTE  R+RSKS SPCSLCS+GSD QNEV
Subjt:  SPHPRKRKHSKRDNRCEVKKTNKKKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGSDHQNEV

Query:  EDDCYVENNFRRLRSVIVVVGEENKLETFERNEQQEEVTHQPDDDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDDGVQNEG
        EDD YVEN+ RRL+S+IVVVGEE++L+TF  NEQQE VTHQ DD+HPSFGDM+SKDG +KRELDYVISKEAPEVESK ++V PDNRNS+++ DDGV+NEG
Subjt:  EDDCYVENNFRRLRSVIVVVGEENKLETFERNEQQEEVTHQPDDDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDDGVQNEG

Query:  SNNNHGGASHDHSLNERKSGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMNGNEFSRQ
        SN NHGG ++DHSL+ERK+GCSGN++SINCIDLESILRQ+ALENLRKFK V PRNVE  ANCK +NNND KQ  SPVSKSV +TSPRD+AE+N   FSRQ
Subjt:  SNNNHGGASHDHSLNERKSGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMNGNEFSRQ

Query:  SGRNAINSMIVEENGVKSTDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLESKVDSLT
         G +A+NSMIV+ NG KSTD ID+AVAS HDPV SSQNLGKISNGSNG+NELKQDISSLDQE +NDNI  KADADI STT+ SNLVIAA R ESKVDS  
Subjt:  SGRNAINSMIVEENGVKSTDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLESKVDSLT

Query:  KQASASQESIQIKPSISDIGVDETAQTQTQMRNNDDQKIGNGFGSSAHKPSSSLNSISGENSLDKSRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIPK
        K+ASA QE IQ K SISDI VDETAQTQTQM NNDDQ I NGFGSSA+KPSSSLNSISGENSLDKSR ESGEGSQFEQKTMSV RGGEMVQVNYKVYIPK
Subjt:  KQASASQESIQIKPSISDIGVDETAQTQTQMRNNDDQKIGNGFGSSAHKPSSSLNSISGENSLDKSRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIPK

Query:  RAPALTRRQLKR
        RAPALTRRQLKR
Subjt:  RAPALTRRQLKR

KAG7037416.1 hypothetical protein SDJN02_01042 [Cucurbita argyrosperma subsp. argyrosperma]8.3e-24179.12Show/hide
Query:  MGKFSSRNKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDYS
        MGKFSSR K RSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPS SD D E+STS+SSSS+EDDEKVGRSRSK RKNA      KKRAKK+S+D Q RDYS
Subjt:  MGKFSSRNKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDYS

Query:  PHPRKRKHSKRDNRCEVKKTNKKKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGSDHQNEVE
        PHPRKRKHSKRD  CE+KK+NKKK +RDASV AT SDSL CSTCGDGSTTSNE EIDRR+GR RKRK NM K ERS++RS S SPCSLCSEGSDHQNEVE
Subjt:  PHPRKRKHSKRDNRCEVKKTNKKKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGSDHQNEVE

Query:  DDCYVENNFRRLRSVIVVVGEENKLETFERNEQQEEVTHQPDDDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDDGVQNEGS
        D CYVENNFRRLRSVIVVVGEE+KLETF+ NE QE VTHQPD DHPSFGD++  DGM+ RELD +IS+EAP       V+I DNRNS+VVK+DGVQNEGS
Subjt:  DDCYVENNFRRLRSVIVVVGEENKLETFERNEQQEEVTHQPDDDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDDGVQNEGS

Query:  NNNHGGASHDHSLNERKSGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMNGNEFSRQS
        NNNHGG++HDH L ERK+GCSGN++  NCIDLESILRQRALENLRK+KRV PRNVETP+NC+ DN+ND KQ  SPVSKS  +TSPRDEA ++GN FSRQ 
Subjt:  NNNHGGASHDHSLNERKSGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMNGNEFSRQS

Query:  GRNAINSMIVEENGVKSTDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLESKVDSLTK
        G NA+NSMI+ ENGVKSTDE+DSAVAST+DPVYSSQ LGKISNGSN +NELK+ ISS+DQE VND+I   ADADIC TT+RSNLVIAAL+ ES VDSLT+
Subjt:  GRNAINSMIVEENGVKSTDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLESKVDSLTK

Query:  QASASQESIQIKPSISDIGVDETAQTQTQMRNNDDQKIGNGFGSSAHKP--SSSLNSISGENSLDKSRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIP
        QASASQESIQ KPS SD+G DETAQTQTQMRNND Q IG+GFGSSAHKP  SSSLNSISGEN L++SRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIP
Subjt:  QASASQESIQIKPSISDIGVDETAQTQTQMRNNDDQKIGNGFGSSAHKP--SSSLNSISGENSLDKSRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIP

Query:  KRAPALTRRQLKR
        KRAP L+RRQLKR
Subjt:  KRAPALTRRQLKR

XP_022139776.1 uncharacterized protein LOC111010601 [Momordica charantia]5.6e-24581.37Show/hide
Query:  MGKFSSRNKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDYS
        MGK  SR KERSKTSSSQRSRRK+RSSRKLKSKKLRYRHDSPSCSDTDFESSTS+SSSSSEDDEKVGRSRS      KNAK  KKRAKK+S D Q RD S
Subjt:  MGKFSSRNKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDYS

Query:  PHPRKRKHSKRDNRCEVKKTNK-KKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGSDHQNEV
        PHPRKRKHSKR +RCEVKKTNK KK RRD SVSAT  DSLSCSTCGDGSTTSNESEIDR RGR  KRK N  KTERSR+RSKS SPCSLCSEGSD+QNEV
Subjt:  PHPRKRKHSKRDNRCEVKKTNK-KKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGSDHQNEV

Query:  EDDCYVENNFRRLRSVIVVVGEENKLETFERNEQQEEVTHQPDDDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDDGVQNEG
        ED  YVENNFRRLRSVIVVVGEENKL+TF+ NEQQEEV H PDDDHPSFGDMDS DGM+KRELD V S EA EVE+KKEVVIPD RN +VVKD GVQNEG
Subjt:  EDDCYVENNFRRLRSVIVVVGEENKLETFERNEQQEEVTHQPDDDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDDGVQNEG

Query:  SNNNHGGASHDHSLNERKSGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMNGNEFSRQ
        SNNNHGG ++DH LNE  +G SGN+D INCIDLESILRQRALENLRKFK VPP+NVET ANC+ DN+ND KQ YSPVS SVR+ SPRD+AE+NG  FS Q
Subjt:  SNNNHGGASHDHSLNERKSGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMNGNEFSRQ

Query:  SGRNAINSMIVEENGVKSTDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLESKVDSLT
         G NA+N MIVEENGV+ST+ IDSAVASTHDP+YSSQNLGKIS+ SNG+NELKQDISSLDQE VNDNI QK DADICSTTSRSNLV AALR +SKVD L 
Subjt:  SGRNAINSMIVEENGVKSTDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLESKVDSLT

Query:  KQASASQESIQIKPSISDIGVDETAQTQTQMRNNDDQKIGNGFGSSAHKPSSSLNSISGENSLDKSRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIPK
        KQASA QE IQ KPSISD+GVDE AQ Q Q RNNDDQ I NGF SSAHKPSSSLN  SGENSL+K RHESGEGSQFEQKTMSV RGGEMVQVNYKVYIPK
Subjt:  KQASASQESIQIKPSISDIGVDETAQTQTQMRNNDDQKIGNGFGSSAHKPSSSLNSISGENSLDKSRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIPK

Query:  RAPALTRRQLKR
        RAPAL RRQLKR
Subjt:  RAPALTRRQLKR

XP_023535556.1 transcriptional regulator ATRX isoform X1 [Cucurbita pepo subsp. pepo]3.1e-24379.58Show/hide
Query:  MGK-FSSRNKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDY
        MGK  SSR KERSKTSSSQRSRRKS+SSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDD+KV RSRSKTR   KN+K SKKR+KKQS+D QSR+ 
Subjt:  MGK-FSSRNKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDY

Query:  SPHPRKRKHSKRDNRCEVKKTNKKKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGSDHQNEV
         PHPRKRKHSKR +R E KKTNKKK RRD SV AT SDSLS STCGDGS+TS++SEIDRRRGR  KRK NMVKTE  R+RSKS SPCSLCS+G D QNEV
Subjt:  SPHPRKRKHSKRDNRCEVKKTNKKKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGSDHQNEV

Query:  EDDCYVENNFRRLRSVIVVVGEENKLETFERNEQQEEVTHQPDDDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDDGVQNEG
        EDD YVEN+ RRL+S+IVVVGEE++L+TF  NEQQE VTHQ D++HPSFGDM+SKDG +KRELDYVISKEAPEVESK ++V PDNRNS+++ DDGV+NEG
Subjt:  EDDCYVENNFRRLRSVIVVVGEENKLETFERNEQQEEVTHQPDDDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDDGVQNEG

Query:  SNNNHGGASHDHSLNERKSGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMNGNEFSRQ
        SN NHGG ++DHSL+ERK+GCSGN+DSINCI+LESILRQ+ALENLRKFK V PRNVE  +NCK +NNND KQ  SPVSKSV +T PRD+AE+NG  FSRQ
Subjt:  SNNNHGGASHDHSLNERKSGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMNGNEFSRQ

Query:  SGRNAINSMIVEENGVKSTDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLESKVDSLT
         G +A+NSMIV+ENG KSTD ID+AVAS HDPV SSQNLGKISNGSNG+NELKQDISSLDQE +NDNI  KADADI STT+RSNLVIAA R ESKVDSL 
Subjt:  SGRNAINSMIVEENGVKSTDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLESKVDSLT

Query:  KQASASQESIQIKPSISDIGVDETAQTQTQMRNNDDQKIGNGFGSSAHKPSSSLNSISGENSLDKSRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIPK
        ++ASA+QE I+ KPSISDI VDETAQT+TQM+NN+DQ I NGFGSSA+KPSSSLNSISGENSLDKSRHESGEGSQFEQKTMSV RGGEMVQVNYKVYIPK
Subjt:  KQASASQESIQIKPSISDIGVDETAQTQTQMRNNDDQKIGNGFGSSAHKPSSSLNSISGENSLDKSRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIPK

Query:  RAPALTRRQLKR
        RAPALTRRQLKR
Subjt:  RAPALTRRQLKR

XP_038897880.1 histone-lysine N-methyltransferase SETD2 isoform X1 [Benincasa hispida]1.4e-24080.13Show/hide
Query:  MGK-FSSRNKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDY
        MGK  SSR KERSKTSSSQRSRRKS+SS+KLKSKKLRYRHDSPSCSDTDFESSTSV SSSSEDD++V RSRSKTR   KNAK SKKR+K+QS+D QSR+ 
Subjt:  MGK-FSSRNKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDY

Query:  SPHPRKRKHSKRDNRCEVKKTNKKKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGSDHQNEV
        SPHPRKRKHSKR++ CE KK  KKK RRDASV A  SDS SCSTCG+GSTTSNESE+ RRRGR  KRKGNM KTER R+RSKS SPCSL S+ SD+QNEV
Subjt:  SPHPRKRKHSKRDNRCEVKKTNKKKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGSDHQNEV

Query:  EDDCYVENNFRRLRSVIVVVGEENKLETFERNEQQEEVTHQPD--DDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDDGVQN
        +DD YV NNFRRLRS+IV+ GEENKL+TF  NEQQE  THQP+  DDHPS GDMDSKD  +KRELDYVISKE P VE KKEV +P+NRNSMVVKDDGVQN
Subjt:  EDDCYVENNFRRLRSVIVVVGEENKLETFERNEQQEEVTHQPD--DDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDDGVQN

Query:  EGSNNNHGGASHDHSLNERKSGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMNGNEFS
        EGSN N GG ++DHSL+ERK+GCSG +DS+N IDLESILRQRALENLRKFK  PPRNVET ANCK D+NND KQ  SPVSKSV +TSPRD+AE+N   FS
Subjt:  EGSNNNHGGASHDHSLNERKSGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMNGNEFS

Query:  RQSGRNAINSMIVEENGVKSTDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLESKVDS
        RQ G NA+NSMIV+ENGVKSTD IDS+V S HDPVYSSQNLGKISNGSNG+NELKQ+ISSLDQE +NDNI QKADADICSTT+RSNLVIAALR ESKVDS
Subjt:  RQSGRNAINSMIVEENGVKSTDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLESKVDS

Query:  LTKQASASQESIQIKPSISDIGVDETAQTQTQMRNNDDQKIGNGFGSSAHKPSSSLNSISGENSLDKSRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYI
        L KQA A+QESIQ KPSISDIGVDETAQTQTQMRNNDDQ I NG  SSAHKP SSLNSISGENSL  SRHESG+ SQFEQKTMSV RGGEMVQVNYKVYI
Subjt:  LTKQASASQESIQIKPSISDIGVDETAQTQTQMRNNDDQKIGNGFGSSAHKPSSSLNSISGENSLDKSRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYI

Query:  PKRAPALTRRQLKR
        PKRAPALTRRQLKR
Subjt:  PKRAPALTRRQLKR

TrEMBL top hitse value%identityAlignment
A0A6J1CDR0 uncharacterized protein LOC1110106012.7e-24581.37Show/hide
Query:  MGKFSSRNKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDYS
        MGK  SR KERSKTSSSQRSRRK+RSSRKLKSKKLRYRHDSPSCSDTDFESSTS+SSSSSEDDEKVGRSRS      KNAK  KKRAKK+S D Q RD S
Subjt:  MGKFSSRNKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDYS

Query:  PHPRKRKHSKRDNRCEVKKTNK-KKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGSDHQNEV
        PHPRKRKHSKR +RCEVKKTNK KK RRD SVSAT  DSLSCSTCGDGSTTSNESEIDR RGR  KRK N  KTERSR+RSKS SPCSLCSEGSD+QNEV
Subjt:  PHPRKRKHSKRDNRCEVKKTNK-KKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGSDHQNEV

Query:  EDDCYVENNFRRLRSVIVVVGEENKLETFERNEQQEEVTHQPDDDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDDGVQNEG
        ED  YVENNFRRLRSVIVVVGEENKL+TF+ NEQQEEV H PDDDHPSFGDMDS DGM+KRELD V S EA EVE+KKEVVIPD RN +VVKD GVQNEG
Subjt:  EDDCYVENNFRRLRSVIVVVGEENKLETFERNEQQEEVTHQPDDDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDDGVQNEG

Query:  SNNNHGGASHDHSLNERKSGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMNGNEFSRQ
        SNNNHGG ++DH LNE  +G SGN+D INCIDLESILRQRALENLRKFK VPP+NVET ANC+ DN+ND KQ YSPVS SVR+ SPRD+AE+NG  FS Q
Subjt:  SNNNHGGASHDHSLNERKSGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMNGNEFSRQ

Query:  SGRNAINSMIVEENGVKSTDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLESKVDSLT
         G NA+N MIVEENGV+ST+ IDSAVASTHDP+YSSQNLGKIS+ SNG+NELKQDISSLDQE VNDNI QK DADICSTTSRSNLV AALR +SKVD L 
Subjt:  SGRNAINSMIVEENGVKSTDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLESKVDSLT

Query:  KQASASQESIQIKPSISDIGVDETAQTQTQMRNNDDQKIGNGFGSSAHKPSSSLNSISGENSLDKSRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIPK
        KQASA QE IQ KPSISD+GVDE AQ Q Q RNNDDQ I NGF SSAHKPSSSLN  SGENSL+K RHESGEGSQFEQKTMSV RGGEMVQVNYKVYIPK
Subjt:  KQASASQESIQIKPSISDIGVDETAQTQTQMRNNDDQKIGNGFGSSAHKPSSSLNSISGENSLDKSRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIPK

Query:  RAPALTRRQLKR
        RAPAL RRQLKR
Subjt:  RAPALTRRQLKR

A0A6J1F6D1 uncharacterized protein LOC111442542 isoform X11.1e-23879.08Show/hide
Query:  MGK-FSSRNKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDY
        MGK  SSR KERSKTSSSQRSRRKS+SSR+LKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDD+KV RSRSKTR   KN+K SKKR+KKQS+D QSR+ 
Subjt:  MGK-FSSRNKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDY

Query:  SPHPRKRKHSKRDNRCEVKKTNKKKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGSDHQNEV
        SPHPRKRKHSKR +R E KKTNKKK RRDASV AT SDSL  STCGDGS+TS++SEIDRRRGR  KRK NMVKTE  R+RSKS SPCSLCS+GSD QNEV
Subjt:  SPHPRKRKHSKRDNRCEVKKTNKKKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGSDHQNEV

Query:  EDDCYVENNFRRLRSVIVVVGEENKLETFERNEQQEEVTHQPDDDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDDGVQNEG
        EDD YVEN+ RRL+S+IVVVGEE++L+TF  NEQQE VTHQ DD+HPSFGDM+SKDG +KRELDYVISKEAPEVESK ++V PDNRNS+++ DDGV+NEG
Subjt:  EDDCYVENNFRRLRSVIVVVGEENKLETFERNEQQEEVTHQPDDDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDDGVQNEG

Query:  SNNNHGGASHDHSLNERKSGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMNGNEFSRQ
        SN NHGG ++DHSL+ERK+GCSGN++SINCIDLESILRQ+ALENLRKFK V PRNVE  ANCK +NNND KQ  SPVSKSV +T PRD+AE+N   FSRQ
Subjt:  SNNNHGGASHDHSLNERKSGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMNGNEFSRQ

Query:  SGRNAINSMIVEENGVKSTDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLESKVDSLT
         G +A+NSMIV+ NG KSTD ID+AVAS HDPV SSQNLGKISNGSNG+NE KQDISSLDQE +NDNI  KADADI STT+RSNLVIAA R ESKVDS  
Subjt:  SGRNAINSMIVEENGVKSTDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLESKVDSLT

Query:  KQASASQESIQIKPSISDIGVDETAQTQTQMRNNDDQKIGNGFGSSAHKPSSSLNSISGENSLDKSRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIPK
        K+AS  QE IQ K SISDI VDETAQTQTQM NNDDQ I NGFGSSA+K SSSLN ISGEN LDKSR ESGEGSQFEQKTMSV RGGEMVQVNYKVYIPK
Subjt:  KQASASQESIQIKPSISDIGVDETAQTQTQMRNNDDQKIGNGFGSSAHKPSSSLNSISGENSLDKSRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIPK

Query:  RAPALTRRQLKR
        RAPALTRRQLKR
Subjt:  RAPALTRRQLKR

A0A6J1FN60 pre-mRNA-splicing factor CWC22 homolog isoform X11.2e-22978.36Show/hide
Query:  QRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDYSPHPRKRKHSKRDNRCEV
        +RSRRKSRSSRKLKS  +RYRHDSPS SD D ESSTS+SSSS+EDDEKVGRSRSK RKNA      KKRAKK+S+D Q RDYSPHPRKRKHSKRD  CE+
Subjt:  QRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDYSPHPRKRKHSKRDNRCEV

Query:  KKTNKKKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGSDHQNEVEDDCYVENNFRRLRSVIV
        KK+NKKK +RDASV AT SDSLSCSTCGDGSTTSNE EIDRR+GR RKRK NM K ERSR+ SKS SPCSLCSEGSDHQNEVE+ CYVENNFRRLRSVIV
Subjt:  KKTNKKKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGSDHQNEVEDDCYVENNFRRLRSVIV

Query:  VVGEENKLETFERNEQQEEVTHQPDDDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDDGVQNEGSNNNHGGASHDHSLNERK
        VVGEE+KLETF+ NE QE VTHQPD DHPSFGD++  DGM+ RELD +IS+EAP       V+I DNRNS+VVKDDGVQNEGSNNNHGG++HDH L ERK
Subjt:  VVGEENKLETFERNEQQEEVTHQPDDDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDDGVQNEGSNNNHGGASHDHSLNERK

Query:  SGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMNGNEFSRQSGRNAINSMIVEENGVKS
        +GCS N+D  NCIDLESILRQRALENLRK+KRV PRNVETPAN + DN+ND KQ  SPVSK V +TSPRDEA +NG+ FSRQ G NA+NSMI+ ENGVKS
Subjt:  SGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMNGNEFSRQSGRNAINSMIVEENGVKS

Query:  TDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLESKVDSLTKQASASQESIQIKPSISD
        TDE+DSAVAST+DPVYSSQ+LGKISNGSN +NELKQ ISS+DQE VND+I   ADADIC TT+RSNLVIAAL+ ES VDSLT+QASASQESIQ KPS SD
Subjt:  TDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLESKVDSLTKQASASQESIQIKPSISD

Query:  IGVDETAQTQTQMRNNDDQKIGNGFGSSAHKP--SSSLNSISGENSLDKSRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIPKRAPALTRRQLKR
        +G  ET QTQTQMRNND Q IG+GFGSSAHKP  SSSLNSISGE+ L+ SRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIPKRAP L+RRQLKR
Subjt:  IGVDETAQTQTQMRNNDDQKIGNGFGSSAHKP--SSSLNSISGENSLDKSRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIPKRAPALTRRQLKR

A0A6J1FTH5 pre-mRNA-splicing factor CWC22 homolog isoform X23.0e-23678.79Show/hide
Query:  MGKFSSRNKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDYS
        MGKFSSR K RSKTSSSQRSRRKSRSSRKLKS  +RYRHDSPS SD D ESSTS+SSSS+EDDEKVGRSRSK RKNA      KKRAKK+S+D Q RDYS
Subjt:  MGKFSSRNKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDYS

Query:  PHPRKRKHSKRDNRCEVKKTNKKKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGSDHQNEVE
        PHPRKRKHSKRD  CE+KK+NKKK +RDASV AT SDSLSCSTCGDGSTTSNE EIDRR+GR RKRK NM K ERSR+ SKS SPCSLCSEGSDHQNEVE
Subjt:  PHPRKRKHSKRDNRCEVKKTNKKKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGSDHQNEVE

Query:  DDCYVENNFRRLRSVIVVVGEENKLETFERNEQQEEVTHQPDDDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDDGVQNEGS
        + CYVENNFRRLRSVIVVVGEE+KLETF+ NE QE VTHQPD DHPSFGD++  DGM+ RELD +IS+EAP       V+I DNRNS+VVKDDGVQNEGS
Subjt:  DDCYVENNFRRLRSVIVVVGEENKLETFERNEQQEEVTHQPDDDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDDGVQNEGS

Query:  NNNHGGASHDHSLNERKSGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMNGNEFSRQS
        NNNHGG++HDH L ERK+GCS N+D  NCIDLESILRQRALENLRK+KRV PRNVETPAN + DN+ND KQ  SPVSK V +TSPRDEA +NG+ FSRQ 
Subjt:  NNNHGGASHDHSLNERKSGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMNGNEFSRQS

Query:  GRNAINSMIVEENGVKSTDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLESKVDSLTK
        G NA+NSMI+ ENGVKSTDE+DSAVAST+DPVYSSQ+LGKISNGSN +NELKQ ISS+DQE VND+I   ADADIC TT+RSNLVIAAL+ ES VDSLT+
Subjt:  GRNAINSMIVEENGVKSTDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLESKVDSLTK

Query:  QASASQESIQIKPSISDIGVDETAQTQTQMRNNDDQKIGNGFGSSAHKP--SSSLNSISGENSLDKSRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIP
        QASASQESIQ KPS SD+G  ET QTQTQMRNND Q IG+GFGSSAHKP  SSSLNSISGE+ L+ SRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIP
Subjt:  QASASQESIQIKPSISDIGVDETAQTQTQMRNNDDQKIGNGFGSSAHKP--SSSLNSISGENSLDKSRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIP

Query:  KRAPALTRRQLKR
        KRAP L+RRQLKR
Subjt:  KRAPALTRRQLKR

A0A6J1IGY0 uncharacterized protein LOC111476850 isoform X14.2e-23878.59Show/hide
Query:  MGK-FSSRNKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDY
        MGK  SSR KERSKTSSSQRSRRKS+SSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDD+KV RSRSKTR   KN+K SKKR+KKQS+D QSR+ 
Subjt:  MGK-FSSRNKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDY

Query:  SPHPRKRKHSKRDNRCEVKKTNKKKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGSDHQNEV
        SPHPRKRKHSKR++R E KKTNKKK RRD SV AT SDSLS STCGDGS+TS++SEIDRRRGR  KRK NMVKTE  R+RSKS SPCSLCS+GSD QNEV
Subjt:  SPHPRKRKHSKRDNRCEVKKTNKKKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGSDHQNEV

Query:  EDDCYVENNFRRLRSVIVVVGEENKLETFERNEQQEEVTHQPDDDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDDGVQNEG
        EDD YV+N  RRL+S+IVVVGEE++L+TF  NEQQE VTHQ DD+HP F DM+SKDG  KRELDYVISKEAPEVESK ++  PDNRNS+++ +DGV+NEG
Subjt:  EDDCYVENNFRRLRSVIVVVGEENKLETFERNEQQEEVTHQPDDDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDDGVQNEG

Query:  SNNNHGGASHDHSLNERKSGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMNGNEFSRQ
        SN NHGG ++DHSL+ERK+GCSGN+D+INCIDLESILRQ+ALENLRKFK   PRNVE  ANCK +NNND KQ +SPVSKSV + SPRD+AE NG  FSRQ
Subjt:  SNNNHGGASHDHSLNERKSGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMNGNEFSRQ

Query:  SGRNAINSMIVEENGVKSTDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLESKVDSLT
         G +A+NSMI++ NG KSTD ID+AVAS HDPV SSQNLGKISNGSNG+NELKQDISSLDQE +NDNI  KADA+I STT+RSNLVIAA R ESKVDSL 
Subjt:  SGRNAINSMIVEENGVKSTDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLESKVDSLT

Query:  KQASASQESIQIKPSISDIGVDETAQTQTQMRNNDDQKIGNGFGSSAHKPSSSLNSISGENSLDKSRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIPK
        ++ASA+QE IQ KPSISDI VDE +QTQTQ  NNDDQ I NGFGSSA+KPSSSLNSISGENSLDKSR ESGEGSQFEQKTMSV RGGEMVQVNYKVYIPK
Subjt:  KQASASQESIQIKPSISDIGVDETAQTQTQMRNNDDQKIGNGFGSSAHKPSSSLNSISGENSLDKSRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIPK

Query:  RAPALTRRQLKR
        RAPALTRRQLKR
Subjt:  RAPALTRRQLKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G53930.1 unknown protein5.6e-1730.46Show/hide
Query:  MGKFSSRNKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDYS
        MGK SS +K    +S+  RS +K +S R  KSKK+R   D    S +D     S   SSSEDD +        RK  + +K SKKR++K+ + S+S D S
Subjt:  MGKFSSRNKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDYS

Query:  PHPR---KRKHSKRDNRCEVKKTNK----KKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGS
           R   K+K SKR +    KK  K    K+ +RD S S+T S+     +  DGS    ES+  +R  R R R+   VK  RSR R +        SE  
Subjt:  PHPR---KRKHSKRDNRCEVKKTNK----KKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGS

Query:  DHQNEVEDDCYVENNFRRLRSVIVVVGEENKLETFERNEQQEEVTHQPDDDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDD
        D   + ED+   E N RRL+S++VV         +   E++EE      DD     D+    G   REL Y  S+++ E++ +       +  S +  DD
Subjt:  DHQNEVEDDCYVENNFRRLRSVIVVVGEENKLETFERNEQQEEVTHQPDDDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDD

Query:  GVQNEGSNNNHGGASH-DHSLNERKSGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMN
            E + +     SH D+SL +               DLE+IL++RALENL++F+ V  +            +   K+  S VS+   M    ++ E  
Subjt:  GVQNEGSNNNHGGASH-DHSLNERKSGCSGNSDSINCIDLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMN

Query:  GNEFSRQSGRNAINSMIVEENGVKSTDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLE
         ++   Q                      DSAV+     + +S+ +  + N       L    S  DQ+   D    K  + + S T++  LV   L  +
Subjt:  GNEFSRQSGRNAINSMIVEENGVKSTDEIDSAVASTHDPVYSSQNLGKISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLE

Query:  SKVDSLTKQASASQ--ESIQIKPSISDIGVDETA---QTQTQMRNNDDQKIGNGFG--SSAHKPSSSLNSI-SGENSLDKSRHESGEGSQFEQKTMSVKR
        S   +  K+AS SQ  E+  I  S  D    E+     T+ +  + +  K+ +     SS+H  +  ++ +  G  S  K+  E+ + SQ+EQKTM+V R
Subjt:  SKVDSLTKQASASQ--ESIQIKPSISDIGVDETA---QTQTQMRNNDDQKIGNGFG--SSAHKPSSSLNSI-SGENSLDKSRHESGEGSQFEQKTMSVKR

Query:  GGEMVQVNYKVYIPKRAPALTRRQLKR
        GGEMVQV+YKVYIPK+A +L RR+L R
Subjt:  GGEMVQVNYKVYIPKRAPALTRRQLKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAGTTCTCTTCTCGCAACAAGGAGCGTTCCAAGACTTCCTCATCCCAGAGAAGTAGGAGAAAGAGTAGGAGTAGTAGGAAATTGAAGTCCAAGAAGCTGCGGTA
TCGCCACGATTCTCCCTCTTGCTCTGACACTGATTTTGAAAGTTCAACTTCAGTGTCTTCTTCTAGCTCGGAGGATGATGAAAAAGTGGGAAGATCTCGATCCAAGACGC
GAAAGAATGCTAAGAATGCTAAGTCTAGTAAAAAGAGAGCTAAGAAGCAATCTAACGACAGTCAAAGTAGAGATTACTCCCCTCATCCCAGAAAGAGGAAGCATTCAAAG
AGAGACAACCGTTGTGAGGTGAAGAAGACCAATAAAAAGAAGCCTAGAAGAGATGCGAGTGTTAGTGCCACACGTAGCGACTCTTTGAGCTGCTCAACTTGTGGAGATGG
GAGTACAACCAGCAATGAGAGTGAAATTGATAGGCGTAGGGGAAGGTTTAGAAAGAGGAAAGGAAATATGGTGAAGACTGAAAGAAGTAGACACAGGTCAAAGAGTCCTT
CACCATGCTCTTTATGTAGTGAAGGTAGTGATCATCAGAATGAGGTTGAGGATGACTGTTATGTTGAAAACAACTTTAGACGACTGAGATCCGTAATAGTTGTAGTAGGA
GAGGAAAATAAATTAGAGACATTTGAGAGGAATGAACAACAAGAAGAGGTCACACATCAGCCTGATGATGACCACCCTTCTTTTGGAGATATGGACAGTAAGGATGGGAT
GAATAAAAGAGAATTAGATTATGTTATATCGAAAGAGGCACCAGAGGTAGAAAGCAAAAAAGAAGTGGTTATACCTGACAATAGAAACTCTATGGTTGTAAAAGATGATG
GAGTTCAAAATGAGGGAAGCAACAATAACCATGGAGGAGCATCTCATGACCATTCTTTAAATGAAAGAAAGAGTGGCTGTTCTGGAAATTCTGACAGCATAAATTGTATC
GATTTAGAGTCAATTTTAAGACAGAGGGCTTTGGAAAACCTAAGAAAGTTCAAACGGGTGCCCCCAAGGAATGTGGAAACTCCGGCTAATTGCAAAGGTGACAATAATAA
TGATGTGAAGCAATTTTACTCTCCTGTCTCTAAGTCAGTTCGCATGACTTCCCCTAGGGATGAGGCCGAGATGAATGGTAATGAGTTCTCTAGACAAAGTGGAAGGAATG
CAATAAATTCAATGATAGTTGAGGAGAATGGTGTTAAATCTACTGATGAAATAGATTCAGCAGTTGCATCTACGCATGATCCTGTCTATTCTTCACAGAACCTGGGTAAG
ATTTCCAATGGAAGCAATGGTATCAATGAATTAAAGCAGGATATCTCTTCATTAGACCAGGAGGGTGTAAATGATAATATTTTCCAGAAGGCAGATGCAGATATTTGTTC
TACAACTAGCAGAAGCAATTTGGTTATTGCAGCTTTGAGGCTCGAGTCAAAAGTGGATTCTCTTACAAAGCAAGCATCTGCTTCTCAGGAATCTATCCAAATAAAGCCAT
CTATATCTGACATTGGTGTTGATGAAACTGCTCAAACTCAGACCCAGATGAGGAATAATGATGATCAAAAGATTGGTAATGGTTTTGGTTCTTCAGCTCACAAGCCTTCT
TCTTCCCTTAATTCTATTTCAGGAGAAAATAGCTTGGATAAGTCCAGACACGAGAGTGGCGAAGGCTCGCAGTTTGAACAGAAAACCATGTCTGTGAAGCGGGGTGGTGA
AATGGTGCAGGTGAACTACAAGGTCTACATCCCAAAGAGAGCTCCTGCTTTGACTAGAAGGCAACTCAAGCGGTGA
mRNA sequenceShow/hide mRNA sequence
CTTAGTTTAGGGACCCAAGGGAGAAAATAAAAAGAATTTGGTACCGAAACGGAAATAGCATTTAAGAGAGACATAATGTTTTAACACGAAATAAACTTAGAGAAAGATTA
CGGGTTGTCGACGAAGAGAGAACAGACCTCTTCCAGTTCGCGATGAGATCCTCCGGAAACAGAGCAAGCCCTGCATTATTAAGCTCCAATCCCTCCACTTTTCTAATTTC
CCAACCCTAATTCCCACATTCAAATCGATTCTTCTTCTTTTGATTTCTGTTGTCCCCCTTTTTTATCTGTTTTTTCTTTCTTGGGCCTCGTAATTGCAGGTTAAATCTCG
ATGGGAAAGTTCTCTTCTCGCAACAAGGAGCGTTCCAAGACTTCCTCATCCCAGAGAAGTAGGAGAAAGAGTAGGAGTAGTAGGAAATTGAAGTCCAAGAAGCTGCGGTA
TCGCCACGATTCTCCCTCTTGCTCTGACACTGATTTTGAAAGTTCAACTTCAGTGTCTTCTTCTAGCTCGGAGGATGATGAAAAAGTGGGAAGATCTCGATCCAAGACGC
GAAAGAATGCTAAGAATGCTAAGTCTAGTAAAAAGAGAGCTAAGAAGCAATCTAACGACAGTCAAAGTAGAGATTACTCCCCTCATCCCAGAAAGAGGAAGCATTCAAAG
AGAGACAACCGTTGTGAGGTGAAGAAGACCAATAAAAAGAAGCCTAGAAGAGATGCGAGTGTTAGTGCCACACGTAGCGACTCTTTGAGCTGCTCAACTTGTGGAGATGG
GAGTACAACCAGCAATGAGAGTGAAATTGATAGGCGTAGGGGAAGGTTTAGAAAGAGGAAAGGAAATATGGTGAAGACTGAAAGAAGTAGACACAGGTCAAAGAGTCCTT
CACCATGCTCTTTATGTAGTGAAGGTAGTGATCATCAGAATGAGGTTGAGGATGACTGTTATGTTGAAAACAACTTTAGACGACTGAGATCCGTAATAGTTGTAGTAGGA
GAGGAAAATAAATTAGAGACATTTGAGAGGAATGAACAACAAGAAGAGGTCACACATCAGCCTGATGATGACCACCCTTCTTTTGGAGATATGGACAGTAAGGATGGGAT
GAATAAAAGAGAATTAGATTATGTTATATCGAAAGAGGCACCAGAGGTAGAAAGCAAAAAAGAAGTGGTTATACCTGACAATAGAAACTCTATGGTTGTAAAAGATGATG
GAGTTCAAAATGAGGGAAGCAACAATAACCATGGAGGAGCATCTCATGACCATTCTTTAAATGAAAGAAAGAGTGGCTGTTCTGGAAATTCTGACAGCATAAATTGTATC
GATTTAGAGTCAATTTTAAGACAGAGGGCTTTGGAAAACCTAAGAAAGTTCAAACGGGTGCCCCCAAGGAATGTGGAAACTCCGGCTAATTGCAAAGGTGACAATAATAA
TGATGTGAAGCAATTTTACTCTCCTGTCTCTAAGTCAGTTCGCATGACTTCCCCTAGGGATGAGGCCGAGATGAATGGTAATGAGTTCTCTAGACAAAGTGGAAGGAATG
CAATAAATTCAATGATAGTTGAGGAGAATGGTGTTAAATCTACTGATGAAATAGATTCAGCAGTTGCATCTACGCATGATCCTGTCTATTCTTCACAGAACCTGGGTAAG
ATTTCCAATGGAAGCAATGGTATCAATGAATTAAAGCAGGATATCTCTTCATTAGACCAGGAGGGTGTAAATGATAATATTTTCCAGAAGGCAGATGCAGATATTTGTTC
TACAACTAGCAGAAGCAATTTGGTTATTGCAGCTTTGAGGCTCGAGTCAAAAGTGGATTCTCTTACAAAGCAAGCATCTGCTTCTCAGGAATCTATCCAAATAAAGCCAT
CTATATCTGACATTGGTGTTGATGAAACTGCTCAAACTCAGACCCAGATGAGGAATAATGATGATCAAAAGATTGGTAATGGTTTTGGTTCTTCAGCTCACAAGCCTTCT
TCTTCCCTTAATTCTATTTCAGGAGAAAATAGCTTGGATAAGTCCAGACACGAGAGTGGCGAAGGCTCGCAGTTTGAACAGAAAACCATGTCTGTGAAGCGGGGTGGTGA
AATGGTGCAGGTGAACTACAAGGTCTACATCCCAAAGAGAGCTCCTGCTTTGACTAGAAGGCAACTCAAGCGGTGACATAAGATCCGTGATTTTGCAGCCTTGTAGGATG
ATCCTGTTGCTTGCCATTCCAAAGATTGTAAAGTTTTAGTAGTAAAATTTGTCATCACTTTCTATGCCAAGGATCTATAAGCTTAAAAACATGAACAATAATTAGTGCAG
CCAATTGTTAATATTTGTGTTAACTTTTCCATCATAATGATCAAAATGACCATTTGATCAAAGGAGAGCCTCATGTTTACAAAGGCTTCTTATTAAATTTTCCTTGTGAT
GCCAAAACCTGGTATATGTTAGTTTATGATCATAATTATTTTCACATTCTATTCTATAATGTTATTGTGAATTAACAAAACCCTCTAGGTCGTAATTCTTAAGAATTGGT
GCAAAGTATCGAGTTTGGTTC
Protein sequenceShow/hide protein sequence
MGKFSSRNKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSSSEDDEKVGRSRSKTRKNAKNAKSSKKRAKKQSNDSQSRDYSPHPRKRKHSK
RDNRCEVKKTNKKKPRRDASVSATRSDSLSCSTCGDGSTTSNESEIDRRRGRFRKRKGNMVKTERSRHRSKSPSPCSLCSEGSDHQNEVEDDCYVENNFRRLRSVIVVVG
EENKLETFERNEQQEEVTHQPDDDHPSFGDMDSKDGMNKRELDYVISKEAPEVESKKEVVIPDNRNSMVVKDDGVQNEGSNNNHGGASHDHSLNERKSGCSGNSDSINCI
DLESILRQRALENLRKFKRVPPRNVETPANCKGDNNNDVKQFYSPVSKSVRMTSPRDEAEMNGNEFSRQSGRNAINSMIVEENGVKSTDEIDSAVASTHDPVYSSQNLGK
ISNGSNGINELKQDISSLDQEGVNDNIFQKADADICSTTSRSNLVIAALRLESKVDSLTKQASASQESIQIKPSISDIGVDETAQTQTQMRNNDDQKIGNGFGSSAHKPS
SSLNSISGENSLDKSRHESGEGSQFEQKTMSVKRGGEMVQVNYKVYIPKRAPALTRRQLKR