; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g1601 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g1601
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Description18S pre-ribosomal assembly protein gar2-related, putative isoform 2
Genome locationMC09:21664890..21669822
RNA-Seq ExpressionMC09g1601
SyntenyMC09g1601
Gene Ontology termsGO:0009786 - regulation of asymmetric cell division (biological process)
InterPro domainsIPR040378 - Protein BREAKING OF ASYMMETRY IN THE STOMATAL LINEAGE


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7032146.1 hypothetical protein SDJN02_06189, partial [Cucurbita argyrosperma subsp. argyrosperma]1.74e-19666.53Show/hide
Query:  MKVEGEHVVCHSNISVKFVPKS--SDNDAHDSGGMMLEDQKELTSPPKCNDQDADQFVPKHDCLGLDDFNHYNEVEASVSPFTNSSKVDLFEEDSELYME
        MKVEGE VV HS+ S KF+PKS   DNDA DSGGM LED K+ T   K N ++A+    K+  +GLDD N ++EV+A V   TNSSKVDLFEEDSELYME
Subjt:  MKVEGEHVVCHSNISVKFVPKS--SDNDAHDSGGMMLEDQKELTSPPKCNDQDADQFVPKHDCLGLDDFNHYNEVEASVSPFTNSSKVDLFEEDSELYME

Query:  KSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGN-SLDEKAVCAIAPSEKDWKDELEGELEKRKMFSS---EHAESFSNKDSPKQCDLKDLG
        KSIVECQLPELIVCYKEN CNIVKDICID+GVPSRD LLCG+ SLDEKAVC I P E+DWKDEL   LE+  MF+S   EH+ESF  KDSPKQ D +DL 
Subjt:  KSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGN-SLDEKAVCAIAPSEKDWKDELEGELEKRKMFSS---EHAESFSNKDSPKQCDLKDLG

Query:  RIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYSTTGEAIA-------ASTEPKNSSSVNEISYNS
        R PEAEYDV YFTDNDI NL M DL  ES+KPL N+K+E +PQSEQVFIE+ SLEVPV     E+SYS T E I+       A+ EPKNS S  +ISYNS
Subjt:  RIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYSTTGEAIA-------ASTEPKNSSSVNEISYNS

Query:  KVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDL--------PKAEVGISSSHSVST
        K+D GNITFDFNS AS ASDG+EH DNG  NSSAP+TSASVDC DTSS +   SADK Q  C+  SSNPK VEYEDL         KAEVG S S+SVS+
Subjt:  KVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDL--------PKAEVGISSSHSVST

Query:  QVQHGIGETSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADR---RKHRGWKHGLLCCRF
        QVQHG+GE S SSM  LGSL+SNSGRIGYSGSIS RSDSSTTST SFAFPI             +QSEWNSSPVRMAKAD+   RK RGW+ GLLCCRF
Subjt:  QVQHGIGETSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADR---RKHRGWKHGLLCCRF

XP_008446468.1 PREDICTED: uncharacterized protein LOC103489197 isoform X1 [Cucumis melo]2.64e-19664.94Show/hide
Query:  MKVEGEHVVCHSNISVKFVPKS--SDNDAHDSGGMMLEDQKELTSPPKCN------DQDADQFVPK-HDCLGLDDFNHYNEVEASVSPFTNSSKVDLFEE
        MKVEGE +VCHSN S KFVPKS   DND  +SGGM LEDQKE TS  K N      +  AD +V K  +CL LDDFN Y++V+A VSP  NS KVDL EE
Subjt:  MKVEGEHVVCHSNISVKFVPKS--SDNDAHDSGGMMLEDQKELTSPPKCN------DQDADQFVPK-HDCLGLDDFNHYNEVEASVSPFTNSSKVDLFEE

Query:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGNSLDEKAVCAIAPSEKDWKDELEGELEKRKMFSS---EHAESFSNKDSPKQC
        DSELYMEKSIVECQLPELIVCYKENICNIVKDICID+G P RD L CG+SLDE+ VC+I P  KDWKDE  GEL++R MF+S   EH+ESF +KDSP QC
Subjt:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGNSLDEKAVCAIAPSEKDWKDELEGELEKRKMFSS---EHAESFSNKDSPKQC

Query:  DLKDLGRIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYS----TTGEAIAASTEPKNSSSVNEIS
        D KDL   PEAEYDVAYFTDND+P   M DLV ESLKPL ++K + HPQSEQV IE+   EVPV     ++S+     TT E+I ++ +PKNS S N +S
Subjt:  DLKDLGRIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYS----TTGEAIAASTEPKNSSSVNEIS

Query:  YNSKVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDLP--------KAEVGISSSHS
        YNSKVD GNITFDFNS A  ASDG+E  DNG  NSSAP+TSASV C++T+S +P  SADKS+ QCH+TSSNPK VEYEDLP        K EVG   SH+
Subjt:  YNSKVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDLP--------KAEVGISSSHS

Query:  VSTQVQHGIGETSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADRR---KHRGWKHGLLCC
        VS++VQ G+GETSFS + PLGSL+SNSGRIGYSGSIS RSDSSTTSTRSFAFPI             +Q+EWNSSPVRMAK DR+   KHRGW+HG+LCC
Subjt:  VSTQVQHGIGETSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADRR---KHRGWKHGLLCC

Query:  RF
        RF
Subjt:  RF

XP_022149065.1 uncharacterized protein LOC111017570 [Momordica charantia]0.096.93Show/hide
Query:  MCGKNPLEQCMQIMKVEGEHVVCHSNISVKFVPKSSDNDAHDSGGMMLEDQKELTSPPKCNDQDADQFVPKHDCLGLDDFNHYNEVEASVSPFTNSSKVD
        MCGKNPLEQCMQIMKVEGEHVVCHSNISVKFVPKSSDNDAHDSGGMMLEDQKELTSPPKCNDQDADQFVPKHDCLGLDDFNHYNEVEASVSPFTNSSKVD
Subjt:  MCGKNPLEQCMQIMKVEGEHVVCHSNISVKFVPKSSDNDAHDSGGMMLEDQKELTSPPKCNDQDADQFVPKHDCLGLDDFNHYNEVEASVSPFTNSSKVD

Query:  LFEEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGNSLDEKAVCAIAPSEKDWKDELEGELEKRKMFSSEHAESFSNKDSPKQ
        LFEEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGNSLDEKAVCAIAPSEKDWKDELEGELEKRKMFSSEHAESFSNKDSPKQ
Subjt:  LFEEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGNSLDEKAVCAIAPSEKDWKDELEGELEKRKMFSSEHAESFSNKDSPKQ

Query:  CDLKDLGRIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYSTTGEAIAASTEPKNSSSVNEISYNS
        CDLKDLGRIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYSTTGEAIAASTEPKNSSSVNEISYNS
Subjt:  CDLKDLGRIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYSTTGEAIAASTEPKNSSSVNEISYNS

Query:  KVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDLPKAEVGISSSHSVSTQVQHGIGE
        KVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDLPKAEVGIS S SVSTQVQHGIGE
Subjt:  KVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDLPKAEVGISSSHSVSTQVQHGIGE

Query:  TSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADRRKHRGWKHGLLCCRF
        TSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPI             IQSEWNSSPVRMAKADRRKHRGWKHGLLCCRF
Subjt:  TSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADRRKHRGWKHGLLCCRF

XP_022956433.1 uncharacterized protein LOC111458170 isoform X1 [Cucurbita moschata]1.22e-19666.53Show/hide
Query:  MKVEGEHVVCHSNISVKFVPKS--SDNDAHDSGGMMLEDQKELTSPPKCNDQDADQFVPKHDCLGLDDFNHYNEVEASVSPFTNSSKVDLFEEDSELYME
        MKVEGE VV HS+ S KF+PKS   DNDA DSGGM LED K+ T   K N +DA+    K+  +GLDD N ++EV+A V   TNSSKVDLFEEDSELYME
Subjt:  MKVEGEHVVCHSNISVKFVPKS--SDNDAHDSGGMMLEDQKELTSPPKCNDQDADQFVPKHDCLGLDDFNHYNEVEASVSPFTNSSKVDLFEEDSELYME

Query:  KSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGNS-LDEKAVCAIAPSEKDWKDELEGELEKRKMFSS---EHAESFSNKDSPKQCDLKDLG
        KSIVECQLPELIVCYKEN CNIVKDICID+GVPSRD LLCG+S LDEKAVC I P E+DWKDEL   LE+  MF+S   EH+ESF  KDSPKQ D +DL 
Subjt:  KSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGNS-LDEKAVCAIAPSEKDWKDELEGELEKRKMFSS---EHAESFSNKDSPKQCDLKDLG

Query:  RIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYSTTGEAIA-------ASTEPKNSSSVNEISYNS
        R PEAEYDV YFTDNDI NL M DL  ES+KPL N+K+E +PQSEQVFIE+ SLEVPV     E+SYS T E I+       A+ EPKNS S  +ISYNS
Subjt:  RIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYSTTGEAIA-------ASTEPKNSSSVNEISYNS

Query:  KVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDL--------PKAEVGISSSHSVST
        K+D GNITFDFNS AS ASDG+EH DNG  NSSAP+TSASVDC D+SS +   SADK Q  C+  SSNPK VEYEDL         KAEVG S S+SVS+
Subjt:  KVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDL--------PKAEVGISSSHSVST

Query:  QVQHGIGETSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADR---RKHRGWKHGLLCCRF
        QVQHG+GE S SSM  LGSL+SNSGRIGYSGSIS RSDSSTTST SFAFPI             +QSEWNSSPVRMAKAD+   RK RGW+ GLLCCRF
Subjt:  QVQHGIGETSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADR---RKHRGWKHGLLCCRF

XP_038892052.1 uncharacterized protein LOC120081347 isoform X1 [Benincasa hispida]3.15e-20566.6Show/hide
Query:  MKVEGEHVVCHSNISVKFVPKS--SDNDAHDSGGMMLEDQKELTSPPKCNDQD------ADQFVP-KHDCLGLDDFNHYNEVEASVSPFTNSSKVDLFEE
        MKVEGE +V HSN S +FVPKS   DNDA +SGGM LEDQKE TS  K N+        AD +V  K +CL LDDFN Y+EV+A VSP TNSSKVDLFEE
Subjt:  MKVEGEHVVCHSNISVKFVPKS--SDNDAHDSGGMMLEDQKELTSPPKCNDQD------ADQFVP-KHDCLGLDDFNHYNEVEASVSPFTNSSKVDLFEE

Query:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGNSLDEKAVCAIAPSEKDWKDELEGELEKRKMFSS---EHAESFSNKDSPKQC
        DSELYMEKS VECQLPELIVCYKENICNIVKDICID+GVPSRD LLCG+SLDEK VC+I P    WKD+L  ELEKR +++S   EH+ESF NKDSPKQ 
Subjt:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGNSLDEKAVCAIAPSEKDWKDELEGELEKRKMFSS---EHAESFSNKDSPKQC

Query:  DLKDLGRIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYSTTGEAIAAST-------EPKNSSSVN
        D  DL R PEAEYDVAYFTDND+P   M D V ESLKPL N+K E HP+SEQVFIE+ SLEVPV     E+S+S + E I+ ST       E KNS S N
Subjt:  DLKDLGRIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYSTTGEAIAAST-------EPKNSSSVN

Query:  EISYNSKVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDLP--------KAEVGISS
        ++SYNSKVD GNITFDFNS AS ASDG+EH DN   N+SAP+TSASV CQ+TSS +P  SADK Q QCH TS+NPKCVEYEDLP        K EVG   
Subjt:  EISYNSKVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDLP--------KAEVGISS

Query:  SHSVSTQVQHG----------IGETSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADRR--
        SH+VS+QVQHG          +GETSFSSM PLGSL+SNSG IGYSGSISLRSDSSTTSTRSFAFPI             +QSEWNSSPVRMAKADRR  
Subjt:  SHSVSTQVQHG----------IGETSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADRR--

Query:  -KHRGWKHGLLCCRF
         KHRGW+ G+LCCRF
Subjt:  -KHRGWKHGLLCCRF

TrEMBL top hitse value%identityAlignment
A0A1S3BFZ0 uncharacterized protein LOC103489197 isoform X11.28e-19664.94Show/hide
Query:  MKVEGEHVVCHSNISVKFVPKS--SDNDAHDSGGMMLEDQKELTSPPKCN------DQDADQFVPK-HDCLGLDDFNHYNEVEASVSPFTNSSKVDLFEE
        MKVEGE +VCHSN S KFVPKS   DND  +SGGM LEDQKE TS  K N      +  AD +V K  +CL LDDFN Y++V+A VSP  NS KVDL EE
Subjt:  MKVEGEHVVCHSNISVKFVPKS--SDNDAHDSGGMMLEDQKELTSPPKCN------DQDADQFVPK-HDCLGLDDFNHYNEVEASVSPFTNSSKVDLFEE

Query:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGNSLDEKAVCAIAPSEKDWKDELEGELEKRKMFSS---EHAESFSNKDSPKQC
        DSELYMEKSIVECQLPELIVCYKENICNIVKDICID+G P RD L CG+SLDE+ VC+I P  KDWKDE  GEL++R MF+S   EH+ESF +KDSP QC
Subjt:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGNSLDEKAVCAIAPSEKDWKDELEGELEKRKMFSS---EHAESFSNKDSPKQC

Query:  DLKDLGRIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYS----TTGEAIAASTEPKNSSSVNEIS
        D KDL   PEAEYDVAYFTDND+P   M DLV ESLKPL ++K + HPQSEQV IE+   EVPV     ++S+     TT E+I ++ +PKNS S N +S
Subjt:  DLKDLGRIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYS----TTGEAIAASTEPKNSSSVNEIS

Query:  YNSKVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDLP--------KAEVGISSSHS
        YNSKVD GNITFDFNS A  ASDG+E  DNG  NSSAP+TSASV C++T+S +P  SADKS+ QCH+TSSNPK VEYEDLP        K EVG   SH+
Subjt:  YNSKVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDLP--------KAEVGISSSHS

Query:  VSTQVQHGIGETSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADRR---KHRGWKHGLLCC
        VS++VQ G+GETSFS + PLGSL+SNSGRIGYSGSIS RSDSSTTSTRSFAFPI             +Q+EWNSSPVRMAK DR+   KHRGW+HG+LCC
Subjt:  VSTQVQHGIGETSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADRR---KHRGWKHGLLCC

Query:  RF
        RF
Subjt:  RF

A0A6J1D4Q3 uncharacterized protein LOC1110175700.096.93Show/hide
Query:  MCGKNPLEQCMQIMKVEGEHVVCHSNISVKFVPKSSDNDAHDSGGMMLEDQKELTSPPKCNDQDADQFVPKHDCLGLDDFNHYNEVEASVSPFTNSSKVD
        MCGKNPLEQCMQIMKVEGEHVVCHSNISVKFVPKSSDNDAHDSGGMMLEDQKELTSPPKCNDQDADQFVPKHDCLGLDDFNHYNEVEASVSPFTNSSKVD
Subjt:  MCGKNPLEQCMQIMKVEGEHVVCHSNISVKFVPKSSDNDAHDSGGMMLEDQKELTSPPKCNDQDADQFVPKHDCLGLDDFNHYNEVEASVSPFTNSSKVD

Query:  LFEEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGNSLDEKAVCAIAPSEKDWKDELEGELEKRKMFSSEHAESFSNKDSPKQ
        LFEEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGNSLDEKAVCAIAPSEKDWKDELEGELEKRKMFSSEHAESFSNKDSPKQ
Subjt:  LFEEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGNSLDEKAVCAIAPSEKDWKDELEGELEKRKMFSSEHAESFSNKDSPKQ

Query:  CDLKDLGRIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYSTTGEAIAASTEPKNSSSVNEISYNS
        CDLKDLGRIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYSTTGEAIAASTEPKNSSSVNEISYNS
Subjt:  CDLKDLGRIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYSTTGEAIAASTEPKNSSSVNEISYNS

Query:  KVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDLPKAEVGISSSHSVSTQVQHGIGE
        KVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDLPKAEVGIS S SVSTQVQHGIGE
Subjt:  KVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDLPKAEVGISSSHSVSTQVQHGIGE

Query:  TSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADRRKHRGWKHGLLCCRF
        TSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPI             IQSEWNSSPVRMAKADRRKHRGWKHGLLCCRF
Subjt:  TSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADRRKHRGWKHGLLCCRF

A0A6J1GWB6 uncharacterized protein LOC111458170 isoform X21.98e-19165.73Show/hide
Query:  MKVEGEHVVCHSNISVKFVPKS--SDNDAHDSGGMMLEDQKELTSPPKCNDQDADQFVPKHDCLGLDDFNHYNEVEASVSPFTNSSKVDLFEEDSELYME
        MKVEGE VV HS+ S KF+PKS   DNDA DSGGM LED K+ T   K N +DA+    K+  +GLDD N ++EV+A V   TNSSKVDLFEEDSELYME
Subjt:  MKVEGEHVVCHSNISVKFVPKS--SDNDAHDSGGMMLEDQKELTSPPKCNDQDADQFVPKHDCLGLDDFNHYNEVEASVSPFTNSSKVDLFEEDSELYME

Query:  KSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGNS-LDEKAVCAIAPSEKDWKDELEGELEKRKMFSS---EHAESFSNKDSPKQCDLKDLG
        KSIVECQLPELIVCYKEN CNIVKDICID+GVPSRD LLCG+S LDEKAVC I P E+DWKDEL   LE+  MF+S   EH+ESF  KDSPKQ D +DL 
Subjt:  KSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGNS-LDEKAVCAIAPSEKDWKDELEGELEKRKMFSS---EHAESFSNKDSPKQCDLKDLG

Query:  RIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYSTTGEAIA-------ASTEPKNSSSVNEISYNS
        R PEAEYDV YFTDNDI NL M DL  ES+KPL N+K+E +PQSEQV      LEVPV     E+SYS T E I+       A+ EPKNS S  +ISYNS
Subjt:  RIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYSTTGEAIA-------ASTEPKNSSSVNEISYNS

Query:  KVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDL--------PKAEVGISSSHSVST
        K+D GNITFDFNS AS ASDG+EH DNG  NSSAP+TSASVDC D+SS +   SADK Q  C+  SSNPK VEYEDL         KAEVG S S+SVS+
Subjt:  KVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDL--------PKAEVGISSSHSVST

Query:  QVQHGIGETSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADR---RKHRGWKHGLLCCRF
        QVQHG+GE S SSM  LGSL+SNSGRIGYSGSIS RSDSSTTST SFAFPI             +QSEWNSSPVRMAKAD+   RK RGW+ GLLCCRF
Subjt:  QVQHGIGETSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADR---RKHRGWKHGLLCCRF

A0A6J1GWT7 uncharacterized protein LOC111458170 isoform X15.93e-19766.53Show/hide
Query:  MKVEGEHVVCHSNISVKFVPKS--SDNDAHDSGGMMLEDQKELTSPPKCNDQDADQFVPKHDCLGLDDFNHYNEVEASVSPFTNSSKVDLFEEDSELYME
        MKVEGE VV HS+ S KF+PKS   DNDA DSGGM LED K+ T   K N +DA+    K+  +GLDD N ++EV+A V   TNSSKVDLFEEDSELYME
Subjt:  MKVEGEHVVCHSNISVKFVPKS--SDNDAHDSGGMMLEDQKELTSPPKCNDQDADQFVPKHDCLGLDDFNHYNEVEASVSPFTNSSKVDLFEEDSELYME

Query:  KSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGNS-LDEKAVCAIAPSEKDWKDELEGELEKRKMFSS---EHAESFSNKDSPKQCDLKDLG
        KSIVECQLPELIVCYKEN CNIVKDICID+GVPSRD LLCG+S LDEKAVC I P E+DWKDEL   LE+  MF+S   EH+ESF  KDSPKQ D +DL 
Subjt:  KSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGNS-LDEKAVCAIAPSEKDWKDELEGELEKRKMFSS---EHAESFSNKDSPKQCDLKDLG

Query:  RIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYSTTGEAIA-------ASTEPKNSSSVNEISYNS
        R PEAEYDV YFTDNDI NL M DL  ES+KPL N+K+E +PQSEQVFIE+ SLEVPV     E+SYS T E I+       A+ EPKNS S  +ISYNS
Subjt:  RIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYSTTGEAIA-------ASTEPKNSSSVNEISYNS

Query:  KVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDL--------PKAEVGISSSHSVST
        K+D GNITFDFNS AS ASDG+EH DNG  NSSAP+TSASVDC D+SS +   SADK Q  C+  SSNPK VEYEDL         KAEVG S S+SVS+
Subjt:  KVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDL--------PKAEVGISSSHSVST

Query:  QVQHGIGETSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADR---RKHRGWKHGLLCCRF
        QVQHG+GE S SSM  LGSL+SNSGRIGYSGSIS RSDSSTTST SFAFPI             +QSEWNSSPVRMAKAD+   RK RGW+ GLLCCRF
Subjt:  QVQHGIGETSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADR---RKHRGWKHGLLCCRF

A0A6J1IHW9 uncharacterized protein LOC111477604 isoform X11.31e-18864.8Show/hide
Query:  MKVEGEHVVCHSNISVKFVPKS--SDNDAHDSGGMMLEDQKELTSPPKCNDQDADQFVPKHDCLGLDDFNHYNEVEASVSPFTNSSKVDLFEEDSELYME
        MKVEGE VV HS+ S KF+PKS   DNDA DSGGM LED K+ T   K N +DA+Q   K+  +GLDD N ++EV+A V   TNSSKVDLFEEDSELYME
Subjt:  MKVEGEHVVCHSNISVKFVPKS--SDNDAHDSGGMMLEDQKELTSPPKCNDQDADQFVPKHDCLGLDDFNHYNEVEASVSPFTNSSKVDLFEEDSELYME

Query:  KSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGNS-LDEKAVCAIAPSEKDWKDELEGELEKRKMFSS---EHAESFSNKDSPKQCDLKDLG
        KSIVECQLPELIVCYKEN CNIVKDICID+GVPSRD LLCG+S LDEKAVC I P E+DWKDEL   LE+  MF+S   EH+ESF  KDSPKQ D ++L 
Subjt:  KSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGNS-LDEKAVCAIAPSEKDWKDELEGELEKRKMFSS---EHAESFSNKDSPKQCDLKDLG

Query:  RIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYSTTGEAIA--------ASTEPKNSSSVNEISYN
        R P+AEYDV Y TDND+ NL + DL  ES+KPL N+K+E +PQSEQV                E+SYS T EAI+        A+ EPKNS S  +ISYN
Subjt:  RIPEAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYSTTGEAIA--------ASTEPKNSSSVNEISYN

Query:  SKVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDL--------PKAEVGISSSHSVS
        SK+D GNITFDFNS AS ASDG+EH DNG  NSSAP+TSASVDC DTSS +   SADK QV C+  SSNPK VEYEDL         KAEVG S SHSVS
Subjt:  SKVDNGNITFDFNSSASIASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDL--------PKAEVGISSSHSVS

Query:  TQVQHGIGETSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADR---RKHRGWKHGLLCCRF
        +QVQHG+GE S SSM  LGSL+SNSGRIGYSGSIS RSDSSTTSTRSFAFPI             +QSEWNSSPVRMAKAD+   RK RGW+ GLLCCRF
Subjt:  TQVQHGIGETSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADR---RKHRGWKHGLLCCRF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G03810.1 18S pre-ribosomal assembly protein gar2-related2.1e-3030.06Show/hide
Query:  SDNDAHDSGGMMLEDQKELTSPPKCND---QDADQFVPKHD------CLGLDDFNHYNEVEASVSPFTNSSKVDL-------FEEDSELYMEKSIVECQL
        +DND   + G  +E  K+ + P +C D   +DA+  VP++       C    D     E E         +  D         ++D   YM+K++  C L
Subjt:  SDNDAHDSGGMMLEDQKELTSPPKCND---QDADQFVPKHD------CLGLDDFNHYNEVEASVSPFTNSSKVDL-------FEEDSELYMEKSIVECQL

Query:  PELIVCYKENICNIVKDICIDEGVPSRDMLLCGNSLDEKAVCAIAPSEKD-WKDELEGELEKRKMFSSEHAESFSNKDSPKQCDLKDLGRIPEAEYDVAY
        PE++VCYKEN  +IVKDIC+DEGVP ++  L G              EKD  K     +L K    +   +E+ S +DS        + ++ ++E+   +
Subjt:  PELIVCYKENICNIVKDICIDEGVPSRDMLLCGNSLDEKAVCAIAPSEKD-WKDELEGELEKRKMFSSEHAESFSNKDSPKQCDLKDLGRIPEAEYDVAY

Query:  FTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVE-DSYSTTGEAIAASTEPKNSSSVNEISYNSKVDNGNITFDFNSSASI
         TD D+   S +D   ++     N+  E    +E+V    AS    +S S +E D  S    AI+   + K   ++ +I   S+ D        N S+  
Subjt:  FTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVE-DSYSTTGEAIAASTEPKNSSSVNEISYNSKVDNGNITFDFNSSASI

Query:  ASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSES-----ADKSQVQCHHTSSNPKCVEYEDLPKAEVGISSSHSVSTQVQHGIGETSFSSMG--PLG
          +            S  TT+   + + T  P   E      +  +  + + T + P+  E E+  +    + +S+          GETSFS+     + 
Subjt:  ASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSES-----ADKSQVQCHHTSSNPKCVEYEDLPKAEVGISSSHSVSTQVQHGIGETSFSSMG--PLG

Query:  SLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADRRKHR-GWKHGLLCCRF
          ++ SG I YSGS+S+RSD+STTS RSFAFPI             +QSEWNSSPVRMAKAD+R+ + GW+H LLCCRF
Subjt:  SLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADRRKHR-GWKHGLLCCRF

AT2G03810.2 18S pre-ribosomal assembly protein gar2-related2.1e-3030.06Show/hide
Query:  SDNDAHDSGGMMLEDQKELTSPPKCND---QDADQFVPKHD------CLGLDDFNHYNEVEASVSPFTNSSKVDL-------FEEDSELYMEKSIVECQL
        +DND   + G  +E  K+ + P +C D   +DA+  VP++       C    D     E E         +  D         ++D   YM+K++  C L
Subjt:  SDNDAHDSGGMMLEDQKELTSPPKCND---QDADQFVPKHD------CLGLDDFNHYNEVEASVSPFTNSSKVDL-------FEEDSELYMEKSIVECQL

Query:  PELIVCYKENICNIVKDICIDEGVPSRDMLLCGNSLDEKAVCAIAPSEKD-WKDELEGELEKRKMFSSEHAESFSNKDSPKQCDLKDLGRIPEAEYDVAY
        PE++VCYKEN  +IVKDIC+DEGVP ++  L G              EKD  K     +L K    +   +E+ S +DS        + ++ ++E+   +
Subjt:  PELIVCYKENICNIVKDICIDEGVPSRDMLLCGNSLDEKAVCAIAPSEKD-WKDELEGELEKRKMFSSEHAESFSNKDSPKQCDLKDLGRIPEAEYDVAY

Query:  FTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVE-DSYSTTGEAIAASTEPKNSSSVNEISYNSKVDNGNITFDFNSSASI
         TD D+   S +D   ++     N+  E    +E+V    AS    +S S +E D  S    AI+   + K   ++ +I   S+ D        N S+  
Subjt:  FTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVE-DSYSTTGEAIAASTEPKNSSSVNEISYNSKVDNGNITFDFNSSASI

Query:  ASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSES-----ADKSQVQCHHTSSNPKCVEYEDLPKAEVGISSSHSVSTQVQHGIGETSFSSMG--PLG
          +            S  TT+   + + T  P   E      +  +  + + T + P+  E E+  +    + +S+          GETSFS+     + 
Subjt:  ASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSES-----ADKSQVQCHHTSSNPKCVEYEDLPKAEVGISSSHSVSTQVQHGIGETSFSSMG--PLG

Query:  SLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADRRKHR-GWKHGLLCCRF
          ++ SG I YSGS+S+RSD+STTS RSFAFPI             +QSEWNSSPVRMAKAD+R+ + GW+H LLCCRF
Subjt:  SLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADRRKHR-GWKHGLLCCRF

AT2G03810.3 18S pre-ribosomal assembly protein gar2-related2.1e-3030.06Show/hide
Query:  SDNDAHDSGGMMLEDQKELTSPPKCND---QDADQFVPKHD------CLGLDDFNHYNEVEASVSPFTNSSKVDL-------FEEDSELYMEKSIVECQL
        +DND   + G  +E  K+ + P +C D   +DA+  VP++       C    D     E E         +  D         ++D   YM+K++  C L
Subjt:  SDNDAHDSGGMMLEDQKELTSPPKCND---QDADQFVPKHD------CLGLDDFNHYNEVEASVSPFTNSSKVDL-------FEEDSELYMEKSIVECQL

Query:  PELIVCYKENICNIVKDICIDEGVPSRDMLLCGNSLDEKAVCAIAPSEKD-WKDELEGELEKRKMFSSEHAESFSNKDSPKQCDLKDLGRIPEAEYDVAY
        PE++VCYKEN  +IVKDIC+DEGVP ++  L G              EKD  K     +L K    +   +E+ S +DS        + ++ ++E+   +
Subjt:  PELIVCYKENICNIVKDICIDEGVPSRDMLLCGNSLDEKAVCAIAPSEKD-WKDELEGELEKRKMFSSEHAESFSNKDSPKQCDLKDLGRIPEAEYDVAY

Query:  FTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVE-DSYSTTGEAIAASTEPKNSSSVNEISYNSKVDNGNITFDFNSSASI
         TD D+   S +D   ++     N+  E    +E+V    AS    +S S +E D  S    AI+   + K   ++ +I   S+ D        N S+  
Subjt:  FTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVE-DSYSTTGEAIAASTEPKNSSSVNEISYNSKVDNGNITFDFNSSASI

Query:  ASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSES-----ADKSQVQCHHTSSNPKCVEYEDLPKAEVGISSSHSVSTQVQHGIGETSFSSMG--PLG
          +            S  TT+   + + T  P   E      +  +  + + T + P+  E E+  +    + +S+          GETSFS+     + 
Subjt:  ASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSES-----ADKSQVQCHHTSSNPKCVEYEDLPKAEVGISSSHSVSTQVQHGIGETSFSSMG--PLG

Query:  SLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADRRKHR-GWKHGLLCCRF
          ++ SG I YSGS+S+RSD+STTS RSFAFPI             +QSEWNSSPVRMAKAD+R+ + GW+H LLCCRF
Subjt:  SLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADRRKHR-GWKHGLLCCRF

AT2G03810.4 18S pre-ribosomal assembly protein gar2-related2.1e-3030.06Show/hide
Query:  SDNDAHDSGGMMLEDQKELTSPPKCND---QDADQFVPKHD------CLGLDDFNHYNEVEASVSPFTNSSKVDL-------FEEDSELYMEKSIVECQL
        +DND   + G  +E  K+ + P +C D   +DA+  VP++       C    D     E E         +  D         ++D   YM+K++  C L
Subjt:  SDNDAHDSGGMMLEDQKELTSPPKCND---QDADQFVPKHD------CLGLDDFNHYNEVEASVSPFTNSSKVDL-------FEEDSELYMEKSIVECQL

Query:  PELIVCYKENICNIVKDICIDEGVPSRDMLLCGNSLDEKAVCAIAPSEKD-WKDELEGELEKRKMFSSEHAESFSNKDSPKQCDLKDLGRIPEAEYDVAY
        PE++VCYKEN  +IVKDIC+DEGVP ++  L G              EKD  K     +L K    +   +E+ S +DS        + ++ ++E+   +
Subjt:  PELIVCYKENICNIVKDICIDEGVPSRDMLLCGNSLDEKAVCAIAPSEKD-WKDELEGELEKRKMFSSEHAESFSNKDSPKQCDLKDLGRIPEAEYDVAY

Query:  FTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVE-DSYSTTGEAIAASTEPKNSSSVNEISYNSKVDNGNITFDFNSSASI
         TD D+   S +D   ++     N+  E    +E+V    AS    +S S +E D  S    AI+   + K   ++ +I   S+ D        N S+  
Subjt:  FTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVE-DSYSTTGEAIAASTEPKNSSSVNEISYNSKVDNGNITFDFNSSASI

Query:  ASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSES-----ADKSQVQCHHTSSNPKCVEYEDLPKAEVGISSSHSVSTQVQHGIGETSFSSMG--PLG
          +            S  TT+   + + T  P   E      +  +  + + T + P+  E E+  +    + +S+          GETSFS+     + 
Subjt:  ASDGMEHHDNGYSNSSAPTTSASVDCQDTSSPDPSES-----ADKSQVQCHHTSSNPKCVEYEDLPKAEVGISSSHSVSTQVQHGIGETSFSSMG--PLG

Query:  SLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADRRKHR-GWKHGLLCCRF
          ++ SG I YSGS+S+RSD+STTS RSFAFPI             +QSEWNSSPVRMAKAD+R+ + GW+H LLCCRF
Subjt:  SLVSNSGRIGYSGSISLRSDSSTTSTRSFAFPISELMKPKSLFSCRIQSEWNSSPVRMAKADRRKHR-GWKHGLLCCRF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTGGAAAAAATCCATTGGAACAGTGCATGCAAATCATGAAAGTTGAGGGTGAACATGTAGTTTGCCATTCAAATATTAGCGTCAAGTTTGTTCCTAAGTCAAGTGA
TAATGATGCTCATGATTCTGGTGGGATGATGCTTGAAGATCAGAAAGAACTTACAAGCCCTCCGAAATGTAACGATCAGGATGCCGATCAGTTTGTTCCTAAGCATGACT
GTTTAGGTCTTGATGATTTTAATCACTACAATGAGGTTGAAGCCTCTGTGTCACCGTTCACTAATTCTTCTAAAGTAGACTTGTTTGAGGAAGACTCAGAGCTATACATG
GAAAAGAGTATTGTTGAATGTCAACTTCCTGAACTGATAGTTTGTTACAAAGAAAATATTTGCAATATTGTTAAGGATATTTGTATCGATGAGGGCGTACCCTCTCGGGA
TATGCTCTTGTGTGGTAATAGTTTGGATGAGAAGGCGGTGTGTGCCATTGCCCCTTCTGAGAAAGATTGGAAAGATGAGTTGGAAGGAGAACTGGAGAAAAGAAAGATGT
TTTCTTCAGAGCATGCAGAATCTTTTTCCAACAAGGATTCACCCAAACAATGTGATTTGAAGGATTTGGGAAGAATACCTGAGGCAGAATATGATGTGGCGTATTTTACT
GACAATGATATACCAAATCTTTCAATGAAAGACTTGGTTGTAGAGAGCTTAAAGCCATTGATCAACCATAAGGATGAGTCTCACCCTCAGTCTGAACAGGTTTTTATTGA
ATCTGCAAGTTTGGAAGTCCCAGTTTCCGTATCTGCAGTTGAAGATTCTTATAGTACCACCGGGGAAGCAATAGCAGCATCCACAGAACCAAAAAATAGCTCTTCTGTGA
ACGAGATATCATACAATAGTAAAGTGGATAATGGAAACATTACTTTTGATTTTAATTCTTCAGCATCTATAGCTAGTGATGGAATGGAGCATCATGATAATGGCTACTCA
AACTCTTCAGCTCCTACGACCAGCGCCTCAGTGGACTGCCAAGATACTAGCAGCCCTGACCCTTCAGAATCTGCCGATAAATCCCAAGTACAGTGCCATCATACTAGTAG
TAACCCCAAATGTGTGGAATATGAAGACTTACCCAAGGCAGAAGTTGGGATTTCTAGTAGTCACTCGGTTTCAACCCAAGTTCAACATGGCATCGGGGAAACCAGTTTCT
CTTCTATGGGACCTCTGGGAAGTCTGGTGTCTAATTCGGGCCGTATAGGTTACTCTGGCAGCATCTCTCTTCGATCCGACAGTAGCACAACCAGCACCCGTTCATTCGCC
TTTCCCATTTCAGAACTAATGAAACCAAAATCTTTATTTTCCTGCAGAATACAATCTGAGTGGAATAGTAGTCCTGTTAGAATGGCTAAAGCTGATCGCCGGAAGCATAG
GGGTTGGAAACATGGCCTTCTCTGCTGTAGATTC
mRNA sequenceShow/hide mRNA sequence
TCCTTTTCATTTTCTTCCACTACTTTTACCCCATGGAACATACGATACAAAAATGTGAAACTATCTACGTACCAATGGTTGCGCTATCAAATAACCTTTAAAATCATTCG
TCGAAATTTCAATAAGTCATAATAGATATGTAGCAAAAATTTGAGTAGATAGGGTGAATTATTGTTGGAAACTTCTAATAACTTTCAAATAAATCTAACAAAATTGTCCT
CAATTACAATTGTCATATATCTCCAACCAAAAAGTTTTCGATCCACATCTCATGACATAATTAAATGTTGTTCATATAATTGAGACACAGTTTCAGCCCATGCTCATATA
ATTGACAAGTGGAAAAAAAATTCCTTAGTTATAAAAAAAAAGCTTTAAAAATGTTGTTTTTAGTTTTTGAATTTTAGTAAAGCTATCAAAAGTAAAAATGCAAAATAAAG
AAATATATAGTAAATAAGCTTAATTTAAAAAATGGAAACCTAAAGAAAACCCAAAAAAATTCTAAACTTTGAAAATTTCAATTTGTTTAAGTATATCAAAAGATGGAAAG
TAAATAAAATAAAATCATCCAAATTACAAATATATACCGTCTTCCGTAATAAAACAAAATAGAACCTTTTCAAGGAAATGATCGCATAAAAATGGCAGAGATATTCAGAA
TCCCAATTCCCAAATCAGAATGATACGAGAGAGAAGGACCCACAAAAATATAGAGAAAGGGGACCGCCTCTGATCAGAAAACCAGGGCAGAGCGAGCTTTTGGGGGTAAA
CCAGCAAGAAATTCAGAAATCAGAGGCTATACAGAAGAAATTCAGAAATCAGAATCAGCAGCAATCTGAGGCAGCGTTTGAAATTGAATCGAACCTGGTAGGGATACAGT
ACTCTACTCGCCGCAAGGAAACAACAACTTCGTCCAGCTAATTTCCCCTGCTCTCGCTTTTCCCGGCAAAAAGATGAAACGATCAACAAAAACAAATTAAAATTCTCCGG
CGGTTGCTTTTGGGGGTGGAAACATGTCCGGTTTGACTTTGTAACAGAAGAAAAAGCCCTCTGGGGTGCACTTTCTGATATGATTGTCAGCCCCAGAAAAAGGTGTCTTC
CAACAGTTCTTTCCCTGTTAATTTAGGGCGAGGGGATGGTCAAATCAAACTCTCTCCAATTTCACGGGGACCCCAAGAGAGCATTCGGGATTGCTGCTCGCTCGATCGGC
TTATAAAAGCCACATTCCAGCAGCTTCTTCGGGATTGGAATTCGGAATCGTTCAGGCACACCTCTTTGTTCCAGTTTGCTTGGTGGGTTCTTTCTGAACAGTTGCTTTAA
CGCATTGTGGAATGCATTTTCCTCTGTTCCTATAGGCGCCAAGAGTCTCCACATGGTAGCCATCTCAAGGCAAATTTGATGCATGTGTGGAAAAAATCCATTGGAACAGT
GCATGCAAATCATGAAAGTTGAGGGTGAACATGTAGTTTGCCATTCAAATATTAGCGTCAAGTTTGTTCCTAAGTCAAGTGATAATGATGCTCATGATTCTGGTGGGATG
ATGCTTGAAGATCAGAAAGAACTTACAAGCCCTCCGAAATGTAACGATCAGGATGCCGATCAGTTTGTTCCTAAGCATGACTGTTTAGGTCTTGATGATTTTAATCACTA
CAATGAGGTTGAAGCCTCTGTGTCACCGTTCACTAATTCTTCTAAAGTAGACTTGTTTGAGGAAGACTCAGAGCTATACATGGAAAAGAGTATTGTTGAATGTCAACTTC
CTGAACTGATAGTTTGTTACAAAGAAAATATTTGCAATATTGTTAAGGATATTTGTATCGATGAGGGCGTACCCTCTCGGGATATGCTCTTGTGTGGTAATAGTTTGGAT
GAGAAGGCGGTGTGTGCCATTGCCCCTTCTGAGAAAGATTGGAAAGATGAGTTGGAAGGAGAACTGGAGAAAAGAAAGATGTTTTCTTCAGAGCATGCAGAATCTTTTTC
CAACAAGGATTCACCCAAACAATGTGATTTGAAGGATTTGGGAAGAATACCTGAGGCAGAATATGATGTGGCGTATTTTACTGACAATGATATACCAAATCTTTCAATGA
AAGACTTGGTTGTAGAGAGCTTAAAGCCATTGATCAACCATAAGGATGAGTCTCACCCTCAGTCTGAACAGGTTTTTATTGAATCTGCAAGTTTGGAAGTCCCAGTTTCC
GTATCTGCAGTTGAAGATTCTTATAGTACCACCGGGGAAGCAATAGCAGCATCCACAGAACCAAAAAATAGCTCTTCTGTGAACGAGATATCATACAATAGTAAAGTGGA
TAATGGAAACATTACTTTTGATTTTAATTCTTCAGCATCTATAGCTAGTGATGGAATGGAGCATCATGATAATGGCTACTCAAACTCTTCAGCTCCTACGACCAGCGCCT
CAGTGGACTGCCAAGATACTAGCAGCCCTGACCCTTCAGAATCTGCCGATAAATCCCAAGTACAGTGCCATCATACTAGTAGTAACCCCAAATGTGTGGAATATGAAGAC
TTACCCAAGGCAGAAGTTGGGATTTCTAGTAGTCACTCGGTTTCAACCCAAGTTCAACATGGCATCGGGGAAACCAGTTTCTCTTCTATGGGACCTCTGGGAAGTCTGGT
GTCTAATTCGGGCCGTATAGGTTACTCTGGCAGCATCTCTCTTCGATCCGACAGTAGCACAACCAGCACCCGTTCATTCGCCTTTCCCATTTCAGAACTAATGAAACCAA
AATCTTTATTTTCCTGCAGAATACAATCTGAGTGGAATAGTAGTCCTGTTAGAATGGCTAAAGCTGATCGCCGGAAGCATAGGGGTTGGAAACATGGCCTTCTCTGCTGT
AGATTC
Protein sequenceShow/hide protein sequence
MCGKNPLEQCMQIMKVEGEHVVCHSNISVKFVPKSSDNDAHDSGGMMLEDQKELTSPPKCNDQDADQFVPKHDCLGLDDFNHYNEVEASVSPFTNSSKVDLFEEDSELYM
EKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDMLLCGNSLDEKAVCAIAPSEKDWKDELEGELEKRKMFSSEHAESFSNKDSPKQCDLKDLGRIPEAEYDVAYFT
DNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLEVPVSVSAVEDSYSTTGEAIAASTEPKNSSSVNEISYNSKVDNGNITFDFNSSASIASDGMEHHDNGYS
NSSAPTTSASVDCQDTSSPDPSESADKSQVQCHHTSSNPKCVEYEDLPKAEVGISSSHSVSTQVQHGIGETSFSSMGPLGSLVSNSGRIGYSGSISLRSDSSTTSTRSFA
FPISELMKPKSLFSCRIQSEWNSSPVRMAKADRRKHRGWKHGLLCCRF