; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS005986 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS005986
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionOBERON-like protein
Genome locationscaffold254:2979418..2981717
RNA-Seq ExpressionMS005986
SyntenyMS005986
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR032881 - Oberon, PHD finger domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582301.1 Protein OBERON 1, partial [Cucurbita argyrosperma subsp. sororia]1.9e-21979.26Show/hide
Query:  MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER
        MSGDPVETEVL DING   + NK++LILRPVSQDESGEGLPYAPENWPN GDNW+WRVG+RVAITGHF DRYLY PR I +  NS+RRG  FAS+LSVER
Subjt:  MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER

Query:  FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC
        +IQSEFPNADVDAFFASFSWKIPA KSS AQG R KQ+  PLPSKE  ECSASD QID   CKAGNK+C+SLS AENP+LLKSMSCDICCSEPRFC DCC
Subjt:  FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC

Query:  CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL
        CILCSKIIDTT  S SYIKC+A VGDGYICGHHAHI CGL+SY+AGTVGG IGLDAEYYCRRCDARTDLVSHV+ FLQ CQSTD RDDI +ILS+G+CIL
Subjt:  CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL

Query:  RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF
        RGS K RAKELLRH++L I+KLK+GTCLEE+WKMEE+ SA CTDA DNAD S +GSHD S SII+S+WT+ TPFDHW ESLKLE EIDQVLQALKRSQEF
Subjt:  RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF

Query:  EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE
        EYNLAEEKL  HKNYL NLFQQLDKEQ EL  Q+SSTGQN FL+NV NRVDQ+KREVK+LKRM  VA+GFGMTP  ILKE+F L+VE
Subjt:  EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE

KAG7018712.1 Protein VERNALIZATION INSENSITIVE 3 [Cucurbita argyrosperma subsp. argyrosperma]3.2e-21979.26Show/hide
Query:  MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER
        MSGDPVETEVL DING   + NK++LILRPVSQDESGEGLPYAPENWPN GDNW+WRVG+RVAITGHF DRYLY PR I +  NS+RRG  FAS+LSVER
Subjt:  MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER

Query:  FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC
        +IQSEFPNADVDAFFASFSWKIPA KSS AQG R KQ+  PLPSKE  ECSASD QID   CKAGNK+C+SLS AENP+LLKSMSCDICCSEPRFC DCC
Subjt:  FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC

Query:  CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL
        CILCSKIIDTT  S SYIKC+A VGDGYICGHHAHI CGL+SY+AGTVGG IGLDAEYYCRRCDARTDLVSHV+ FLQ CQSTD RDDI +ILS+G+CIL
Subjt:  CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL

Query:  RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF
        RGS K RAKELLRH++L I+KLK+GTCLEE+WKMEE+ SA CTDA DNAD S +GSHD S SII+S+WT+ TPFDHW ESLKLE EIDQVLQALKRSQEF
Subjt:  RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF

Query:  EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE
        EYNLAEEKL  HKNYL NLFQQLDKEQ EL  Q+SSTGQN FL+NV NRVDQ+KREVK+LKRM  VA+GFGMTP  ILKE+F L+VE
Subjt:  EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE

XP_022138271.1 OBERON-like protein [Momordica charantia]1.6e-282100Show/hide
Query:  MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER
        MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER
Subjt:  MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER

Query:  FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC
        FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC
Subjt:  FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC

Query:  CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL
        CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL
Subjt:  CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL

Query:  RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF
        RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF
Subjt:  RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF

Query:  EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE
        EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE
Subjt:  EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE

XP_022979490.1 OBERON-like protein isoform X1 [Cucurbita maxima]1.4e-21979.26Show/hide
Query:  MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER
        MSGDPVETEVL DING   + NK+DLILRPVSQDESGEGLPYAPENWPN GDNW+WRVG+RVAITGHF DRYLY PR I +  NS+RRG GFAS+LSVER
Subjt:  MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER

Query:  FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC
        +IQSEFP+ADVDAFFASFSWKIPA KSS AQG R KQ+  PLPSKE  ECSASDSQID   CKAGNK+C+SLS AE P+LLKSMSCDICCSEP+FC DCC
Subjt:  FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC

Query:  CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL
        CILCSK IDTT  S SYIKC+A VGDGYICGHHAHI CGL+SY+AGTVGG IGLDAEYYCRRCDARTDLVSHV+ FLQ CQSTD RDDI +ILS+G+CIL
Subjt:  CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL

Query:  RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF
        RGS K RAKELLRH +L I+KLKTGTCLEE+WKMEE+ SA CTDA DNAD S +GSHD S SII+S+WT+STPFDHW ESLKLE EIDQVLQALK+SQEF
Subjt:  RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF

Query:  EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE
        EYNLAEEKL  HKNYL NLFQQLDKEQ EL  Q+SSTGQN FL+NV NRVDQ+KREVK+LKRM  VA+GFGMTP  ILKE+F L+VE
Subjt:  EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE

XP_023527162.1 OBERON-like protein isoform X1 [Cucurbita pepo subsp. pepo]5.0e-22079.47Show/hide
Query:  MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER
        MSGDPVETEVL DING   + NK++LILRPVSQDESGEGLPYAPENWPN GDNW+WRVG+RVAITGHF DRYLY PR I +  NS+RRG  FAS+LSV R
Subjt:  MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER

Query:  FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC
        +IQSEFPNADVDAFFASFSWKIPA KSS AQG R KQ+  PLPSKE  ECSASDSQID   CKAGNK+C+SLS AENP+LLKSMSCDICCSEPRFC DCC
Subjt:  FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC

Query:  CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL
        CILCSKIIDTT  S SYIKC+A VGDGYICGHHAHI CGL+SY+AGTVGG IGLDAEYYCRRCDARTDLVSHV+ FLQ CQSTD  DDI +ILS+G+CIL
Subjt:  CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL

Query:  RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF
        RGS K RAKELLRH++L I+KLKTGTCLEE+WKMEE+ SA CTDA DNAD S +GSHD S SII+S+WT+STPFDHW ESLKLE+EIDQVLQALKRSQEF
Subjt:  RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF

Query:  EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE
        EYNLAEEKL  HKNYL NLFQQLDKEQ EL  Q+SSTGQN FL+NV NRVDQ+KREVK+LKRM  VA+GFGMTP  ILKE+F L+VE
Subjt:  EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE

TrEMBL top hitse value%identityAlignment
A0A1S3AWZ1 uncharacterized protein LOC103483705 isoform X23.8e-21878.23Show/hide
Query:  MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER
        M+GDPV+TEVLED NG +  VNK++LILRPV+QDESGEGLPYAPENWPNPGD W+WRVGKRVAITGHFLDRYLY PR I+  ENS R+G  FASKLSVER
Subjt:  MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER

Query:  FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC
        +IQSEFPNAD+DAFFASFSWKIPA KSS AQG R KQ+P PLPSK+  ECSAS+SQ D  GCKAGNK+CSSLS +ENP+  KSMSC ICCSEPRFC DCC
Subjt:  FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC

Query:  CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL
        CILC KIIDTT  SYSYIKC+  VGDGYICGHHAHI CGL+SY AGTVGGSIGLDAEYYCRRCDARTDLVSHV+ FLQSCQS D RDD+E+IL++G+CIL
Subjt:  CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL

Query:  RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF
        RGS K RAKELLRHIEL I K+KTG CLEEIWKMEE+ SA CTDA D AD S + SH+TSGSII+S+WT+STPFDHW ESLKLE EIDQVL  LKRSQEF
Subjt:  RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF

Query:  EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE
        EYNLAEEKL LHKNYL NLFQQL+KEQTELR QT STGQN+    V NRVDQ+KREVK+LKRM  VA+GFGMTP  ILKE+F L+VE
Subjt:  EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE

A0A1S4DSZ4 uncharacterized protein LOC103483705 isoform X13.8e-21878.23Show/hide
Query:  MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER
        M+GDPV+TEVLED NG +  VNK++LILRPV+QDESGEGLPYAPENWPNPGD W+WRVGKRVAITGHFLDRYLY PR I+  ENS R+G  FASKLSVER
Subjt:  MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER

Query:  FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC
        +IQSEFPNAD+DAFFASFSWKIPA KSS AQG R KQ+P PLPSK+  ECSAS+SQ D  GCKAGNK+CSSLS +ENP+  KSMSC ICCSEPRFC DCC
Subjt:  FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC

Query:  CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL
        CILC KIIDTT  SYSYIKC+  VGDGYICGHHAHI CGL+SY AGTVGGSIGLDAEYYCRRCDARTDLVSHV+ FLQSCQS D RDD+E+IL++G+CIL
Subjt:  CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL

Query:  RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF
        RGS K RAKELLRHIEL I K+KTG CLEEIWKMEE+ SA CTDA D AD S + SH+TSGSII+S+WT+STPFDHW ESLKLE EIDQVL  LKRSQEF
Subjt:  RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF

Query:  EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE
        EYNLAEEKL LHKNYL NLFQQL+KEQTELR QT STGQN+    V NRVDQ+KREVK+LKRM  VA+GFGMTP  ILKE+F L+VE
Subjt:  EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE

A0A6J1C998 OBERON-like protein7.6e-283100Show/hide
Query:  MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER
        MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER
Subjt:  MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER

Query:  FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC
        FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC
Subjt:  FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC

Query:  CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL
        CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL
Subjt:  CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL

Query:  RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF
        RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF
Subjt:  RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF

Query:  EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE
        EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE
Subjt:  EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE

A0A6J1GVC0 protein OBERON 4-like isoform X15.0e-21878.85Show/hide
Query:  MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER
        MSGDPVETEVL DING   + NK++L LRPVSQDESGEGLPYAPENWPN GDNW+WRVG+RVAITGHF DRYLY PR I +  NS+RRG  FAS+LSVER
Subjt:  MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER

Query:  FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC
        +IQSEFPNADVDAFFASFSWKIPA KSS AQG R +Q+  PLPSKE  ECSASDSQID   CKAGNK+C+SLS AENP+LLKSMSCDICCSEPRFC DCC
Subjt:  FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC

Query:  CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL
        CILCSKIIDTT  S S+IKC+A V DGYICGHHAHI CGL+SY+AGTVGG IGLDAEYYCRRCDARTDLVSHV+ FLQ CQSTD RDDI +ILS+G+CIL
Subjt:  CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL

Query:  RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF
        RGS K RAKELLRH++L I+KLK+GTCLEE+WKMEE+ SA CTDA DNAD S QGSHD S SII+S+WT+ TPFDHW ESLKLE EIDQVLQALKRSQEF
Subjt:  RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF

Query:  EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE
        EYNLAEEKL  HKNYL NLFQQLDKEQ EL  Q+SSTGQN FL+NV NRVDQ+KREVK+LKRM  VA+GFGMTP  ILKE+F L+VE
Subjt:  EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE

A0A6J1ITE5 OBERON-like protein isoform X17.0e-22079.26Show/hide
Query:  MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER
        MSGDPVETEVL DING   + NK+DLILRPVSQDESGEGLPYAPENWPN GDNW+WRVG+RVAITGHF DRYLY PR I +  NS+RRG GFAS+LSVER
Subjt:  MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVER

Query:  FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC
        +IQSEFP+ADVDAFFASFSWKIPA KSS AQG R KQ+  PLPSKE  ECSASDSQID   CKAGNK+C+SLS AE P+LLKSMSCDICCSEP+FC DCC
Subjt:  FIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCC

Query:  CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL
        CILCSK IDTT  S SYIKC+A VGDGYICGHHAHI CGL+SY+AGTVGG IGLDAEYYCRRCDARTDLVSHV+ FLQ CQSTD RDDI +ILS+G+CIL
Subjt:  CILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCIL

Query:  RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF
        RGS K RAKELLRH +L I+KLKTGTCLEE+WKMEE+ SA CTDA DNAD S +GSHD S SII+S+WT+STPFDHW ESLKLE EIDQVLQALK+SQEF
Subjt:  RGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEF

Query:  EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE
        EYNLAEEKL  HKNYL NLFQQLDKEQ EL  Q+SSTGQN FL+NV NRVDQ+KREVK+LKRM  VA+GFGMTP  ILKE+F L+VE
Subjt:  EYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05410.1 Protein of unknown function (DUF1423)1.2e-12649.68Show/hide
Query:  LILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNIT-LPENSTRRGQGFASKLSVERFIQSEFPNADVDAFFASFSWKIPA
        L+LRPVS  ESGEGLPYAPENWPNPGD W W+VG R++  G+F+DRYLYPP+ +  L     R+ + F S+LS++R+I+  FP ADV  FFASFSW IP 
Subjt:  LILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNIT-LPENSTRRGQGFASKLSVERFIQSEFPNADVDAFFASFSWKIPA

Query:  TKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCCCILCSKIIDTTNGSYSYIKCEAKV
                 +  Q+P     ++      SD+ +    CKAGN+ C SL        L +M CDICC E +FC DCCCILC K+I   +G YSYIKCEA V
Subjt:  TKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCCCILCSKIIDTTNGSYSYIKCEAKV

Query:  GDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCILRGSLKKRAKELLRHIELIISKLKT
         +G+ICGH AH+ C LR+Y+AGT+GGS+GLD EYYCRRCDA+ DL  HV  FL+ CQ+ + + D+EKIL++G+CILRG+ +  AKELL  IE  + KLK 
Subjt:  GDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCILRGSLKKRAKELLRHIELIISKLKT

Query:  GTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEFEYNLAEEKLRLHKNYLLNLFQQLD
        GT LE++W   ++   I +D +D+ +      +DT  S+ +       PF+H  E  KLE EI +VL+AL+++QEFEY +AE KL   K  L +L++QL+
Subjt:  GTCLEEIWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEFEYNLAEEKLRLHKNYLLNLFQQLD

Query:  KEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE
        KE++EL R+ S T  NS + NV+ R+DQ+++EV KLK M  VA GFG TP G+L+E F L +E
Subjt:  KEQTELRRQTSSTGQNSFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE

AT1G05410.2 Protein of unknown function (DUF1423)1.4e-10847.33Show/hide
Query:  VGKRVAITGHFLDRYLYPPRNIT-LPENSTRRGQGFASKLSVERFIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQ
        VG R++  G+F+DRYLYPP+ +  L     R+ + F S+LS++R+I+  FP ADV  FFASFSW IP          +  Q+P     ++      SD+ 
Subjt:  VGKRVAITGHFLDRYLYPPRNIT-LPENSTRRGQGFASKLSVERFIQSEFPNADVDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQ

Query:  IDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCCCILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDA
        +    CKAGN+ C SL        L +M CDICC E +FC DCCCILC K+I   +G YSYIKCEA V +G+ICGH AH+ C LR+Y+AGT+GGS+GLD 
Subjt:  IDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCCCILCSKIIDTTNGSYSYIKCEAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDA

Query:  EYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCILRGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGS
        EYYCRRCDA+ DL  HV  FL+ CQ+ + + D+EKIL++G+CILRG+ +  AKELL  IE  + KLK GT LE++W   ++   I +D +D+ +      
Subjt:  EYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCILRGSLKKRAKELLRHIELIISKLKTGTCLEEIWKMEEEISAICTDAADNADTSIQGS

Query:  HDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEFEYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKRE
        +DT  S+ +       PF+H  E  KLE EI +VL+AL+++QEFEY +AE KL   K  L +L++QL+KE++EL R+ S T  NS + NV+ R+DQ+++E
Subjt:  HDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEFEYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQNSFLNNVINRVDQVKRE

Query:  VKKLKRMANVANGFGMTPNGILKEEFGLEVE
        V KLK M  VA GFG TP G+L+E F L +E
Subjt:  VKKLKRMANVANGFGMTPNGILKEEFGLEVE

AT3G22520.1 unknown protein3.2e-2342.06Show/hide
Query:  DPVETEVLED----INGSTHRVNKSDLILRP-VSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSV
        DP  ++  E+    ++ S     ++DL   P +    +G+GLPYAP +WP+PGD WTWRVG+RV   G+  DR+L  P+ +          + FASK  +
Subjt:  DPVETEVLED----INGSTHRVNKSDLILRP-VSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSV

Query:  ERFIQSEFPNADVDAFFASFSWKIPA
         R+++S+FP  D DAFFASFSWK+PA
Subjt:  ERFIQSEFPNADVDAFFASFSWKIPA

AT4G14840.1 unknown protein2.8e-1941.23Show/hide
Query:  EDINGSTHRVNKSDLILRP-VSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVERFIQSEFPNAD
        +D+  +  R + +DL   P +    SG+GLP+AP ++P+PGD WTWRVG+RV   G   DR L  P  +          + FASK ++ R++++ FP+ D
Subjt:  EDINGSTHRVNKSDLILRP-VSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVERFIQSEFPNAD

Query:  VDAFFASFSWKIPA
         +AFFASF+W IPA
Subjt:  VDAFFASFSWKIPA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGGGATCCTGTGGAGACTGAAGTTCTTGAGGATATAAATGGGAGCACGCATAGGGTAAATAAAAGTGATCTGATCCTTAGACCAGTTTCTCAAGATGAATCTGG
GGAGGGATTGCCATATGCTCCTGAAAATTGGCCCAATCCTGGTGATAACTGGACTTGGAGAGTGGGGAAGAGAGTTGCTATAACTGGCCATTTTTTGGATAGGTACCTTT
ATCCTCCTCGCAATATTACTCTTCCTGAGAACTCAACTCGTAGAGGGCAAGGTTTTGCAAGCAAGCTTTCAGTTGAAAGATTTATCCAGTCTGAGTTCCCTAATGCAGAC
GTTGATGCATTTTTTGCTTCATTCAGCTGGAAGATACCAGCAACAAAGTCATCTTTCGCACAAGGGAGTCGAGCCAAACAAGTTCCATTCCCATTACCCTCAAAAGAGAA
GACAGAATGCTCAGCATCTGATTCCCAGATTGATCTAGAGGGTTGTAAGGCTGGAAACAAGGACTGTAGTAGTTTATCTGCTGCAGAAAACCCAGCTTTATTAAAATCCA
TGTCCTGTGATATTTGCTGCAGCGAACCTCGGTTTTGCCATGATTGCTGCTGTATTCTTTGCAGCAAGATTATAGACACAACCAATGGAAGTTATAGCTACATAAAGTGT
GAAGCAAAGGTGGGTGATGGTTATATTTGTGGACATCATGCTCATATAATATGTGGTCTTCGATCTTATGTGGCTGGGACAGTCGGAGGAAGCATAGGGTTGGATGCTGA
GTATTATTGTCGGCGTTGTGATGCTAGAACAGATTTGGTATCACATGTTCAAGGATTTTTGCAGTCATGTCAATCAACTGATTCTCGGGATGATATTGAGAAGATCTTAA
GCATTGGTGTCTGCATTTTGCGTGGTTCGCTGAAAAAGAGAGCAAAGGAGTTGCTAAGACATATTGAGTTGATTATTTCAAAGCTTAAAACTGGGACTTGCTTAGAAGAG
ATTTGGAAGATGGAGGAAGAAATCTCAGCTATTTGCACTGATGCAGCTGATAATGCTGATACTTCCATACAAGGTTCTCACGACACTTCAGGCTCCATTATAAACTCAGA
CTGGACTGTGTCCACCCCTTTTGATCATTGGACCGAGTCCCTAAAACTCGAAGCTGAGATCGATCAGGTTCTGCAGGCACTCAAAAGATCACAAGAGTTCGAGTACAATT
TAGCAGAAGAAAAGCTTCGATTGCATAAAAATTATCTACTAAATCTATTCCAGCAACTTGACAAGGAGCAAACTGAACTCAGACGTCAAACATCATCAACTGGACAAAAT
TCTTTCCTGAATAATGTAATAAATAGAGTGGATCAAGTAAAACGAGAAGTAAAGAAACTCAAGAGAATGGCAAATGTGGCCAATGGATTTGGAATGACTCCAAATGGTAT
TCTCAAGGAGGAATTCGGTTTGGAAGTTGAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAGGGGATCCTGTGGAGACTGAAGTTCTTGAGGATATAAATGGGAGCACGCATAGGGTAAATAAAAGTGATCTGATCCTTAGACCAGTTTCTCAAGATGAATCTGG
GGAGGGATTGCCATATGCTCCTGAAAATTGGCCCAATCCTGGTGATAACTGGACTTGGAGAGTGGGGAAGAGAGTTGCTATAACTGGCCATTTTTTGGATAGGTACCTTT
ATCCTCCTCGCAATATTACTCTTCCTGAGAACTCAACTCGTAGAGGGCAAGGTTTTGCAAGCAAGCTTTCAGTTGAAAGATTTATCCAGTCTGAGTTCCCTAATGCAGAC
GTTGATGCATTTTTTGCTTCATTCAGCTGGAAGATACCAGCAACAAAGTCATCTTTCGCACAAGGGAGTCGAGCCAAACAAGTTCCATTCCCATTACCCTCAAAAGAGAA
GACAGAATGCTCAGCATCTGATTCCCAGATTGATCTAGAGGGTTGTAAGGCTGGAAACAAGGACTGTAGTAGTTTATCTGCTGCAGAAAACCCAGCTTTATTAAAATCCA
TGTCCTGTGATATTTGCTGCAGCGAACCTCGGTTTTGCCATGATTGCTGCTGTATTCTTTGCAGCAAGATTATAGACACAACCAATGGAAGTTATAGCTACATAAAGTGT
GAAGCAAAGGTGGGTGATGGTTATATTTGTGGACATCATGCTCATATAATATGTGGTCTTCGATCTTATGTGGCTGGGACAGTCGGAGGAAGCATAGGGTTGGATGCTGA
GTATTATTGTCGGCGTTGTGATGCTAGAACAGATTTGGTATCACATGTTCAAGGATTTTTGCAGTCATGTCAATCAACTGATTCTCGGGATGATATTGAGAAGATCTTAA
GCATTGGTGTCTGCATTTTGCGTGGTTCGCTGAAAAAGAGAGCAAAGGAGTTGCTAAGACATATTGAGTTGATTATTTCAAAGCTTAAAACTGGGACTTGCTTAGAAGAG
ATTTGGAAGATGGAGGAAGAAATCTCAGCTATTTGCACTGATGCAGCTGATAATGCTGATACTTCCATACAAGGTTCTCACGACACTTCAGGCTCCATTATAAACTCAGA
CTGGACTGTGTCCACCCCTTTTGATCATTGGACCGAGTCCCTAAAACTCGAAGCTGAGATCGATCAGGTTCTGCAGGCACTCAAAAGATCACAAGAGTTCGAGTACAATT
TAGCAGAAGAAAAGCTTCGATTGCATAAAAATTATCTACTAAATCTATTCCAGCAACTTGACAAGGAGCAAACTGAACTCAGACGTCAAACATCATCAACTGGACAAAAT
TCTTTCCTGAATAATGTAATAAATAGAGTGGATCAAGTAAAACGAGAAGTAAAGAAACTCAAGAGAATGGCAAATGTGGCCAATGGATTTGGAATGACTCCAAATGGTAT
TCTCAAGGAGGAATTCGGTTTGGAAGTTGAG
Protein sequenceShow/hide protein sequence
MSGDPVETEVLEDINGSTHRVNKSDLILRPVSQDESGEGLPYAPENWPNPGDNWTWRVGKRVAITGHFLDRYLYPPRNITLPENSTRRGQGFASKLSVERFIQSEFPNAD
VDAFFASFSWKIPATKSSFAQGSRAKQVPFPLPSKEKTECSASDSQIDLEGCKAGNKDCSSLSAAENPALLKSMSCDICCSEPRFCHDCCCILCSKIIDTTNGSYSYIKC
EAKVGDGYICGHHAHIICGLRSYVAGTVGGSIGLDAEYYCRRCDARTDLVSHVQGFLQSCQSTDSRDDIEKILSIGVCILRGSLKKRAKELLRHIELIISKLKTGTCLEE
IWKMEEEISAICTDAADNADTSIQGSHDTSGSIINSDWTVSTPFDHWTESLKLEAEIDQVLQALKRSQEFEYNLAEEKLRLHKNYLLNLFQQLDKEQTELRRQTSSTGQN
SFLNNVINRVDQVKREVKKLKRMANVANGFGMTPNGILKEEFGLEVE