; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g14930 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g14930
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:10053408..10058999
RNA-Seq ExpressionMoc03g14930
SyntenyMoc03g14930
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]2.0e-23884.91Show/hide
Query:  AGAGSRSE----NGVTRVDVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
        AG  +R E     G     VEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASDAIKCRAF+IAL
Subjt:  AGAGSRSE----NGVTRVDVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSGRLWYRKLPARSISTYSQLRREFLAQLSSRHYDKKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAPAT
        TGS RLWYR+LPA SISTYSQLRREFLA  SSRHYDKKTATHLATIRQKEGETLREYVTRFQ+EQLKVAHCSDDSAMCYFLTGLADE LTVKLGEEAPAT
Subjt:  TGSGRLWYRKLPARSISTYSQLRREFLAQLSSRHYDKKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAECRRAESGPTTSRPYERFTPTTIPISEILTNIEKSGMEKLLKRP
        FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E ADPKSKDKGSFSSGRAE RRAE+GPT SRPYERFTPTTIPISEILTNIE+SGMEKLLKRP
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAECRRAESGPTTSRPYERFTPTTIPISEILTNIEKSGMEKLLKRP

Query:  EKLRGAPERRNKVKYCRFHREHGHNTSDCWELNRQIEDLIQDGYFKKFVGKPRASSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAR
        EKLRGAPERR+K KYCRFHREHGHNTSD WEL RQIE+LIQDGYFKKFVGKPR SS EKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELAR
Subjt:  EKLRGAPERRNKVKYCRFHREHGHNTSDCWELNRQIEDLIQDGYFKKFVGKPRASSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAR

Query:  AARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIDPLIDHVV-------------------------------RSPTPLVGFSGESVIPEGCIDLP
        AARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVI PLIDHVV                               +SPTPLVGFSGESVIPEG IDLP
Subjt:  AARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIDPLIDHVV-------------------------------RSPTPLVGFSGESVIPEGCIDLP

Query:  VTLGQDRTRVTQMAEFV
        VTLGQD+T+VTQMAEFV
Subjt:  VTLGQDRTRVTQMAEFV

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]2.5e-21793.53Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSGRLWYRKLPARSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGS RLWYR+LPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSGRLWYRKLPARSISTYSQLRREFLAQ

Query:  LSSRHYDKKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
         SSR Y KKT THLATIRQKEG TLREYVTRFQ+EQLKVAHCSDDSAMCYFLTGLADE LTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  LSSRHYDKKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKD-ERADPKSKDKGSFSSGRAECRRAESGPTTSRPYERFTPTTIPISEILTNIEKSGMEKLLKRPEKLRGAPERRNKVKYCRFHREHGHNTSDC
        GRGRSGKD ERADPKSKDKGSFSSGRAE RRAESGPT SRPYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLRGAPERR+K KYCRFHREHGHNTSDC
Subjt:  GRGRSGKD-ERADPKSKDKGSFSSGRAECRRAESGPTTSRPYERFTPTTIPISEILTNIEKSGMEKLLKRPEKLRGAPERRNKVKYCRFHREHGHNTSDC

Query:  WELNRQIEDLIQDGYFKKFVGKPRASSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVH
        WEL RQIEDLIQDGYFKKFVGKPR SS EKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGAD EEVH
Subjt:  WELNRQIEDLIQDGYFKKFVGKPRASSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVH

Query:  LPHNDALVIDPLIDHVV
        LPHNDA VI PLIDHVV
Subjt:  LPHNDALVIDPLIDHVV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]2.6e-23875.25Show/hide
Query:  GVTRVDVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSGRLWYRKLPA
        G     VEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGS RLW      
Subjt:  GVTRVDVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSGRLWYRKLPA

Query:  RSISTYSQLRREFLAQLSSRHYDKKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAPATFAEVLQKAKKVID
                                                       FQ++QLKVA  SDDSAMCYFLTGLADE LTVKLG+EAPATFAEVLQKAKKVID
Subjt:  RSISTYSQLRREFLAQLSSRHYDKKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAPATFAEVLQKAKKVID

Query:  GQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTTSRPYERFTPTTIPISEILTNIEKSGMEKLLKRPEKLRGAPERRNKVK
        GQELLRTKTGRPER I RGRSGKDE+AD KSKDKGSFSSGRAE RRA +GPT SRPYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLRGAPERRNK K
Subjt:  GQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTTSRPYERFTPTTIPISEILTNIEKSGMEKLLKRPEKLRGAPERRNKVK

Query:  YCRFHREHGHNTSDCWELNRQIEDLIQDGYFKKFVGKPRASSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGP
        YCRFHREH HNTSD WEL RQIEDLIQD YFKKFVGKPR SS EKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQ P
Subjt:  YCRFHREHGHNTSDCWELNRQIEDLIQDGYFKKFVGKPRASSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGP

Query:  TCPITFDGADLEEVHLPHNDALVIDPLIDHVV-------------------------------RSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMA
        TCPITFD ADLEEVHLPHNDALVI PLIDHVV                               +S TPLVGFS ESVIPEGCIDLPVTLG D+T+VTQMA
Subjt:  TCPITFDGADLEEVHLPHNDALVIDPLIDHVV-------------------------------RSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMA

Query:  EFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTIRGEQAASSECYASALKGSSVCALETLAGRDRALEFEADLPRKEFAAPTEELELVP
        EFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG +RGEQ AS ECYASALKGSSVCALETL  RD  LEF+A+LPR+EFAAPTEELELVP
Subjt:  EFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTIRGEQAASSECYASALKGSSVCALETLAGRDRALEFEADLPRKEFAAPTEELELVP

Query:  LL
        LL
Subjt:  LL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.2e-24867.95Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRPSKATRGRGGTSKNGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+PSKA       +           T E FD L+ + +A  
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRPSKATRGRGGTSKNGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRTMEEMYNEMMLAAGAGSRSENGVTRVDVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAAS
                                       VEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+
Subjt:  TQMRTMEEMYNEMMLAAGAGSRSENGVTRVDVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAAS

Query:  DAIKCRAFQIALTGSGRLWYRKLPARSISTYSQLRREFLAQLSSRHYDKKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADEV
        DAIKC AFQIALTGS RLWYR+LPAR ISTYSQLR+EF++Q SSRHYD+KT THLATIRQKEGETLREYVTRF +EQLKVAHCSDDSAMCYFLTGLADE 
Subjt:  DAIKCRAFQIALTGSGRLWYRKLPARSISTYSQLRREFLAQLSSRHYDKKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADEV

Query:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAECRRAESGPTTSRPYERFTPTTIPISEILTN
        LTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+DKG S SS R + RR+ S    SRPYE +TPTTIPI EILTN
Subjt:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAECRRAESGPTTSRPYERFTPTTIPISEILTN

Query:  IEKSGMEKLLKRPEKLRGAPERRNKVKYCRFHREHGHNTSDCWELNRQIEDLIQDGYFKKFVGKPRASSTEKKEERKRSRTPPRRTDRPAVINTIFGGPS
        IE++GMEKLLKRPEKLRG PE+RN  KYCRFHR+HGHNTS+ WEL RQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN       
Subjt:  IEKSGMEKLLKRPEKLRGAPERRNKVKYCRFHREHGHNTSDCWELNRQIEDLIQDGYFKKFVGKPRASSTEKKEERKRSRTPPRRTDRPAVINTIFGGPS

Query:  GGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIDPLIDHVV-------------------------------RSPTPLVGFS
              K+KELAR ARREVCIIREQ PT  I F+ ADLE VHLPHNDALVI PLID V+                               +SPTPLVGFS
Subjt:  GGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIDPLIDHVV-------------------------------RSPTPLVGFS

Query:  GESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTIRGEQAASSECYASALKGSSVCALETLAG
        GES+  EGCIDLPV++ QD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGT+RGE   S ECYAS  K SSVCALE    
Subjt:  GESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTIRGEQAASSECYASALKGSSVCALETLAG

Query:  RD
        RD
Subjt:  RD

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]6.6e-21878.4Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSGRLWYRKLPARSISTYSQLRREFLAQLSSRHYDKKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGS RLWYR+LPARSISTYSQLR+EF++Q SS HYD+KTATHLATIRQKE ETLREYVTRFQ+EQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSGRLWYRKLPARSISTYSQLRREFLAQLSSRHYDKKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFL

Query:  TGLADEVLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ADPKSKDKGSFSS-GRAECRRAESGPTTSRPYERFTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPE++I + +  +++R AD KS+DKGS SS  R E RR ESGP+ SRPYER+T +TIP
Subjt:  TGLADEVLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ADPKSKDKGSFSS-GRAECRRAESGPTTSRPYERFTPTTIP

Query:  ISEILTNIEKSGMEKLLKRPEKLRGAPERRNKVKYCRFHREHGHNTSDCWELNRQIEDLIQDGYFKKFVGKPRASSTEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIE+SGMEKLLKRPEKLRG  E+RNK KYCRFHR+HGHNT+ CWEL RQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEKSGMEKLLKRPEKLRGAPERRNKVKYCRFHREHGHNTSDCWELNRQIEDLIQDGYFKKFVGKPRASSTEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIDPLIDH-VVRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRV
        TIFGGP+GGQSG+KRKELAR ARREVCIIRE  PTC ITF  ADLE VHLPHNDALVI  LIDH +VR            +I  GCIDLPVT+GQD T+V
Subjt:  TIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIDPLIDH-VVRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRV

Query:  TQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTIRGEQAASSECYASALKGSSVCALETLAGRDRALEFEADLP---RKEFAAPT
        TQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY TPN VG +RGEQ  S ECYASALKGS+VCALE    R +  E EADLP   +++F  PT
Subjt:  TQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTIRGEQAASSECYASALKGSSVCALETLAGRDRALEFEADLP---RKEFAAPT

Query:  EELELVPLLSPEKQ
        EELELVPLLSPE+Q
Subjt:  EELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088139.5e-23984.91Show/hide
Query:  AGAGSRSE----NGVTRVDVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
        AG  +R E     G     VEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASDAIKCRAF+IAL
Subjt:  AGAGSRSE----NGVTRVDVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSGRLWYRKLPARSISTYSQLRREFLAQLSSRHYDKKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAPAT
        TGS RLWYR+LPA SISTYSQLRREFLA  SSRHYDKKTATHLATIRQKEGETLREYVTRFQ+EQLKVAHCSDDSAMCYFLTGLADE LTVKLGEEAPAT
Subjt:  TGSGRLWYRKLPARSISTYSQLRREFLAQLSSRHYDKKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAECRRAESGPTTSRPYERFTPTTIPISEILTNIEKSGMEKLLKRP
        FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E ADPKSKDKGSFSSGRAE RRAE+GPT SRPYERFTPTTIPISEILTNIE+SGMEKLLKRP
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAECRRAESGPTTSRPYERFTPTTIPISEILTNIEKSGMEKLLKRP

Query:  EKLRGAPERRNKVKYCRFHREHGHNTSDCWELNRQIEDLIQDGYFKKFVGKPRASSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAR
        EKLRGAPERR+K KYCRFHREHGHNTSD WEL RQIE+LIQDGYFKKFVGKPR SS EKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELAR
Subjt:  EKLRGAPERRNKVKYCRFHREHGHNTSDCWELNRQIEDLIQDGYFKKFVGKPRASSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAR

Query:  AARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIDPLIDHVV-------------------------------RSPTPLVGFSGESVIPEGCIDLP
        AARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVI PLIDHVV                               +SPTPLVGFSGESVIPEG IDLP
Subjt:  AARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIDPLIDHVV-------------------------------RSPTPLVGFSGESVIPEGCIDLP

Query:  VTLGQDRTRVTQMAEFV
        VTLGQD+T+VTQMAEFV
Subjt:  VTLGQDRTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.2e-23875.25Show/hide
Query:  GVTRVDVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSGRLWYRKLPA
        G     VEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGS RLW      
Subjt:  GVTRVDVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSGRLWYRKLPA

Query:  RSISTYSQLRREFLAQLSSRHYDKKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAPATFAEVLQKAKKVID
                                                       FQ++QLKVA  SDDSAMCYFLTGLADE LTVKLG+EAPATFAEVLQKAKKVID
Subjt:  RSISTYSQLRREFLAQLSSRHYDKKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAPATFAEVLQKAKKVID

Query:  GQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTTSRPYERFTPTTIPISEILTNIEKSGMEKLLKRPEKLRGAPERRNKVK
        GQELLRTKTGRPER I RGRSGKDE+AD KSKDKGSFSSGRAE RRA +GPT SRPYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLRGAPERRNK K
Subjt:  GQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTTSRPYERFTPTTIPISEILTNIEKSGMEKLLKRPEKLRGAPERRNKVK

Query:  YCRFHREHGHNTSDCWELNRQIEDLIQDGYFKKFVGKPRASSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGP
        YCRFHREH HNTSD WEL RQIEDLIQD YFKKFVGKPR SS EKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQ P
Subjt:  YCRFHREHGHNTSDCWELNRQIEDLIQDGYFKKFVGKPRASSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGP

Query:  TCPITFDGADLEEVHLPHNDALVIDPLIDHVV-------------------------------RSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMA
        TCPITFD ADLEEVHLPHNDALVI PLIDHVV                               +S TPLVGFS ESVIPEGCIDLPVTLG D+T+VTQMA
Subjt:  TCPITFDGADLEEVHLPHNDALVIDPLIDHVV-------------------------------RSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMA

Query:  EFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTIRGEQAASSECYASALKGSSVCALETLAGRDRALEFEADLPRKEFAAPTEELELVP
        EFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG +RGEQ AS ECYASALKGSSVCALETL  RD  LEF+A+LPR+EFAAPTEELELVP
Subjt:  EFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTIRGEQAASSECYASALKGSSVCALETLAGRDRALEFEADLPRKEFAAPTEELELVP

Query:  LL
        LL
Subjt:  LL

A0A6J1D9W7 uncharacterized protein LOC1110187081.2e-21793.53Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSGRLWYRKLPARSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGS RLWYR+LPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSGRLWYRKLPARSISTYSQLRREFLAQ

Query:  LSSRHYDKKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
         SSR Y KKT THLATIRQKEG TLREYVTRFQ+EQLKVAHCSDDSAMCYFLTGLADE LTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  LSSRHYDKKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKD-ERADPKSKDKGSFSSGRAECRRAESGPTTSRPYERFTPTTIPISEILTNIEKSGMEKLLKRPEKLRGAPERRNKVKYCRFHREHGHNTSDC
        GRGRSGKD ERADPKSKDKGSFSSGRAE RRAESGPT SRPYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLRGAPERR+K KYCRFHREHGHNTSDC
Subjt:  GRGRSGKD-ERADPKSKDKGSFSSGRAECRRAESGPTTSRPYERFTPTTIPISEILTNIEKSGMEKLLKRPEKLRGAPERRNKVKYCRFHREHGHNTSDC

Query:  WELNRQIEDLIQDGYFKKFVGKPRASSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVH
        WEL RQIEDLIQDGYFKKFVGKPR SS EKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGAD EEVH
Subjt:  WELNRQIEDLIQDGYFKKFVGKPRASSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVH

Query:  LPHNDALVIDPLIDHVV
        LPHNDA VI PLIDHVV
Subjt:  LPHNDALVIDPLIDHVV

A0A6J1DHB3 uncharacterized protein LOC1110204796.0e-24967.95Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRPSKATRGRGGTSKNGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+PSKA       +           T E FD L+ + +A  
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRPSKATRGRGGTSKNGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRTMEEMYNEMMLAAGAGSRSENGVTRVDVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAAS
                                       VEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+
Subjt:  TQMRTMEEMYNEMMLAAGAGSRSENGVTRVDVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAAS

Query:  DAIKCRAFQIALTGSGRLWYRKLPARSISTYSQLRREFLAQLSSRHYDKKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADEV
        DAIKC AFQIALTGS RLWYR+LPAR ISTYSQLR+EF++Q SSRHYD+KT THLATIRQKEGETLREYVTRF +EQLKVAHCSDDSAMCYFLTGLADE 
Subjt:  DAIKCRAFQIALTGSGRLWYRKLPARSISTYSQLRREFLAQLSSRHYDKKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADEV

Query:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAECRRAESGPTTSRPYERFTPTTIPISEILTN
        LTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+DKG S SS R + RR+ S    SRPYE +TPTTIPI EILTN
Subjt:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAECRRAESGPTTSRPYERFTPTTIPISEILTN

Query:  IEKSGMEKLLKRPEKLRGAPERRNKVKYCRFHREHGHNTSDCWELNRQIEDLIQDGYFKKFVGKPRASSTEKKEERKRSRTPPRRTDRPAVINTIFGGPS
        IE++GMEKLLKRPEKLRG PE+RN  KYCRFHR+HGHNTS+ WEL RQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN       
Subjt:  IEKSGMEKLLKRPEKLRGAPERRNKVKYCRFHREHGHNTSDCWELNRQIEDLIQDGYFKKFVGKPRASSTEKKEERKRSRTPPRRTDRPAVINTIFGGPS

Query:  GGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIDPLIDHVV-------------------------------RSPTPLVGFS
              K+KELAR ARREVCIIREQ PT  I F+ ADLE VHLPHNDALVI PLID V+                               +SPTPLVGFS
Subjt:  GGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIDPLIDHVV-------------------------------RSPTPLVGFS

Query:  GESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTIRGEQAASSECYASALKGSSVCALETLAG
        GES+  EGCIDLPV++ QD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGT+RGE   S ECYAS  K SSVCALE    
Subjt:  GESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTIRGEQAASSECYASALKGSSVCALETLAG

Query:  RD
        RD
Subjt:  RD

A0A6J1DZB9 uncharacterized protein LOC1110249043.2e-21878.4Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSGRLWYRKLPARSISTYSQLRREFLAQLSSRHYDKKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGS RLWYR+LPARSISTYSQLR+EF++Q SS HYD+KTATHLATIRQKE ETLREYVTRFQ+EQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSGRLWYRKLPARSISTYSQLRREFLAQLSSRHYDKKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFL

Query:  TGLADEVLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ADPKSKDKGSFSS-GRAECRRAESGPTTSRPYERFTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPE++I + +  +++R AD KS+DKGS SS  R E RR ESGP+ SRPYER+T +TIP
Subjt:  TGLADEVLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ADPKSKDKGSFSS-GRAECRRAESGPTTSRPYERFTPTTIP

Query:  ISEILTNIEKSGMEKLLKRPEKLRGAPERRNKVKYCRFHREHGHNTSDCWELNRQIEDLIQDGYFKKFVGKPRASSTEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIE+SGMEKLLKRPEKLRG  E+RNK KYCRFHR+HGHNT+ CWEL RQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEKSGMEKLLKRPEKLRGAPERRNKVKYCRFHREHGHNTSDCWELNRQIEDLIQDGYFKKFVGKPRASSTEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIDPLIDH-VVRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRV
        TIFGGP+GGQSG+KRKELAR ARREVCIIRE  PTC ITF  ADLE VHLPHNDALVI  LIDH +VR            +I  GCIDLPVT+GQD T+V
Subjt:  TIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIDPLIDH-VVRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRV

Query:  TQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTIRGEQAASSECYASALKGSSVCALETLAGRDRALEFEADLP---RKEFAAPT
        TQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY TPN VG +RGEQ  S ECYASALKGS+VCALE    R +  E EADLP   +++F  PT
Subjt:  TQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTIRGEQAASSECYASALKGSSVCALETLAGRDRALEFEADLP---RKEFAAPT

Query:  EELELVPLLSPEKQ
        EELELVPLLSPE+Q
Subjt:  EELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGCCGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAATGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCACCATGGAGGAAATGTAT
AACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATGGAGTGACGCGCGTGGACGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAA
CGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGACGAAGGACCCCA
AGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGGGCGATTGTGGTAT
CGGAAACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTATCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCAC
CATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAATATGTCACCAGATTCCAGAAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCA
CCGGTCTAGCCGACGAAGTCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTC
CGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGC
TGAGTGTCGAAGGGCGGAGAGCGGACCTACCACGAGCCGACCTTACGAACGATTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACAAACATCGAGAAATCTGGAA
TGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAACAAGGTCAAGTATTGCCGCTTCCACCGGGAGCACGGACATAACACGTCAGACTGC
TGGGAGTTGAATCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGGCCAGCTCGACAGAGAAAAAGGAAGAGCGAAAGCGTTC
AAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAAGAGTTAGCCCGTGCAGCCA
GGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGAT
CCATTGATTGATCATGTGGTGAGAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCG
AACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGAGCCATTCCCTCAACAC
TGCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGATCCGAGGAGAACAGGCCGCTTCGAGCGAGTGTTATGCCTCCGCACTCAAGGGTTCATCGGTCTGC
GCCCTCGAAACGCTCGCCGGTAGGGATAGAGCGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAG
TCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCTGTCGAGATCCTAGACAATCCCTCGATCTCAGAGTCAGATCTGATGGAGATCGGCG
CTCCAGAATCTTCATGGATGGACCCGATCGCAGACTTCATTAGGGGCAACTCACCACAAGATCCCAAGGAATGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTCGTC
CGAGATGGGACATTGTACCGACGTGGCTTTTCCCTGCCTCTATTGAGATACCTAACCCCTGAAGAGGGCCTAGTAGAGCATTTCGAACCTACGACAAATGAGGAAGAGCT
GCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGG
CCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTAGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACA
TACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGCTATTATCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGCCGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAATGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCACCATGGAGGAAATGTAT
AACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATGGAGTGACGCGCGTGGACGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAA
CGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGACGAAGGACCCCA
AGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGGGCGATTGTGGTAT
CGGAAACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTATCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCAC
CATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAATATGTCACCAGATTCCAGAAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCA
CCGGTCTAGCCGACGAAGTCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTC
CGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGC
TGAGTGTCGAAGGGCGGAGAGCGGACCTACCACGAGCCGACCTTACGAACGATTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACAAACATCGAGAAATCTGGAA
TGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAACAAGGTCAAGTATTGCCGCTTCCACCGGGAGCACGGACATAACACGTCAGACTGC
TGGGAGTTGAATCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGGCCAGCTCGACAGAGAAAAAGGAAGAGCGAAAGCGTTC
AAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAAGAGTTAGCCCGTGCAGCCA
GGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGAT
CCATTGATTGATCATGTGGTGAGAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCG
AACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGAGCCATTCCCTCAACAC
TGCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGATCCGAGGAGAACAGGCCGCTTCGAGCGAGTGTTATGCCTCCGCACTCAAGGGTTCATCGGTCTGC
GCCCTCGAAACGCTCGCCGGTAGGGATAGAGCGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAG
TCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCTGTCGAGATCCTAGACAATCCCTCGATCTCAGAGTCAGATCTGATGGAGATCGGCG
CTCCAGAATCTTCATGGATGGACCCGATCGCAGACTTCATTAGGGGCAACTCACCACAAGATCCCAAGGAATGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTCGTC
CGAGATGGGACATTGTACCGACGTGGCTTTTCCCTGCCTCTATTGAGATACCTAACCCCTGAAGAGGGCCTAGTAGAGCATTTCGAACCTACGACAAATGAGGAAGAGCT
GCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGG
CCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTAGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACA
TACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGCTATTATCCCTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRPSKATRGRGGTSKNGARGPAPAPTSENFDALQREMEAMRTQMRTMEEMY
NEMMLAAGAGSRSENGVTRVDVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSGRLWY
RKLPARSISTYSQLRREFLAQLSSRHYDKKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAPATFAEVLQKAKKVIDGQELL
RTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTTSRPYERFTPTTIPISEILTNIEKSGMEKLLKRPEKLRGAPERRNKVKYCRFHREHGHNTSDC
WELNRQIEDLIQDGYFKKFVGKPRASSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVID
PLIDHVVRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTIRGEQAASSECYASALKGSSVC
ALETLAGRDRALEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSISESDLMEIGAPESSWMDPIADFIRGNSPQDPKECRKLARRAARFVV
RDGTLYRRGFSLPLLRYLTPEEGLVEHFEPTTNEEELLLNLDLLEERRAMAQLRLAEYQGRMARHYNARVRPRAFQVGHLVLRRVQTHVGALDPAWEGPFEIKGIVRPGT
YILADLKGDVLAHPWNAEHLKRYYP