; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g06410 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g06410
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr5:4453644..4456112
RNA-Seq ExpressionMoc05g06410
SyntenyMoc05g06410
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]2.8e-26590.53Show/hide
Query:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTLDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEG LNDGDLGESPFT DVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTLDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLSTIRQKEGETLREYVTRFQEAQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHL+TIRQKEGETLREYVTRFQE QLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLSTIRQKEGETLREYVTRFQEAQLKVAHCSDDSAMCYFLTGLADEAL

Query:  T-------------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRVENGPTRSRPYERFTPTTISISEILTNIE
        T                         ELLRTKTGRPERKIGRGRSGKDIE ADPKSKDKGSFSSGRAEYRR ENGPTRSRPYERFTPTTI ISEILTNIE
Subjt:  T-------------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRVENGPTRSRPYERFTPTTISISEILTNIE

Query:  ESGMEKLLKRPAKLRGAPERRSKDKYCRFHRKHGHNTSDCWELKHQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        ESGMEKLLKRP KLRGAPERRSKDKYCRFHR+HGHNTSD WELK QIE+LIQDGYFKKFVGKPRTSS EKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPAKLRGAPERRSKDKYCRFHRKHGHNTSDCWELKHQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTCPITFD  DLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQL KSPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV
        SVIPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]7.0e-26479.4Show/hide
Query:  SSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTLDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ
        SSNQQAESS NPATP GVITREEFDQLRG+L+AQVEALKAKCEQKEG LNDGDLGESPFT DVLE        APTVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTLDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLSTIRQKEGETLREYVTRFQEAQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLSTIRQKEGETLREYVTRFQEAQLKVAHCSDDSAMCYFLTGLA

Query:  DEALT-------------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRVENGPTRSRPYERFTPTTISISEIL
        DEALT                         ELLRTKTGRPER I RGRSGKD EKAD KSKDKGSFSSGRAE+RR  NGPTRSRPYERFTPTTI ISEIL
Subjt:  DEALT-------------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRVENGPTRSRPYERFTPTTISISEIL

Query:  TNIEESGMEKLLKRPAKLRGAPERRSKDKYCRFHRKHGHNTSDCWELKHQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGG
        TNIEESGMEKLLKRP KLRGAPERR+KDKYCRFHR+H HNTSD WELK QIEDLIQD YFKKFVGKPRTSS EKKEERK SRTP RR DRPAVINTIFGG
Subjt:  TNIEESGMEKLLKRPAKLRGAPERRSKDKYCRFHRKHGHNTSDCWELKHQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGG

Query:  PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVG
        PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDS DLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQL KS TPLVG
Subjt:  PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQIASRECYASALKGSSFCALETL
        FS ESVIPEGCIDLPVTLG DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQIASRECYASALKGSS CALETL
Subjt:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQIASRECYASALKGSSFCALETL

Query:  AGRDGTLEFEADLPRREFAAPTEELELVPLL
          RDGTLEF+A+LPRREFAAPTEELELVPLL
Subjt:  AGRDGTLEFEADLPRREFAAPTEELELVPLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]9.9e-21086.26Show/hide
Query:  MCYFLTGLADEALT-------------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRVENGPTRSRPYERFTP
        MCYFLTGLADEALT                         ELLRT       KIG+GRSGKD+E  DPKSKDKGSFS+GRAEYRR ENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALT-------------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRVENGPTRSRPYERFTP

Query:  TTISISEILTNIEESGMEKLLKRPAKLRGAPERRSKDKYCRFHRKHGHNTSDCWELKHQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRP
        TTI ISEILTNIEESGMEKLLKRP KLRGAPERRSKDKYCRFHR+HGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSS EKKEERKRSRTPPRRTDRP
Subjt:  TTISISEILTNIEESGMEKLLKRPAKLRGAPERRSKDKYCRFHRKHGHNTSDCWELKHQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQRPTCPITFD  DL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  TKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQIASRECYASALKG
         KSPTPLVGFSGESV+PEGCIDLPVTLGQDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQ ASRECYAS LKG
Subjt:  TKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQIASRECYASALKG

Query:  SSFCALETLAGRDGTLEFEADLPRREFAAPTEELELVPLLSPEK
        +S CALETL  RDGTLEFEADLP REFAAP EELELVPLLS EK
Subjt:  SSFCALETLAGRDGTLEFEADLPRREFAAPTEELELVPLLSPEK

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.8e-25763.89Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGIATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAM
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ + TEPL RSARIT PVLPPAHP                                        
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGIATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAM

Query:  RTQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQTPSRSHRSSNQQAESSRNPA
                                                                                         PS+        AESS NP 
Subjt:  RTQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQTPSRSHRSSNQQAESSRNPA

Query:  TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTLDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQI
        TP GVITREEFDQL+ + DAQVEALKA+CE+KE S +DGDLGE  F+ D+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQI
Subjt:  TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTLDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQI

Query:  ALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLSTIRQKEGETLREYVTRFQEAQLKVAHCSDDSAMCYFLTGLADEALT--------
        ALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THL+TIRQKEGETLREYVTRF E QLKVAHCSDDSAMCYFLTGLADE LT        
Subjt:  ALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLSTIRQKEGETLREYVTRFQEAQLKVAHCSDDSAMCYFLTGLADEALT--------

Query:  -----------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRVENGPTRSRPYERFTPTTISISEILTNIEESGMEKLL
                         ELLRTKTGRPE+ I +GR+GKD  KAD KS+DKG S SS R +YRR  +   +SRPYE +TPTTI I EILTNIEE+GMEKLL
Subjt:  -----------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRVENGPTRSRPYERFTPTTISISEILTNIEESGMEKLL

Query:  KRPAKLRGAPERRSKDKYCRFHRKHGHNTSDCWELKHQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKE
        KRP KLRG PE+R+ DKYCRFHR HGHNTS+ WELK QIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KE
Subjt:  KRPAKLRGAPERRSKDKYCRFHRKHGHNTSDCWELKHQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKE

Query:  LARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCI
        LAR ARREVCIIREQRPT  I F+  DLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQL KSPTPLVGFSGES+  EGCI
Subjt:  LARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCI

Query:  DLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQIASRECYASALKGSSFCALETLAGRD
        DLPV++ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SS CALE    RD
Subjt:  DLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQIASRECYASALKGSSFCALETLAGRD

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]7.4e-20571.45Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLSTIRQKEGETLREYVTRFQEAQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHL+TIRQKE ETLREYVTRFQE QLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLSTIRQKEGETLREYVTRFQEAQLKVAHCSDDSAMCYFL

Query:  TGLADEALT-------------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSS-GRAEYRRVENGPTRSRPYERFTPTTIS
        T LADE LT                         ELLRTKTGRPE++I + +  ++  KAD KS+DKGS SS  R EYRR+E+GP+RSRPYER+T +TI 
Subjt:  TGLADEALT-------------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSS-GRAEYRRVENGPTRSRPYERFTPTTIS

Query:  ISEILTNIEESGMEKLLKRPAKLRGAPERRSKDKYCRFHRKHGHNTSDCWELKHQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIEESGMEKLLKRP KLRG  E+R+K+KYCRFHR HGHNT+ CWELK QIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPAKLRGAPERRSKDKYCRFHRKHGHNTSDCWELKHQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSP
        TIFGGP+GGQSG+KRKELAR ARREVCIIRE +PTC ITF   DLE VHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQIASRECYASALKGSSFC
                      GCIDLPVT+GQD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN VG VRGEQ  SRECYASALKGS+ C
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQIASRECYASALKGSSFC

Query:  ALETLAGRDGTLEFEADLP---RREFAAPTEELELVPLLSPEK
        ALE    R    E EADLP   +R+F  PTEELELVPLLSPE+
Subjt:  ALETLAGRDGTLEFEADLP---RREFAAPTEELELVPLLSPEK

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.4e-26590.53Show/hide
Query:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTLDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEG LNDGDLGESPFT DVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTLDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLSTIRQKEGETLREYVTRFQEAQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHL+TIRQKEGETLREYVTRFQE QLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLSTIRQKEGETLREYVTRFQEAQLKVAHCSDDSAMCYFLTGLADEAL

Query:  T-------------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRVENGPTRSRPYERFTPTTISISEILTNIE
        T                         ELLRTKTGRPERKIGRGRSGKDIE ADPKSKDKGSFSSGRAEYRR ENGPTRSRPYERFTPTTI ISEILTNIE
Subjt:  T-------------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRVENGPTRSRPYERFTPTTISISEILTNIE

Query:  ESGMEKLLKRPAKLRGAPERRSKDKYCRFHRKHGHNTSDCWELKHQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        ESGMEKLLKRP KLRGAPERRSKDKYCRFHR+HGHNTSD WELK QIE+LIQDGYFKKFVGKPRTSS EKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPAKLRGAPERRSKDKYCRFHRKHGHNTSDCWELKHQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTCPITFD  DLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQL KSPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV
        SVIPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188233.4e-26479.4Show/hide
Query:  SSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTLDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ
        SSNQQAESS NPATP GVITREEFDQLRG+L+AQVEALKAKCEQKEG LNDGDLGESPFT DVLE        APTVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTLDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLSTIRQKEGETLREYVTRFQEAQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLSTIRQKEGETLREYVTRFQEAQLKVAHCSDDSAMCYFLTGLA

Query:  DEALT-------------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRVENGPTRSRPYERFTPTTISISEIL
        DEALT                         ELLRTKTGRPER I RGRSGKD EKAD KSKDKGSFSSGRAE+RR  NGPTRSRPYERFTPTTI ISEIL
Subjt:  DEALT-------------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRVENGPTRSRPYERFTPTTISISEIL

Query:  TNIEESGMEKLLKRPAKLRGAPERRSKDKYCRFHRKHGHNTSDCWELKHQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGG
        TNIEESGMEKLLKRP KLRGAPERR+KDKYCRFHR+H HNTSD WELK QIEDLIQD YFKKFVGKPRTSS EKKEERK SRTP RR DRPAVINTIFGG
Subjt:  TNIEESGMEKLLKRPAKLRGAPERRSKDKYCRFHRKHGHNTSDCWELKHQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGG

Query:  PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVG
        PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDS DLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQL KS TPLVG
Subjt:  PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQIASRECYASALKGSSFCALETL
        FS ESVIPEGCIDLPVTLG DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQIASRECYASALKGSS CALETL
Subjt:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQIASRECYASALKGSSFCALETL

Query:  AGRDGTLEFEADLPRREFAAPTEELELVPLL
          RDGTLEF+A+LPRREFAAPTEELELVPLL
Subjt:  AGRDGTLEFEADLPRREFAAPTEELELVPLL

A0A6J1DD03 uncharacterized protein LOC1110198994.8e-21086.26Show/hide
Query:  MCYFLTGLADEALT-------------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRVENGPTRSRPYERFTP
        MCYFLTGLADEALT                         ELLRT       KIG+GRSGKD+E  DPKSKDKGSFS+GRAEYRR ENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALT-------------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRVENGPTRSRPYERFTP

Query:  TTISISEILTNIEESGMEKLLKRPAKLRGAPERRSKDKYCRFHRKHGHNTSDCWELKHQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRP
        TTI ISEILTNIEESGMEKLLKRP KLRGAPERRSKDKYCRFHR+HGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSS EKKEERKRSRTPPRRTDRP
Subjt:  TTISISEILTNIEESGMEKLLKRPAKLRGAPERRSKDKYCRFHRKHGHNTSDCWELKHQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQRPTCPITFD  DL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  TKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQIASRECYASALKG
         KSPTPLVGFSGESV+PEGCIDLPVTLGQDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQ ASRECYAS LKG
Subjt:  TKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQIASRECYASALKG

Query:  SSFCALETLAGRDGTLEFEADLPRREFAAPTEELELVPLLSPEK
        +S CALETL  RDGTLEFEADLP REFAAP EELELVPLLS EK
Subjt:  SSFCALETLAGRDGTLEFEADLPRREFAAPTEELELVPLLSPEK

A0A6J1DHB3 uncharacterized protein LOC1110204791.4e-25763.89Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGIATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAM
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ + TEPL RSARIT PVLPPAHP                                        
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGIATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAM

Query:  RTQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQTPSRSHRSSNQQAESSRNPA
                                                                                         PS+        AESS NP 
Subjt:  RTQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQTPSRSHRSSNQQAESSRNPA

Query:  TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTLDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQI
        TP GVITREEFDQL+ + DAQVEALKA+CE+KE S +DGDLGE  F+ D+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQI
Subjt:  TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTLDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQI

Query:  ALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLSTIRQKEGETLREYVTRFQEAQLKVAHCSDDSAMCYFLTGLADEALT--------
        ALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THL+TIRQKEGETLREYVTRF E QLKVAHCSDDSAMCYFLTGLADE LT        
Subjt:  ALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLSTIRQKEGETLREYVTRFQEAQLKVAHCSDDSAMCYFLTGLADEALT--------

Query:  -----------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRVENGPTRSRPYERFTPTTISISEILTNIEESGMEKLL
                         ELLRTKTGRPE+ I +GR+GKD  KAD KS+DKG S SS R +YRR  +   +SRPYE +TPTTI I EILTNIEE+GMEKLL
Subjt:  -----------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRVENGPTRSRPYERFTPTTISISEILTNIEESGMEKLL

Query:  KRPAKLRGAPERRSKDKYCRFHRKHGHNTSDCWELKHQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKE
        KRP KLRG PE+R+ DKYCRFHR HGHNTS+ WELK QIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KE
Subjt:  KRPAKLRGAPERRSKDKYCRFHRKHGHNTSDCWELKHQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKE

Query:  LARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCI
        LAR ARREVCIIREQRPT  I F+  DLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQL KSPTPLVGFSGES+  EGCI
Subjt:  LARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCI

Query:  DLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQIASRECYASALKGSSFCALETLAGRD
        DLPV++ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SS CALE    RD
Subjt:  DLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQIASRECYASALKGSSFCALETLAGRD

A0A6J1DZB9 uncharacterized protein LOC1110249043.6e-20571.45Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLSTIRQKEGETLREYVTRFQEAQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHL+TIRQKE ETLREYVTRFQE QLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLSTIRQKEGETLREYVTRFQEAQLKVAHCSDDSAMCYFL

Query:  TGLADEALT-------------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSS-GRAEYRRVENGPTRSRPYERFTPTTIS
        T LADE LT                         ELLRTKTGRPE++I + +  ++  KAD KS+DKGS SS  R EYRR+E+GP+RSRPYER+T +TI 
Subjt:  TGLADEALT-------------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSS-GRAEYRRVENGPTRSRPYERFTPTTIS

Query:  ISEILTNIEESGMEKLLKRPAKLRGAPERRSKDKYCRFHRKHGHNTSDCWELKHQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIEESGMEKLLKRP KLRG  E+R+K+KYCRFHR HGHNT+ CWELK QIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPAKLRGAPERRSKDKYCRFHRKHGHNTSDCWELKHQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSP
        TIFGGP+GGQSG+KRKELAR ARREVCIIRE +PTC ITF   DLE VHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQIASRECYASALKGSSFC
                      GCIDLPVT+GQD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN VG VRGEQ  SRECYASALKGS+ C
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQIASRECYASALKGSSFC

Query:  ALETLAGRDGTLEFEADLP---RREFAAPTEELELVPLLSPEK
        ALE    R    E EADLP   +R+F  PTEELELVPLLSPE+
Subjt:  ALETLAGRDGTLEFEADLP---RREFAAPTEELELVPLLSPEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGT
CACGACGGCATAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGCGCACCCCCCAAGGACATCCAAGGCCACCCGTGGCCGA
GGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGG
TCCATGGAGGAAATGTATAACGAGATGATACTAGCAGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCAC
CTCGGCCCAGTCGAGGAGGAACATCCTGAAGACAACGAGAGCGAGGGACACACTCGCCGGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCT
CTCCGAAAAGGACAGACACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAG
TTCGACCAGCTGAGGGGCCAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTTCACTGAACGATGGCGATTTGGGAGAATCGCCC
TTCACCTTGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTT
GAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCC
AGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCTCCACCATCAGACAG
AAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGCGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGT
CTAGCTGACGAAGCCCTCACGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCC
AAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGTGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACG
ATTTCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGCGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGAC
AAGTATTGCCGCTTCCATCGGAAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGCACCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTT
GTGGGAAAGCCCAGGACCAGCTCGACAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTC
GGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATC
ACCTTCGACAGTACAGACTTAGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGAC
GGAGGCGCATCTGCTAACATTCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGACGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGA
GAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACACTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCG
GCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCGTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACG
GTCCGAGGAGAACAGATCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGTTCTGCGCCCTCGAAACTCTCGCTGGTAGGGATGGGACGCTCGAG
TTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGT
CACGACGGCATAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGCGCACCCCCCAAGGACATCCAAGGCCACCCGTGGCCGA
GGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGG
TCCATGGAGGAAATGTATAACGAGATGATACTAGCAGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCAC
CTCGGCCCAGTCGAGGAGGAACATCCTGAAGACAACGAGAGCGAGGGACACACTCGCCGGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCT
CTCCGAAAAGGACAGACACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAG
TTCGACCAGCTGAGGGGCCAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTTCACTGAACGATGGCGATTTGGGAGAATCGCCC
TTCACCTTGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTT
GAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCC
AGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCTCCACCATCAGACAG
AAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGCGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGT
CTAGCTGACGAAGCCCTCACGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCC
AAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGTGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACG
ATTTCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGCGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGAC
AAGTATTGCCGCTTCCATCGGAAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGCACCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTT
GTGGGAAAGCCCAGGACCAGCTCGACAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTC
GGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATC
ACCTTCGACAGTACAGACTTAGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGAC
GGAGGCGCATCTGCTAACATTCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGACGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGA
GAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACACTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCG
GCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCGTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACG
GTCCGAGGAGAACAGATCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGTTCTGCGCCCTCGAAACTCTCGCTGGTAGGGATGGGACGCTCGAG
TTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGTAA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGIATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMR
SMEEMYNEMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQTPSRSHRSSNQQAESSRNPATPAGVITREE
FDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTLDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA
RSISTYSQLRREFLAQFSSRHYDKKTATHLSTIRQKEGETLREYVTRFQEAQLKVAHCSDDSAMCYFLTGLADEALTELLRTKTGRPERKIGRGRSGKDIEKADP
KSKDKGSFSSGRAEYRRVENGPTRSRPYERFTPTTISISEILTNIEESGMEKLLKRPAKLRGAPERRSKDKYCRFHRKHGHNTSDCWELKHQIEDLIQDGYFKKF
VGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVD
GGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGT
VRGEQIASRECYASALKGSSFCALETLAGRDGTLEFEADLPRREFAAPTEELELVPLLSPEK