; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g11030 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g11030
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr10:8432964..8436635
RNA-Seq ExpressionMoc10g11030
SyntenyMoc10g11030
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]3.6e-22681.25Show/hide
Query:  QVQSSRNPATPAGVITREEFDQLRGQLDAQ---------------------------DVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASD
        + +SSRNPATPAGVITREEFDQLRGQLDAQ                           DVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASD
Subjt:  QVQSSRNPATPAGVITREEFDQLRGQLDAQ---------------------------DVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIEKADPKSKDKGSFPSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKT RPERKIGRGRSGKDIE ADPKSKDKGSF SGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIEKADPKSKDKGSFPSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  KSGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDGYFKEFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        +SGMEKLLKRPEKLRGAPERRSKDKY RFHREHGHNTSD WELKRQIE+LIQDGYFK+FVGKPRTSS EKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  KSGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDGYFKEFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKE---------------------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGE
        QSG KRKE                                                   RVLVDGG SANILSLPTYLALGWTRSQL KSPTPLVGFSGE
Subjt:  QSGHKRKE---------------------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGE

Query:  SVIPEGCIDLLVTHGQDQTQVTQMAEFV
        SVIPEG IDL VT GQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLLVTHGQDQTQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.2e-21870.63Show/hide
Query:  SSNQQVQSSRNPATPAGVITREEFDQLRGQLDAQ------------------DVLEAPIPPK-FKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKC
        SSNQQ +SS NPATP GVITREEFDQLRG+L+AQ                  D+ E+P      +APTVK YDGSKDPKDYVEVFE LMDFQAASDAIKC
Subjt:  SSNQQVQSSRNPATPAGVITREEFDQLRGQLDAQ------------------DVLEAPIPPK-FKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKC

Query:  RAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL
        RAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLADEALTVKL
Subjt:  RAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL

Query:  GEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIEKADPKSKDKGSFPSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKSGM
        G+EAPATFAEVLQKAKKVIDGQELLRTKT RPER I RGRSGKD EKAD KSKDKGSF SGRAE+RRA NGPTRSRPYERFTPTTIPISEILTNIE+SGM
Subjt:  GEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIEKADPKSKDKGSFPSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKSGM

Query:  EKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDGYFKEFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGH
        EKLLKRPEKLRGAPERR+KDKY RFHREH HNTSD WELKRQIEDLIQD YFK+FVGKPRTSS EKKEERK SRTP RR DRPAVINTIFGGPSGGQSGH
Subjt:  EKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDGYFKEFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGH

Query:  KRKE---------------------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIP
        KRKE                                                   RVLVD G SANI+SL TYLALGWTRSQL KS TPLVGFS ESVIP
Subjt:  KRKE---------------------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIP

Query:  EGCIDLLVTHGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAILLTLHQVLKYSTPNGVGTVRGEQTASRECYASALKDSSVCALETLAGRDETLE
        EGCIDL VT G DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAI  TLHQVLKYSTPNGVG VRGEQ ASRECYASALK SSVCALETL  RD TLE
Subjt:  EGCIDLLVTHGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAILLTLHQVLKYSTPNGVGTVRGEQTASRECYASALKDSSVCALETLAGRDETLE

Query:  FEADLPRREFAAPTEELELVPLL
        F+A+LPRREFAAPTEELELVPLL
Subjt:  FEADLPRREFAAPTEELELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]6.8e-22559.62Show/hide
Query:  MVQPAKSTNTADRRTLAASDAHQREGGAAVVEGKGHDGLATEPLRRSARITAPILPPAHPPRTSKATRGRGGTSKKSARGPAPAPTSENLDALQREMEAM
        MVQPA STNTADRR LAA+  HQRE GA VVEG+GH+ L TEPL RSARIT P+LPPAHP                                        
Subjt:  MVQPAKSTNTADRRTLAASDAHQREGGAAVVEGKGHDGLATEPLRRSARITAPILPPAHPPRTSKATRGRGGTSKKSARGPAPAPTSENLDALQREMEAM

Query:  RTKMRSMEEMYNEMISAAGTGSRSENRVTRVGIREQRGSHLGPIEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVQSSRNPA
                                                                                         PS++        +SS NP 
Subjt:  RTKMRSMEEMYNEMISAAGTGSRSENRVTRVGIREQRGSHLGPIEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVQSSRNPA

Query:  TPAGVITREEFDQLRGQLDAQ---------------------------DVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFQI
        TP GVITREEFDQL+ + DAQ                           D+LEA IPPKFK PT+KPYDGSKDPKDYVEVFESLMDFQAA+DAIKC AFQI
Subjt:  TPAGVITREEFDQLRGQLDAQ---------------------------DVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFQI

Query:  ALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAP
        ALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAP
Subjt:  ALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAP

Query:  ATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIEKADPKSKDKG-SFPSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKSGMEKLL
        ATFAEVLQK KKVIDGQELLRTKT RPE+ I +GR+GKD  KAD KS+DKG S  S R +YRR+ +   +SRPYE +TPTTIPI EILTNIE++GMEKLL
Subjt:  ATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIEKADPKSKDKG-SFPSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKSGMEKLL

Query:  KRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDGYFKEFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVIN-----------------
        KRPEKLRG PE+R+ DKY RFHR+HGHNTS+ WELKRQIEDLIQDGYFK+FVGKPR++SVEKKEERKR RTPPRR DRPAVIN                 
Subjt:  KRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDGYFKEFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVIN-----------------

Query:  ------TIFGGPSGGQSGHKRK---------------ERVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCIDLLVTHGQDQTQV
              +I    +  +  H                   R+LVDGGASANILSL TYLALGWTRSQL KSPTPLVGFSGES+  EGCIDL V+  QD TQV
Subjt:  ------TIFGGPSGGQSGHKRK---------------ERVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCIDLLVTHGQDQTQV

Query:  TQMAEFVVIDGRSAYNAIFGRPIIHSFRAILLTLHQVLKYSTPNGVGTVRGEQTASRECYASALKDSSVCALETLAGRDE
        TQMAEFVVIDGRSAYNAIFGRPIIHSFRA+  TLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCALE    RDE
Subjt:  TQMAEFVVIDGRSAYNAIFGRPIIHSFRAILLTLHQVLKYSTPNGVGTVRGEQTASRECYASALKDSSVCALETLAGRDE

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]9.9e-19289.14Show/hide
Query:  GVITREEFDQLRGQLDAQ---------------------------DVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFQIALT
        G+ITREEFDQLRG+LDAQ                           DVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASDAIKCRAFQIALT
Subjt:  GVITREEFDQLRGQLDAQ---------------------------DVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIEKADPKSKDKGSFPSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKSGMEKLLKRPE
        AEVLQKAKKVIDGQELLRTKT RPERKIGRGRSGKD+E+ADPKSKDKGSF SGRAEYRRAENGPTRSRPYERFTPTTIPI EILTNIE+SGMEKLLKRPE
Subjt:  AEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIEKADPKSKDKGSFPSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKSGMEKLLKRPE

Query:  KLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDGYFKEFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKE
        KLRGAPERRSKDKY RFHREHGHNTSD WELKRQIEDLIQDGYFK+FVGKPRTSS EKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQ GHKRKE
Subjt:  KLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDGYFKEFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKE

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]2.0e-19272.9Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIEKADPKSKDKGSFPS-GRAEYRRAENGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKT RPE++I + +  ++  KAD KS+DKGS  S  R EYRR E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIEKADPKSKDKGSFPS-GRAEYRRAENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEKSGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDGYFKEFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIE+SGMEKLLKRPEKLRG  E+R+K+KY RFHR+HGHNT+ CWELKRQIEDLIQDGYFK+FVGKPR++SVEKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEKSGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDGYFKEFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKERV------------------LVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCIDLLVTHGQDQTQVTQM
        TIFGGP+GGQSG+KRKE                    +  G A    + LP   AL    S +       V      +I  GCIDL VT GQD TQVTQM
Subjt:  TIFGGPSGGQSGHKRKERV------------------LVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCIDLLVTHGQDQTQVTQM

Query:  AEFVVIDGRSAYNAIFGRPIIHSFRAILLTLHQVLKYSTPNGVGTVRGEQTASRECYASALKDSSVCALETLAGRDETLEFEADLP---RREFAAPTEEL
        AEFVVIDGRSAYNAIFGRPIIHSFRA+  TLHQVLKYSTPN VG VRGEQ  SRECYASALK S+VCALE    R +  E EADLP   +R+F  PTEEL
Subjt:  AEFVVIDGRSAYNAIFGRPIIHSFRAILLTLHQVLKYSTPNGVGTVRGEQTASRECYASALKDSSVCALETLAGRDETLEFEADLP---RREFAAPTEEL

Query:  ELVPLLSPEKQVS
        ELVPLLSPE+Q +
Subjt:  ELVPLLSPEKQVS

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.7e-22681.25Show/hide
Query:  QVQSSRNPATPAGVITREEFDQLRGQLDAQ---------------------------DVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASD
        + +SSRNPATPAGVITREEFDQLRGQLDAQ                           DVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASD
Subjt:  QVQSSRNPATPAGVITREEFDQLRGQLDAQ---------------------------DVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIEKADPKSKDKGSFPSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKT RPERKIGRGRSGKDIE ADPKSKDKGSF SGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIEKADPKSKDKGSFPSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  KSGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDGYFKEFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        +SGMEKLLKRPEKLRGAPERRSKDKY RFHREHGHNTSD WELKRQIE+LIQDGYFK+FVGKPRTSS EKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  KSGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDGYFKEFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKE---------------------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGE
        QSG KRKE                                                   RVLVDGG SANILSLPTYLALGWTRSQL KSPTPLVGFSGE
Subjt:  QSGHKRKE---------------------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGE

Query:  SVIPEGCIDLLVTHGQDQTQVTQMAEFV
        SVIPEG IDL VT GQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLLVTHGQDQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188236.0e-21970.63Show/hide
Query:  SSNQQVQSSRNPATPAGVITREEFDQLRGQLDAQ------------------DVLEAPIPPK-FKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKC
        SSNQQ +SS NPATP GVITREEFDQLRG+L+AQ                  D+ E+P      +APTVK YDGSKDPKDYVEVFE LMDFQAASDAIKC
Subjt:  SSNQQVQSSRNPATPAGVITREEFDQLRGQLDAQ------------------DVLEAPIPPK-FKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKC

Query:  RAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL
        RAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLADEALTVKL
Subjt:  RAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL

Query:  GEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIEKADPKSKDKGSFPSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKSGM
        G+EAPATFAEVLQKAKKVIDGQELLRTKT RPER I RGRSGKD EKAD KSKDKGSF SGRAE+RRA NGPTRSRPYERFTPTTIPISEILTNIE+SGM
Subjt:  GEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIEKADPKSKDKGSFPSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKSGM

Query:  EKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDGYFKEFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGH
        EKLLKRPEKLRGAPERR+KDKY RFHREH HNTSD WELKRQIEDLIQD YFK+FVGKPRTSS EKKEERK SRTP RR DRPAVINTIFGGPSGGQSGH
Subjt:  EKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDGYFKEFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGH

Query:  KRKE---------------------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIP
        KRKE                                                   RVLVD G SANI+SL TYLALGWTRSQL KS TPLVGFS ESVIP
Subjt:  KRKE---------------------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIP

Query:  EGCIDLLVTHGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAILLTLHQVLKYSTPNGVGTVRGEQTASRECYASALKDSSVCALETLAGRDETLE
        EGCIDL VT G DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAI  TLHQVLKYSTPNGVG VRGEQ ASRECYASALK SSVCALETL  RD TLE
Subjt:  EGCIDLLVTHGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAILLTLHQVLKYSTPNGVGTVRGEQTASRECYASALKDSSVCALETLAGRDETLE

Query:  FEADLPRREFAAPTEELELVPLL
        F+A+LPRREFAAPTEELELVPLL
Subjt:  FEADLPRREFAAPTEELELVPLL

A0A6J1DHB3 uncharacterized protein LOC1110204793.3e-22559.62Show/hide
Query:  MVQPAKSTNTADRRTLAASDAHQREGGAAVVEGKGHDGLATEPLRRSARITAPILPPAHPPRTSKATRGRGGTSKKSARGPAPAPTSENLDALQREMEAM
        MVQPA STNTADRR LAA+  HQRE GA VVEG+GH+ L TEPL RSARIT P+LPPAHP                                        
Subjt:  MVQPAKSTNTADRRTLAASDAHQREGGAAVVEGKGHDGLATEPLRRSARITAPILPPAHPPRTSKATRGRGGTSKKSARGPAPAPTSENLDALQREMEAM

Query:  RTKMRSMEEMYNEMISAAGTGSRSENRVTRVGIREQRGSHLGPIEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVQSSRNPA
                                                                                         PS++        +SS NP 
Subjt:  RTKMRSMEEMYNEMISAAGTGSRSENRVTRVGIREQRGSHLGPIEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVQSSRNPA

Query:  TPAGVITREEFDQLRGQLDAQ---------------------------DVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFQI
        TP GVITREEFDQL+ + DAQ                           D+LEA IPPKFK PT+KPYDGSKDPKDYVEVFESLMDFQAA+DAIKC AFQI
Subjt:  TPAGVITREEFDQLRGQLDAQ---------------------------DVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFQI

Query:  ALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAP
        ALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAP
Subjt:  ALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAP

Query:  ATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIEKADPKSKDKG-SFPSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKSGMEKLL
        ATFAEVLQK KKVIDGQELLRTKT RPE+ I +GR+GKD  KAD KS+DKG S  S R +YRR+ +   +SRPYE +TPTTIPI EILTNIE++GMEKLL
Subjt:  ATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIEKADPKSKDKG-SFPSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKSGMEKLL

Query:  KRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDGYFKEFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVIN-----------------
        KRPEKLRG PE+R+ DKY RFHR+HGHNTS+ WELKRQIEDLIQDGYFK+FVGKPR++SVEKKEERKR RTPPRR DRPAVIN                 
Subjt:  KRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDGYFKEFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVIN-----------------

Query:  ------TIFGGPSGGQSGHKRK---------------ERVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCIDLLVTHGQDQTQV
              +I    +  +  H                   R+LVDGGASANILSL TYLALGWTRSQL KSPTPLVGFSGES+  EGCIDL V+  QD TQV
Subjt:  ------TIFGGPSGGQSGHKRK---------------ERVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCIDLLVTHGQDQTQV

Query:  TQMAEFVVIDGRSAYNAIFGRPIIHSFRAILLTLHQVLKYSTPNGVGTVRGEQTASRECYASALKDSSVCALETLAGRDE
        TQMAEFVVIDGRSAYNAIFGRPIIHSFRA+  TLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCALE    RDE
Subjt:  TQMAEFVVIDGRSAYNAIFGRPIIHSFRAILLTLHQVLKYSTPNGVGTVRGEQTASRECYASALKDSSVCALETLAGRDE

A0A6J1DS95 uncharacterized protein LOC1110234214.8e-19289.14Show/hide
Query:  GVITREEFDQLRGQLDAQ---------------------------DVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFQIALT
        G+ITREEFDQLRG+LDAQ                           DVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASDAIKCRAFQIALT
Subjt:  GVITREEFDQLRGQLDAQ---------------------------DVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIEKADPKSKDKGSFPSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKSGMEKLLKRPE
        AEVLQKAKKVIDGQELLRTKT RPERKIGRGRSGKD+E+ADPKSKDKGSF SGRAEYRRAENGPTRSRPYERFTPTTIPI EILTNIE+SGMEKLLKRPE
Subjt:  AEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIEKADPKSKDKGSFPSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKSGMEKLLKRPE

Query:  KLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDGYFKEFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKE
        KLRGAPERRSKDKY RFHREHGHNTSD WELKRQIEDLIQDGYFK+FVGKPRTSS EKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQ GHKRKE
Subjt:  KLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDGYFKEFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKE

A0A6J1DZB9 uncharacterized protein LOC1110249049.7e-19372.9Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIEKADPKSKDKGSFPS-GRAEYRRAENGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKT RPE++I + +  ++  KAD KS+DKGS  S  R EYRR E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIEKADPKSKDKGSFPS-GRAEYRRAENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEKSGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDGYFKEFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIE+SGMEKLLKRPEKLRG  E+R+K+KY RFHR+HGHNT+ CWELKRQIEDLIQDGYFK+FVGKPR++SVEKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEKSGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDGYFKEFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKERV------------------LVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCIDLLVTHGQDQTQVTQM
        TIFGGP+GGQSG+KRKE                    +  G A    + LP   AL    S +       V      +I  GCIDL VT GQD TQVTQM
Subjt:  TIFGGPSGGQSGHKRKERV------------------LVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCIDLLVTHGQDQTQVTQM

Query:  AEFVVIDGRSAYNAIFGRPIIHSFRAILLTLHQVLKYSTPNGVGTVRGEQTASRECYASALKDSSVCALETLAGRDETLEFEADLP---RREFAAPTEEL
        AEFVVIDGRSAYNAIFGRPIIHSFRA+  TLHQVLKYSTPN VG VRGEQ  SRECYASALK S+VCALE    R +  E EADLP   +R+F  PTEEL
Subjt:  AEFVVIDGRSAYNAIFGRPIIHSFRAILLTLHQVLKYSTPNGVGTVRGEQTASRECYASALKDSSVCALETLAGRDETLEFEADLP---RREFAAPTEEL

Query:  ELVPLLSPEKQVS
        ELVPLLSPE+Q +
Subjt:  ELVPLLSPEKQVS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAAGTCGACAAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGGCGGAGCAGCAGTGGTAGAGGGGAAAGGT
CACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTATTCTACCACCTGCGCACCCCCCAAGGACATCCAAGGCCACCCGTGGCCGA
GGTGGAACCTCTAAGAAGAGCGCCCGGGGTCCAGCCCCGGCCCCGACAAGTGAGAACTTGGATGCACTCCAGAGAGAAATGGAGGCAATGCGCACGAAAATGCGG
TCCATGGAGGAAATGTATAACGAAATGATATCAGCTGCAGGCACAGGGTCTCGATCTGAGAACCGAGTGACGCGCGTTGGCATACGCGAGCAAAGGGGTTCCCAC
CTCGGCCCAATCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCT
CTCCGAAAAGGACAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGTTCAATCCTCTCGCAACCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAG
TTCGACCAGCTGAGGGGCCAGCTCGACGCTCAGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGAC
CCCAAGGATTATGTTGAGGTCTTTGAAAGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGA
TTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCG
ACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCG
GCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAA
GTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGACCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCC
AAGGACAAGGGATCCTTTCCCAGCGGTCGAGCTGAATATCGAAGGGCGGAGAATGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCA
ATTTCCGAGATCCTAACGAACATCGAGAAGTCTGGAATGGAAAAATTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTAT
TACCGCTTCCATCGGGAGCACGGCCACAATACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGGAATTTGTGGGA
AAGCCCAGGACCAGCTCGGTAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTTGGAGGG
CCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCTTGTCCTTACCGACCTACCTCGCCCTGGGATGG
ACGAGGTCGCAATTGACGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCTGGTCACACATGGGCAGGAC
CAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCTATCATCCACTCATTTCGGGCCATTCTC
TTGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGAC
TCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGAGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAG
CTTGTTCCTCTGCTTAGTCCCGAGAAGCAAGTAAGCATAGGAACCAAGCTGGGGGCCACCGACAGAGAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTC
GTGGTCCGAGGTGGAGCATTGTACCGACGCGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTGAGGGTCCAAACCCATGTGGGTGCCCTT
GACCCGACCTGGGAGGGGCCGTTTGAGGTCAAGGGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGTGCACCCGTGGAACGCG
GAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAAGTCGACAAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGGCGGAGCAGCAGTGGTAGAGGGGAAAGGT
CACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTATTCTACCACCTGCGCACCCCCCAAGGACATCCAAGGCCACCCGTGGCCGA
GGTGGAACCTCTAAGAAGAGCGCCCGGGGTCCAGCCCCGGCCCCGACAAGTGAGAACTTGGATGCACTCCAGAGAGAAATGGAGGCAATGCGCACGAAAATGCGG
TCCATGGAGGAAATGTATAACGAAATGATATCAGCTGCAGGCACAGGGTCTCGATCTGAGAACCGAGTGACGCGCGTTGGCATACGCGAGCAAAGGGGTTCCCAC
CTCGGCCCAATCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCT
CTCCGAAAAGGACAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGTTCAATCCTCTCGCAACCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAG
TTCGACCAGCTGAGGGGCCAGCTCGACGCTCAGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGAC
CCCAAGGATTATGTTGAGGTCTTTGAAAGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGA
TTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCG
ACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCG
GCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAA
GTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGACCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCC
AAGGACAAGGGATCCTTTCCCAGCGGTCGAGCTGAATATCGAAGGGCGGAGAATGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCA
ATTTCCGAGATCCTAACGAACATCGAGAAGTCTGGAATGGAAAAATTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTAT
TACCGCTTCCATCGGGAGCACGGCCACAATACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGGAATTTGTGGGA
AAGCCCAGGACCAGCTCGGTAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTTGGAGGG
CCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCTTGTCCTTACCGACCTACCTCGCCCTGGGATGG
ACGAGGTCGCAATTGACGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCTGGTCACACATGGGCAGGAC
CAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCTATCATCCACTCATTTCGGGCCATTCTC
TTGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGAC
TCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGAGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAG
CTTGTTCCTCTGCTTAGTCCCGAGAAGCAAGTAAGCATAGGAACCAAGCTGGGGGCCACCGACAGAGAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTC
GTGGTCCGAGGTGGAGCATTGTACCGACGCGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTGAGGGTCCAAACCCATGTGGGTGCCCTT
GACCCGACCTGGGAGGGGCCGTTTGAGGTCAAGGGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGTGCACCCGTGGAACGCG
GAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPAKSTNTADRRTLAASDAHQREGGAAVVEGKGHDGLATEPLRRSARITAPILPPAHPPRTSKATRGRGGTSKKSARGPAPAPTSENLDALQREMEAMRTKMR
SMEEMYNEMISAAGTGSRSENRVTRVGIREQRGSHLGPIEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVQSSRNPATPAGVITREE
FDQLRGQLDAQDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTA
THLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIEKADPKS
KDKGSFPSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKSGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDGYFKEFVG
KPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKERVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCIDLLVTHGQD
QTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAILLTLHQVLKYSTPNGVGTVRGEQTASRECYASALKDSSVCALETLAGRDETLEFEADLPRREFAAPTEELE
LVPLLSPEKQVSIGTKLGATDREERRKLARRAARFVVRGGALYRRGFSLPLLRCLTPEEGLRVQTHVGALDPTWEGPFEVKGIVRPGTYILADLKGDVLVHPWNA
EHLKRYYP