; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g10160 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g10160
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:7802310..7804612
RNA-Seq ExpressionMoc07g10160
SyntenyMoc07g10160
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]4.9e-24084.66Show/hide
Query:  QAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASD
        +AESS NPATPAGVITREEFDQLRG+LDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEA IPPKFKAPTVKPYDGSKDPK+YVEVFE LMDFQAASD
Subjt:  QAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVT-------------------------SRRSP
        AIKCRAF+IALTGSARLWYRRLPA SISTY+QLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVT                         +  + 
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVT-------------------------SRRSP

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGISRGRSGKD-EKADLKSKDKGSFSSGRAEFRRA----------------------ILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKD E AD KSKDKGSFSSGRAE+RRA                      ILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGISRGRSGKD-EKADLKSKDKGSFSSGRAEFRRA----------------------ILTNIE

Query:  ESGMEKLLKRPEKLRGAPERHSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEARKRSRTPPRRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPER SKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKE RKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERHSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEARKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARRDVCIMREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRMLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
        QSG KRKELARAARR+VCI+REQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVV R+LVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGHKRKELARAARRDVCIMREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRMLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTFGQDQTQVTQMAEFV
        SVIPEG IDLPVT GQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTFGQDQTQVTQMAEFV

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]1.5e-17780.57Show/hide
Query:  KEGPLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQ
        K+  LNDGDLGES FTSDVLEA IPPKFKAPTVKPYDGSKDPK+YVEVFEGLMDF AASDAIKCRAFQIALTGSARLWYRRLPARSISTY+QLRREFLAQ
Subjt:  KEGPLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVT-------------------------SRRSPTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGI
        FSSR Y KKT THLATIRQKEG TLREYVT                         +  + TVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+R I
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVT-------------------------SRRSPTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGI

Query:  SRGRSGKD-EKADLKSKDKGSFSSGRAEFRRA----------------------ILTNIEESGMEKLLKRPEKLRGAPERHSKDKYCRFHREHGHNTSDC
         RGRSGKD E+AD KSKDKGSFSSGRAE+RRA                      ILTNIEESGMEKLLKRPEKLRGAPER SKDKYCRFHREHGHNTSDC
Subjt:  SRGRSGKD-EKADLKSKDKGSFSSGRAEFRRA----------------------ILTNIEESGMEKLLKRPEKLRGAPERHSKDKYCRFHREHGHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEARKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARRDVCIMREQRPTCPITFDSADLEEVH
        WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKE RKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARR+VCI+REQ PTCPITFD AD EEVH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEARKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARRDVCIMREQRPTCPITFDSADLEEVH

Query:  LPHNDALVIAPLIDHVVVRRML
        LPHNDA VIAPLIDHVVVRR+L
Subjt:  LPHNDALVIAPLIDHVVVRRML

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]4.4e-23380.58Show/hide
Query:  SSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQ
        SSNQQAESSHNPATP GVITREEFDQLRGKL+AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDGSKDPK+YVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTSRRSPTVKLGEEAPATFAEVLQKAKK
        AASDAIKCRAFQIALTGSARLW++           QL+   +AQ S    D     +  T    E            + TVKLG+EAPATFAEVLQKAKK
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTSRRSPTVKLGEEAPATFAEVLQKAKK

Query:  VIDGQELLRTKTGRPERGISRGRSGKDEKADLKSKDKGSFSSGRAEFRRA----------------------ILTNIEESGMEKLLKRPEKLRGAPERHS
        VIDGQELLRTKTGRPERGI RGRSGKDEKADLKSKDKGSFSSGRAEFRRA                      ILTNIEESGMEKLLKRPEKLRGAPER +
Subjt:  VIDGQELLRTKTGRPERGISRGRSGKDEKADLKSKDKGSFSSGRAEFRRA----------------------ILTNIEESGMEKLLKRPEKLRGAPERHS

Query:  KDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEARKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARRDVCIMRE
        KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKE RK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARR+VCI+RE
Subjt:  KDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEARKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARRDVCIMRE

Query:  QRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRMLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTFGQDQTQVT
        QRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRR+LVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS ESVIPEGCIDLPVT G DQTQVT
Subjt:  QRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRMLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTFGQDQTQVT

Query:  QMAEFVVIDGRSAYYAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASR
        QMAEFVVIDGRSAY AIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASR
Subjt:  QMAEFVVIDGRSAYYAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASR

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.1e-23561.64Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAYPRTSKATRGRGGTSKKGVRGPAPTPTSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPA+P+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAYPRTSKATRGRGGTSKKGVRGPAPTPTSENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGAGSRSENRVTRIDVREQSGSRLGPVEEERPEDNESEGHTRQRGDLREHLDRKRGSSLRKGQSPSRSYRSSNQQAESSHNPAT
                                                                                                   AESS+NP T
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENRVTRIDVREQSGSRLGPVEEERPEDNESEGHTRQRGDLREHLDRKRGSSLRKGQSPSRSYRSSNQQAESSHNPAT

Query:  PAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASDAIKCRAFQIA
        P GVITREEFDQL+ K DAQVEALKA+CE+KE   +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPK+YVEVFE LMDFQAA+DAIKC AFQIA
Subjt:  PAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASDAIKCRAFQIA

Query:  LTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVT-------------------------SRRSPTVKLGEEAPA
        LTGSARLWYRRLPAR ISTY+QLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVT                         +  + TVKL EEAPA
Subjt:  LTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVT-------------------------SRRSPTVKLGEEAPA

Query:  TFAEVLQKAKKVIDGQELLRTKTGRPERGISRGRSGKDE-KADLKSKDKG-SFSSGRAEFRRA----------------------ILTNIEESGMEKLLK
        TFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ KAD KS+DKG S SS R ++RR+                      ILTNIEE+GMEKLLK
Subjt:  TFAEVLQKAKKVIDGQELLRTKTGRPERGISRGRSGKDE-KADLKSKDKG-SFSSGRAEFRRA----------------------ILTNIEESGMEKLLK

Query:  RPEKLRGAPERHSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEARKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL
        RPEKLRG PE+ + DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKE RKR RTPPRR DRPAVIN             K+KEL
Subjt:  RPEKLRGAPERHSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEARKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL

Query:  ARAARRDVCIMREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRMLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCID
        AR ARR+VCI+REQRPT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGCID
Subjt:  ARAARRDVCIMREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRMLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCID

Query:  LPVTFGQDQTQVTQMAEFVVIDGRSAYYAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASR
        LPV+  QD TQVTQMAEFVVIDGRSAY AIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SR
Subjt:  LPVTFGQDQTQVTQMAEFVVIDGRSAYYAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASR

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]7.4e-20468.58Show/hide
Query:  EQSGSRLGPVEEERPEDNESEGHTRQRGDLREHL-DRKRGSSL--RKGQSPSRSYRSSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQ
        E+    + P   E   ++E   ++ +  DLR+HL D+K+ +S       S SR + +SN +A+S + P  P  VI REEFD ++ + D QVEALKA+CE+
Subjt:  EQSGSRLGPVEEERPEDNESEGHTRQRGDLREHL-DRKRGSSL--RKGQSPSRSYRSSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQ

Query:  KEGPLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQ
        KE P +D DLGESPFTSD++EA IPPKFK PT+KPYDGSKDPK+YVEVFEGLMDFQAA+DAIKC AFQIALTGSARLW RRLPARSISTY+QLR+EF+ Q
Subjt:  KEGPLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTSRRSPTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGISRGR-SGKDEKADLKSKDKGSFSSG
        FS RHYD+KTATHLATIRQKE ETL          TVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT RPE+ I + R S K  K D KSKDKGS SSG
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTSRRSPTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGISRGR-SGKDEKADLKSKDKGSFSSG

Query:  -RAEFRRAILTNIEESGMEKLLKRPEKLRGAPERHSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEARKRSRTPPRRTDR
         R E+RR+      ESG  +   RP +                          CWELKRQIEDLIQD YFKKFVGKPR++S EKKE RKRSRTPPRR DR
Subjt:  -RAEFRRAILTNIEESGMEKLLKRPEKLRGAPERHSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEARKRSRTPPRRTDR

Query:  PAVINTIFGGPSGGQSGHKRKELARAARRDVCIMREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRMLVDGGASANILSLPTYLALGWTRSQ
        PAVINTIFGGPSGGQ  +KRKELA  ARR V I+REQ+PTC ITF   DLE VHLPHNDALVIAPLIDHV+VRR+LVDGGASANILSLPTYLAL  TRSQ
Subjt:  PAVINTIFGGPSGGQSGHKRKELARAARRDVCIMREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRMLVDGGASANILSLPTYLALGWTRSQ

Query:  LKKSPTPLVGFSGESVIPEGCIDLPVTFGQDQTQVTQMAEFVVIDGRSAYYAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASR
        LKKSPTPLVGFS ESV PEGCIDLPVT GQD TQVTQMAEFVVIDGR AY AIF RPIIHSF+A+PS LHQVLKYSTPNGVGTVRGEQ  SR
Subjt:  LKKSPTPLVGFSGESVIPEGCIDLPVTFGQDQTQVTQMAEFVVIDGRSAYYAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASR

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088132.4e-24084.66Show/hide
Query:  QAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASD
        +AESS NPATPAGVITREEFDQLRG+LDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEA IPPKFKAPTVKPYDGSKDPK+YVEVFE LMDFQAASD
Subjt:  QAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVT-------------------------SRRSP
        AIKCRAF+IALTGSARLWYRRLPA SISTY+QLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVT                         +  + 
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVT-------------------------SRRSP

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGISRGRSGKD-EKADLKSKDKGSFSSGRAEFRRA----------------------ILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKD E AD KSKDKGSFSSGRAE+RRA                      ILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGISRGRSGKD-EKADLKSKDKGSFSSGRAEFRRA----------------------ILTNIE

Query:  ESGMEKLLKRPEKLRGAPERHSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEARKRSRTPPRRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPER SKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKE RKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERHSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEARKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARRDVCIMREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRMLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
        QSG KRKELARAARR+VCI+REQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVV R+LVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGHKRKELARAARRDVCIMREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRMLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTFGQDQTQVTQMAEFV
        SVIPEG IDLPVT GQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTFGQDQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188232.1e-23380.58Show/hide
Query:  SSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQ
        SSNQQAESSHNPATP GVITREEFDQLRGKL+AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDGSKDPK+YVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTSRRSPTVKLGEEAPATFAEVLQKAKK
        AASDAIKCRAFQIALTGSARLW++           QL+   +AQ S    D     +  T    E            + TVKLG+EAPATFAEVLQKAKK
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTSRRSPTVKLGEEAPATFAEVLQKAKK

Query:  VIDGQELLRTKTGRPERGISRGRSGKDEKADLKSKDKGSFSSGRAEFRRA----------------------ILTNIEESGMEKLLKRPEKLRGAPERHS
        VIDGQELLRTKTGRPERGI RGRSGKDEKADLKSKDKGSFSSGRAEFRRA                      ILTNIEESGMEKLLKRPEKLRGAPER +
Subjt:  VIDGQELLRTKTGRPERGISRGRSGKDEKADLKSKDKGSFSSGRAEFRRA----------------------ILTNIEESGMEKLLKRPEKLRGAPERHS

Query:  KDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEARKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARRDVCIMRE
        KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKE RK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARR+VCI+RE
Subjt:  KDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEARKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARRDVCIMRE

Query:  QRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRMLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTFGQDQTQVT
        QRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRR+LVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS ESVIPEGCIDLPVT G DQTQVT
Subjt:  QRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRMLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTFGQDQTQVT

Query:  QMAEFVVIDGRSAYYAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASR
        QMAEFVVIDGRSAY AIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASR
Subjt:  QMAEFVVIDGRSAYYAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASR

A0A6J1D9W7 uncharacterized protein LOC1110187087.5e-17880.57Show/hide
Query:  KEGPLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQ
        K+  LNDGDLGES FTSDVLEA IPPKFKAPTVKPYDGSKDPK+YVEVFEGLMDF AASDAIKCRAFQIALTGSARLWYRRLPARSISTY+QLRREFLAQ
Subjt:  KEGPLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVT-------------------------SRRSPTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGI
        FSSR Y KKT THLATIRQKEG TLREYVT                         +  + TVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+R I
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVT-------------------------SRRSPTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGI

Query:  SRGRSGKD-EKADLKSKDKGSFSSGRAEFRRA----------------------ILTNIEESGMEKLLKRPEKLRGAPERHSKDKYCRFHREHGHNTSDC
         RGRSGKD E+AD KSKDKGSFSSGRAE+RRA                      ILTNIEESGMEKLLKRPEKLRGAPER SKDKYCRFHREHGHNTSDC
Subjt:  SRGRSGKD-EKADLKSKDKGSFSSGRAEFRRA----------------------ILTNIEESGMEKLLKRPEKLRGAPERHSKDKYCRFHREHGHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEARKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARRDVCIMREQRPTCPITFDSADLEEVH
        WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKE RKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARR+VCI+REQ PTCPITFD AD EEVH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEARKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARRDVCIMREQRPTCPITFDSADLEEVH

Query:  LPHNDALVIAPLIDHVVVRRML
        LPHNDA VIAPLIDHVVVRR+L
Subjt:  LPHNDALVIAPLIDHVVVRRML

A0A6J1DHB3 uncharacterized protein LOC1110204791.0e-23561.64Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAYPRTSKATRGRGGTSKKGVRGPAPTPTSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPA+P+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAYPRTSKATRGRGGTSKKGVRGPAPTPTSENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGAGSRSENRVTRIDVREQSGSRLGPVEEERPEDNESEGHTRQRGDLREHLDRKRGSSLRKGQSPSRSYRSSNQQAESSHNPAT
                                                                                                   AESS+NP T
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENRVTRIDVREQSGSRLGPVEEERPEDNESEGHTRQRGDLREHLDRKRGSSLRKGQSPSRSYRSSNQQAESSHNPAT

Query:  PAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASDAIKCRAFQIA
        P GVITREEFDQL+ K DAQVEALKA+CE+KE   +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPK+YVEVFE LMDFQAA+DAIKC AFQIA
Subjt:  PAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASDAIKCRAFQIA

Query:  LTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVT-------------------------SRRSPTVKLGEEAPA
        LTGSARLWYRRLPAR ISTY+QLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVT                         +  + TVKL EEAPA
Subjt:  LTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVT-------------------------SRRSPTVKLGEEAPA

Query:  TFAEVLQKAKKVIDGQELLRTKTGRPERGISRGRSGKDE-KADLKSKDKG-SFSSGRAEFRRA----------------------ILTNIEESGMEKLLK
        TFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ KAD KS+DKG S SS R ++RR+                      ILTNIEE+GMEKLLK
Subjt:  TFAEVLQKAKKVIDGQELLRTKTGRPERGISRGRSGKDE-KADLKSKDKG-SFSSGRAEFRRA----------------------ILTNIEESGMEKLLK

Query:  RPEKLRGAPERHSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEARKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL
        RPEKLRG PE+ + DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKE RKR RTPPRR DRPAVIN             K+KEL
Subjt:  RPEKLRGAPERHSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEARKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL

Query:  ARAARRDVCIMREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRMLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCID
        AR ARR+VCI+REQRPT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGCID
Subjt:  ARAARRDVCIMREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRMLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCID

Query:  LPVTFGQDQTQVTQMAEFVVIDGRSAYYAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASR
        LPV+  QD TQVTQMAEFVVIDGRSAY AIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SR
Subjt:  LPVTFGQDQTQVTQMAEFVVIDGRSAYYAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASR

A0A6J1DPC9 uncharacterized protein LOC1110222803.6e-20468.58Show/hide
Query:  EQSGSRLGPVEEERPEDNESEGHTRQRGDLREHL-DRKRGSSL--RKGQSPSRSYRSSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQ
        E+    + P   E   ++E   ++ +  DLR+HL D+K+ +S       S SR + +SN +A+S + P  P  VI REEFD ++ + D QVEALKA+CE+
Subjt:  EQSGSRLGPVEEERPEDNESEGHTRQRGDLREHL-DRKRGSSL--RKGQSPSRSYRSSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQ

Query:  KEGPLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQ
        KE P +D DLGESPFTSD++EA IPPKFK PT+KPYDGSKDPK+YVEVFEGLMDFQAA+DAIKC AFQIALTGSARLW RRLPARSISTY+QLR+EF+ Q
Subjt:  KEGPLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTSRRSPTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGISRGR-SGKDEKADLKSKDKGSFSSG
        FS RHYD+KTATHLATIRQKE ETL          TVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT RPE+ I + R S K  K D KSKDKGS SSG
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTSRRSPTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGISRGR-SGKDEKADLKSKDKGSFSSG

Query:  -RAEFRRAILTNIEESGMEKLLKRPEKLRGAPERHSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEARKRSRTPPRRTDR
         R E+RR+      ESG  +   RP +                          CWELKRQIEDLIQD YFKKFVGKPR++S EKKE RKRSRTPPRR DR
Subjt:  -RAEFRRAILTNIEESGMEKLLKRPEKLRGAPERHSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEARKRSRTPPRRTDR

Query:  PAVINTIFGGPSGGQSGHKRKELARAARRDVCIMREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRMLVDGGASANILSLPTYLALGWTRSQ
        PAVINTIFGGPSGGQ  +KRKELA  ARR V I+REQ+PTC ITF   DLE VHLPHNDALVIAPLIDHV+VRR+LVDGGASANILSLPTYLAL  TRSQ
Subjt:  PAVINTIFGGPSGGQSGHKRKELARAARRDVCIMREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRMLVDGGASANILSLPTYLALGWTRSQ

Query:  LKKSPTPLVGFSGESVIPEGCIDLPVTFGQDQTQVTQMAEFVVIDGRSAYYAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASR
        LKKSPTPLVGFS ESV PEGCIDLPVT GQD TQVTQMAEFVVIDGR AY AIF RPIIHSF+A+PS LHQVLKYSTPNGVGTVRGEQ  SR
Subjt:  LKKSPTPLVGFSGESVIPEGCIDLPVTFGQDQTQVTQMAEFVVIDGRSAYYAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCAGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCGCGAATCACCGCGCCTGTTCTACCACCTGCGTACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGTCCGAGGTCCAGCCCCGACTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAAATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTAACGCGCATTGACGTACGCGAGCAAAGCGGTTCCCGCCTCGGCCCAGTCGAGGAGGAACG
TCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCGACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCT
CATACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCTGCAGGGGTGATCACAAGGGAGGAGTTTGACCAGCTGAGGGGCAAGCTCGATGCTCAG
GTTGAGGCCTTAAAGGCCAAGTGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACCTGGGAGAATCGCCATTCACCTCGGACGTTTTGGAAGCATCGATCCCTCCGAA
GTTCAAAGCTCCAACTGTGAAACCTTATGATGGGTCAAAGGACCCTAAGAATTATGTTGAGGTCTTTGAGGGCCTCATGGATTTTCAAGCAGCATCAGACGCAATCAAAT
GTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGTTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACGCTCAGCTGAGAAGGGAGTTCCTTGCCCAG
TTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAAAAGGAAGGTGAGACGCTGCGGGAATATGTCACCAGCCGACGAAGCCCTACGGT
GAAGCTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAAGTGCTGCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAG
GGATCAGCCGGGGCAGAAGCGGAAAAGATGAAAAGGCGGATCTCAAGTCCAAGGACAAGGGATCTTTCTCCAGTGGCCGAGCTGAGTTTCGAAGGGCGATCCTAACGAAC
ATCGAGGAGTCTGGAATGGAAAAACTACTCAAACGTCCTGAGAAACTTCGGGGAGCCCCAGAGAGGCACAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCA
CAACACGTCAGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCTAGGACCAGCTCGGCAGAGAAAAAGG
AAGCGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGGGGGCCAAGCGGGGGCCAATCCGGACATAAAAGAAAGGAG
TTAGCTCGTGCAGCCAGGCGCGATGTGTGTATCATGAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACAGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGA
TGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGAAGGATGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGAT
GGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACATTTGGGCAGGACCAA
ACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATTACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACT
TCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCAGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCGCGAATCACCGCGCCTGTTCTACCACCTGCGTACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGTCCGAGGTCCAGCCCCGACTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAAATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTAACGCGCATTGACGTACGCGAGCAAAGCGGTTCCCGCCTCGGCCCAGTCGAGGAGGAACG
TCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCGACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCT
CATACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCTGCAGGGGTGATCACAAGGGAGGAGTTTGACCAGCTGAGGGGCAAGCTCGATGCTCAG
GTTGAGGCCTTAAAGGCCAAGTGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACCTGGGAGAATCGCCATTCACCTCGGACGTTTTGGAAGCATCGATCCCTCCGAA
GTTCAAAGCTCCAACTGTGAAACCTTATGATGGGTCAAAGGACCCTAAGAATTATGTTGAGGTCTTTGAGGGCCTCATGGATTTTCAAGCAGCATCAGACGCAATCAAAT
GTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGTTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACGCTCAGCTGAGAAGGGAGTTCCTTGCCCAG
TTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAAAAGGAAGGTGAGACGCTGCGGGAATATGTCACCAGCCGACGAAGCCCTACGGT
GAAGCTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAAGTGCTGCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAG
GGATCAGCCGGGGCAGAAGCGGAAAAGATGAAAAGGCGGATCTCAAGTCCAAGGACAAGGGATCTTTCTCCAGTGGCCGAGCTGAGTTTCGAAGGGCGATCCTAACGAAC
ATCGAGGAGTCTGGAATGGAAAAACTACTCAAACGTCCTGAGAAACTTCGGGGAGCCCCAGAGAGGCACAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCA
CAACACGTCAGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCTAGGACCAGCTCGGCAGAGAAAAAGG
AAGCGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGGGGGCCAAGCGGGGGCCAATCCGGACATAAAAGAAAGGAG
TTAGCTCGTGCAGCCAGGCGCGATGTGTGTATCATGAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACAGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGA
TGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGAAGGATGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGAT
GGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACATTTGGGCAGGACCAA
ACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATTACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACT
TCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGTAG
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAYPRTSKATRGRGGTSKKGVRGPAPTPTSENFDALQREMEAMRTQMRSMEEMY
NEMILAAGAGSRSENRVTRIDVREQSGSRLGPVEEERPEDNESEGHTRQRGDLREHLDRKRGSSLRKGQSPSRSYRSSNQQAESSHNPATPAGVITREEFDQLRGKLDAQ
VEALKAKCEQKEGPLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGSKDPKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQ
FSSRHYDKKTATHLATIRQKEGETLREYVTSRRSPTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGISRGRSGKDEKADLKSKDKGSFSSGRAEFRRAILTN
IEESGMEKLLKRPEKLRGAPERHSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEARKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKE
LARAARRDVCIMREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRMLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTFGQDQ
TQVTQMAEFVVIDGRSAYYAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASR