; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g29130 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g29130
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:21856642..21861441
RNA-Seq ExpressionMoc09g29130
SyntenyMoc09g29130
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]7.8e-16965.44Show/hide
Query:  QAESSYNPITPEEVITREEFDQLKSKFDAQVEALKAKCEKKESPFDDGDLGESSFTSDILEAPIPPKFKTPTMKPYDGYKDPKDYVEVFEGLIDFQAATD
        +AESS NP TP  VITREEFDQL+ + DAQVEALKAKCE+KE P +DGDLGES FTSD+LEAPIPPKFK PT+KPYDG KDPKDYVEVFE L+DFQAA+D
Subjt:  QAESSYNPITPEEVITREEFDQLKSKFDAQVEALKAKCEKKESPFDDGDLGESSFTSDILEAPIPPKFKTPTMKPYDGYKDPKDYVEVFEGLIDFQAATD

Query:  AIKCRAFQIALTASTPV--------------------------------TTTHLATIRQKEGETLREYVTRFQEEQLK----------------------
        AIKCRAF+IALT S  +                                T THLATIRQKEGETLREYVTRFQEEQLK                      
Subjt:  AIKCRAFQIALTASTPV--------------------------------TTTHLATIRQKEGETLREYVTRFQEEQLK----------------------

Query:  -------APATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKPNQEKRKIDSKSKDKGSPSSGSRTEYRRSEIGLNRSRPYERYTPTTIPISEILTNI
               APATFAEVLQK KKVIDGQELLRTKTGRPE++I + +  ++    D KSKDKGS SSG R EYRR+E G  RSRPYER+TPTTIPISEILTNI
Subjt:  -------APATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKPNQEKRKIDSKSKDKGSPSSGSRTEYRRSEIGLNRSRPYERYTPTTIPISEILTNI

Query:  EESGMEKLLEQPEKLQGDPEKCHKDKYCRFHRDHGHNTSNCWELKRQIEDVIQDGYFKKFVGKP-----SHQYHLRRLK--------------------W
        EESGMEKLL++PEKL+G PE+  KDKYCRFHR+HGHNTS+ WELKRQIE++IQDGYFKKFVGKP       +   +R +                     
Subjt:  EESGMEKLLEQPEKLQGDPEKCHKDKYCRFHRDHGHNTSNCWELKRQIEDVIQDGYFKKFVGKP-----SHQYHLRRLK--------------------W

Query:  GQSGIKRKELAREARREVCTIRKQQPTCSITFDDSDLEGVYLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLTTYLALGWTRAQLKKSPTPLVGFSG
        GQSG KRKELAR ARREVC IR+Q+PTC ITFD +DLE V+LPHNDALVIAPLIDHV+V +VLVDGG SANILSL TYLALGWTR+QLKKSPTPLVGFSG
Subjt:  GQSGIKRKELAREARREVCTIRKQQPTCSITFDDSDLEGVYLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLTTYLALGWTRAQLKKSPTPLVGFSG

Query:  ESVSPEGCIDLPVTIGQN
        ESV PEG IDLPVT+GQ+
Subjt:  ESVSPEGCIDLPVTIGQN

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]2.1e-16660.52Show/hide
Query:  NSNQQAESSYNPITPEEVITREEFDQLKSKFDAQVEALKAKCEKKESPFDDGDLGESSFTSDILEAPIPPKFKTPTMKPYDGYKDPKDYVEVFEGLIDFQ
        +SNQQAESS+NP TP+ VITREEFDQL+ K +AQVEALKAKCE+KE P +DGDLGES FTSD+LEA        PT+K YDG KDPKDYVEVFEGL+DFQ
Subjt:  NSNQQAESSYNPITPEEVITREEFDQLKSKFDAQVEALKAKCEKKESPFDDGDLGESSFTSDILEAPIPPKFKTPTMKPYDGYKDPKDYVEVFEGLIDFQ

Query:  AATDAIKCRAFQIALTASTPV--TTTHLATIRQKEGETLREYVTRFQEEQL------KAPATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKPNQEK
        AA+DAIKCRAFQIALT S  +      L   +  +   +  ++T   +E L      +APATFAEVLQK KKVIDGQELLRTKTGRPE+ ID+ +  +++
Subjt:  AATDAIKCRAFQIALTASTPV--TTTHLATIRQKEGETLREYVTRFQEEQL------KAPATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKPNQEK

Query:  RKIDSKSKDKGSPSSGSRTEYRRSEIGLNRSRPYERYTPTTIPISEILTNIEESGMEKLLEQPEKLQGDPEKCHKDKYCRFHRDHGHNTSNCWELKRQIE
         K D KSKDKGS SSG R E+RR+  G  RSRPYER+TPTTIPISEILTNIEESGMEKLL++PEKL+G PE+ +KDKYCRFHR+H HNTS+ WELKRQIE
Subjt:  RKIDSKSKDKGSPSSGSRTEYRRSEIGLNRSRPYERYTPTTIPISEILTNIEESGMEKLLEQPEKLQGDPEKCHKDKYCRFHRDHGHNTSNCWELKRQIE

Query:  DVIQDGYFKKFVGKPS------------HQYHLRRL-------------KWGQSGIKRKELAREARREVCTIRKQQPTCSITFDDSDLEGVYLPHNDALV
        D+IQD YFKKFVGKP              +  LRR+               GQSG KRKELAR ARREVC IR+Q+PTC ITFD +DLE V+LPHNDALV
Subjt:  DVIQDGYFKKFVGKPS------------HQYHLRRL-------------KWGQSGIKRKELAREARREVCTIRKQQPTCSITFDDSDLEGVYLPHNDALV

Query:  IAPLIDHVLVRKVLVDGGASANILSLTTYLALGWTRAQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQN-------------------------------
        IAPLIDHV+VR+VLVD G SANI+SL TYLALGWTR+QLKKS TPLVGFS ESV PEGCIDLPVT+G +                               
Subjt:  IAPLIDHVLVRKVLVDGGASANILSLTTYLALGWTRAQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQN-------------------------------

Query:  -------------------SIVQGEQRMSRECYASALKGSSVYALEKQVERVGKQSFEADLPREGAKEFSAPTEELELHP
                            +V+GEQ  SRECYASALKGSSV ALE  V R G   F+A+LPR   +EF+APTEELEL P
Subjt:  -------------------SIVQGEQRMSRECYASALKGSSVYALEKQVERVGKQSFEADLPREGAKEFSAPTEELELHP

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]3.7e-18753.56Show/hide
Query:  SNSTNTADRRALVANDGHQRKAGAKVVEGQVHEGMGTEPLRRLARITTPVLPSVHPKPSKANRGRGGASKRTTRGPAPAPTRENFYALQKEMEAMRTQMR
        +NSTNTADRRAL AN GHQR+ GA+VVEGQ HE +GTEPL R ARITTPVLP  HPKPSK                                        
Subjt:  SNSTNTADRRALVANDGHQRKAGAKVVEGQVHEGMGTEPLRRLARITTPVLPSVHPKPSKANRGRGGASKRTTRGPAPAPTRENFYALQKEMEAMRTQMR

Query:  TMEEMYNEMVQTAGAGSRFENRVARDDMREQRGHHLGPVEEEHPEGGEDEEYTHQRGDLRKHLNKKRSSSLRKGQSPSCSHRNSNQQAESSYNPITPEEV
                                                                                               AESSYNPITP  V
Subjt:  TMEEMYNEMVQTAGAGSRFENRVARDDMREQRGHHLGPVEEEHPEGGEDEEYTHQRGDLRKHLNKKRSSSLRKGQSPSCSHRNSNQQAESSYNPITPEEV

Query:  ITREEFDQLKSKFDAQVEALKAKCEKKESPFDDGDLGESSFTSDILEAPIPPKFKTPTMKPYDGYKDPKDYVEVFEGLIDFQAATDAIKCRAFQIALTAS
        ITREEFDQLKSKFDAQVEALKA+CEKKES FDDGDLGE SF+SDILEA IPPKFKTPTMKPYDG KDPKDYVEVFE L+DFQAATDAIKC AFQIALT S
Subjt:  ITREEFDQLKSKFDAQVEALKAKCEKKESPFDDGDLGESSFTSDILEAPIPPKFKTPTMKPYDGYKDPKDYVEVFEGLIDFQAATDAIKCRAFQIALTAS

Query:  TPV--------------------------------TTTHLATIRQKEGETLREYVTRFQEEQLK-----------------------------APATFAE
          +                                T THLATIRQKEGETLREYVTRF EEQLK                             APATFAE
Subjt:  TPV--------------------------------TTTHLATIRQKEGETLREYVTRFQEEQLK-----------------------------APATFAE

Query:  VLQKVKKVIDGQELLRTKTGRPEKQIDQKKPNQEKRKIDSKSKDKGSPSSGSRTEYRRSEIGLNRSRPYERYTPTTIPISEILTNIEESGMEKLLEQPEK
        VLQK KKVIDGQELLRTKTGRPEK IDQ +  ++K K DSKS+DKG  SS SR +YRRS    N+SRPYE YTPTTIPI EILTNIEE+GMEKLL++PEK
Subjt:  VLQKVKKVIDGQELLRTKTGRPEKQIDQKKPNQEKRKIDSKSKDKGSPSSGSRTEYRRSEIGLNRSRPYERYTPTTIPISEILTNIEESGMEKLLEQPEK

Query:  LQGDPEKCHKDKYCRFHRDHGHNTSNCWELKRQIEDVIQDGYFKKFVGKP-----SHQYHLRRLKWGQSG-------IKRKELAREARREVCTIRKQQPT
        L+GDPEK + DKYCRFHRDHGHNTSN WELKRQIED+IQDGYFKKFVGKP       +   +RL+             K+KELAREARREVC IR+Q+PT
Subjt:  LQGDPEKCHKDKYCRFHRDHGHNTSNCWELKRQIEDVIQDGYFKKFVGKP-----SHQYHLRRLKWGQSG-------IKRKELAREARREVCTIRKQQPT

Query:  CSITFDDSDLEGVYLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLTTYLALGWTRAQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQN---------
         SI F+ +DLEGV+LPHNDALVIAPLID VLVR++LVDGGASANILSL+TYLALGWTR+QLKKSPTPLVGFSGES+S EGCIDLPV+I Q+         
Subjt:  CSITFDDSDLEGVYLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLTTYLALGWTRAQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQN---------

Query:  -----------------------------------------SIVQGEQRMSRECYASALKGSSVYALEKQVER
                                                   V+GE + SRECYAS  K SSV ALE+Q  R
Subjt:  -----------------------------------------SIVQGEQRMSRECYASALKGSSVYALEKQVER

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]1.3e-14453.42Show/hide
Query:  EQRGHHLGPVEEEHPEGGEDEEYTHQRGDLRKHL-NKKRSSSLRKGQSPSCSHR--NSNQQAESSYNPITPEEVITREEFDQLKSKFDAQVEALKAKCEK
        E+    + P   E     E  +Y+ +  DLRKHL +KK+ +S     S S S    NSN +A+S Y P+ PE VI REEFD +K +FD QVEALKA+CEK
Subjt:  EQRGHHLGPVEEEHPEGGEDEEYTHQRGDLRKHL-NKKRSSSLRKGQSPSCSHR--NSNQQAESSYNPITPEEVITREEFDQLKSKFDAQVEALKAKCEK

Query:  KESPFDDGDLGESSFTSDILEAPIPPKFKTPTMKPYDGYKDPKDYVEVFEGLIDFQAATDAIKCRAFQIALTASTPV-----------------------
        KESPFDD DLGES FTSDI+EAPIPPKFKTPTMKPYDG KDPKDYVEVFEGL+DFQAATDAIKC AFQIALT S  +                       
Subjt:  KESPFDDGDLGESSFTSDILEAPIPPKFKTPTMKPYDGYKDPKDYVEVFEGLIDFQAATDAIKCRAFQIALTASTPV-----------------------

Query:  ---------TTTHLATIRQKEGETLREYVTRFQEEQLKAPATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKPNQEKRKIDSKSKDKGSPSSGSRTE
                 T THLATIRQKE ETL     +  EE   APATFAEVLQ  KKVIDGQELLRTKT RPEKQIDQK+ +Q+KRK DSKSKDKGS SSGSRTE
Subjt:  ---------TTTHLATIRQKEGETLREYVTRFQEEQLKAPATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKPNQEKRKIDSKSKDKGSPSSGSRTE

Query:  YRRSEIGLNRSRPYERYTPTTIPISEILTNIEESGMEKLLEQPEKLQGDPEKCHKDKYCRFHRDHGHNTSNCWELKRQIEDVIQDGYFKKFVGKP-----
        YRRSE G +RSRPYER                                                       CWELKRQIED+IQD YFKKFVGKP     
Subjt:  YRRSEIGLNRSRPYERYTPTTIPISEILTNIEESGMEKLLEQPEKLQGDPEKCHKDKYCRFHRDHGHNTSNCWELKRQIEDVIQDGYFKKFVGKP-----

Query:  SHQYHLRRLK--------------------WGQSGIKRKELAREARREVCTIRKQQPTCSITFDDSDLEGVYLPHNDALVIAPLIDHVLVRKVLVDGGAS
          +   +R +                     GQ   KRKELA EARR+V  IR+Q+PTCSITF D+DLEGV+LPHNDALVIAPLIDHVLVR+VLVDGGAS
Subjt:  SHQYHLRRLK--------------------WGQSGIKRKELAREARREVCTIRKQQPTCSITFDDSDLEGVYLPHNDALVIAPLIDHVLVRKVLVDGGAS

Query:  ANILSLTTYLALGWTRAQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQNS--------------------------------------------------
        ANILSL TYLAL  TR+QLKKSPTPLVGFS ESVSPEGCIDLPVTIGQ+S                                                  
Subjt:  ANILSLTTYLALGWTRAQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQNS--------------------------------------------------

Query:  IVQGEQRMSRECYASALKGSSVYALEKQVERVGKQSFEADLPRE
         V+GEQ+ SRECYASALK SSV ALE+Q         + DLPRE
Subjt:  IVQGEQRMSRECYASALKGSSVYALEKQVERVGKQSFEADLPRE

XP_022156088.1 uncharacterized protein LOC111023060 [Momordica charantia]1.0e-14456.99Show/hide
Query:  MEAMRTQMRTMEEMYNEMVQTAGAGSRFENRVARDDMREQRGHHLGPVEEEHPEGGEDEEYTHQRGDLRKHLNKKRSSSLRKGQSPSCSHRNSNQQAESS
        MEAMRTQMRTMEEMYN+MVQ AGA SR  ++V  +D+ EQ   H  PV+EE           H  GDLR HLN+KR+SS R  ++ +  H+NSNQQAESS
Subjt:  MEAMRTQMRTMEEMYNEMVQTAGAGSRFENRVARDDMREQRGHHLGPVEEEHPEGGEDEEYTHQRGDLRKHLNKKRSSSLRKGQSPSCSHRNSNQQAESS

Query:  YNPITPEEVITREEFDQLKSKFDAQVEALKAKCEKKESPFDDGDLGESSFTSDILEAPIPPKFKTPTMKPYDGYKDPKDYVEVFEGLIDFQAATDAIKCR
        YNPI PE VITREEF+QLKSKFDAQVEALK +CEKKES FDDGDLGES FTSDILEA IPPKFKTPTMK YDG KDPKDYVEVFEGL+DFQAATDAIKCR
Subjt:  YNPITPEEVITREEFDQLKSKFDAQVEALKAKCEKKESPFDDGDLGESSFTSDILEAPIPPKFKTPTMKPYDGYKDPKDYVEVFEGLIDFQAATDAIKCR

Query:  AFQIALTASTPV--------------------------------TTTHLATIRQKEGETLREYVTRFQEEQLK---------------------------
        AFQIALT S  +                                TTTHLATIRQKEG+TL+EY+TRFQEEQLK                           
Subjt:  AFQIALTASTPV--------------------------------TTTHLATIRQKEGETLREYVTRFQEEQLK---------------------------

Query:  --APATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKPNQEKRKIDSKSKDKGSPSSGSRTEYRRSEIGLNRSRPYERYTPTTIPISEILTNIEESGM
          A ATFAEVLQ  KK IDGQELLRTKT RPEKQIDQKK +Q+KRK DSKSKDKGS SS SRT+Y RS                         ++E+   
Subjt:  --APATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKPNQEKRKIDSKSKDKGSPSSGSRTEYRRSEIGLNRSRPYERYTPTTIPISEILTNIEESGM

Query:  EKLLEQPEKLQGDPEKCHKDKYCRFHRDHGHNTSNCWELKRQIEDVIQDGYFKKFVGKPSHQYHLRRLKWGQSGIKRKELAREARREVCTIRKQQPTCSI
         K    P +L   P                                          G PS          GQSG KRKELAREA REVC IR+Q+PTCS+
Subjt:  EKLLEQPEKLQGDPEKCHKDKYCRFHRDHGHNTSNCWELKRQIEDVIQDGYFKKFVGKPSHQYHLRRLKWGQSGIKRKELAREARREVCTIRKQQPTCSI

Query:  TFDDSDLEGVYLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLTTYLALGWTRAQLKKSPTPLVGFSGESVSPEGCI
        TFDDSDLEGV+LP+NDALVIAPLIDHVLVR+VLVDGGASANILS    LALGWTR+QLKKSPTPLVGFS ESVS +G +
Subjt:  TFDDSDLEGVYLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLTTYLALGWTRAQLKKSPTPLVGFSGESVSPEGCI

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088133.8e-16965.44Show/hide
Query:  QAESSYNPITPEEVITREEFDQLKSKFDAQVEALKAKCEKKESPFDDGDLGESSFTSDILEAPIPPKFKTPTMKPYDGYKDPKDYVEVFEGLIDFQAATD
        +AESS NP TP  VITREEFDQL+ + DAQVEALKAKCE+KE P +DGDLGES FTSD+LEAPIPPKFK PT+KPYDG KDPKDYVEVFE L+DFQAA+D
Subjt:  QAESSYNPITPEEVITREEFDQLKSKFDAQVEALKAKCEKKESPFDDGDLGESSFTSDILEAPIPPKFKTPTMKPYDGYKDPKDYVEVFEGLIDFQAATD

Query:  AIKCRAFQIALTASTPV--------------------------------TTTHLATIRQKEGETLREYVTRFQEEQLK----------------------
        AIKCRAF+IALT S  +                                T THLATIRQKEGETLREYVTRFQEEQLK                      
Subjt:  AIKCRAFQIALTASTPV--------------------------------TTTHLATIRQKEGETLREYVTRFQEEQLK----------------------

Query:  -------APATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKPNQEKRKIDSKSKDKGSPSSGSRTEYRRSEIGLNRSRPYERYTPTTIPISEILTNI
               APATFAEVLQK KKVIDGQELLRTKTGRPE++I + +  ++    D KSKDKGS SSG R EYRR+E G  RSRPYER+TPTTIPISEILTNI
Subjt:  -------APATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKPNQEKRKIDSKSKDKGSPSSGSRTEYRRSEIGLNRSRPYERYTPTTIPISEILTNI

Query:  EESGMEKLLEQPEKLQGDPEKCHKDKYCRFHRDHGHNTSNCWELKRQIEDVIQDGYFKKFVGKP-----SHQYHLRRLK--------------------W
        EESGMEKLL++PEKL+G PE+  KDKYCRFHR+HGHNTS+ WELKRQIE++IQDGYFKKFVGKP       +   +R +                     
Subjt:  EESGMEKLLEQPEKLQGDPEKCHKDKYCRFHRDHGHNTSNCWELKRQIEDVIQDGYFKKFVGKP-----SHQYHLRRLK--------------------W

Query:  GQSGIKRKELAREARREVCTIRKQQPTCSITFDDSDLEGVYLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLTTYLALGWTRAQLKKSPTPLVGFSG
        GQSG KRKELAR ARREVC IR+Q+PTC ITFD +DLE V+LPHNDALVIAPLIDHV+V +VLVDGG SANILSL TYLALGWTR+QLKKSPTPLVGFSG
Subjt:  GQSGIKRKELAREARREVCTIRKQQPTCSITFDDSDLEGVYLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLTTYLALGWTRAQLKKSPTPLVGFSG

Query:  ESVSPEGCIDLPVTIGQN
        ESV PEG IDLPVT+GQ+
Subjt:  ESVSPEGCIDLPVTIGQN

A0A6J1D9E1 uncharacterized protein LOC1110188231.0e-16660.52Show/hide
Query:  NSNQQAESSYNPITPEEVITREEFDQLKSKFDAQVEALKAKCEKKESPFDDGDLGESSFTSDILEAPIPPKFKTPTMKPYDGYKDPKDYVEVFEGLIDFQ
        +SNQQAESS+NP TP+ VITREEFDQL+ K +AQVEALKAKCE+KE P +DGDLGES FTSD+LEA        PT+K YDG KDPKDYVEVFEGL+DFQ
Subjt:  NSNQQAESSYNPITPEEVITREEFDQLKSKFDAQVEALKAKCEKKESPFDDGDLGESSFTSDILEAPIPPKFKTPTMKPYDGYKDPKDYVEVFEGLIDFQ

Query:  AATDAIKCRAFQIALTASTPV--TTTHLATIRQKEGETLREYVTRFQEEQL------KAPATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKPNQEK
        AA+DAIKCRAFQIALT S  +      L   +  +   +  ++T   +E L      +APATFAEVLQK KKVIDGQELLRTKTGRPE+ ID+ +  +++
Subjt:  AATDAIKCRAFQIALTASTPV--TTTHLATIRQKEGETLREYVTRFQEEQL------KAPATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKPNQEK

Query:  RKIDSKSKDKGSPSSGSRTEYRRSEIGLNRSRPYERYTPTTIPISEILTNIEESGMEKLLEQPEKLQGDPEKCHKDKYCRFHRDHGHNTSNCWELKRQIE
         K D KSKDKGS SSG R E+RR+  G  RSRPYER+TPTTIPISEILTNIEESGMEKLL++PEKL+G PE+ +KDKYCRFHR+H HNTS+ WELKRQIE
Subjt:  RKIDSKSKDKGSPSSGSRTEYRRSEIGLNRSRPYERYTPTTIPISEILTNIEESGMEKLLEQPEKLQGDPEKCHKDKYCRFHRDHGHNTSNCWELKRQIE

Query:  DVIQDGYFKKFVGKPS------------HQYHLRRL-------------KWGQSGIKRKELAREARREVCTIRKQQPTCSITFDDSDLEGVYLPHNDALV
        D+IQD YFKKFVGKP              +  LRR+               GQSG KRKELAR ARREVC IR+Q+PTC ITFD +DLE V+LPHNDALV
Subjt:  DVIQDGYFKKFVGKPS------------HQYHLRRL-------------KWGQSGIKRKELAREARREVCTIRKQQPTCSITFDDSDLEGVYLPHNDALV

Query:  IAPLIDHVLVRKVLVDGGASANILSLTTYLALGWTRAQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQN-------------------------------
        IAPLIDHV+VR+VLVD G SANI+SL TYLALGWTR+QLKKS TPLVGFS ESV PEGCIDLPVT+G +                               
Subjt:  IAPLIDHVLVRKVLVDGGASANILSLTTYLALGWTRAQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQN-------------------------------

Query:  -------------------SIVQGEQRMSRECYASALKGSSVYALEKQVERVGKQSFEADLPREGAKEFSAPTEELELHP
                            +V+GEQ  SRECYASALKGSSV ALE  V R G   F+A+LPR   +EF+APTEELEL P
Subjt:  -------------------SIVQGEQRMSRECYASALKGSSVYALEKQVERVGKQSFEADLPREGAKEFSAPTEELELHP

A0A6J1DHB3 uncharacterized protein LOC1110204791.8e-18753.56Show/hide
Query:  SNSTNTADRRALVANDGHQRKAGAKVVEGQVHEGMGTEPLRRLARITTPVLPSVHPKPSKANRGRGGASKRTTRGPAPAPTRENFYALQKEMEAMRTQMR
        +NSTNTADRRAL AN GHQR+ GA+VVEGQ HE +GTEPL R ARITTPVLP  HPKPSK                                        
Subjt:  SNSTNTADRRALVANDGHQRKAGAKVVEGQVHEGMGTEPLRRLARITTPVLPSVHPKPSKANRGRGGASKRTTRGPAPAPTRENFYALQKEMEAMRTQMR

Query:  TMEEMYNEMVQTAGAGSRFENRVARDDMREQRGHHLGPVEEEHPEGGEDEEYTHQRGDLRKHLNKKRSSSLRKGQSPSCSHRNSNQQAESSYNPITPEEV
                                                                                               AESSYNPITP  V
Subjt:  TMEEMYNEMVQTAGAGSRFENRVARDDMREQRGHHLGPVEEEHPEGGEDEEYTHQRGDLRKHLNKKRSSSLRKGQSPSCSHRNSNQQAESSYNPITPEEV

Query:  ITREEFDQLKSKFDAQVEALKAKCEKKESPFDDGDLGESSFTSDILEAPIPPKFKTPTMKPYDGYKDPKDYVEVFEGLIDFQAATDAIKCRAFQIALTAS
        ITREEFDQLKSKFDAQVEALKA+CEKKES FDDGDLGE SF+SDILEA IPPKFKTPTMKPYDG KDPKDYVEVFE L+DFQAATDAIKC AFQIALT S
Subjt:  ITREEFDQLKSKFDAQVEALKAKCEKKESPFDDGDLGESSFTSDILEAPIPPKFKTPTMKPYDGYKDPKDYVEVFEGLIDFQAATDAIKCRAFQIALTAS

Query:  TPV--------------------------------TTTHLATIRQKEGETLREYVTRFQEEQLK-----------------------------APATFAE
          +                                T THLATIRQKEGETLREYVTRF EEQLK                             APATFAE
Subjt:  TPV--------------------------------TTTHLATIRQKEGETLREYVTRFQEEQLK-----------------------------APATFAE

Query:  VLQKVKKVIDGQELLRTKTGRPEKQIDQKKPNQEKRKIDSKSKDKGSPSSGSRTEYRRSEIGLNRSRPYERYTPTTIPISEILTNIEESGMEKLLEQPEK
        VLQK KKVIDGQELLRTKTGRPEK IDQ +  ++K K DSKS+DKG  SS SR +YRRS    N+SRPYE YTPTTIPI EILTNIEE+GMEKLL++PEK
Subjt:  VLQKVKKVIDGQELLRTKTGRPEKQIDQKKPNQEKRKIDSKSKDKGSPSSGSRTEYRRSEIGLNRSRPYERYTPTTIPISEILTNIEESGMEKLLEQPEK

Query:  LQGDPEKCHKDKYCRFHRDHGHNTSNCWELKRQIEDVIQDGYFKKFVGKP-----SHQYHLRRLKWGQSG-------IKRKELAREARREVCTIRKQQPT
        L+GDPEK + DKYCRFHRDHGHNTSN WELKRQIED+IQDGYFKKFVGKP       +   +RL+             K+KELAREARREVC IR+Q+PT
Subjt:  LQGDPEKCHKDKYCRFHRDHGHNTSNCWELKRQIEDVIQDGYFKKFVGKP-----SHQYHLRRLKWGQSG-------IKRKELAREARREVCTIRKQQPT

Query:  CSITFDDSDLEGVYLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLTTYLALGWTRAQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQN---------
         SI F+ +DLEGV+LPHNDALVIAPLID VLVR++LVDGGASANILSL+TYLALGWTR+QLKKSPTPLVGFSGES+S EGCIDLPV+I Q+         
Subjt:  CSITFDDSDLEGVYLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLTTYLALGWTRAQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQN---------

Query:  -----------------------------------------SIVQGEQRMSRECYASALKGSSVYALEKQVER
                                                   V+GE + SRECYAS  K SSV ALE+Q  R
Subjt:  -----------------------------------------SIVQGEQRMSRECYASALKGSSVYALEKQVER

A0A6J1DPC9 uncharacterized protein LOC1110222806.5e-14553.42Show/hide
Query:  EQRGHHLGPVEEEHPEGGEDEEYTHQRGDLRKHL-NKKRSSSLRKGQSPSCSHR--NSNQQAESSYNPITPEEVITREEFDQLKSKFDAQVEALKAKCEK
        E+    + P   E     E  +Y+ +  DLRKHL +KK+ +S     S S S    NSN +A+S Y P+ PE VI REEFD +K +FD QVEALKA+CEK
Subjt:  EQRGHHLGPVEEEHPEGGEDEEYTHQRGDLRKHL-NKKRSSSLRKGQSPSCSHR--NSNQQAESSYNPITPEEVITREEFDQLKSKFDAQVEALKAKCEK

Query:  KESPFDDGDLGESSFTSDILEAPIPPKFKTPTMKPYDGYKDPKDYVEVFEGLIDFQAATDAIKCRAFQIALTASTPV-----------------------
        KESPFDD DLGES FTSDI+EAPIPPKFKTPTMKPYDG KDPKDYVEVFEGL+DFQAATDAIKC AFQIALT S  +                       
Subjt:  KESPFDDGDLGESSFTSDILEAPIPPKFKTPTMKPYDGYKDPKDYVEVFEGLIDFQAATDAIKCRAFQIALTASTPV-----------------------

Query:  ---------TTTHLATIRQKEGETLREYVTRFQEEQLKAPATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKPNQEKRKIDSKSKDKGSPSSGSRTE
                 T THLATIRQKE ETL     +  EE   APATFAEVLQ  KKVIDGQELLRTKT RPEKQIDQK+ +Q+KRK DSKSKDKGS SSGSRTE
Subjt:  ---------TTTHLATIRQKEGETLREYVTRFQEEQLKAPATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKPNQEKRKIDSKSKDKGSPSSGSRTE

Query:  YRRSEIGLNRSRPYERYTPTTIPISEILTNIEESGMEKLLEQPEKLQGDPEKCHKDKYCRFHRDHGHNTSNCWELKRQIEDVIQDGYFKKFVGKP-----
        YRRSE G +RSRPYER                                                       CWELKRQIED+IQD YFKKFVGKP     
Subjt:  YRRSEIGLNRSRPYERYTPTTIPISEILTNIEESGMEKLLEQPEKLQGDPEKCHKDKYCRFHRDHGHNTSNCWELKRQIEDVIQDGYFKKFVGKP-----

Query:  SHQYHLRRLK--------------------WGQSGIKRKELAREARREVCTIRKQQPTCSITFDDSDLEGVYLPHNDALVIAPLIDHVLVRKVLVDGGAS
          +   +R +                     GQ   KRKELA EARR+V  IR+Q+PTCSITF D+DLEGV+LPHNDALVIAPLIDHVLVR+VLVDGGAS
Subjt:  SHQYHLRRLK--------------------WGQSGIKRKELAREARREVCTIRKQQPTCSITFDDSDLEGVYLPHNDALVIAPLIDHVLVRKVLVDGGAS

Query:  ANILSLTTYLALGWTRAQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQNS--------------------------------------------------
        ANILSL TYLAL  TR+QLKKSPTPLVGFS ESVSPEGCIDLPVTIGQ+S                                                  
Subjt:  ANILSLTTYLALGWTRAQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQNS--------------------------------------------------

Query:  IVQGEQRMSRECYASALKGSSVYALEKQVERVGKQSFEADLPRE
         V+GEQ+ SRECYASALK SSV ALE+Q         + DLPRE
Subjt:  IVQGEQRMSRECYASALKGSSVYALEKQVERVGKQSFEADLPRE

A0A6J1DPN4 uncharacterized protein LOC1110230605.0e-14556.99Show/hide
Query:  MEAMRTQMRTMEEMYNEMVQTAGAGSRFENRVARDDMREQRGHHLGPVEEEHPEGGEDEEYTHQRGDLRKHLNKKRSSSLRKGQSPSCSHRNSNQQAESS
        MEAMRTQMRTMEEMYN+MVQ AGA SR  ++V  +D+ EQ   H  PV+EE           H  GDLR HLN+KR+SS R  ++ +  H+NSNQQAESS
Subjt:  MEAMRTQMRTMEEMYNEMVQTAGAGSRFENRVARDDMREQRGHHLGPVEEEHPEGGEDEEYTHQRGDLRKHLNKKRSSSLRKGQSPSCSHRNSNQQAESS

Query:  YNPITPEEVITREEFDQLKSKFDAQVEALKAKCEKKESPFDDGDLGESSFTSDILEAPIPPKFKTPTMKPYDGYKDPKDYVEVFEGLIDFQAATDAIKCR
        YNPI PE VITREEF+QLKSKFDAQVEALK +CEKKES FDDGDLGES FTSDILEA IPPKFKTPTMK YDG KDPKDYVEVFEGL+DFQAATDAIKCR
Subjt:  YNPITPEEVITREEFDQLKSKFDAQVEALKAKCEKKESPFDDGDLGESSFTSDILEAPIPPKFKTPTMKPYDGYKDPKDYVEVFEGLIDFQAATDAIKCR

Query:  AFQIALTASTPV--------------------------------TTTHLATIRQKEGETLREYVTRFQEEQLK---------------------------
        AFQIALT S  +                                TTTHLATIRQKEG+TL+EY+TRFQEEQLK                           
Subjt:  AFQIALTASTPV--------------------------------TTTHLATIRQKEGETLREYVTRFQEEQLK---------------------------

Query:  --APATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKPNQEKRKIDSKSKDKGSPSSGSRTEYRRSEIGLNRSRPYERYTPTTIPISEILTNIEESGM
          A ATFAEVLQ  KK IDGQELLRTKT RPEKQIDQKK +Q+KRK DSKSKDKGS SS SRT+Y RS                         ++E+   
Subjt:  --APATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKPNQEKRKIDSKSKDKGSPSSGSRTEYRRSEIGLNRSRPYERYTPTTIPISEILTNIEESGM

Query:  EKLLEQPEKLQGDPEKCHKDKYCRFHRDHGHNTSNCWELKRQIEDVIQDGYFKKFVGKPSHQYHLRRLKWGQSGIKRKELAREARREVCTIRKQQPTCSI
         K    P +L   P                                          G PS          GQSG KRKELAREA REVC IR+Q+PTCS+
Subjt:  EKLLEQPEKLQGDPEKCHKDKYCRFHRDHGHNTSNCWELKRQIEDVIQDGYFKKFVGKPSHQYHLRRLKWGQSGIKRKELAREARREVCTIRKQQPTCSI

Query:  TFDDSDLEGVYLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLTTYLALGWTRAQLKKSPTPLVGFSGESVSPEGCI
        TFDDSDLEGV+LP+NDALVIAPLIDHVLVR+VLVDGGASANILS    LALGWTR+QLKKSPTPLVGFS ESVS +G +
Subjt:  TFDDSDLEGVYLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLTTYLALGWTRAQLKKSPTPLVGFSGESVSPEGCI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAAAGGCTAAGTGTACATGCCGAAGTCCGATCTGATGGAGGTCCACTTTTGTGTTCAGGTCGAGCCGGGGACCAATTTCGAGCTAGGTTCGTAAAGTGTCGTTC
AAACTCAACCAACACGGCCGACCGAAGAGCTCTAGTGGCCAATGATGGCCACCAGAGGAAGGCCGGGGCAAAGGTGGTAGAGGGGCAGGTTCATGAAGGCATGGGGACAG
AGCCCCTCCGCAGGTTGGCACGCATCACCACGCCCGTTCTGCCATCAGTACATCCAAAGCCATCCAAGGCCAATCGCGGCCGAGGTGGGGCTTCGAAGAGAACCACTCGA
GGACCAGCCCCAGCTCCAACAAGGGAGAATTTTTATGCACTCCAGAAAGAAATGGAGGCAATGCGCACTCAGATGCGCACCATGGAAGAGATGTACAACGAGATGGTGCA
AACTGCGGGCGCTGGATCTCGGTTTGAAAACCGGGTGGCACGCGATGACATGCGCGAGCAAAGGGGTCATCACCTCGGTCCAGTCGAGGAAGAGCACCCTGAAGGAGGTG
AGGACGAAGAGTACACTCACCAGAGGGGTGATCTCCGAAAGCATCTCAACAAAAAGAGAAGCTCGTCCCTCCGAAAGGGACAATCTCCGTCCTGCTCACACAGGAACTCC
AACCAGCAGGCAGAGTCCTCCTACAATCCAATAACTCCCGAGGAAGTGATCACAAGGGAGGAGTTCGACCAGCTAAAGAGCAAGTTTGATGCTCAGGTTGAGGCCTTGAA
GGCTAAGTGCGAGAAAAAAGAAAGTCCATTCGATGATGGCGACCTGGGAGAGTCGTCATTCACCTCGGACATCTTAGAGGCCCCAATCCCCCCGAAGTTCAAAACTCCCA
CCATGAAACCTTATGATGGGTATAAGGACCCTAAGGATTATGTTGAGGTCTTCGAAGGCCTCATAGACTTTCAAGCGGCAACAGATGCAATCAAGTGCCGCGCCTTCCAG
ATCGCGCTCACCGCCAGCACGCCTGTGACAACAACTCACCTCGCCACCATCAGACAGAAGGAAGGTGAGACGCTGAGAGAATATGTCACAAGGTTCCAAGAGGAGCAGTT
GAAGGCTCCAGCAACCTTCGCCGAAGTTTTGCAAAAGGTGAAGAAAGTCATTGATGGACAAGAGCTTCTCCGAACCAAGACTGGCAGACCTGAGAAGCAAATCGACCAGA
AGAAGCCAAACCAAGAAAAGAGGAAGATTGATTCCAAGTCGAAGGATAAGGGATCGCCCTCATCCGGTAGCCGAACTGAGTATCGAAGGTCGGAGATCGGCCTCAATCGA
AGCCGACCTTACGAACGTTATACTCCAACCACCATCCCCATCTCTGAGATACTAACAAACATCGAGGAAAGTGGGATGGAAAAGCTCCTCGAGCAACCTGAGAAGCTCCA
AGGAGACCCAGAAAAGTGCCATAAAGATAAATATTGTCGTTTTCATCGCGATCACGGCCATAATACATCAAATTGCTGGGAATTGAAGCGCCAAATTGAAGACGTCATTC
AAGATGGCTACTTCAAAAAATTTGTTGGCAAACCGAGTCATCAATACCATCTTCGGAGGCTCAAGTGGGGGCAGTCTGGGATCAAAAGGAAAGAGCTAGCTCGAGAAGCC
AGGCGCGAGGTGTGCACCATTAGGAAGCAGCAACCGACCTGCTCCATTACCTTTGATGACTCCGACTTAGAGGGGGTCTATCTGCCCCATAACGACGCACTTGTGATCGC
TCCTCTCATCGATCATGTCCTGGTTCGAAAAGTATTGGTGGATGGTGGTGCGTCTGCCAATATTCTGTCCCTCACAACATATCTTGCTTTGGGATGGACTAGAGCACAAT
TGAAGAAAAGTCCAACGCCCTTGGTTGGATTTTCAGGGGAATCAGTCTCCCCAGAAGGGTGTATTGACTTGCCAGTTACGATTGGGCAGAACAGCATAGTCCAAGGGGAG
CAAAGGATGTCAAGGGAGTGCTATGCCTCGGCACTTAAAGGGTCCTCGGTATACGCCCTCGAGAAACAAGTGGAACGAGTTGGAAAGCAAAGCTTCGAAGCCGACCTACC
AAGGGAAGGAGCAAAGGAGTTTTCTGCACCAACAGAGGAGCTTGAGCTTCACCCTCGTGGATGGACCCAATCGCGGAATTCATCAAAGGAAATCCGTCGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAAAGGCTAAGTGTACATGCCGAAGTCCGATCTGATGGAGGTCCACTTTTGTGTTCAGGTCGAGCCGGGGACCAATTTCGAGCTAGGTTCGTAAAGTGTCGTTC
AAACTCAACCAACACGGCCGACCGAAGAGCTCTAGTGGCCAATGATGGCCACCAGAGGAAGGCCGGGGCAAAGGTGGTAGAGGGGCAGGTTCATGAAGGCATGGGGACAG
AGCCCCTCCGCAGGTTGGCACGCATCACCACGCCCGTTCTGCCATCAGTACATCCAAAGCCATCCAAGGCCAATCGCGGCCGAGGTGGGGCTTCGAAGAGAACCACTCGA
GGACCAGCCCCAGCTCCAACAAGGGAGAATTTTTATGCACTCCAGAAAGAAATGGAGGCAATGCGCACTCAGATGCGCACCATGGAAGAGATGTACAACGAGATGGTGCA
AACTGCGGGCGCTGGATCTCGGTTTGAAAACCGGGTGGCACGCGATGACATGCGCGAGCAAAGGGGTCATCACCTCGGTCCAGTCGAGGAAGAGCACCCTGAAGGAGGTG
AGGACGAAGAGTACACTCACCAGAGGGGTGATCTCCGAAAGCATCTCAACAAAAAGAGAAGCTCGTCCCTCCGAAAGGGACAATCTCCGTCCTGCTCACACAGGAACTCC
AACCAGCAGGCAGAGTCCTCCTACAATCCAATAACTCCCGAGGAAGTGATCACAAGGGAGGAGTTCGACCAGCTAAAGAGCAAGTTTGATGCTCAGGTTGAGGCCTTGAA
GGCTAAGTGCGAGAAAAAAGAAAGTCCATTCGATGATGGCGACCTGGGAGAGTCGTCATTCACCTCGGACATCTTAGAGGCCCCAATCCCCCCGAAGTTCAAAACTCCCA
CCATGAAACCTTATGATGGGTATAAGGACCCTAAGGATTATGTTGAGGTCTTCGAAGGCCTCATAGACTTTCAAGCGGCAACAGATGCAATCAAGTGCCGCGCCTTCCAG
ATCGCGCTCACCGCCAGCACGCCTGTGACAACAACTCACCTCGCCACCATCAGACAGAAGGAAGGTGAGACGCTGAGAGAATATGTCACAAGGTTCCAAGAGGAGCAGTT
GAAGGCTCCAGCAACCTTCGCCGAAGTTTTGCAAAAGGTGAAGAAAGTCATTGATGGACAAGAGCTTCTCCGAACCAAGACTGGCAGACCTGAGAAGCAAATCGACCAGA
AGAAGCCAAACCAAGAAAAGAGGAAGATTGATTCCAAGTCGAAGGATAAGGGATCGCCCTCATCCGGTAGCCGAACTGAGTATCGAAGGTCGGAGATCGGCCTCAATCGA
AGCCGACCTTACGAACGTTATACTCCAACCACCATCCCCATCTCTGAGATACTAACAAACATCGAGGAAAGTGGGATGGAAAAGCTCCTCGAGCAACCTGAGAAGCTCCA
AGGAGACCCAGAAAAGTGCCATAAAGATAAATATTGTCGTTTTCATCGCGATCACGGCCATAATACATCAAATTGCTGGGAATTGAAGCGCCAAATTGAAGACGTCATTC
AAGATGGCTACTTCAAAAAATTTGTTGGCAAACCGAGTCATCAATACCATCTTCGGAGGCTCAAGTGGGGGCAGTCTGGGATCAAAAGGAAAGAGCTAGCTCGAGAAGCC
AGGCGCGAGGTGTGCACCATTAGGAAGCAGCAACCGACCTGCTCCATTACCTTTGATGACTCCGACTTAGAGGGGGTCTATCTGCCCCATAACGACGCACTTGTGATCGC
TCCTCTCATCGATCATGTCCTGGTTCGAAAAGTATTGGTGGATGGTGGTGCGTCTGCCAATATTCTGTCCCTCACAACATATCTTGCTTTGGGATGGACTAGAGCACAAT
TGAAGAAAAGTCCAACGCCCTTGGTTGGATTTTCAGGGGAATCAGTCTCCCCAGAAGGGTGTATTGACTTGCCAGTTACGATTGGGCAGAACAGCATAGTCCAAGGGGAG
CAAAGGATGTCAAGGGAGTGCTATGCCTCGGCACTTAAAGGGTCCTCGGTATACGCCCTCGAGAAACAAGTGGAACGAGTTGGAAAGCAAAGCTTCGAAGCCGACCTACC
AAGGGAAGGAGCAAAGGAGTTTTCTGCACCAACAGAGGAGCTTGAGCTTCACCCTCGTGGATGGACCCAATCGCGGAATTCATCAAAGGAAATCCGTCGCTAG
Protein sequenceShow/hide protein sequence
MEKRLSVHAEVRSDGGPLLCSGRAGDQFRARFVKCRSNSTNTADRRALVANDGHQRKAGAKVVEGQVHEGMGTEPLRRLARITTPVLPSVHPKPSKANRGRGGASKRTTR
GPAPAPTRENFYALQKEMEAMRTQMRTMEEMYNEMVQTAGAGSRFENRVARDDMREQRGHHLGPVEEEHPEGGEDEEYTHQRGDLRKHLNKKRSSSLRKGQSPSCSHRNS
NQQAESSYNPITPEEVITREEFDQLKSKFDAQVEALKAKCEKKESPFDDGDLGESSFTSDILEAPIPPKFKTPTMKPYDGYKDPKDYVEVFEGLIDFQAATDAIKCRAFQ
IALTASTPVTTTHLATIRQKEGETLREYVTRFQEEQLKAPATFAEVLQKVKKVIDGQELLRTKTGRPEKQIDQKKPNQEKRKIDSKSKDKGSPSSGSRTEYRRSEIGLNR
SRPYERYTPTTIPISEILTNIEESGMEKLLEQPEKLQGDPEKCHKDKYCRFHRDHGHNTSNCWELKRQIEDVIQDGYFKKFVGKPSHQYHLRRLKWGQSGIKRKELAREA
RREVCTIRKQQPTCSITFDDSDLEGVYLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLTTYLALGWTRAQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQNSIVQGE
QRMSRECYASALKGSSVYALEKQVERVGKQSFEADLPREGAKEFSAPTEELELHPRGWTQSRNSSKEIRR