; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g10030 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g10030
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:7655286..7660801
RNA-Seq ExpressionMoc07g10030
SyntenyMoc07g10030
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]2.9e-24284.47Show/hide
Query:  QAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDRSKDPKDYVEVFEGLMDFQVALD
        +AESS NPATPAGVITREEFDQLRG+LDAQVEALKAKCEQKEGPLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYD SKDPKDYVEVFE LMDFQ A D
Subjt:  QAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDRSKDPKDYVEVFEGLMDFQVALD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAKFSSRHYDKKTATHLATIRQREGETLWEYVTRFEEEQLKVAHCFDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQ+EGETL EYVTRF+EEQLKVAHC DDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAKFSSRHYDKKTATHLATIRQREGETLWEYVTRFEEEQLKVAHCFDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKVKKVIDGQELLRTKTGRPEKKIGRGRSGKEVEKADSKSKDKGSFSSGRAEYRRAENGPNRSRPYEHFTPTMIPISEILTNIE
        TVKLGEEAPATFAEVLQK KKVIDGQELLRTKTGRPE+KIGRGRSGK++E AD KSKDKGSFSSGRAEYRRAENGP RSRPYE FTPT IPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKVKKVIDGQELLRTKTGRPEKKIGRGRSGKEVEKADSKSKDKGSFSSGRAEYRRAENGPNRSRPYEHFTPTMIPISEILTNIE

Query:  EFGMEKLLKRPEKLREAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKNEERKRSRTPPRHTDRPAVINTIFGGPSGG
        E GMEKLLKRPEKLR APERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEK EERKRSRTPPR TDRPAVINTIFGGPSGG
Subjt:  EFGMEKLLKRPEKLREAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKNEERKRSRTPPRHTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARCE--------------------------------------------VLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVRFSGE
        QSG KRKELARAAR E                                            VLVDGG SANILSLPTYL LGWTRSQLKKSPTPLV FSGE
Subjt:  QSGHKRKELARAARCE--------------------------------------------VLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVRFSGE

Query:  SVIPEGCIDLPVTFGQDKTQVTQMAEFV
        SVIPEG IDLPVT GQD+TQVTQMAEFV
Subjt:  SVIPEGCIDLPVTFGQDKTQVTQMAEFV

XP_022141796.1 uncharacterized protein LOC111012081 [Momordica charantia]1.9e-20191.16Show/hide
Query:  VITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDRSKDPKDYVEVFEGLMDFQVALDAIKCRAFQIALTG
        +ITREEFDQLRG+LDAQ EALKAKCEQKEGPLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYD SKDPKDYVEVFEGLMDFQ   DAIKCRAFQIALTG
Subjt:  VITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDRSKDPKDYVEVFEGLMDFQVALDAIKCRAFQIALTG

Query:  SARLWYRRLPARSISTYSQLRREFLAKFSSRHYDKKTATHLATIRQREGETLWEYVTRFEEEQLKVAHCFDDSAMCYFLTGLADEALTVKLGEEAPATFA
        SARLWYRRLPARSISTYSQLRREFLA+FSSRHYDKKTATHLATIRQ+EGETL EYVTRF+EEQLKVAHC DDSAMCYFLTGLADEALTVKLGEEAP+TF 
Subjt:  SARLWYRRLPARSISTYSQLRREFLAKFSSRHYDKKTATHLATIRQREGETLWEYVTRFEEEQLKVAHCFDDSAMCYFLTGLADEALTVKLGEEAPATFA

Query:  EVLQKVKKVIDGQELLRTKTGRPEKKIGRGRSGKEVEKADSKSKDKGSFSSGRAEYRRAENGPNRSRPYEHFTPTMIPISEILTNIEEFGMEKLLKRPEK
        EVLQK KKVIDG ELLRTKTGRPE+KI RGRSGK++EK D KSKDKGSFSSGR EYRRAENGP RSRPYE FTPT IPI EILT IEE GMEKLLKRPEK
Subjt:  EVLQKVKKVIDGQELLRTKTGRPEKKIGRGRSGKEVEKADSKSKDKGSFSSGRAEYRRAENGPNRSRPYEHFTPTMIPISEILTNIEEFGMEKLLKRPEK

Query:  LREAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKNEERKRSRTPPRHTDRPAVINTIFGGPSGGQSGHKRKEL
        LR   ERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEK EERKRSRTPPR TDRPAVINTIFGGPSGGQSGHKRKEL
Subjt:  LREAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKNEERKRSRTPPRHTDRPAVINTIFGGPSGGQSGHKRKEL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]6.9e-22871.77Show/hide
Query:  SSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDRSKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPATP GVITREEFDQLRGKL+AQVEALKAKCEQKEGPLNDGDLGES FTSDVLE        APTVK YD SKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDRSKDPKDYVEVFEGLMDFQ

Query:  VALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAKFSSRHYDKKTATHLATIRQREGETLWEYVTRFEEEQLKVAHCFDDSAMCYFLTGLA
         A DAIKCRAFQIALTGSARLW                                                     F+E+QLKVA   DDSAMCYFLTGLA
Subjt:  VALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAKFSSRHYDKKTATHLATIRQREGETLWEYVTRFEEEQLKVAHCFDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKVKKVIDGQELLRTKTGRPEKKIGRGRSGKEVEKADSKSKDKGSFSSGRAEYRRAENGPNRSRPYEHFTPTMIPISEIL
        DEALTVKLG+EAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I RGRSGK+ EKAD KSKDKGSFSSGRAE+RRA NGP RSRPYE FTPT IPISEIL
Subjt:  DEALTVKLGEEAPATFAEVLQKVKKVIDGQELLRTKTGRPEKKIGRGRSGKEVEKADSKSKDKGSFSSGRAEYRRAENGPNRSRPYEHFTPTMIPISEIL

Query:  TNIEEFGMEKLLKRPEKLREAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKNEERKRSRTPPRHTDRPAVINTIFGG
        TNIEE GMEKLLKRPEKLR APERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEK EERK SRTP R  DRPAVINTIFGG
Subjt:  TNIEEFGMEKLLKRPEKLREAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKNEERKRSRTPPRHTDRPAVINTIFGG

Query:  PSGGQSGHKRKELARAARCE--------------------------------------------VLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVR
        PSGGQSGHKRKELARAAR E                                            VLVD G SANI+SL TYL LGWTRSQLKKS TPLV 
Subjt:  PSGGQSGHKRKELARAARCE--------------------------------------------VLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVR

Query:  FSGESVIPEGCIDLPVTFGQDKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCTLEAH
        FS ESVIPEGCIDLPVT G D+TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVC LE  
Subjt:  FSGESVIPEGCIDLPVTFGQDKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCTLEAH

Query:  DSGDETPEFEPICREGSSPRPLRSLSL
         S D T EF+          P   L L
Subjt:  DSGDETPEFEPICREGSSPRPLRSLSL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]6.6e-24762.12Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQDHNGLATEPLRRSARITAPVLPPVHPRTSKATRGRGGTSKKGARGPAPAPTNENFDALQREMEAMC
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQ H  L TEPL RSARIT PVLPP HP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQDHNGLATEPLRRSARITAPVLPPVHPRTSKATRGRGGTSKKGARGPAPAPTNENFDALQREMEAMC

Query:  TQIRSMEEMYNEMILAAGAGSRSENRVTRVDIREQRGSHLGPIEQEHPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT
                                                                                                   AESS+NP T
Subjt:  TQIRSMEEMYNEMILAAGAGSRSENRVTRVDIREQRGSHLGPIEQEHPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT

Query:  PAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDRSKDPKDYVEVFEGLMDFQVALDAIKCRAFQIA
        P GVITREEFDQL+ K DAQVEALKA+CE+KE   +DGDLGE SF+SD+LEA IPPKFK PT+KPYD SKDPKDYVEVFE LMDFQ A DAIKC AFQIA
Subjt:  PAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDRSKDPKDYVEVFEGLMDFQVALDAIKCRAFQIA

Query:  LTGSARLWYRRLPARSISTYSQLRREFLAKFSSRHYDKKTATHLATIRQREGETLWEYVTRFEEEQLKVAHCFDDSAMCYFLTGLADEALTVKLGEEAPA
        LTGSARLWYRRLPAR ISTYSQLR+EF+++FSSRHYD+KT THLATIRQ+EGETL EYVTRF EEQLKVAHC DDSAMCYFLTGLADE LTVKL EEAPA
Subjt:  LTGSARLWYRRLPARSISTYSQLRREFLAKFSSRHYDKKTATHLATIRQREGETLWEYVTRFEEEQLKVAHCFDDSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFAEVLQKVKKVIDGQELLRTKTGRPEKKIGRGRSGKEVEKADSKSKDKG-SFSSGRAEYRRAENGPNRSRPYEHFTPTMIPISEILTNIEEFGMEKLLK
        TFAEVLQK KKVIDGQELLRTKTGRPEK I +GR+GK+  KADSKS+DKG S SS R +YRR+ +  N+SRPYEH+TPT IPI EILTNIEE GMEKLLK
Subjt:  TFAEVLQKVKKVIDGQELLRTKTGRPEKKIGRGRSGKEVEKADSKSKDKG-SFSSGRAEYRRAENGPNRSRPYEHFTPTMIPISEILTNIEEFGMEKLLK

Query:  RPEKLREAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKNEERKRSRTPPRHTDRPAVINTIFGGPSGGQSGHKRKEL
        RPEKLR  PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EK EERKR RTPPR  DRPAVIN             K+KEL
Subjt:  RPEKLREAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKNEERKRSRTPPRHTDRPAVINTIFGGPSGGQSGHKRKEL

Query:  ARAARCEV--------------------------------------------LVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVRFSGESVIPEGCID
        AR AR EV                                            LVDGGASANILSL TYL LGWTRSQLKKSPTPLV FSGES+  EGCID
Subjt:  ARAARCEV--------------------------------------------LVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVRFSGESVIPEGCID

Query:  LPVTFGQDKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCTLEAHDSGDE
        LPV+  QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVC LE     DE
Subjt:  LPVTFGQDKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCTLEAHDSGDE

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]2.5e-20691.85Show/hide
Query:  GVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDRSKDPKDYVEVFEGLMDFQVALDAIKCRAFQIALT
        G+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGES FTSDVLEAPIPPKFKAPTVKPYD +KDPKDYVEVFEGLMDFQ A DAIKCRAFQIALT
Subjt:  GVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDRSKDPKDYVEVFEGLMDFQVALDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAKFSSRHYDKKTATHLATIRQREGETLWEYVTRFEEEQLKVAHCFDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLA+FSSRHYDKKTATHLATIRQ+EGETL EYVTRF+EEQLKVAHC DDSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAKFSSRHYDKKTATHLATIRQREGETLWEYVTRFEEEQLKVAHCFDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKVKKVIDGQELLRTKTGRPEKKIGRGRSGKEVEKADSKSKDKGSFSSGRAEYRRAENGPNRSRPYEHFTPTMIPISEILTNIEEFGMEKLLKRPE
        AEVLQK KKVIDGQELLRTKTGRPE+KIGRGRSGK+VE+AD KSKDKGSFSSGRAEYRRAENGP RSRPYE FTPT IPI EILTNIEE GMEKLLKRPE
Subjt:  AEVLQKVKKVIDGQELLRTKTGRPEKKIGRGRSGKEVEKADSKSKDKGSFSSGRAEYRRAENGPNRSRPYEHFTPTMIPISEILTNIEEFGMEKLLKRPE

Query:  KLREAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKNEERKRSRTPPRHTDRPAVINTIFGGPSGGQSGHKRKELARA
        KLR APERRSKDKYCRFHREHGHNTSD WELKRQIEDLIQDGYFKKFVGKPRTSSAEK EERKRSRTPPR TDRPAVINTIFGGPSGGQ GHKRKELARA
Subjt:  KLREAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKNEERKRSRTPPRHTDRPAVINTIFGGPSGGQSGHKRKELARA

Query:  ARCEV
        AR E+
Subjt:  ARCEV

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.4e-24284.47Show/hide
Query:  QAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDRSKDPKDYVEVFEGLMDFQVALD
        +AESS NPATPAGVITREEFDQLRG+LDAQVEALKAKCEQKEGPLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYD SKDPKDYVEVFE LMDFQ A D
Subjt:  QAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDRSKDPKDYVEVFEGLMDFQVALD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAKFSSRHYDKKTATHLATIRQREGETLWEYVTRFEEEQLKVAHCFDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQ+EGETL EYVTRF+EEQLKVAHC DDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAKFSSRHYDKKTATHLATIRQREGETLWEYVTRFEEEQLKVAHCFDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKVKKVIDGQELLRTKTGRPEKKIGRGRSGKEVEKADSKSKDKGSFSSGRAEYRRAENGPNRSRPYEHFTPTMIPISEILTNIE
        TVKLGEEAPATFAEVLQK KKVIDGQELLRTKTGRPE+KIGRGRSGK++E AD KSKDKGSFSSGRAEYRRAENGP RSRPYE FTPT IPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKVKKVIDGQELLRTKTGRPEKKIGRGRSGKEVEKADSKSKDKGSFSSGRAEYRRAENGPNRSRPYEHFTPTMIPISEILTNIE

Query:  EFGMEKLLKRPEKLREAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKNEERKRSRTPPRHTDRPAVINTIFGGPSGG
        E GMEKLLKRPEKLR APERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEK EERKRSRTPPR TDRPAVINTIFGGPSGG
Subjt:  EFGMEKLLKRPEKLREAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKNEERKRSRTPPRHTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARCE--------------------------------------------VLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVRFSGE
        QSG KRKELARAAR E                                            VLVDGG SANILSLPTYL LGWTRSQLKKSPTPLV FSGE
Subjt:  QSGHKRKELARAARCE--------------------------------------------VLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVRFSGE

Query:  SVIPEGCIDLPVTFGQDKTQVTQMAEFV
        SVIPEG IDLPVT GQD+TQVTQMAEFV
Subjt:  SVIPEGCIDLPVTFGQDKTQVTQMAEFV

A0A6J1CKB3 uncharacterized protein LOC1110120819.1e-20291.16Show/hide
Query:  VITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDRSKDPKDYVEVFEGLMDFQVALDAIKCRAFQIALTG
        +ITREEFDQLRG+LDAQ EALKAKCEQKEGPLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYD SKDPKDYVEVFEGLMDFQ   DAIKCRAFQIALTG
Subjt:  VITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDRSKDPKDYVEVFEGLMDFQVALDAIKCRAFQIALTG

Query:  SARLWYRRLPARSISTYSQLRREFLAKFSSRHYDKKTATHLATIRQREGETLWEYVTRFEEEQLKVAHCFDDSAMCYFLTGLADEALTVKLGEEAPATFA
        SARLWYRRLPARSISTYSQLRREFLA+FSSRHYDKKTATHLATIRQ+EGETL EYVTRF+EEQLKVAHC DDSAMCYFLTGLADEALTVKLGEEAP+TF 
Subjt:  SARLWYRRLPARSISTYSQLRREFLAKFSSRHYDKKTATHLATIRQREGETLWEYVTRFEEEQLKVAHCFDDSAMCYFLTGLADEALTVKLGEEAPATFA

Query:  EVLQKVKKVIDGQELLRTKTGRPEKKIGRGRSGKEVEKADSKSKDKGSFSSGRAEYRRAENGPNRSRPYEHFTPTMIPISEILTNIEEFGMEKLLKRPEK
        EVLQK KKVIDG ELLRTKTGRPE+KI RGRSGK++EK D KSKDKGSFSSGR EYRRAENGP RSRPYE FTPT IPI EILT IEE GMEKLLKRPEK
Subjt:  EVLQKVKKVIDGQELLRTKTGRPEKKIGRGRSGKEVEKADSKSKDKGSFSSGRAEYRRAENGPNRSRPYEHFTPTMIPISEILTNIEEFGMEKLLKRPEK

Query:  LREAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKNEERKRSRTPPRHTDRPAVINTIFGGPSGGQSGHKRKEL
        LR   ERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEK EERKRSRTPPR TDRPAVINTIFGGPSGGQSGHKRKEL
Subjt:  LREAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKNEERKRSRTPPRHTDRPAVINTIFGGPSGGQSGHKRKEL

A0A6J1D9E1 uncharacterized protein LOC1110188233.3e-22871.77Show/hide
Query:  SSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDRSKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPATP GVITREEFDQLRGKL+AQVEALKAKCEQKEGPLNDGDLGES FTSDVLE        APTVK YD SKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDRSKDPKDYVEVFEGLMDFQ

Query:  VALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAKFSSRHYDKKTATHLATIRQREGETLWEYVTRFEEEQLKVAHCFDDSAMCYFLTGLA
         A DAIKCRAFQIALTGSARLW                                                     F+E+QLKVA   DDSAMCYFLTGLA
Subjt:  VALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAKFSSRHYDKKTATHLATIRQREGETLWEYVTRFEEEQLKVAHCFDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKVKKVIDGQELLRTKTGRPEKKIGRGRSGKEVEKADSKSKDKGSFSSGRAEYRRAENGPNRSRPYEHFTPTMIPISEIL
        DEALTVKLG+EAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I RGRSGK+ EKAD KSKDKGSFSSGRAE+RRA NGP RSRPYE FTPT IPISEIL
Subjt:  DEALTVKLGEEAPATFAEVLQKVKKVIDGQELLRTKTGRPEKKIGRGRSGKEVEKADSKSKDKGSFSSGRAEYRRAENGPNRSRPYEHFTPTMIPISEIL

Query:  TNIEEFGMEKLLKRPEKLREAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKNEERKRSRTPPRHTDRPAVINTIFGG
        TNIEE GMEKLLKRPEKLR APERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEK EERK SRTP R  DRPAVINTIFGG
Subjt:  TNIEEFGMEKLLKRPEKLREAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKNEERKRSRTPPRHTDRPAVINTIFGG

Query:  PSGGQSGHKRKELARAARCE--------------------------------------------VLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVR
        PSGGQSGHKRKELARAAR E                                            VLVD G SANI+SL TYL LGWTRSQLKKS TPLV 
Subjt:  PSGGQSGHKRKELARAARCE--------------------------------------------VLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVR

Query:  FSGESVIPEGCIDLPVTFGQDKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCTLEAH
        FS ESVIPEGCIDLPVT G D+TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVC LE  
Subjt:  FSGESVIPEGCIDLPVTFGQDKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCTLEAH

Query:  DSGDETPEFEPICREGSSPRPLRSLSL
         S D T EF+          P   L L
Subjt:  DSGDETPEFEPICREGSSPRPLRSLSL

A0A6J1DHB3 uncharacterized protein LOC1110204793.2e-24762.12Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQDHNGLATEPLRRSARITAPVLPPVHPRTSKATRGRGGTSKKGARGPAPAPTNENFDALQREMEAMC
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQ H  L TEPL RSARIT PVLPP HP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQDHNGLATEPLRRSARITAPVLPPVHPRTSKATRGRGGTSKKGARGPAPAPTNENFDALQREMEAMC

Query:  TQIRSMEEMYNEMILAAGAGSRSENRVTRVDIREQRGSHLGPIEQEHPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT
                                                                                                   AESS+NP T
Subjt:  TQIRSMEEMYNEMILAAGAGSRSENRVTRVDIREQRGSHLGPIEQEHPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT

Query:  PAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDRSKDPKDYVEVFEGLMDFQVALDAIKCRAFQIA
        P GVITREEFDQL+ K DAQVEALKA+CE+KE   +DGDLGE SF+SD+LEA IPPKFK PT+KPYD SKDPKDYVEVFE LMDFQ A DAIKC AFQIA
Subjt:  PAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDRSKDPKDYVEVFEGLMDFQVALDAIKCRAFQIA

Query:  LTGSARLWYRRLPARSISTYSQLRREFLAKFSSRHYDKKTATHLATIRQREGETLWEYVTRFEEEQLKVAHCFDDSAMCYFLTGLADEALTVKLGEEAPA
        LTGSARLWYRRLPAR ISTYSQLR+EF+++FSSRHYD+KT THLATIRQ+EGETL EYVTRF EEQLKVAHC DDSAMCYFLTGLADE LTVKL EEAPA
Subjt:  LTGSARLWYRRLPARSISTYSQLRREFLAKFSSRHYDKKTATHLATIRQREGETLWEYVTRFEEEQLKVAHCFDDSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFAEVLQKVKKVIDGQELLRTKTGRPEKKIGRGRSGKEVEKADSKSKDKG-SFSSGRAEYRRAENGPNRSRPYEHFTPTMIPISEILTNIEEFGMEKLLK
        TFAEVLQK KKVIDGQELLRTKTGRPEK I +GR+GK+  KADSKS+DKG S SS R +YRR+ +  N+SRPYEH+TPT IPI EILTNIEE GMEKLLK
Subjt:  TFAEVLQKVKKVIDGQELLRTKTGRPEKKIGRGRSGKEVEKADSKSKDKG-SFSSGRAEYRRAENGPNRSRPYEHFTPTMIPISEILTNIEEFGMEKLLK

Query:  RPEKLREAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKNEERKRSRTPPRHTDRPAVINTIFGGPSGGQSGHKRKEL
        RPEKLR  PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EK EERKR RTPPR  DRPAVIN             K+KEL
Subjt:  RPEKLREAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKNEERKRSRTPPRHTDRPAVINTIFGGPSGGQSGHKRKEL

Query:  ARAARCEV--------------------------------------------LVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVRFSGESVIPEGCID
        AR AR EV                                            LVDGGASANILSL TYL LGWTRSQLKKSPTPLV FSGES+  EGCID
Subjt:  ARAARCEV--------------------------------------------LVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVRFSGESVIPEGCID

Query:  LPVTFGQDKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCTLEAHDSGDE
        LPV+  QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVC LE     DE
Subjt:  LPVTFGQDKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCTLEAHDSGDE

A0A6J1DS95 uncharacterized protein LOC1110234211.2e-20691.85Show/hide
Query:  GVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDRSKDPKDYVEVFEGLMDFQVALDAIKCRAFQIALT
        G+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGES FTSDVLEAPIPPKFKAPTVKPYD +KDPKDYVEVFEGLMDFQ A DAIKCRAFQIALT
Subjt:  GVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDRSKDPKDYVEVFEGLMDFQVALDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAKFSSRHYDKKTATHLATIRQREGETLWEYVTRFEEEQLKVAHCFDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLA+FSSRHYDKKTATHLATIRQ+EGETL EYVTRF+EEQLKVAHC DDSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAKFSSRHYDKKTATHLATIRQREGETLWEYVTRFEEEQLKVAHCFDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKVKKVIDGQELLRTKTGRPEKKIGRGRSGKEVEKADSKSKDKGSFSSGRAEYRRAENGPNRSRPYEHFTPTMIPISEILTNIEEFGMEKLLKRPE
        AEVLQK KKVIDGQELLRTKTGRPE+KIGRGRSGK+VE+AD KSKDKGSFSSGRAEYRRAENGP RSRPYE FTPT IPI EILTNIEE GMEKLLKRPE
Subjt:  AEVLQKVKKVIDGQELLRTKTGRPEKKIGRGRSGKEVEKADSKSKDKGSFSSGRAEYRRAENGPNRSRPYEHFTPTMIPISEILTNIEEFGMEKLLKRPE

Query:  KLREAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKNEERKRSRTPPRHTDRPAVINTIFGGPSGGQSGHKRKELARA
        KLR APERRSKDKYCRFHREHGHNTSD WELKRQIEDLIQDGYFKKFVGKPRTSSAEK EERKRSRTPPR TDRPAVINTIFGGPSGGQ GHKRKELARA
Subjt:  KLREAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKNEERKRSRTPPRHTDRPAVINTIFGGPSGGQSGHKRKELARA

Query:  ARCEV
        AR E+
Subjt:  ARCEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGATCACAA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGTGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAATGAGAACTTTGACGCACTCCAAAGAGAAATGGAGGCAATGTGCACGCAAATACGGTCCATGGAGGAAATGTAT
AACGAAATGATACTAGCTGCAGGTGCAGGGTCCCGATCTGAAAACCGAGTGACGCGCGTTGACATACGCGAGCAAAGGGGTTCACACCTCGGCCCAATCGAGCAGGAACA
TCCCGAAGACAACGAGAGTGAGGGATACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGTTCATCCCTCCGAAAAGGACAGTCACCATCCCGCT
CACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCCGCAGGGGTGATCACGAGGGAGGAGTTCGACCAACTTAGGGGCAAGCTCGATGCTCAG
GTTGAGGCCTTAAAGGCCAAATGCGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGTCATTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAA
GTTCAAAGCTCCTACCGTGAAACCTTATGATAGGTCGAAAGACCCTAAGGATTATGTTGAGGTCTTTGAGGGCCTCATGGATTTTCAAGTGGCATTAGATGCAATCAAAT
GCCGTGCTTTTCAGATCGCGCTTACTGGCAGCGCGCGTTTGTGGTATCGGAGATTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTTGCCAAG
TTCTCTTCTCGGCACTATGACAAAAAGACAGCGACTCATCTCGCCACCATCAGGCAAAGGGAAGGTGAGACGCTGTGGGAGTACGTCACCAGGTTCGAAGAGGAACAATT
GAAGGTCGCACACTGCTTCGACGACTCGGCCATGTGCTACTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCG
AAGTACTGCAGAAGGTGAAGAAAGTCATCGATGGGCAGGAGCTCCTCCGAACCAAAACCGGCCGGCCAGAGAAAAAGATCGGCCGGGGCAGAAGTGGGAAAGAAGTAGAA
AAGGCGGATTCCAAGTCCAAGGACAAGGGATCTTTCTCTAGTGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTAACAGGAGCCGACCTTACGAACACTTCACCCC
GACCATGATTCCAATTTCCGAGATCCTAACGAACATTGAGGAGTTCGGAATGGAAAAACTCCTCAAGCGTCCTGAGAAGCTTCGGGAAGCCCCGGAGAGGCGTAGCAAGG
ACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGATTGTTGGGAATTGAAGCGTCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTG
GGGAAGCCCAGGACCAGCTCGGCAGAGAAAAATGAAGAGAGGAAGCGTTCGAGGACGCCGCCCCGGCACACTGACCGACCCGCGGTCATCAATACCATTTTCGGAGGGCC
AAGCGGGGGCCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGTGCGAGGTACTAGTAGACGGGGGCGCATCTGCTAACATCCTGTCCCTACCGACCTACC
TCACCCTGGGATGGACAAGGTCGCAATTGAAGAAAAGCCCGACACCGCTAGTTAGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACATTT
GGGCAAGACAAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGTAGATCGGCGTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCAT
TCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACAGTCCGAGGAGAACAGACCGCTTCGAGGGAATGCTATGCCTCCGCACTCAAAGGGT
CATCGGTATGCACTCTCGAAGCTCACGACAGTGGGGATGAGACGCCCGAGTTCGAGCCGATCTGCCGCGAAGGGAGTTCTCCGCGCCCACTGAGGAGCTTGAGCTTGTTC
CTCTGCTTAGTCCTGAGAAACAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCGGAGCCAGATCTGATGGAGATCGACGCTCCAGAGCCCTCATGGATGGA
CCCGATTGTGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGCAAAAAGTTGGCAAGGCAGGCAGCTCGGAAGGTGCAAACCCATGTGGGTGCCATTGATC
CGACCTGGGAGGGGCCGTTTGAGGTCAAGGGAATATTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACATCCTCGCGCACCCATGGAACGCGGAGCACCTG
AAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGATCACAA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGTGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAATGAGAACTTTGACGCACTCCAAAGAGAAATGGAGGCAATGTGCACGCAAATACGGTCCATGGAGGAAATGTAT
AACGAAATGATACTAGCTGCAGGTGCAGGGTCCCGATCTGAAAACCGAGTGACGCGCGTTGACATACGCGAGCAAAGGGGTTCACACCTCGGCCCAATCGAGCAGGAACA
TCCCGAAGACAACGAGAGTGAGGGATACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGTTCATCCCTCCGAAAAGGACAGTCACCATCCCGCT
CACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCCGCAGGGGTGATCACGAGGGAGGAGTTCGACCAACTTAGGGGCAAGCTCGATGCTCAG
GTTGAGGCCTTAAAGGCCAAATGCGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGTCATTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAA
GTTCAAAGCTCCTACCGTGAAACCTTATGATAGGTCGAAAGACCCTAAGGATTATGTTGAGGTCTTTGAGGGCCTCATGGATTTTCAAGTGGCATTAGATGCAATCAAAT
GCCGTGCTTTTCAGATCGCGCTTACTGGCAGCGCGCGTTTGTGGTATCGGAGATTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTTGCCAAG
TTCTCTTCTCGGCACTATGACAAAAAGACAGCGACTCATCTCGCCACCATCAGGCAAAGGGAAGGTGAGACGCTGTGGGAGTACGTCACCAGGTTCGAAGAGGAACAATT
GAAGGTCGCACACTGCTTCGACGACTCGGCCATGTGCTACTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCG
AAGTACTGCAGAAGGTGAAGAAAGTCATCGATGGGCAGGAGCTCCTCCGAACCAAAACCGGCCGGCCAGAGAAAAAGATCGGCCGGGGCAGAAGTGGGAAAGAAGTAGAA
AAGGCGGATTCCAAGTCCAAGGACAAGGGATCTTTCTCTAGTGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTAACAGGAGCCGACCTTACGAACACTTCACCCC
GACCATGATTCCAATTTCCGAGATCCTAACGAACATTGAGGAGTTCGGAATGGAAAAACTCCTCAAGCGTCCTGAGAAGCTTCGGGAAGCCCCGGAGAGGCGTAGCAAGG
ACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGATTGTTGGGAATTGAAGCGTCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTG
GGGAAGCCCAGGACCAGCTCGGCAGAGAAAAATGAAGAGAGGAAGCGTTCGAGGACGCCGCCCCGGCACACTGACCGACCCGCGGTCATCAATACCATTTTCGGAGGGCC
AAGCGGGGGCCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGTGCGAGGTACTAGTAGACGGGGGCGCATCTGCTAACATCCTGTCCCTACCGACCTACC
TCACCCTGGGATGGACAAGGTCGCAATTGAAGAAAAGCCCGACACCGCTAGTTAGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACATTT
GGGCAAGACAAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGTAGATCGGCGTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCAT
TCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACAGTCCGAGGAGAACAGACCGCTTCGAGGGAATGCTATGCCTCCGCACTCAAAGGGT
CATCGGTATGCACTCTCGAAGCTCACGACAGTGGGGATGAGACGCCCGAGTTCGAGCCGATCTGCCGCGAAGGGAGTTCTCCGCGCCCACTGAGGAGCTTGAGCTTGTTC
CTCTGCTTAGTCCTGAGAAACAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCGGAGCCAGATCTGATGGAGATCGACGCTCCAGAGCCCTCATGGATGGA
CCCGATTGTGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGCAAAAAGTTGGCAAGGCAGGCAGCTCGGAAGGTGCAAACCCATGTGGGTGCCATTGATC
CGACCTGGGAGGGGCCGTTTGAGGTCAAGGGAATATTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACATCCTCGCGCACCCATGGAACGCGGAGCACCTG
AAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQDHNGLATEPLRRSARITAPVLPPVHPRTSKATRGRGGTSKKGARGPAPAPTNENFDALQREMEAMCTQIRSMEEMY
NEMILAAGAGSRSENRVTRVDIREQRGSHLGPIEQEHPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITREEFDQLRGKLDAQ
VEALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDRSKDPKDYVEVFEGLMDFQVALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAK
FSSRHYDKKTATHLATIRQREGETLWEYVTRFEEEQLKVAHCFDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKVKKVIDGQELLRTKTGRPEKKIGRGRSGKEVE
KADSKSKDKGSFSSGRAEYRRAENGPNRSRPYEHFTPTMIPISEILTNIEEFGMEKLLKRPEKLREAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV
GKPRTSSAEKNEERKRSRTPPRHTDRPAVINTIFGGPSGGQSGHKRKELARAARCEVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVRFSGESVIPEGCIDLPVTF
GQDKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCTLEAHDSGDETPEFEPICREGSSPRPLRSLSLF
LCLVLRNRSVPVEILDNPSISEPDLMEIDAPEPSWMDPIVDFIRGNSPQDPKERKKLARQAARKVQTHVGAIDPTWEGPFEVKGIFRPGTYILADLKGDILAHPWNAEHL
KRYYP