; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g12190 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g12190
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:9311236..9315768
RNA-Seq ExpressionMoc06g12190
SyntenyMoc06g12190
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.6e-22380.3Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDIQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMD QAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDIQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAE-------------------------IGRGRSGKD-ERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAE                         IGRGRSGKD E ADPKSKDKGSFSSGRAE RRAE+GPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAE-------------------------IGRGRSGKD-ERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPRGAARTKVCRKAQDQ--------------------------------LSRKKGEERKRSRTPPRRTDRPAVIDTIFGGPSGG
        ESGMEKLLKRPEKLRGAP   ++ K CR  ++                                  S +K EERKRSRTPPRRTDRPAVI+TIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPRGAARTKVCRKAQDQ--------------------------------LSRKKGEERKRSRTPPRRTDRPAVIDTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQGPTCPISFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRE
        QSG KRKELARAARREVCIIREQ PTCPI+FDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFS E
Subjt:  QSGHKRKELARAARREVCIIREQGPTCPISFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRE

Query:  SVIPEGCIDLPVTLGQDRTRVTQMAEFV
        SVIPEG IDLPVTLGQD+T+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDRTRVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.1e-22771.11Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDIQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMD Q
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDIQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAE-------------------------IGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILT
        DEALTVKLG+EAPATFAE                         I RGRSGKDE+AD KSKDKGSFSSGRAE RRA +GPTRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEAPATFAE-------------------------IGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILT

Query:  NIEESGMEKLLKRPEKLRGAPRGAARTKVC-----------------RKAQDQL---------------SRKKGEERKRSRTPPRRTDRPAVIDTIFGGP
        NIEESGMEKLLKRPEKLRGAP    + K C                 R+ +D +               S +K EERK SRTP RR DRPAVI+TIFGGP
Subjt:  NIEESGMEKLLKRPEKLRGAPRGAARTKVC-----------------RKAQDQL---------------SRKKGEERKRSRTPPRRTDRPAVIDTIFGGP

Query:  SGGQSGHKRKELARAARREVCIIREQGPTCPISFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        SGGQSGHKRKELARAARREVCIIREQ PTCPI+FD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELARAARREVCIIREQGPTCPISFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SRESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTVHQVLKYPTPNGVGTIRGEQTASRECYASALKGSSVCALETLA
        SRESVIPEGCIDLPVTLG D+T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPST+HQVLKY TPNGVG +RGEQ ASRECYASALKGSSVCALETL 
Subjt:  SRESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTVHQVLKYPTPNGVGTIRGEQTASRECYASALKGSSVCALETLA

Query:  GRDGALEFEADLPRKEFAAPTEELELVPLL
         RDG LEF+A+LPR+EFAAPTEELELVPLL
Subjt:  GRDGALEFEADLPRKEFAAPTEELELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]7.9e-22358.48Show/hide
Query:  MVQPANSTNTTDRRTLAASNAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMESMR
        MVQPANSTNT DRR LAA++ HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASNAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMESMR

Query:  TQMRTMEEMYSEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEEYTRQRGDLREHLNRKRCSSLRKRQSPSHSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRTMEEMYSEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEEYTRQRGDLREHLNRKRCSSLRKRQSPSHSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDIQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMD QAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDIQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAE-------------------------IGRGRSGKDE-RADPKSKDKG-SFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKR
        FAE                         I +GR+GKD+ +AD KS+DKG S SS R + RR+ S   +SRPYE +TPTTIPI EILTNIEE+GMEKLLKR
Subjt:  FAE-------------------------IGRGRSGKDE-RADPKSKDKG-SFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKR

Query:  PEKLRGAPRGAARTKVCRKAQD---------QLSR-----------------------KKGEERKRSRTPPRRTDRPAVIDTIFGGPSGGQSGHKRKELA
        PEKLRG P      K CR  +D         +L R                       +K EERKR RTPPRR DRPAVI             +K+KELA
Subjt:  PEKLRGAPRGAARTKVCRKAQD---------QLSR-----------------------KKGEERKRSRTPPRRTDRPAVIDTIFGGPSGGQSGHKRKELA

Query:  RAARREVCIIREQGPTCPISFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDL
        R ARREVCIIREQ PT  I+F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFS ES+  EGCIDL
Subjt:  RAARREVCIIREQGPTCPISFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDL

Query:  PVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTVHQVLKYPTPNGVGTIRGEQTASRECYASALKGSSVCALETLAGRD
        PV++ QD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PST+HQVLKY T NGVGT+RGE   SRECYAS  K SSVCALE    RD
Subjt:  PVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTVHQVLKYPTPNGVGTIRGEQTASRECYASALKGSSVCALETLAGRD

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]4.0e-18259.44Show/hide
Query:  EQRGSHLGPAEEERPEDNESEEYTRQRGDLREHL-NRKRCSSLRKRQSPSHSHR--SSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQ
        E+    + P   E   ++E  +Y+ +  DLR+HL ++K+ +S     S S+S    +SN +A+S +    P  +I REEFD ++   D QVEALKA+CE+
Subjt:  EQRGSHLGPAEEERPEDNESEEYTRQRGDLREHL-NRKRCSSLRKRQSPSHSHR--SSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQ

Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDIQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        K+   +D DLGESPFTSD++EAPIPPKFK PT+KPYDG+KDPKDYVEVFEGLMD QAA+DAIKC AFQIALTGSARLW RRLPARSISTYSQLR+EF+ Q
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDIQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEI-------------------------
        FS RHYD+KTATHLATIRQKE                                   DE LTVKLGEEAPATFAE+                         
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEI-------------------------

Query:  -GRGRSGKDERADPKSKDKGSFSSG-RAECRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPRGAARTKVCRKAQDQLSRKKG
          +  S K  + D KSKDKGS SSG R E RR+ESGP+RSRPYER       I ++   I++S  +K + +P                       S +K 
Subjt:  -GRGRSGKDERADPKSKDKGSFSSG-RAECRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPRGAARTKVCRKAQDQLSRKKG

Query:  EERKRSRTPPRRTDRPAVIDTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPISFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANI
        EERKRSRTPPRR DRPAVI+TIFGGPSGGQ  +KRKELA  ARR+V IIREQ PTC I+F   DLE VHLPHNDALVIAPLIDHV+VRRVLVDGGASANI
Subjt:  EERKRSRTPPRRTDRPAVIDTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPISFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANI

Query:  LSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTVHQVLKYPTPNGVGTIR
        LSLPTYLAL  TRSQLK+SPTPLVGFS ESV PEGCIDLPVT+GQD T+VTQMAEFVVIDGR AYNAIF RPIIHSF+A+PS +HQVLKY TPNGVGT+R
Subjt:  LSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTVHQVLKYPTPNGVGTIR

Query:  GEQTASRECYASALKGSSVCALETLAGRDGALEFEADLPRK
        GEQ  SRECYASALK SSVCALE    +D       DLPR+
Subjt:  GEQTASRECYASALKGSSVCALETLAGRDGALEFEADLPRK

XP_022157676.1 uncharacterized protein LOC111024332 [Momordica charantia]5.7e-18183.95Show/hide
Query:  LADEALTVKLGEEAPATFAE-------------------------IGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEI
        +ADEALTVKLGEEAPATFAE                         IGRGRSGKDERADPKSKDKGSFSSGRAE RRAE+GPTRSRPYERFTPTTIPISEI
Subjt:  LADEALTVKLGEEAPATFAE-------------------------IGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEI

Query:  LTNIEESGMEKLLKRPEKLRGAPRGAARTKVCRKAQDQLSRKKGEERKRSRTPPRRTDRPAVIDTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTC
        LTNIE+SGMEKLLKRPEKLRGAP   ++ K         S +K EERKRSRTPPRRTDRPAVI+TIFGGPSGGQSGHKRKELAR ARREVCIIREQGPTC
Subjt:  LTNIEESGMEKLLKRPEKLRGAPRGAARTKVCRKAQDQLSRKKGEERKRSRTPPRRTDRPAVIDTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTC

Query:  PISFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLPVTLGQDRTRVTQMAEF
        PI+FDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFS ESVIPEGCIDLPVTLGQD+TRVTQM EF
Subjt:  PISFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLPVTLGQDRTRVTQMAEF

Query:  VVIDGRSAYNAIFGRPIIHSFRAIPSTVHQVLKYPTPNGVGTIRGEQTASRECYASALKGSSVCALETLAGRDGALEFEADLPRKEFAAPTEELELVPLL
        VV+DGRS YNAIFGRPIIHSFR IPST+HQVLKY TPNGVGT+RGEQT SRECYA+ALKGSSVCALETL  RDG LE EADLPRKEFAAPTEELELVPLL
Subjt:  VVIDGRSAYNAIFGRPIIHSFRAIPSTVHQVLKYPTPNGVGTIRGEQTASRECYASALKGSSVCALETLAGRDGALEFEADLPRKEFAAPTEELELVPLL

Query:  SPEKQ
        SPEKQ
Subjt:  SPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088137.7e-22480.3Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDIQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMD QAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDIQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAE-------------------------IGRGRSGKD-ERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAE                         IGRGRSGKD E ADPKSKDKGSFSSGRAE RRAE+GPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAE-------------------------IGRGRSGKD-ERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPRGAARTKVCRKAQDQ--------------------------------LSRKKGEERKRSRTPPRRTDRPAVIDTIFGGPSGG
        ESGMEKLLKRPEKLRGAP   ++ K CR  ++                                  S +K EERKRSRTPPRRTDRPAVI+TIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPRGAARTKVCRKAQDQ--------------------------------LSRKKGEERKRSRTPPRRTDRPAVIDTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQGPTCPISFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRE
        QSG KRKELARAARREVCIIREQ PTCPI+FDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFS E
Subjt:  QSGHKRKELARAARREVCIIREQGPTCPISFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRE

Query:  SVIPEGCIDLPVTLGQDRTRVTQMAEFV
        SVIPEG IDLPVTLGQD+T+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDRTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188235.1e-22871.11Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDIQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMD Q
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDIQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAE-------------------------IGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILT
        DEALTVKLG+EAPATFAE                         I RGRSGKDE+AD KSKDKGSFSSGRAE RRA +GPTRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEAPATFAE-------------------------IGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILT

Query:  NIEESGMEKLLKRPEKLRGAPRGAARTKVC-----------------RKAQDQL---------------SRKKGEERKRSRTPPRRTDRPAVIDTIFGGP
        NIEESGMEKLLKRPEKLRGAP    + K C                 R+ +D +               S +K EERK SRTP RR DRPAVI+TIFGGP
Subjt:  NIEESGMEKLLKRPEKLRGAPRGAARTKVC-----------------RKAQDQL---------------SRKKGEERKRSRTPPRRTDRPAVIDTIFGGP

Query:  SGGQSGHKRKELARAARREVCIIREQGPTCPISFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        SGGQSGHKRKELARAARREVCIIREQ PTCPI+FD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELARAARREVCIIREQGPTCPISFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SRESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTVHQVLKYPTPNGVGTIRGEQTASRECYASALKGSSVCALETLA
        SRESVIPEGCIDLPVTLG D+T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPST+HQVLKY TPNGVG +RGEQ ASRECYASALKGSSVCALETL 
Subjt:  SRESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTVHQVLKYPTPNGVGTIRGEQTASRECYASALKGSSVCALETLA

Query:  GRDGALEFEADLPRKEFAAPTEELELVPLL
         RDG LEF+A+LPR+EFAAPTEELELVPLL
Subjt:  GRDGALEFEADLPRKEFAAPTEELELVPLL

A0A6J1DHB3 uncharacterized protein LOC1110204793.8e-22358.48Show/hide
Query:  MVQPANSTNTTDRRTLAASNAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMESMR
        MVQPANSTNT DRR LAA++ HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASNAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMESMR

Query:  TQMRTMEEMYSEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEEYTRQRGDLREHLNRKRCSSLRKRQSPSHSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRTMEEMYSEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEEYTRQRGDLREHLNRKRCSSLRKRQSPSHSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDIQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMD QAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDIQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAE-------------------------IGRGRSGKDE-RADPKSKDKG-SFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKR
        FAE                         I +GR+GKD+ +AD KS+DKG S SS R + RR+ S   +SRPYE +TPTTIPI EILTNIEE+GMEKLLKR
Subjt:  FAE-------------------------IGRGRSGKDE-RADPKSKDKG-SFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKR

Query:  PEKLRGAPRGAARTKVCRKAQD---------QLSR-----------------------KKGEERKRSRTPPRRTDRPAVIDTIFGGPSGGQSGHKRKELA
        PEKLRG P      K CR  +D         +L R                       +K EERKR RTPPRR DRPAVI             +K+KELA
Subjt:  PEKLRGAPRGAARTKVCRKAQD---------QLSR-----------------------KKGEERKRSRTPPRRTDRPAVIDTIFGGPSGGQSGHKRKELA

Query:  RAARREVCIIREQGPTCPISFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDL
        R ARREVCIIREQ PT  I+F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFS ES+  EGCIDL
Subjt:  RAARREVCIIREQGPTCPISFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDL

Query:  PVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTVHQVLKYPTPNGVGTIRGEQTASRECYASALKGSSVCALETLAGRD
        PV++ QD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PST+HQVLKY T NGVGT+RGE   SRECYAS  K SSVCALE    RD
Subjt:  PVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTVHQVLKYPTPNGVGTIRGEQTASRECYASALKGSSVCALETLAGRD

A0A6J1DPC9 uncharacterized protein LOC1110222801.9e-18259.44Show/hide
Query:  EQRGSHLGPAEEERPEDNESEEYTRQRGDLREHL-NRKRCSSLRKRQSPSHSHR--SSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQ
        E+    + P   E   ++E  +Y+ +  DLR+HL ++K+ +S     S S+S    +SN +A+S +    P  +I REEFD ++   D QVEALKA+CE+
Subjt:  EQRGSHLGPAEEERPEDNESEEYTRQRGDLREHL-NRKRCSSLRKRQSPSHSHR--SSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQ

Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDIQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        K+   +D DLGESPFTSD++EAPIPPKFK PT+KPYDG+KDPKDYVEVFEGLMD QAA+DAIKC AFQIALTGSARLW RRLPARSISTYSQLR+EF+ Q
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDIQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEI-------------------------
        FS RHYD+KTATHLATIRQKE                                   DE LTVKLGEEAPATFAE+                         
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEI-------------------------

Query:  -GRGRSGKDERADPKSKDKGSFSSG-RAECRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPRGAARTKVCRKAQDQLSRKKG
          +  S K  + D KSKDKGS SSG R E RR+ESGP+RSRPYER       I ++   I++S  +K + +P                       S +K 
Subjt:  -GRGRSGKDERADPKSKDKGSFSSG-RAECRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPRGAARTKVCRKAQDQLSRKKG

Query:  EERKRSRTPPRRTDRPAVIDTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPISFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANI
        EERKRSRTPPRR DRPAVI+TIFGGPSGGQ  +KRKELA  ARR+V IIREQ PTC I+F   DLE VHLPHNDALVIAPLIDHV+VRRVLVDGGASANI
Subjt:  EERKRSRTPPRRTDRPAVIDTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPISFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANI

Query:  LSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTVHQVLKYPTPNGVGTIR
        LSLPTYLAL  TRSQLK+SPTPLVGFS ESV PEGCIDLPVT+GQD T+VTQMAEFVVIDGR AYNAIF RPIIHSF+A+PS +HQVLKY TPNGVGT+R
Subjt:  LSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTVHQVLKYPTPNGVGTIR

Query:  GEQTASRECYASALKGSSVCALETLAGRDGALEFEADLPRK
        GEQ  SRECYASALK SSVCALE    +D       DLPR+
Subjt:  GEQTASRECYASALKGSSVCALETLAGRDGALEFEADLPRK

A0A6J1DYW5 uncharacterized protein LOC1110243322.8e-18183.95Show/hide
Query:  LADEALTVKLGEEAPATFAE-------------------------IGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEI
        +ADEALTVKLGEEAPATFAE                         IGRGRSGKDERADPKSKDKGSFSSGRAE RRAE+GPTRSRPYERFTPTTIPISEI
Subjt:  LADEALTVKLGEEAPATFAE-------------------------IGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEI

Query:  LTNIEESGMEKLLKRPEKLRGAPRGAARTKVCRKAQDQLSRKKGEERKRSRTPPRRTDRPAVIDTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTC
        LTNIE+SGMEKLLKRPEKLRGAP   ++ K         S +K EERKRSRTPPRRTDRPAVI+TIFGGPSGGQSGHKRKELAR ARREVCIIREQGPTC
Subjt:  LTNIEESGMEKLLKRPEKLRGAPRGAARTKVCRKAQDQLSRKKGEERKRSRTPPRRTDRPAVIDTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTC

Query:  PISFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLPVTLGQDRTRVTQMAEF
        PI+FDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFS ESVIPEGCIDLPVTLGQD+TRVTQM EF
Subjt:  PISFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLPVTLGQDRTRVTQMAEF

Query:  VVIDGRSAYNAIFGRPIIHSFRAIPSTVHQVLKYPTPNGVGTIRGEQTASRECYASALKGSSVCALETLAGRDGALEFEADLPRKEFAAPTEELELVPLL
        VV+DGRS YNAIFGRPIIHSFR IPST+HQVLKY TPNGVGT+RGEQT SRECYA+ALKGSSVCALETL  RDG LE EADLPRKEFAAPTEELELVPLL
Subjt:  VVIDGRSAYNAIFGRPIIHSFRAIPSTVHQVLKYPTPNGVGTIRGEQTASRECYASALKGSSVCALETLAGRDGALEFEADLPRKEFAAPTEELELVPLL

Query:  SPEKQ
        SPEKQ
Subjt:  SPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCAATGCCCACCAGAGAGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCCTGGCAACCGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTTCAGAGAGAGATGGAGTCAATGCGCACACAAATGCGCACCATGGAGGAAATGTAT
AGCGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAGCG
TCCCGAAGACAACGAGAGCGAGGAGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGATGCTCGTCTCTCCGAAAAAGGCAGTCACCATCCCACT
CACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAAGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCGCCAATCCCTCCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTACGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACATCCAAGCGGCATCAGACGCAATCAAGTGCCGCGCCT
TTCAGATCGCGCTCACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCT
CGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGC
ACACTGCTCTGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGATCGGCC
GGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTGTCGAAGGGCGGAGAGCGGACCTACCAGGAGC
CGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACAAACATCGAGGAATCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGG
AGCCCCGAGAGGCGCAGCAAGGACAAAAGTTTGTAGGAAAGCCCAGGACCAGCTCAGTAGGAAAAAAGGAGAGGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCG
ACCGACCTGCGGTCATCGATACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAAGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGG
GAGCAGGGGCCGACCTGCCCAATCTCCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGT
CAGGAGAGTGCTGGTAGATGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAGAAGCCCGACACCGCTGG
TTGGGTTCTCTAGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGAC
GGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACAGTGCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGG
CACGATCCGAGGAGAACAGACCGCGTCGAGGGAGTGTTATGCCTCCGCACTCAAGGGCTCATCGGTCTGCGCCCTCGAAACGCTCGCCGGTAGGGATGGGGCGCTCGAGT
TCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGGGGCAACTCACCGCAAGACCCCAAAGA
GCGCAGAAAGTTGGCATGGCGGGTAGCTCGGTTCGTGGTCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCTTGCCTCTATTGAGATGCCTAACCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCAATGCCCACCAGAGAGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCCTGGCAACCGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTTCAGAGAGAGATGGAGTCAATGCGCACACAAATGCGCACCATGGAGGAAATGTAT
AGCGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAGCG
TCCCGAAGACAACGAGAGCGAGGAGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGATGCTCGTCTCTCCGAAAAAGGCAGTCACCATCCCACT
CACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAAGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCGCCAATCCCTCCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTACGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACATCCAAGCGGCATCAGACGCAATCAAGTGCCGCGCCT
TTCAGATCGCGCTCACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCT
CGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGC
ACACTGCTCTGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGATCGGCC
GGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTGTCGAAGGGCGGAGAGCGGACCTACCAGGAGC
CGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACAAACATCGAGGAATCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGG
AGCCCCGAGAGGCGCAGCAAGGACAAAAGTTTGTAGGAAAGCCCAGGACCAGCTCAGTAGGAAAAAAGGAGAGGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCG
ACCGACCTGCGGTCATCGATACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAAGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGG
GAGCAGGGGCCGACCTGCCCAATCTCCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGT
CAGGAGAGTGCTGGTAGATGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAGAAGCCCGACACCGCTGG
TTGGGTTCTCTAGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGAC
GGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACAGTGCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGG
CACGATCCGAGGAGAACAGACCGCGTCGAGGGAGTGTTATGCCTCCGCACTCAAGGGCTCATCGGTCTGCGCCCTCGAAACGCTCGCCGGTAGGGATGGGGCGCTCGAGT
TCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGGGGCAACTCACCGCAAGACCCCAAAGA
GCGCAGAAAGTTGGCATGGCGGGTAGCTCGGTTCGTGGTCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCTTGCCTCTATTGAGATGCCTAACCCCTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASNAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMESMRTQMRTMEEMY
SEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEEYTRQRGDLREHLNRKRCSSLRKRQSPSHSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEA
LKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDIQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSS
RHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEIGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTRS
RPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPRGAARTKVCRKAQDQLSRKKGEERKRSRTPPRRTDRPAVIDTIFGGPSGGQSGHKRKELARAARREVCIIR
EQGPTCPISFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSRESVIPEGCIDLPVTLGQDRTRVTQMAEFVVID
GRSAYNAIFGRPIIHSFRAIPSTVHQVLKYPTPNGVGTIRGEQTASRECYASALKGSSVCALETLAGRDGALEFEADLPRKEFAAPTEELELVPLLSPEKQGQLTARPQR
AQKVGMAGSSVRGPRWGIVPTWLFLASIEMPNP