; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g08470 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g08470
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:6120105..6125690
RNA-Seq ExpressionMoc04g08470
SyntenyMoc04g08470
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]2.1e-26791.29Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDSDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAALD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LND DLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAA D
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDSDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAALD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEMLQKAKKVIDGQELIRTKTGRPERKIGRSRSGKD-ERADPKSKDKGSFSSGRGEYRRTENGPTRSQLYERFTPTTILISEILTNIE
        TVKLGEEAPATFAE+LQKAKKVIDGQEL+RTKTGRPERKIGR RSGKD E ADPKSKDKGSFSSGR EYRR ENGPTRS+ YERFTPTTI ISEILTNIE
Subjt:  TVKLGEEAPATFAEMLQKAKKVIDGQELIRTKTGRPERKIGRSRSGKD-ERADPKSKDKGSFSSGRGEYRRTENGPTRSQLYERFTPTTILISEILTNIE

Query:  DSGMEKLLKRPEKLRGAPERHSKYKYCRFHREHGHNMSDCWELKRQIEDLIQEGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        +SGMEKLLKRPEKLRGAPER SK KYCRFHREHGHN SD WELKRQIE+LIQ+GYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  DSGMEKLLKRPEKLRGAPERHSKYKYCRFHREHGHNMSDCWELKRQIEDLIQEGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQGLPCPITFDDADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQ   CPITFD ADL+EVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQGLPCPITFDDADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV
        SVIPEG IDLPVTLGQDQT+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]3.0e-26675.45Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDSDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LND DLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDSDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AA DAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEMLQKAKKVIDGQELIRTKTGRPERKIGRSRSGKDERADPKSKDKGSFSSGRGEYRRTENGPTRSQLYERFTPTTILISEILT
        DEALTVKLG+EAPATFAE+LQKAKKVIDGQEL+RTKTGRPER I R RSGKDE+AD KSKDKGSFSSGR E+RR  NGPTRS+ YERFTPTTI ISEILT
Subjt:  DEALTVKLGEEAPATFAEMLQKAKKVIDGQELIRTKTGRPERKIGRSRSGKDERADPKSKDKGSFSSGRGEYRRTENGPTRSQLYERFTPTTILISEILT

Query:  NIEDSGMEKLLKRPEKLRGAPERHSKYKYCRFHREHGHNMSDCWELKRQIEDLIQEGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP
        NIE+SGMEKLLKRPEKLRGAPER +K KYCRFHREH HN SD WELKRQIEDLIQ+ YFKKFVGKP TSSAEKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEDSGMEKLLKRPEKLRGAPERHSKYKYCRFHREHGHNMSDCWELKRQIEDLIQEGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELARAARREVCIIREQGLPCPITFDDADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        SGGQSGHKRKELARAARREVCIIREQ   CPITFD ADL+EVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELARAARREVCIIREQGLPCPITFDDADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRVVPSTLHQILKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETL-
        S ESVIPEGCIDLPVTLG DQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFR +PSTLHQ+LKYSTPNGVG VRGEQ ASRECYA+ALKGSSVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRVVPSTLHQILKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETL-

Query:  -RDGALEFEADLPKKEFAAPTEELELVPLLSPEKQTDLARSVPVEILDNPSILEPDLMEIGAPE
         RDG LEF+A+LP++EFAAPTEELELVPLL  +   ++     ++   + + ++ D+   G PE
Subjt:  -RDGALEFEADLPKKEFAAPTEELELVPLLSPEKQTDLARSVPVEILDNPSILEPDLMEIGAPE

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]5.7e-21788.39Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEMLQKAKKVIDGQELIRTKTGRPERKIGRSRSGKD-ERADPKSKDKGSFSSGRGEYRRTENGPTRSQLYERFTP
        MCYFLTGLADEALTVKL EEAPATFAE+LQKAKKVIDGQEL+RT       KIG+ RSGKD E  DPKSKDKGSFS+GR EYRR ENGPTRS+ YERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEMLQKAKKVIDGQELIRTKTGRPERKIGRSRSGKD-ERADPKSKDKGSFSSGRGEYRRTENGPTRSQLYERFTP

Query:  TTILISEILTNIEDSGMEKLLKRPEKLRGAPERHSKYKYCRFHREHGHNMSDCWELKRQIEDLIQEGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRP
        TTI ISEILTNIE+SGMEKLLKRPEKLRGAPER SK KYCRFHREHGHN SD WELK QIEDLIQ+GYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRP
Subjt:  TTILISEILTNIEDSGMEKLLKRPEKLRGAPERHSKYKYCRFHREHGHNMSDCWELKRQIEDLIQEGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGLPCPITFDDADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQ   CPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGLPCPITFDDADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRVVPSTLHQILKYSTPNGVGTVRGEQTASRECYAAALKG
        K+SPTPLVGFSGESV+PEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFR +PSTLHQ+LKYSTPNGVGTVRGEQTASRECYA+ LKG
Subjt:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRVVPSTLHQILKYSTPNGVGTVRGEQTASRECYAAALKG

Query:  SSVCALETL--RDGALEFEADLPKKEFAAPTEELELVPLLSPEKQTDL
        +SVCALETL  RDG LEFEADLP +EFAAP EELELVPLLS EKQ  L
Subjt:  SSVCALETL--RDGALEFEADLPKKEFAAPTEELELVPLLSPEKQTDL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]5.8e-26264.18Show/hide
Query:  MVQPADSTNTTDRRTLAASDAHPREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATCGRGGTSKKGAQGPAPAPTSENFDALQREMEAMR
        MVQPA+STNT DRR LAA+  H REVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPADSTNTTDRRTLAASDAHPREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATCGRGGTSKKGAQGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQGGSHLGPVEEERLEDNESEGYTRQRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQGGSHLGPVEEERLEDNESEGYTRQRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDSDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAALDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +D DLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDSDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAALDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEMLQKAKKVIDGQELIRTKTGRPERKIGRSRSGKDE-RADPKSKDKG-SFSSGRGEYRRTENGPTRSQLYERFTPTTILISEILTNIEDSGMEKLLKR
        FAE+LQK KKVIDGQEL+RTKTGRPE+ I + R+GKD+ +AD KS+DKG S SS R +YRR+ +   +S+ YE +TPTTI I EILTNIE++GMEKLLKR
Subjt:  FAEMLQKAKKVIDGQELIRTKTGRPERKIGRSRSGKDE-RADPKSKDKG-SFSSGRGEYRRTENGPTRSQLYERFTPTTILISEILTNIEDSGMEKLLKR

Query:  PEKLRGAPERHSKYKYCRFHREHGHNMSDCWELKRQIEDLIQEGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA
        PEKLRG PE+ +  KYCRFHR+HGHN S+ WELKRQIEDLIQ+GYFKKFVGKP ++S EKKEERKR RTPPRR DRPAVIN             K+KELA
Subjt:  PEKLRGAPERHSKYKYCRFHREHGHNMSDCWELKRQIEDLIQEGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA

Query:  RAARREVCIIREQGLPCPITFDDADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDL
        R ARREVCIIREQ     I F+ ADL+ VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGES+  EGCIDL
Subjt:  RAARREVCIIREQGLPCPITFDDADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDL

Query:  PVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRVVPSTLHQILKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALE--TLRD
        PV++ QD T+VTQMAEFVV+DGRSAYNAIFGRPIIHSFR VPSTLHQ+LKYST NGVGTVRGE   SRECYA+  K SSVCALE  T+RD
Subjt:  PVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRVVPSTLHQILKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALE--TLRD

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]5.4e-21572.29Show/hide
Query:  MDFQAALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEMLQKAKKVIDGQELIRTKTGRPERKIGRSRSGKDER-ADPKSKDKGSFSS-GRGEYRRTENGPTRSQLYERFTPTTIL
        T LADE LTVKLGEEAP TF E+LQKAKKVIDGQEL+RTKTGRPE++I + +  +++R AD KS+DKGS SS  R EYRR E+GP+RS+ YER+T +TI 
Subjt:  TGLADEALTVKLGEEAPATFAEMLQKAKKVIDGQELIRTKTGRPERKIGRSRSGKDER-ADPKSKDKGSFSS-GRGEYRRTENGPTRSQLYERFTPTTIL

Query:  ISEILTNIEDSGMEKLLKRPEKLRGAPERHSKYKYCRFHREHGHNMSDCWELKRQIEDLIQEGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIE+SGMEKLLKRPEKLRG  E+ +K KYCRFHR+HGHN + CWELKRQIEDLIQ+GYFKKFVGKP ++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEDSGMEKLLKRPEKLRGAPERHSKYKYCRFHREHGHNMSDCWELKRQIEDLIQEGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARREVCIIREQGLPCPITFDDADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSP
        TIFGGP+GGQSG+KRKELAR ARREVCIIRE    C ITF DADL+ VHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  TIFGGPSGGQSGHKRKELARAARREVCIIREQGLPCPITFDDADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRVVPSTLHQILKYSTPNGVGTVRGEQTASRECYAAALKGSSVC
                      GCIDLPVT+GQD T+VTQMAEFVV+DGRSAYNAIFGRPIIHSFR VPSTLHQ+LKYSTPN VG VRGEQ  SRECYA+ALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRVVPSTLHQILKYSTPNGVGTVRGEQTASRECYAAALKGSSVC

Query:  ALE--TLRDGALEFEADLP---KKEFAAPTEELELVPLLSPEKQTDLARSVPVEILDNPSILE
        ALE  T R    E EADLP   K++F  PTEELELVPLLSPE+Q +  +   V  ++ P  L+
Subjt:  ALE--TLRDGALEFEADLP---KKEFAAPTEELELVPLLSPEKQTDLARSVPVEILDNPSILE

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.0e-26791.29Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDSDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAALD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LND DLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAA D
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDSDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAALD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEMLQKAKKVIDGQELIRTKTGRPERKIGRSRSGKD-ERADPKSKDKGSFSSGRGEYRRTENGPTRSQLYERFTPTTILISEILTNIE
        TVKLGEEAPATFAE+LQKAKKVIDGQEL+RTKTGRPERKIGR RSGKD E ADPKSKDKGSFSSGR EYRR ENGPTRS+ YERFTPTTI ISEILTNIE
Subjt:  TVKLGEEAPATFAEMLQKAKKVIDGQELIRTKTGRPERKIGRSRSGKD-ERADPKSKDKGSFSSGRGEYRRTENGPTRSQLYERFTPTTILISEILTNIE

Query:  DSGMEKLLKRPEKLRGAPERHSKYKYCRFHREHGHNMSDCWELKRQIEDLIQEGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        +SGMEKLLKRPEKLRGAPER SK KYCRFHREHGHN SD WELKRQIE+LIQ+GYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  DSGMEKLLKRPEKLRGAPERHSKYKYCRFHREHGHNMSDCWELKRQIEDLIQEGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQGLPCPITFDDADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQ   CPITFD ADL+EVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQGLPCPITFDDADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV
        SVIPEG IDLPVTLGQDQT+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.4e-26675.45Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDSDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LND DLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDSDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AA DAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEMLQKAKKVIDGQELIRTKTGRPERKIGRSRSGKDERADPKSKDKGSFSSGRGEYRRTENGPTRSQLYERFTPTTILISEILT
        DEALTVKLG+EAPATFAE+LQKAKKVIDGQEL+RTKTGRPER I R RSGKDE+AD KSKDKGSFSSGR E+RR  NGPTRS+ YERFTPTTI ISEILT
Subjt:  DEALTVKLGEEAPATFAEMLQKAKKVIDGQELIRTKTGRPERKIGRSRSGKDERADPKSKDKGSFSSGRGEYRRTENGPTRSQLYERFTPTTILISEILT

Query:  NIEDSGMEKLLKRPEKLRGAPERHSKYKYCRFHREHGHNMSDCWELKRQIEDLIQEGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP
        NIE+SGMEKLLKRPEKLRGAPER +K KYCRFHREH HN SD WELKRQIEDLIQ+ YFKKFVGKP TSSAEKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEDSGMEKLLKRPEKLRGAPERHSKYKYCRFHREHGHNMSDCWELKRQIEDLIQEGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELARAARREVCIIREQGLPCPITFDDADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        SGGQSGHKRKELARAARREVCIIREQ   CPITFD ADL+EVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELARAARREVCIIREQGLPCPITFDDADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRVVPSTLHQILKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETL-
        S ESVIPEGCIDLPVTLG DQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFR +PSTLHQ+LKYSTPNGVG VRGEQ ASRECYA+ALKGSSVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRVVPSTLHQILKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETL-

Query:  -RDGALEFEADLPKKEFAAPTEELELVPLLSPEKQTDLARSVPVEILDNPSILEPDLMEIGAPE
         RDG LEF+A+LP++EFAAPTEELELVPLL  +   ++     ++   + + ++ D+   G PE
Subjt:  -RDGALEFEADLPKKEFAAPTEELELVPLLSPEKQTDLARSVPVEILDNPSILEPDLMEIGAPE

A0A6J1DD03 uncharacterized protein LOC1110198992.8e-21788.39Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEMLQKAKKVIDGQELIRTKTGRPERKIGRSRSGKD-ERADPKSKDKGSFSSGRGEYRRTENGPTRSQLYERFTP
        MCYFLTGLADEALTVKL EEAPATFAE+LQKAKKVIDGQEL+RT       KIG+ RSGKD E  DPKSKDKGSFS+GR EYRR ENGPTRS+ YERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEMLQKAKKVIDGQELIRTKTGRPERKIGRSRSGKD-ERADPKSKDKGSFSSGRGEYRRTENGPTRSQLYERFTP

Query:  TTILISEILTNIEDSGMEKLLKRPEKLRGAPERHSKYKYCRFHREHGHNMSDCWELKRQIEDLIQEGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRP
        TTI ISEILTNIE+SGMEKLLKRPEKLRGAPER SK KYCRFHREHGHN SD WELK QIEDLIQ+GYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRP
Subjt:  TTILISEILTNIEDSGMEKLLKRPEKLRGAPERHSKYKYCRFHREHGHNMSDCWELKRQIEDLIQEGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGLPCPITFDDADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQ   CPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGLPCPITFDDADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRVVPSTLHQILKYSTPNGVGTVRGEQTASRECYAAALKG
        K+SPTPLVGFSGESV+PEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFR +PSTLHQ+LKYSTPNGVGTVRGEQTASRECYA+ LKG
Subjt:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRVVPSTLHQILKYSTPNGVGTVRGEQTASRECYAAALKG

Query:  SSVCALETL--RDGALEFEADLPKKEFAAPTEELELVPLLSPEKQTDL
        +SVCALETL  RDG LEFEADLP +EFAAP EELELVPLLS EKQ  L
Subjt:  SSVCALETL--RDGALEFEADLPKKEFAAPTEELELVPLLSPEKQTDL

A0A6J1DHB3 uncharacterized protein LOC1110204792.8e-26264.18Show/hide
Query:  MVQPADSTNTTDRRTLAASDAHPREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATCGRGGTSKKGAQGPAPAPTSENFDALQREMEAMR
        MVQPA+STNT DRR LAA+  H REVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPADSTNTTDRRTLAASDAHPREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATCGRGGTSKKGAQGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQGGSHLGPVEEERLEDNESEGYTRQRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQGGSHLGPVEEERLEDNESEGYTRQRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDSDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAALDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +D DLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDSDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAALDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEMLQKAKKVIDGQELIRTKTGRPERKIGRSRSGKDE-RADPKSKDKG-SFSSGRGEYRRTENGPTRSQLYERFTPTTILISEILTNIEDSGMEKLLKR
        FAE+LQK KKVIDGQEL+RTKTGRPE+ I + R+GKD+ +AD KS+DKG S SS R +YRR+ +   +S+ YE +TPTTI I EILTNIE++GMEKLLKR
Subjt:  FAEMLQKAKKVIDGQELIRTKTGRPERKIGRSRSGKDE-RADPKSKDKG-SFSSGRGEYRRTENGPTRSQLYERFTPTTILISEILTNIEDSGMEKLLKR

Query:  PEKLRGAPERHSKYKYCRFHREHGHNMSDCWELKRQIEDLIQEGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA
        PEKLRG PE+ +  KYCRFHR+HGHN S+ WELKRQIEDLIQ+GYFKKFVGKP ++S EKKEERKR RTPPRR DRPAVIN             K+KELA
Subjt:  PEKLRGAPERHSKYKYCRFHREHGHNMSDCWELKRQIEDLIQEGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA

Query:  RAARREVCIIREQGLPCPITFDDADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDL
        R ARREVCIIREQ     I F+ ADL+ VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGES+  EGCIDL
Subjt:  RAARREVCIIREQGLPCPITFDDADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDL

Query:  PVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRVVPSTLHQILKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALE--TLRD
        PV++ QD T+VTQMAEFVV+DGRSAYNAIFGRPIIHSFR VPSTLHQ+LKYST NGVGTVRGE   SRECYA+  K SSVCALE  T+RD
Subjt:  PVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRVVPSTLHQILKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALE--TLRD

A0A6J1DZB9 uncharacterized protein LOC1110249042.6e-21572.29Show/hide
Query:  MDFQAALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEMLQKAKKVIDGQELIRTKTGRPERKIGRSRSGKDER-ADPKSKDKGSFSS-GRGEYRRTENGPTRSQLYERFTPTTIL
        T LADE LTVKLGEEAP TF E+LQKAKKVIDGQEL+RTKTGRPE++I + +  +++R AD KS+DKGS SS  R EYRR E+GP+RS+ YER+T +TI 
Subjt:  TGLADEALTVKLGEEAPATFAEMLQKAKKVIDGQELIRTKTGRPERKIGRSRSGKDER-ADPKSKDKGSFSS-GRGEYRRTENGPTRSQLYERFTPTTIL

Query:  ISEILTNIEDSGMEKLLKRPEKLRGAPERHSKYKYCRFHREHGHNMSDCWELKRQIEDLIQEGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIE+SGMEKLLKRPEKLRG  E+ +K KYCRFHR+HGHN + CWELKRQIEDLIQ+GYFKKFVGKP ++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEDSGMEKLLKRPEKLRGAPERHSKYKYCRFHREHGHNMSDCWELKRQIEDLIQEGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARREVCIIREQGLPCPITFDDADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSP
        TIFGGP+GGQSG+KRKELAR ARREVCIIRE    C ITF DADL+ VHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  TIFGGPSGGQSGHKRKELARAARREVCIIREQGLPCPITFDDADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRVVPSTLHQILKYSTPNGVGTVRGEQTASRECYAAALKGSSVC
                      GCIDLPVT+GQD T+VTQMAEFVV+DGRSAYNAIFGRPIIHSFR VPSTLHQ+LKYSTPN VG VRGEQ  SRECYA+ALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRVVPSTLHQILKYSTPNGVGTVRGEQTASRECYAAALKGSSVC

Query:  ALE--TLRDGALEFEADLP---KKEFAAPTEELELVPLLSPEKQTDLARSVPVEILDNPSILE
        ALE  T R    E EADLP   K++F  PTEELELVPLLSPE+Q +  +   V  ++ P  L+
Subjt:  ALE--TLRDGALEFEADLP---KKEFAAPTEELELVPLLSPEKQTDLARSVPVEILDNPSILE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGGATTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCCTAGGGAGGTCGGAGCAGCGGCGGTAGAGGGGCAAGGTCACGA
CGGCCTCGCAACGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCTGTGGCCGAGGTGGGACCTCTA
AAAAGGGCGCCCAGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTAT
AACGAAATGGTGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAGGGGGTTCCCACCTCGGCCCAGTCGAGGAAGAACG
TCTCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGCGAGCATCTCAACAAAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCT
CCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCTAAATGTGAGCAGAAAGACGATTCACTGAACGACAGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTATGACGGGACTAAGGACCCCAAGGACTACGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATTAGACGCAATCAAATGCCGCGCCT
TTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCT
CGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCCGATTCCAGGAGGAGCAGTTGAAGGTTGC
ACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGATGCTCC
AGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCATCCGGACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGAGCAGAAGTGGAAAAGATGAAAGGGCAGATCCC
AAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGGTGAGTATCGAAGGACGGAGAACGGACCTACCAGGAGCCAACTTTACGAGCGCTTCACCCCAACCACGATTCT
AATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAATTACTCAAGCGTCCGGAGAAGCTTCGGGGAGCCCCGGAGAGGCACAGCAAGTACAAGTATTGCC
GCTTCCATCGGGAGCACGGCCACAACATGTCGGACTGCTGGGAATTGAAGCGTCAAATTGAGGATCTAATTCAAGAAGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGG
ACCAGCTCAGCAGAGAAGAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCCCGACGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCA
ATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATTAGGGAGCAGGGGCTGCCCTGCCCAATCACCTTCGACGATGCAGACTTGAAGG
AGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATAGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTA
CCAACCTACCTCGCCTTGGGTTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTCGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCC
GGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAGATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCAT
TTCGGGTCGTTCCCTCAACACTTCATCAAATTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAATGCTATGCCGCCGCA
CTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGGCGCTCGAGTTCGAGGCCGACCTGCCGAAGAAGGAGTTTGCCGCACCTACGGAGGAGCTCGAGCT
TGTTCCTCTGCTTAGTCCCGAGAAGCAGACCGACCTCGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGATCGGCGCTC
CAGAATCCTCATGGATGGACCCGATCACGGACTTCATTAGGGGCAGCTCACCACAAGATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGAGGGTCCAAGCG
CATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAAGAGATGTCCTCGCGCACCC
GTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGGATTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCCTAGGGAGGTCGGAGCAGCGGCGGTAGAGGGGCAAGGTCACGA
CGGCCTCGCAACGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCTGTGGCCGAGGTGGGACCTCTA
AAAAGGGCGCCCAGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTAT
AACGAAATGGTGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAGGGGGTTCCCACCTCGGCCCAGTCGAGGAAGAACG
TCTCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGCGAGCATCTCAACAAAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCT
CCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCTAAATGTGAGCAGAAAGACGATTCACTGAACGACAGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTATGACGGGACTAAGGACCCCAAGGACTACGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATTAGACGCAATCAAATGCCGCGCCT
TTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCT
CGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCCGATTCCAGGAGGAGCAGTTGAAGGTTGC
ACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGATGCTCC
AGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCATCCGGACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGAGCAGAAGTGGAAAAGATGAAAGGGCAGATCCC
AAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGGTGAGTATCGAAGGACGGAGAACGGACCTACCAGGAGCCAACTTTACGAGCGCTTCACCCCAACCACGATTCT
AATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAATTACTCAAGCGTCCGGAGAAGCTTCGGGGAGCCCCGGAGAGGCACAGCAAGTACAAGTATTGCC
GCTTCCATCGGGAGCACGGCCACAACATGTCGGACTGCTGGGAATTGAAGCGTCAAATTGAGGATCTAATTCAAGAAGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGG
ACCAGCTCAGCAGAGAAGAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCCCGACGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCA
ATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATTAGGGAGCAGGGGCTGCCCTGCCCAATCACCTTCGACGATGCAGACTTGAAGG
AGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATAGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTA
CCAACCTACCTCGCCTTGGGTTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTCGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCC
GGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAGATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCAT
TTCGGGTCGTTCCCTCAACACTTCATCAAATTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAATGCTATGCCGCCGCA
CTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGGCGCTCGAGTTCGAGGCCGACCTGCCGAAGAAGGAGTTTGCCGCACCTACGGAGGAGCTCGAGCT
TGTTCCTCTGCTTAGTCCCGAGAAGCAGACCGACCTCGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGATCGGCGCTC
CAGAATCCTCATGGATGGACCCGATCACGGACTTCATTAGGGGCAGCTCACCACAAGATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGAGGGTCCAAGCG
CATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAAGAGATGTCCTCGCGCACCC
GTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPADSTNTTDRRTLAASDAHPREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATCGRGGTSKKGAQGPAPAPTSENFDALQREMEAMRTQMRSMEAMY
NEMVLAAGAGSRSENRVTRMDVREQGGSHLGPVEEERLEDNESEGYTRQRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEA
LKAKCEQKDDSLNDSDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSS
RHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEMLQKAKKVIDGQELIRTKTGRPERKIGRSRSGKDERADP
KSKDKGSFSSGRGEYRRTENGPTRSQLYERFTPTTILISEILTNIEDSGMEKLLKRPEKLRGAPERHSKYKYCRFHREHGHNMSDCWELKRQIEDLIQEGYFKKFVGKPG
TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGLPCPITFDDADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSL
PTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRVVPSTLHQILKYSTPNGVGTVRGEQTASRECYAAA
LKGSSVCALETLRDGALEFEADLPKKEFAAPTEELELVPLLSPEKQTDLARSVPVEILDNPSILEPDLMEIGAPESSWMDPITDFIRGSSPQDPKERRKLARRAARRVQA
HVGALDPAWEGPFEIKGIVRPGTYILADLKRDVLAHPWNAEHLKRYYP