; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g16440 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g16440
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:12649429..12656852
RNA-Seq ExpressionMoc08g16440
SyntenyMoc08g16440
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]6.4e-21977.76Show/hide
Query:  LKAQSKYKPLTPEAVITREEFDLMKHRFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPPKFKTPTMNPYDGSKDPKDYVE------------
        +KA+S   P TP  VITREEFD ++ + D QVEALKA+CE+KE   +DGDLGESPFTSD+LEAPIPPKFK PT+ PYDGSKDPKDYVE            
Subjt:  LKAQSKYKPLTPEAVITREEFDLMKHRFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPPKFKTPTMNPYDGSKDPKDYVE------------

Query:  ---------IALTGSARLWYRRLPASSISTYSQLRKEFINQFSSGHYGRKIATHLAAIRQKEGETLREYVTMFQEEQLKVAHCSDDSAMCYFLTGLADET
                 IALTGSARLWYRRLPA SISTYSQLR+EF+  FSS HY +K ATHLA IRQKEGETLREYVT FQEEQLKVAHCSDDSAMCYFLTGLADE 
Subjt:  ---------IALTGSARLWYRRLPASSISTYSQLRKEFINQFSSGHYGRKIATHLAAIRQKEGETLREYVTMFQEEQLKVAHCSDDSAMCYFLTGLADET

Query:  LTVKLGEEARATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSKDKGSSSSGSRTEYRRSEIGHNRSRPYERYTPTTIPISEILTN
        LTVKLGEEA ATFAEVLQKAKKVIDGQELLRTKTGRPE++I + +  ++   AD KSKDKGS SSG R EYRR+E G  RSRPYER+TPTTIPISEILTN
Subjt:  LTVKLGEEARATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSKDKGSSSSGSRTEYRRSEIGHNRSRPYERYTPTTIPISEILTN

Query:  IEESGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDHNHNTTNCWELKRQIEDLIQDGYFKKFVGKPRYNSVEKKEERKRSRTPPRREDRSAVINTIFGGPS
        IEESGMEKLLKRPEKLRG PE+ +KDKYCRFHR+H HNT++ WELKRQIE+LIQDGYFKKFVGKPR +S EKKEERKRSRTPPRR DR AVINTIFGGPS
Subjt:  IEESGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDHNHNTTNCWELKRQIEDLIQDGYFKKFVGKPRYNSVEKKEERKRSRTPPRREDRSAVINTIFGGPS

Query:  GGQSGNKRKELAREARHEVCIIREQKPTCSITFGDTDLEGVHLPYNDTLVIAPLIDHVLVRRVLVDGGASTNILSLPTYLALGWTRSQLKKSPTPLVGFS
        GGQSG KRKELAR AR EVCIIREQ+PTC ITF   DLE VHLP+ND LVIAPLIDHV+V RVLVDGG S NILSLPTYLALGWTRSQLKKSPTPLVGFS
Subjt:  GGQSGNKRKELAREARHEVCIIREQKPTCSITFGDTDLEGVHLPYNDTLVIAPLIDHVLVRRVLVDGGASTNILSLPTYLALGWTRSQLKKSPTPLVGFS

Query:  GESVSPEGCIDFPVTIG
        GESV PEG ID PVT+G
Subjt:  GESVSPEGCIDFPVTIG

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]9.3e-20259.12Show/hide
Query:  NSDLKAQSKYKPLTPEAVITREEFDLMKHRFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPPKFKTPTMNPYDGSKDPKDYVE---------
        +S+ +A+S + P TP+ VITREEFD ++ + + QVEALKA+CE+KE   +DGDLGESPFTSD+LEA        PT+  YDGSKDPKDYVE         
Subjt:  NSDLKAQSKYKPLTPEAVITREEFDLMKHRFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPPKFKTPTMNPYDGSKDPKDYVE---------

Query:  ------------IALTGSARLWYRRLPASSISTYSQLRKEFINQFSSGHYGRKIATHLAAIRQKEGETLREYVTMFQEEQLKVAHCSDDSAMCYFLTGLA
                    IALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  ------------IALTGSARLWYRRLPASSISTYSQLRKEFINQFSSGHYGRKIATHLAAIRQKEGETLREYVTMFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DETLTVKLGEEARATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSKDKGSSSSGSRTEYRRSEIGHNRSRPYERYTPTTIPISEI
        DE LTVKLG+EA ATFAEVLQKAKKVIDGQELLRTKTGRPE+ ID+ + S +  KAD KSKDKGS SSG R E+RR+  G  RSRPYER+TPTTIPISEI
Subjt:  DETLTVKLGEEARATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSKDKGSSSSGSRTEYRRSEIGHNRSRPYERYTPTTIPISEI

Query:  LTNIEESGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDHNHNTTNCWELKRQIEDLIQDGYFKKFVGKPRYNSVEKKEERKRSRTPPRREDRSAVINTIFG
        LTNIEESGMEKLLKRPEKLRG PE+ NKDKYCRFHR+H+HNT++ WELKRQIEDLIQD YFKKFVGKPR +S EKKEERK SRTP RR DR AVINTIFG
Subjt:  LTNIEESGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDHNHNTTNCWELKRQIEDLIQDGYFKKFVGKPRYNSVEKKEERKRSRTPPRREDRSAVINTIFG

Query:  GPSGGQSGNKRKELAREARHEVCIIREQKPTCSITFGDTDLEGVHLPYNDTLVIAPLIDHVLVRRVLVDGGASTNILSLPTYLALGWTRSQLKKSPTPLV
        GPSGGQSG+KRKELAR AR EVCIIREQ+PTC ITF   DLE VHLP+ND LVIAPLIDHV+VRRVLVD G S NI+SL TYLALGWTRSQLKKS TPLV
Subjt:  GPSGGQSGNKRKELAREARHEVCIIREQKPTCSITFGDTDLEGVHLPYNDTLVIAPLIDHVLVRRVLVDGGASTNILSLPTYLALGWTRSQLKKSPTPLV

Query:  GFSGESVSPEGCIDFPVTIGGNLIYTLNPNLGKGELGFDLGLLPSITCDRARLHESLIFLKVEKIPTPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQK
        GFS ESV PEGCID PVT+G +                   +   +  D    + ++          PIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ 
Subjt:  GFSGESVSPEGCIDFPVTIGGNLIYTLNPNLGKGELGFDLGLLPSITCDRARLHESLIFLKVEKIPTPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQK

Query:  TSRECYASALKGSSVCALEEQTSQD----HLSRVGKRQFSPPTEELELVPLLSPEKQTDLARSVPVEILDSPSILEPDVMEVDTPSPSWMDPIVEFIKGN
         SRECYASALKGSSVCALE   S+D      + + +R+F+ PTEELELVPLL  +   ++     ++   S + ++ D+     P P  +       K +
Subjt:  TSRECYASALKGSSVCALEEQTSQD----HLSRVGKRQFSPPTEELELVPLLSPEKQTDLARSVPVEILDSPSILEPDVMEVDTPSPSWMDPIVEFIKGN

Query:  PL
        PL
Subjt:  PL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]3.9e-24073.35Show/hide
Query:  KAQSKYKPLTPEAVITREEFDLMKHRFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPPKFKTPTMNPYDGSKDPKDYVE-------------
        KA+S Y P+TP  VITREEFD +K +FD QVEALKARCEKKE SFDDGDLGE  F+SDILEA IPPKFKTPTM PYDGSKDPKDYVE             
Subjt:  KAQSKYKPLTPEAVITREEFDLMKHRFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPPKFKTPTMNPYDGSKDPKDYVE-------------

Query:  --------IALTGSARLWYRRLPASSISTYSQLRKEFINQFSSGHYGRKIATHLAAIRQKEGETLREYVTMFQEEQLKVAHCSDDSAMCYFLTGLADETL
                IALTGSARLWYRRLPA  ISTYSQLRKEFI+QFSS HY RK  THLA IRQKEGETLREYVT F EEQLKVAHCSDDSAMCYFLTGLADETL
Subjt:  --------IALTGSARLWYRRLPASSISTYSQLRKEFINQFSSGHYGRKIATHLAAIRQKEGETLREYVTMFQEEQLKVAHCSDDSAMCYFLTGLADETL

Query:  TVKLGEEARATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSKDKGSSSSGSRTEYRRSEIGHNRSRPYERYTPTTIPISEILTNI
        TVKL EEA ATFAEVLQK KKVIDGQELLRTKTGRPEK IDQ +  ++K KADSKS+DKG SSS SR +YRRS   HN+SRPYE YTPTTIPI EILTNI
Subjt:  TVKLGEEARATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSKDKGSSSSGSRTEYRRSEIGHNRSRPYERYTPTTIPISEILTNI

Query:  EESGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDHNHNTTNCWELKRQIEDLIQDGYFKKFVGKPRYNSVEKKEERKRSRTPPRREDRSAVINTIFGGPSG
        EE+GMEKLLKRPEKLRGDPEK N DKYCRFHRDH HNT+N WELKRQIEDLIQDGYFKKFVGKPR NSVEKKEERKR RTPPRR+DR AVI         
Subjt:  EESGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDHNHNTTNCWELKRQIEDLIQDGYFKKFVGKPRYNSVEKKEERKRSRTPPRREDRSAVINTIFGGPSG

Query:  GQSGNKRKELAREARHEVCIIREQKPTCSITFGDTDLEGVHLPYNDTLVIAPLIDHVLVRRVLVDGGASTNILSLPTYLALGWTRSQLKKSPTPLVGFSG
            NK+KELAREAR EVCIIREQ+PT SI F   DLEGVHLP+ND LVIAPLID VLVRR+LVDGGAS NILSL TYLALGWTRSQLKKSPTPLVGFSG
Subjt:  GQSGNKRKELAREARHEVCIIREQKPTCSITFGDTDLEGVHLPYNDTLVIAPLIDHVLVRRVLVDGGASTNILSLPTYLALGWTRSQLKKSPTPLVGFSG

Query:  ESVSPEGCIDFPVTIGGNLIYTLNPNLGKGELGFDLGLLPSITCDRARLHESLIFLKVEKIPTPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRE
        ES+S EGCID PV+I  +                   +   +  D    + ++          PIIHSFRAVPSTLHQVLKYST NGVGTVRGE KTSRE
Subjt:  ESVSPEGCIDFPVTIGGNLIYTLNPNLGKGELGFDLGLLPSITCDRARLHESLIFLKVEKIPTPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRE

Query:  CYASALKGSSVCALEEQTSQDHL
        CYAS  K SSVCALEEQT +D L
Subjt:  CYASALKGSSVCALEEQTSQDHL

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]5.8e-23667.32Show/hide
Query:  VPGAPREKEGRVPSFHPGDREPVPNNEGVDYSLRDNDLRKHLTYKKKRASRESEDSPSYSREFSNSDLKAQSKYKPLTPEAVITREEFDLMKHRFDEQVE
        +PGAP EK    PS  PG+REP+PN+EGVDYSLRDNDLRKHLT KKK+AS E EDS SYSREFSNS+LKAQSKYKPL PEAVI REEFDLMKHRFDEQVE
Subjt:  VPGAPREKEGRVPSFHPGDREPVPNNEGVDYSLRDNDLRKHLTYKKKRASRESEDSPSYSREFSNSDLKAQSKYKPLTPEAVITREEFDLMKHRFDEQVE

Query:  ALKARCEKKECSFDDGDLGESPFTSDILEAPIPPKFKTPTMNPYDGSKDPKDYVE---------------------IALTGSARLWYRRLPASSISTYSQ
        ALKARCEKKE  FDD DLGESPFTSDI+EAPIPPKFKTPTM PYDGSKDPKDYVE                     IALTGSARLW RRLPA SISTYSQ
Subjt:  ALKARCEKKECSFDDGDLGESPFTSDILEAPIPPKFKTPTMNPYDGSKDPKDYVE---------------------IALTGSARLWYRRLPASSISTYSQ

Query:  LRKEFINQFSSGHYGRKIATHLAAIRQKEGETLREYVTMFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEARATFAEVLQKAKKVIDGQELLRTK
        LRKEFI QFS  HY RK ATHLA IRQKE                                   DETLTVKLGEEA ATFAEVLQ AKKVIDGQELLRTK
Subjt:  LRKEFINQFSSGHYGRKIATHLAAIRQKEGETLREYVTMFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEARATFAEVLQKAKKVIDGQELLRTK

Query:  TGRPEKQIDQKKLSQEKRKADSKSKDKGSSSSGSRTEYRRSEIGHNRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKCNKDKYCRFHR
        T RPEKQIDQK+LSQ+KRK DSKSKDKGSSSSGSRTEYRRSE G +RSRPYER                                               
Subjt:  TGRPEKQIDQKKLSQEKRKADSKSKDKGSSSSGSRTEYRRSEIGHNRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKCNKDKYCRFHR

Query:  DHNHNTTNCWELKRQIEDLIQDGYFKKFVGKPRYNSVEKKEERKRSRTPPRREDRSAVINTIFGGPSGGQSGNKRKELAREARHEVCIIREQKPTCSITF
                CWELKRQIEDLIQD YFKKFVGKPR NSVEKKEERKRSRTPPRREDR AVINTIFGGPSGGQ  NKRKELA EAR +V IIREQKPTCSITF
Subjt:  DHNHNTTNCWELKRQIEDLIQDGYFKKFVGKPRYNSVEKKEERKRSRTPPRREDRSAVINTIFGGPSGGQSGNKRKELAREARHEVCIIREQKPTCSITF

Query:  GDTDLEGVHLPYNDTLVIAPLIDHVLVRRVLVDGGASTNILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDFPVTIGGNLIYTLNPNLGKGEL
         DTDLEGVHLP+ND LVIAPLIDHVLVRRVLVDGGAS NILSLPTYLAL  TRSQLKKSPTPLVGFS ESVSPEGCID PVTIG +              
Subjt:  GDTDLEGVHLPYNDTLVIAPLIDHVLVRRVLVDGGASTNILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDFPVTIGGNLIYTLNPNLGKGEL

Query:  GFDLGLLPSITCDRARLHESLIFLKVEKIPTPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDHLSRVGKRQFS
           +  +        RL  + IF +      PIIHSF+AVPS LHQVLKYSTPNGVGTVRGEQKTSRECYASALK SSVCALEEQTSQD L R  K    
Subjt:  GFDLGLLPSITCDRARLHESLIFLKVEKIPTPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDHLSRVGKRQFS

Query:  PPTEELELVPLLS
               L P L+
Subjt:  PPTEELELVPLLS

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]1.7e-21172.18Show/hide
Query:  EIALTGSARLWYRRLPASSISTYSQLRKEFINQFSSGHYGRKIATHLAAIRQKEGETLREYVTMFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEE
        +IALTGSARLWYRRLPA SISTYSQLRKEFI+QFSS HY RK ATHLA IRQKE ETLREYVT FQEEQLKVAHCSDDSAMCYFLT LADETLTVKLGEE
Subjt:  EIALTGSARLWYRRLPASSISTYSQLRKEFINQFSSGHYGRKIATHLAAIRQKEGETLREYVTMFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEE

Query:  ARATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSKDKGSSSSGSRTEYRRSEIGHNRSRPYERYTPTTIPISEILTNIEESGMEK
        A  TF EVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKS+DKGSSSS SRTEYRR E G +RSRPYERYT +TIPISEILTNIEESGMEK
Subjt:  ARATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSKDKGSSSSGSRTEYRRSEIGHNRSRPYERYTPTTIPISEILTNIEESGMEK

Query:  LLKRPEKLRGDPEKCNKDKYCRFHRDHNHNTTNCWELKRQIEDLIQDGYFKKFVGKPRYNSVEKKEERKRSRTPPRREDRSAVINTIFGGPSGGQSGNKR
        LLKRPEKLRGD EK NK+KYCRFHRDH HNTT+CWELKRQIEDLIQDGYFKKFVGKPR NSVEKKEERKRSRTPPRREDR AVINTIFGGP+GGQSGNKR
Subjt:  LLKRPEKLRGDPEKCNKDKYCRFHRDHNHNTTNCWELKRQIEDLIQDGYFKKFVGKPRYNSVEKKEERKRSRTPPRREDRSAVINTIFGGPSGGQSGNKR

Query:  KELAREARHEVCIIREQKPTCSITFGDTDLEGVHLPYNDTLVIAPLIDHVLVRRVLVDGGASTNILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEG
        KELAREAR EVCIIRE KPTCSITFGD DLEGVHLP+ND LVIA LIDH LVRRVL+DG                                        G
Subjt:  KELAREARHEVCIIREQKPTCSITFGDTDLEGVHLPYNDTLVIAPLIDHVLVRRVLVDGGASTNILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEG

Query:  CIDFPVTIGGNLIYTLNPNLGKGELGFDLGLLPSITCDRARLHESLIFLKVEKIPTPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALK
        CID PVTIG +                   +   +  D    + ++          PIIHSFRAVPSTLHQVLKYSTPN VG VRGEQKTSRECYASALK
Subjt:  CIDFPVTIGGNLIYTLNPNLGKGELGFDLGLLPSITCDRARLHESLIFLKVEKIPTPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALK

Query:  GSSVCALEEQT-------SQDHLSRVGKRQFSPPTEELELVPLLSPEKQTDLARSVPVEILDSPSILE
        GS+VCALEEQT       S+  L + GKRQF PPTEELELVPLLSPE+Q +  +   V  +++P  L+
Subjt:  GSSVCALEEQT-------SQDHLSRVGKRQFSPPTEELELVPLLSPEKQTDLARSVPVEILDSPSILE

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088133.1e-21977.76Show/hide
Query:  LKAQSKYKPLTPEAVITREEFDLMKHRFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPPKFKTPTMNPYDGSKDPKDYVE------------
        +KA+S   P TP  VITREEFD ++ + D QVEALKA+CE+KE   +DGDLGESPFTSD+LEAPIPPKFK PT+ PYDGSKDPKDYVE            
Subjt:  LKAQSKYKPLTPEAVITREEFDLMKHRFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPPKFKTPTMNPYDGSKDPKDYVE------------

Query:  ---------IALTGSARLWYRRLPASSISTYSQLRKEFINQFSSGHYGRKIATHLAAIRQKEGETLREYVTMFQEEQLKVAHCSDDSAMCYFLTGLADET
                 IALTGSARLWYRRLPA SISTYSQLR+EF+  FSS HY +K ATHLA IRQKEGETLREYVT FQEEQLKVAHCSDDSAMCYFLTGLADE 
Subjt:  ---------IALTGSARLWYRRLPASSISTYSQLRKEFINQFSSGHYGRKIATHLAAIRQKEGETLREYVTMFQEEQLKVAHCSDDSAMCYFLTGLADET

Query:  LTVKLGEEARATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSKDKGSSSSGSRTEYRRSEIGHNRSRPYERYTPTTIPISEILTN
        LTVKLGEEA ATFAEVLQKAKKVIDGQELLRTKTGRPE++I + +  ++   AD KSKDKGS SSG R EYRR+E G  RSRPYER+TPTTIPISEILTN
Subjt:  LTVKLGEEARATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSKDKGSSSSGSRTEYRRSEIGHNRSRPYERYTPTTIPISEILTN

Query:  IEESGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDHNHNTTNCWELKRQIEDLIQDGYFKKFVGKPRYNSVEKKEERKRSRTPPRREDRSAVINTIFGGPS
        IEESGMEKLLKRPEKLRG PE+ +KDKYCRFHR+H HNT++ WELKRQIE+LIQDGYFKKFVGKPR +S EKKEERKRSRTPPRR DR AVINTIFGGPS
Subjt:  IEESGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDHNHNTTNCWELKRQIEDLIQDGYFKKFVGKPRYNSVEKKEERKRSRTPPRREDRSAVINTIFGGPS

Query:  GGQSGNKRKELAREARHEVCIIREQKPTCSITFGDTDLEGVHLPYNDTLVIAPLIDHVLVRRVLVDGGASTNILSLPTYLALGWTRSQLKKSPTPLVGFS
        GGQSG KRKELAR AR EVCIIREQ+PTC ITF   DLE VHLP+ND LVIAPLIDHV+V RVLVDGG S NILSLPTYLALGWTRSQLKKSPTPLVGFS
Subjt:  GGQSGNKRKELAREARHEVCIIREQKPTCSITFGDTDLEGVHLPYNDTLVIAPLIDHVLVRRVLVDGGASTNILSLPTYLALGWTRSQLKKSPTPLVGFS

Query:  GESVSPEGCIDFPVTIG
        GESV PEG ID PVT+G
Subjt:  GESVSPEGCIDFPVTIG

A0A6J1D9E1 uncharacterized protein LOC1110188234.5e-20259.12Show/hide
Query:  NSDLKAQSKYKPLTPEAVITREEFDLMKHRFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPPKFKTPTMNPYDGSKDPKDYVE---------
        +S+ +A+S + P TP+ VITREEFD ++ + + QVEALKA+CE+KE   +DGDLGESPFTSD+LEA        PT+  YDGSKDPKDYVE         
Subjt:  NSDLKAQSKYKPLTPEAVITREEFDLMKHRFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPPKFKTPTMNPYDGSKDPKDYVE---------

Query:  ------------IALTGSARLWYRRLPASSISTYSQLRKEFINQFSSGHYGRKIATHLAAIRQKEGETLREYVTMFQEEQLKVAHCSDDSAMCYFLTGLA
                    IALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  ------------IALTGSARLWYRRLPASSISTYSQLRKEFINQFSSGHYGRKIATHLAAIRQKEGETLREYVTMFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DETLTVKLGEEARATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSKDKGSSSSGSRTEYRRSEIGHNRSRPYERYTPTTIPISEI
        DE LTVKLG+EA ATFAEVLQKAKKVIDGQELLRTKTGRPE+ ID+ + S +  KAD KSKDKGS SSG R E+RR+  G  RSRPYER+TPTTIPISEI
Subjt:  DETLTVKLGEEARATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSKDKGSSSSGSRTEYRRSEIGHNRSRPYERYTPTTIPISEI

Query:  LTNIEESGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDHNHNTTNCWELKRQIEDLIQDGYFKKFVGKPRYNSVEKKEERKRSRTPPRREDRSAVINTIFG
        LTNIEESGMEKLLKRPEKLRG PE+ NKDKYCRFHR+H+HNT++ WELKRQIEDLIQD YFKKFVGKPR +S EKKEERK SRTP RR DR AVINTIFG
Subjt:  LTNIEESGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDHNHNTTNCWELKRQIEDLIQDGYFKKFVGKPRYNSVEKKEERKRSRTPPRREDRSAVINTIFG

Query:  GPSGGQSGNKRKELAREARHEVCIIREQKPTCSITFGDTDLEGVHLPYNDTLVIAPLIDHVLVRRVLVDGGASTNILSLPTYLALGWTRSQLKKSPTPLV
        GPSGGQSG+KRKELAR AR EVCIIREQ+PTC ITF   DLE VHLP+ND LVIAPLIDHV+VRRVLVD G S NI+SL TYLALGWTRSQLKKS TPLV
Subjt:  GPSGGQSGNKRKELAREARHEVCIIREQKPTCSITFGDTDLEGVHLPYNDTLVIAPLIDHVLVRRVLVDGGASTNILSLPTYLALGWTRSQLKKSPTPLV

Query:  GFSGESVSPEGCIDFPVTIGGNLIYTLNPNLGKGELGFDLGLLPSITCDRARLHESLIFLKVEKIPTPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQK
        GFS ESV PEGCID PVT+G +                   +   +  D    + ++          PIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ 
Subjt:  GFSGESVSPEGCIDFPVTIGGNLIYTLNPNLGKGELGFDLGLLPSITCDRARLHESLIFLKVEKIPTPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQK

Query:  TSRECYASALKGSSVCALEEQTSQD----HLSRVGKRQFSPPTEELELVPLLSPEKQTDLARSVPVEILDSPSILEPDVMEVDTPSPSWMDPIVEFIKGN
         SRECYASALKGSSVCALE   S+D      + + +R+F+ PTEELELVPLL  +   ++     ++   S + ++ D+     P P  +       K +
Subjt:  TSRECYASALKGSSVCALEEQTSQD----HLSRVGKRQFSPPTEELELVPLLSPEKQTDLARSVPVEILDSPSILEPDVMEVDTPSPSWMDPIVEFIKGN

Query:  PL
        PL
Subjt:  PL

A0A6J1DHB3 uncharacterized protein LOC1110204791.9e-24073.35Show/hide
Query:  KAQSKYKPLTPEAVITREEFDLMKHRFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPPKFKTPTMNPYDGSKDPKDYVE-------------
        KA+S Y P+TP  VITREEFD +K +FD QVEALKARCEKKE SFDDGDLGE  F+SDILEA IPPKFKTPTM PYDGSKDPKDYVE             
Subjt:  KAQSKYKPLTPEAVITREEFDLMKHRFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPPKFKTPTMNPYDGSKDPKDYVE-------------

Query:  --------IALTGSARLWYRRLPASSISTYSQLRKEFINQFSSGHYGRKIATHLAAIRQKEGETLREYVTMFQEEQLKVAHCSDDSAMCYFLTGLADETL
                IALTGSARLWYRRLPA  ISTYSQLRKEFI+QFSS HY RK  THLA IRQKEGETLREYVT F EEQLKVAHCSDDSAMCYFLTGLADETL
Subjt:  --------IALTGSARLWYRRLPASSISTYSQLRKEFINQFSSGHYGRKIATHLAAIRQKEGETLREYVTMFQEEQLKVAHCSDDSAMCYFLTGLADETL

Query:  TVKLGEEARATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSKDKGSSSSGSRTEYRRSEIGHNRSRPYERYTPTTIPISEILTNI
        TVKL EEA ATFAEVLQK KKVIDGQELLRTKTGRPEK IDQ +  ++K KADSKS+DKG SSS SR +YRRS   HN+SRPYE YTPTTIPI EILTNI
Subjt:  TVKLGEEARATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSKDKGSSSSGSRTEYRRSEIGHNRSRPYERYTPTTIPISEILTNI

Query:  EESGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDHNHNTTNCWELKRQIEDLIQDGYFKKFVGKPRYNSVEKKEERKRSRTPPRREDRSAVINTIFGGPSG
        EE+GMEKLLKRPEKLRGDPEK N DKYCRFHRDH HNT+N WELKRQIEDLIQDGYFKKFVGKPR NSVEKKEERKR RTPPRR+DR AVI         
Subjt:  EESGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDHNHNTTNCWELKRQIEDLIQDGYFKKFVGKPRYNSVEKKEERKRSRTPPRREDRSAVINTIFGGPSG

Query:  GQSGNKRKELAREARHEVCIIREQKPTCSITFGDTDLEGVHLPYNDTLVIAPLIDHVLVRRVLVDGGASTNILSLPTYLALGWTRSQLKKSPTPLVGFSG
            NK+KELAREAR EVCIIREQ+PT SI F   DLEGVHLP+ND LVIAPLID VLVRR+LVDGGAS NILSL TYLALGWTRSQLKKSPTPLVGFSG
Subjt:  GQSGNKRKELAREARHEVCIIREQKPTCSITFGDTDLEGVHLPYNDTLVIAPLIDHVLVRRVLVDGGASTNILSLPTYLALGWTRSQLKKSPTPLVGFSG

Query:  ESVSPEGCIDFPVTIGGNLIYTLNPNLGKGELGFDLGLLPSITCDRARLHESLIFLKVEKIPTPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRE
        ES+S EGCID PV+I  +                   +   +  D    + ++          PIIHSFRAVPSTLHQVLKYST NGVGTVRGE KTSRE
Subjt:  ESVSPEGCIDFPVTIGGNLIYTLNPNLGKGELGFDLGLLPSITCDRARLHESLIFLKVEKIPTPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRE

Query:  CYASALKGSSVCALEEQTSQDHL
        CYAS  K SSVCALEEQT +D L
Subjt:  CYASALKGSSVCALEEQTSQDHL

A0A6J1DPC9 uncharacterized protein LOC1110222802.8e-23667.32Show/hide
Query:  VPGAPREKEGRVPSFHPGDREPVPNNEGVDYSLRDNDLRKHLTYKKKRASRESEDSPSYSREFSNSDLKAQSKYKPLTPEAVITREEFDLMKHRFDEQVE
        +PGAP EK    PS  PG+REP+PN+EGVDYSLRDNDLRKHLT KKK+AS E EDS SYSREFSNS+LKAQSKYKPL PEAVI REEFDLMKHRFDEQVE
Subjt:  VPGAPREKEGRVPSFHPGDREPVPNNEGVDYSLRDNDLRKHLTYKKKRASRESEDSPSYSREFSNSDLKAQSKYKPLTPEAVITREEFDLMKHRFDEQVE

Query:  ALKARCEKKECSFDDGDLGESPFTSDILEAPIPPKFKTPTMNPYDGSKDPKDYVE---------------------IALTGSARLWYRRLPASSISTYSQ
        ALKARCEKKE  FDD DLGESPFTSDI+EAPIPPKFKTPTM PYDGSKDPKDYVE                     IALTGSARLW RRLPA SISTYSQ
Subjt:  ALKARCEKKECSFDDGDLGESPFTSDILEAPIPPKFKTPTMNPYDGSKDPKDYVE---------------------IALTGSARLWYRRLPASSISTYSQ

Query:  LRKEFINQFSSGHYGRKIATHLAAIRQKEGETLREYVTMFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEARATFAEVLQKAKKVIDGQELLRTK
        LRKEFI QFS  HY RK ATHLA IRQKE                                   DETLTVKLGEEA ATFAEVLQ AKKVIDGQELLRTK
Subjt:  LRKEFINQFSSGHYGRKIATHLAAIRQKEGETLREYVTMFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEARATFAEVLQKAKKVIDGQELLRTK

Query:  TGRPEKQIDQKKLSQEKRKADSKSKDKGSSSSGSRTEYRRSEIGHNRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKCNKDKYCRFHR
        T RPEKQIDQK+LSQ+KRK DSKSKDKGSSSSGSRTEYRRSE G +RSRPYER                                               
Subjt:  TGRPEKQIDQKKLSQEKRKADSKSKDKGSSSSGSRTEYRRSEIGHNRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKCNKDKYCRFHR

Query:  DHNHNTTNCWELKRQIEDLIQDGYFKKFVGKPRYNSVEKKEERKRSRTPPRREDRSAVINTIFGGPSGGQSGNKRKELAREARHEVCIIREQKPTCSITF
                CWELKRQIEDLIQD YFKKFVGKPR NSVEKKEERKRSRTPPRREDR AVINTIFGGPSGGQ  NKRKELA EAR +V IIREQKPTCSITF
Subjt:  DHNHNTTNCWELKRQIEDLIQDGYFKKFVGKPRYNSVEKKEERKRSRTPPRREDRSAVINTIFGGPSGGQSGNKRKELAREARHEVCIIREQKPTCSITF

Query:  GDTDLEGVHLPYNDTLVIAPLIDHVLVRRVLVDGGASTNILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDFPVTIGGNLIYTLNPNLGKGEL
         DTDLEGVHLP+ND LVIAPLIDHVLVRRVLVDGGAS NILSLPTYLAL  TRSQLKKSPTPLVGFS ESVSPEGCID PVTIG +              
Subjt:  GDTDLEGVHLPYNDTLVIAPLIDHVLVRRVLVDGGASTNILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDFPVTIGGNLIYTLNPNLGKGEL

Query:  GFDLGLLPSITCDRARLHESLIFLKVEKIPTPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDHLSRVGKRQFS
           +  +        RL  + IF +      PIIHSF+AVPS LHQVLKYSTPNGVGTVRGEQKTSRECYASALK SSVCALEEQTSQD L R  K    
Subjt:  GFDLGLLPSITCDRARLHESLIFLKVEKIPTPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDHLSRVGKRQFS

Query:  PPTEELELVPLLS
               L P L+
Subjt:  PPTEELELVPLLS

A0A6J1DZB9 uncharacterized protein LOC1110249048.2e-21272.18Show/hide
Query:  EIALTGSARLWYRRLPASSISTYSQLRKEFINQFSSGHYGRKIATHLAAIRQKEGETLREYVTMFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEE
        +IALTGSARLWYRRLPA SISTYSQLRKEFI+QFSS HY RK ATHLA IRQKE ETLREYVT FQEEQLKVAHCSDDSAMCYFLT LADETLTVKLGEE
Subjt:  EIALTGSARLWYRRLPASSISTYSQLRKEFINQFSSGHYGRKIATHLAAIRQKEGETLREYVTMFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEE

Query:  ARATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSKDKGSSSSGSRTEYRRSEIGHNRSRPYERYTPTTIPISEILTNIEESGMEK
        A  TF EVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKS+DKGSSSS SRTEYRR E G +RSRPYERYT +TIPISEILTNIEESGMEK
Subjt:  ARATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSKDKGSSSSGSRTEYRRSEIGHNRSRPYERYTPTTIPISEILTNIEESGMEK

Query:  LLKRPEKLRGDPEKCNKDKYCRFHRDHNHNTTNCWELKRQIEDLIQDGYFKKFVGKPRYNSVEKKEERKRSRTPPRREDRSAVINTIFGGPSGGQSGNKR
        LLKRPEKLRGD EK NK+KYCRFHRDH HNTT+CWELKRQIEDLIQDGYFKKFVGKPR NSVEKKEERKRSRTPPRREDR AVINTIFGGP+GGQSGNKR
Subjt:  LLKRPEKLRGDPEKCNKDKYCRFHRDHNHNTTNCWELKRQIEDLIQDGYFKKFVGKPRYNSVEKKEERKRSRTPPRREDRSAVINTIFGGPSGGQSGNKR

Query:  KELAREARHEVCIIREQKPTCSITFGDTDLEGVHLPYNDTLVIAPLIDHVLVRRVLVDGGASTNILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEG
        KELAREAR EVCIIRE KPTCSITFGD DLEGVHLP+ND LVIA LIDH LVRRVL+DG                                        G
Subjt:  KELAREARHEVCIIREQKPTCSITFGDTDLEGVHLPYNDTLVIAPLIDHVLVRRVLVDGGASTNILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEG

Query:  CIDFPVTIGGNLIYTLNPNLGKGELGFDLGLLPSITCDRARLHESLIFLKVEKIPTPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALK
        CID PVTIG +                   +   +  D    + ++          PIIHSFRAVPSTLHQVLKYSTPN VG VRGEQKTSRECYASALK
Subjt:  CIDFPVTIGGNLIYTLNPNLGKGELGFDLGLLPSITCDRARLHESLIFLKVEKIPTPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALK

Query:  GSSVCALEEQT-------SQDHLSRVGKRQFSPPTEELELVPLLSPEKQTDLARSVPVEILDSPSILE
        GS+VCALEEQT       S+  L + GKRQF PPTEELELVPLLSPE+Q +  +   V  +++P  L+
Subjt:  GSSVCALEEQT-------SQDHLSRVGKRQFSPPTEELELVPLLSPEKQTDLARSVPVEILDSPSILE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGTCAAGGTCCACTCTAGTGTTCAGGACGGAACCGAAGACCGGGTTCGAGCTCGATTCGTGAAGAACCGTTGTCCAGCAAACTCTGCTAACACGACAGAACGGAG
GGGTTTGAATGCTGATAATGGCACTCAGCGAGACCTTGATGCGAGAATGGTCGAGGACCAGGTCAACCGAGGCCGAGGTGGGACCTCGAGAAAGACCTCCCGAAGGGCCA
ACCAGGTGGCAGACCCTGAAGCTTTGTCTACTCTCCAGCGCGAGTTGGATGATATGTGCCATCGGTTGCGCACAATGGAAAAAATGTACGCCGAGGCGACGCGGGCTAAC
CGAACAGCGTCTCCCTCCAGGGTCCCAGGCGCACCCAGAGAGAAGGAAGGTCGGGTTCCATCTTTCCACCCTGGCGACCGCGAGCCCGTTCCGAATAATGAGGGGGTGGA
TTATAGCTTGCGGGATAACGATCTGAGAAAGCACCTCACTTATAAGAAGAAGAGAGCATCTCGGGAGTCGGAAGACTCTCCGTCTTACTCTCGAGAGTTCTCCAATTCTG
ACCTCAAGGCTCAATCAAAATATAAGCCTCTAACACCGGAAGCTGTGATCACCAGGGAAGAGTTCGACCTGATGAAACACAGGTTCGACGAGCAGGTCGAGGCGCTCAAG
GCCAGGTGCGAGAAGAAAGAGTGCTCGTTTGACGATGGCGACTTGGGAGAATCTCCATTCACCTCGGACATCCTGGAGGCCCCAATTCCTCCAAAGTTCAAAACTCCCAC
TATGAATCCTTATGATGGGTCTAAGGACCCGAAGGATTATGTTGAGATCGCTCTCACCGGCAGCGCGCGCCTGTGGTACCGGAGACTGCCGGCCAGTTCGATCTCGACCT
ACTCCCAGCTGAGGAAAGAATTCATTAACCAATTCTCCTCTGGTCATTATGGTAGAAAGATAGCGACTCACCTCGCCGCCATCAGGCAGAAGGAGGGAGAGACGCTGAGA
GAGTACGTCACGATGTTCCAGGAGGAGCAGCTGAAGGTCGCGCACTGCTCCGATGATTCGGCCATGTGCTACTTCCTCACCGGCCTGGCCGACGAGACTCTCACCGTTAA
GCTCGGAGAGGAGGCTCGAGCAACCTTCGCCGAAGTTCTGCAAAAGGCGAAGAAAGTCATCGATGGCCAAGAACTCCTCCGAACCAAGACTGGCCGACCTGAAAAGCAAA
TCGACCAGAAGAAGCTAAGTCAAGAGAAGAGGAAGGCTGATTCTAAGTCTAAGGACAAGGGATCGTCCTCTTCTGGCAGTAGAACCGAGTATCGTCGGTCGGAGATCGGC
CATAATCGAAGCCGACCTTACGAGCGATATACTCCAACCACCATCCCCATCTCTGAGATACTTACGAACATCGAGGAAAGCGGGATGGAAAAGCTCCTCAAGCGACCTGA
GAAGCTCCGAGGAGACCCAGAAAAATGCAATAAAGATAAATATTGCCGTTTTCATCGCGATCACAACCATAATACAACAAATTGCTGGGAGCTGAAGCGCCAGATTGAAG
ACCTCATTCAAGATGGCTACTTCAAAAAGTTTGTTGGGAAACCGAGGTATAACTCGGTCGAAAAGAAAGAAGAGAGAAAGCGTTCAAGAACGCCGCCTCGACGAGAAGAC
CGATCTGCGGTCATCAACACTATTTTCGGAGGCCCAAGTGGTGGCCAGTCCGGAAACAAAAGAAAAGAGCTAGCTCGCGAGGCCAGACACGAGGTATGCATCATCAGGGA
GCAGAAACCTACTTGCTCCATCACCTTCGGCGATACCGATCTGGAAGGAGTCCACTTGCCTTATAATGATACGCTTGTGATCGCTCCCCTTATCGATCATGTCCTGGTCA
GAAGAGTGTTGGTAGATGGAGGCGCGTCTACCAACATCTTGTCCCTCCCGACATATCTTGCCTTGGGGTGGACCAGGTCTCAGTTGAAGAAAAGTCCAACACCCTTGGTT
GGATTTTCAGGAGAATCGGTCTCTCCTGAAGGGTGCATTGACTTTCCGGTCACAATCGGTGGGAACTTGATTTATACACTAAATCCCAACCTTGGGAAAGGTGAGTTAGG
ATTCGACCTTGGATTACTTCCGAGTATTACTTGTGACCGTGCGCGCTTGCACGAGTCTCTAATATTTTTAAAGGTAGAGAAAATCCCAACACCCATCATCCACTCATTCC
GGGCCGTCCCCTCCACACTTCATCAAGTCCTGAAGTACTCAACCCCTAATGGGGTCGGCACGGTCCGAGGTGAGCAAAAGACTTCAAGGGAGTGCTATGCATCCGCGCTC
AAAGGATCGTCAGTATGTGCCCTGGAAGAGCAAACCAGTCAAGACCACCTTTCGAGGGTAGGCAAAAGGCAGTTCTCCCCACCAACAGAGGAGCTCGAGCTTGTTCCCTT
GCTTAGCCCTGAAAAACAAACCGACTTGGCTAGATCGGTCCCGGTCGAGATCTTGGACAGTCCTTCAATCTTAGAGCCAGATGTGATGGAGGTTGACACTCCATCACCCT
CTTGGATGGACCCAATCGTGGAGTTCATCAAAGGAAATCCACTGCAAGATCCGAAGGAGCAAAAGAAGATGGCACGGAGAGCAGCTCGGTTCACACTCCGAGAAGGAGCG
TTGTACCGACGTGGCTTCTCCCTGCCTCTGCTTAAGTGTGTGACTCCCGAAGAAGAATGGCCACATTACAATGCCCGAGTTCGACCTCGAAGCTTCCAAGTTGAACATTT
GGTCTTAAGGAAAATTCAAAGTCATGTGGGCACCCTTGACCCAAGTTGGGAGGGACCGTTCGAGGTTAAAGGCATAGTCCGACCTGGAACGTATATGCTGGCCGAACTGG
AAGGAAGAGTGCTTGCGCATCCATGGAACGCAGAGCACTTGAAGCGCTATTACCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCGTCAAGGTCCACTCTAGTGTTCAGGACGGAACCGAAGACCGGGTTCGAGCTCGATTCGTGAAGAACCGTTGTCCAGCAAACTCTGCTAACACGACAGAACGGAG
GGGTTTGAATGCTGATAATGGCACTCAGCGAGACCTTGATGCGAGAATGGTCGAGGACCAGGTCAACCGAGGCCGAGGTGGGACCTCGAGAAAGACCTCCCGAAGGGCCA
ACCAGGTGGCAGACCCTGAAGCTTTGTCTACTCTCCAGCGCGAGTTGGATGATATGTGCCATCGGTTGCGCACAATGGAAAAAATGTACGCCGAGGCGACGCGGGCTAAC
CGAACAGCGTCTCCCTCCAGGGTCCCAGGCGCACCCAGAGAGAAGGAAGGTCGGGTTCCATCTTTCCACCCTGGCGACCGCGAGCCCGTTCCGAATAATGAGGGGGTGGA
TTATAGCTTGCGGGATAACGATCTGAGAAAGCACCTCACTTATAAGAAGAAGAGAGCATCTCGGGAGTCGGAAGACTCTCCGTCTTACTCTCGAGAGTTCTCCAATTCTG
ACCTCAAGGCTCAATCAAAATATAAGCCTCTAACACCGGAAGCTGTGATCACCAGGGAAGAGTTCGACCTGATGAAACACAGGTTCGACGAGCAGGTCGAGGCGCTCAAG
GCCAGGTGCGAGAAGAAAGAGTGCTCGTTTGACGATGGCGACTTGGGAGAATCTCCATTCACCTCGGACATCCTGGAGGCCCCAATTCCTCCAAAGTTCAAAACTCCCAC
TATGAATCCTTATGATGGGTCTAAGGACCCGAAGGATTATGTTGAGATCGCTCTCACCGGCAGCGCGCGCCTGTGGTACCGGAGACTGCCGGCCAGTTCGATCTCGACCT
ACTCCCAGCTGAGGAAAGAATTCATTAACCAATTCTCCTCTGGTCATTATGGTAGAAAGATAGCGACTCACCTCGCCGCCATCAGGCAGAAGGAGGGAGAGACGCTGAGA
GAGTACGTCACGATGTTCCAGGAGGAGCAGCTGAAGGTCGCGCACTGCTCCGATGATTCGGCCATGTGCTACTTCCTCACCGGCCTGGCCGACGAGACTCTCACCGTTAA
GCTCGGAGAGGAGGCTCGAGCAACCTTCGCCGAAGTTCTGCAAAAGGCGAAGAAAGTCATCGATGGCCAAGAACTCCTCCGAACCAAGACTGGCCGACCTGAAAAGCAAA
TCGACCAGAAGAAGCTAAGTCAAGAGAAGAGGAAGGCTGATTCTAAGTCTAAGGACAAGGGATCGTCCTCTTCTGGCAGTAGAACCGAGTATCGTCGGTCGGAGATCGGC
CATAATCGAAGCCGACCTTACGAGCGATATACTCCAACCACCATCCCCATCTCTGAGATACTTACGAACATCGAGGAAAGCGGGATGGAAAAGCTCCTCAAGCGACCTGA
GAAGCTCCGAGGAGACCCAGAAAAATGCAATAAAGATAAATATTGCCGTTTTCATCGCGATCACAACCATAATACAACAAATTGCTGGGAGCTGAAGCGCCAGATTGAAG
ACCTCATTCAAGATGGCTACTTCAAAAAGTTTGTTGGGAAACCGAGGTATAACTCGGTCGAAAAGAAAGAAGAGAGAAAGCGTTCAAGAACGCCGCCTCGACGAGAAGAC
CGATCTGCGGTCATCAACACTATTTTCGGAGGCCCAAGTGGTGGCCAGTCCGGAAACAAAAGAAAAGAGCTAGCTCGCGAGGCCAGACACGAGGTATGCATCATCAGGGA
GCAGAAACCTACTTGCTCCATCACCTTCGGCGATACCGATCTGGAAGGAGTCCACTTGCCTTATAATGATACGCTTGTGATCGCTCCCCTTATCGATCATGTCCTGGTCA
GAAGAGTGTTGGTAGATGGAGGCGCGTCTACCAACATCTTGTCCCTCCCGACATATCTTGCCTTGGGGTGGACCAGGTCTCAGTTGAAGAAAAGTCCAACACCCTTGGTT
GGATTTTCAGGAGAATCGGTCTCTCCTGAAGGGTGCATTGACTTTCCGGTCACAATCGGTGGGAACTTGATTTATACACTAAATCCCAACCTTGGGAAAGGTGAGTTAGG
ATTCGACCTTGGATTACTTCCGAGTATTACTTGTGACCGTGCGCGCTTGCACGAGTCTCTAATATTTTTAAAGGTAGAGAAAATCCCAACACCCATCATCCACTCATTCC
GGGCCGTCCCCTCCACACTTCATCAAGTCCTGAAGTACTCAACCCCTAATGGGGTCGGCACGGTCCGAGGTGAGCAAAAGACTTCAAGGGAGTGCTATGCATCCGCGCTC
AAAGGATCGTCAGTATGTGCCCTGGAAGAGCAAACCAGTCAAGACCACCTTTCGAGGGTAGGCAAAAGGCAGTTCTCCCCACCAACAGAGGAGCTCGAGCTTGTTCCCTT
GCTTAGCCCTGAAAAACAAACCGACTTGGCTAGATCGGTCCCGGTCGAGATCTTGGACAGTCCTTCAATCTTAGAGCCAGATGTGATGGAGGTTGACACTCCATCACCCT
CTTGGATGGACCCAATCGTGGAGTTCATCAAAGGAAATCCACTGCAAGATCCGAAGGAGCAAAAGAAGATGGCACGGAGAGCAGCTCGGTTCACACTCCGAGAAGGAGCG
TTGTACCGACGTGGCTTCTCCCTGCCTCTGCTTAAGTGTGTGACTCCCGAAGAAGAATGGCCACATTACAATGCCCGAGTTCGACCTCGAAGCTTCCAAGTTGAACATTT
GGTCTTAAGGAAAATTCAAAGTCATGTGGGCACCCTTGACCCAAGTTGGGAGGGACCGTTCGAGGTTAAAGGCATAGTCCGACCTGGAACGTATATGCTGGCCGAACTGG
AAGGAAGAGTGCTTGCGCATCCATGGAACGCAGAGCACTTGAAGCGCTATTACCCTTGA
Protein sequenceShow/hide protein sequence
MGVKVHSSVQDGTEDRVRARFVKNRCPANSANTTERRGLNADNGTQRDLDARMVEDQVNRGRGGTSRKTSRRANQVADPEALSTLQRELDDMCHRLRTMEKMYAEATRAN
RTASPSRVPGAPREKEGRVPSFHPGDREPVPNNEGVDYSLRDNDLRKHLTYKKKRASRESEDSPSYSREFSNSDLKAQSKYKPLTPEAVITREEFDLMKHRFDEQVEALK
ARCEKKECSFDDGDLGESPFTSDILEAPIPPKFKTPTMNPYDGSKDPKDYVEIALTGSARLWYRRLPASSISTYSQLRKEFINQFSSGHYGRKIATHLAAIRQKEGETLR
EYVTMFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEARATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSKDKGSSSSGSRTEYRRSEIG
HNRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDHNHNTTNCWELKRQIEDLIQDGYFKKFVGKPRYNSVEKKEERKRSRTPPRRED
RSAVINTIFGGPSGGQSGNKRKELAREARHEVCIIREQKPTCSITFGDTDLEGVHLPYNDTLVIAPLIDHVLVRRVLVDGGASTNILSLPTYLALGWTRSQLKKSPTPLV
GFSGESVSPEGCIDFPVTIGGNLIYTLNPNLGKGELGFDLGLLPSITCDRARLHESLIFLKVEKIPTPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASAL
KGSSVCALEEQTSQDHLSRVGKRQFSPPTEELELVPLLSPEKQTDLARSVPVEILDSPSILEPDVMEVDTPSPSWMDPIVEFIKGNPLQDPKEQKKMARRAARFTLREGA
LYRRGFSLPLLKCVTPEEEWPHYNARVRPRSFQVEHLVLRKIQSHVGTLDPSWEGPFEVKGIVRPGTYMLAELEGRVLAHPWNAEHLKRYYP