; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g34030 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g34030
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:24815001..24822090
RNA-Seq ExpressionMoc08g34030
SyntenyMoc08g34030
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.1e-21775.09Show/hide
Query:  LKAQSKYKPMAPETVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAAT
        +KA+S   P  P  VITREEFD ++ + D QVEALKA+CE+K+   +DGDLGESPFT+D+LEAPIPPKFK P +KPYDGSKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYKPMAPETVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGEMLREYVTRFQEEQLKVVHYSHDSAMCYFLTGLADET
        DAIKCRAF+IALTGSARLWYRRLPA SISTYSQLR+EF++ FSSRHYD+KTATHLATIRQKEGE LREYVTRFQEEQLKV H S DSAMCYFLTGLADE 
Subjt:  DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGEMLREYVTRFQEEQLKVVHYSHDSAMCYFLTGLADET

Query:  LTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRTDVKSKDKGSSSSSGRTEYRRSEGGSIRNRPYERYTPTTIPVFEILTN
        LTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPE++I + +  ++    D KSKDKG S SSGR EYRR+E G  R+RPYER+TPTTIP+ EILTN
Subjt:  LTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRTDVKSKDKGSSSSSGRTEYRRSEGGSIRNRPYERYTPTTIPVFEILTN

Query:  IEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRQNDRPAVINTIFRGPS
        IEESGMEKLLKR EKLRG PE+R+KDKYCRFHR++GHNT+  WELKRQIE+LIQDGYFKKFVGKPR++S EKKEERKRSRTPPR+ DRPAVINTIF GPS
Subjt:  IEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRQNDRPAVINTIFRGPS

Query:  GGQSGNKRKELAREARCEVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVL-------------------------------KSPTPLVGFS
        GGQSG KRKELAR AR EVCIIREQ+PTC ITF  ADLE +HLPHNDALVIAPLIDHV+                               KSPTPLVGFS
Subjt:  GGQSGNKRKELAREARCEVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVL-------------------------------KSPTPLVGFS

Query:  GESVSPEGCINLPVTIGQDATQITQMAEFV
        GESV PEG I+LPVT+GQD TQ+TQMAEFV
Subjt:  GESVSPEGCINLPVTIGQDATQITQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]3.2e-21465.67Show/hide
Query:  NSSLKAQSKYKPMAPETVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQ
        +S+ +A+S + P  P+ VITREEFD ++ K + QVEALKA+CE+K+   +DGDLGESPFT+D+LEA        P +K YDGSKDPKDYVEVFEGLMDFQ
Subjt:  NSSLKAQSKYKPMAPETVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGEMLREYVTRFQEEQLKVVHYSHDSAMCYFLTGLA
        AA+DAIKCRAFQIALTGSARLW                                                     FQE+QLKV   S DSAMCYFLTGLA
Subjt:  AATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGEMLREYVTRFQEEQLKVVHYSHDSAMCYFLTGLA

Query:  DETLTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRTDVKSKDKGSSSSSGRTEYRRSEGGSIRNRPYERYTPTTIPVFEI
        DE LTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPE+ ID+ + S +  + D+KSKDKG S SSGR E+RR+  G  R+RPYER+TPTTIP+ EI
Subjt:  DETLTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRTDVKSKDKGSSSSSGRTEYRRSEGGSIRNRPYERYTPTTIPVFEI

Query:  LTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRQNDRPAVINTIFR
        LTNIEESGMEKLLKR EKLRG PE+RNKDKYCRFHR++ HNT+  WELKRQIEDLIQD YFKKFVGKPR++S EKKEERK SRTP R+ DRPAVINTIF 
Subjt:  LTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRQNDRPAVINTIFR

Query:  GPSGGQSGNKRKELAREARCEVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVL-------------------------------KSPTPLV
        GPSGGQSG+KRKELAR AR EVCIIREQ+PTC ITF  ADLE +HLPHNDALVIAPLIDHV+                               KS TPLV
Subjt:  GPSGGQSGNKRKELAREARCEVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVL-------------------------------KSPTPLV

Query:  GFSGESVSPEGCINLPVTIGQDATQITQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALKE
        GFS ESV PEGCI+LPVT+G D TQ+TQMAEFVV+DG+SAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ  SRECYASALKGSSVCAL+ 
Subjt:  GFSGESVSPEGCINLPVTIGQDATQITQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALKE

Query:  QTNRGKQQESETDLPKEGKRQFSLPTEELELVPLL
          +R    E + +LP   +R+F+ PTEELELVPLL
Subjt:  QTNRGKQQESETDLPKEGKRQFSLPTEELELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]9.0e-24964.89Show/hide
Query:  SNSANTTEQRGVNADNGPQQDLGARIVEDHVRAGQEGDLPRRSARHANQELPLAHPKPSKANRGRGGTSRKTSQRANQEADPEALSTLQRELDDMRHRLS
        +NS NT ++R + A++G Q+++GA +VE         +   RSAR     LP AHPKPS                                         
Subjt:  SNSANTTEQRGVNADNGPQQDLGARIVEDHVRAGQEGDLPRRSARHANQELPLAHPKPSKANRGRGGTSRKTSQRANQEADPEALSTLQRELDDMRHRLS

Query:  TMEEMYAEATRANRTDLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSSLKAQSKYKPMAPETVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDG
                                                            KA+S Y P+ P  VITREEFD +K KFD QVEALKARCEKK+ SFDDG
Subjt:  TMEEMYAEATRANRTDLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSSLKAQSKYKPMAPETVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDG

Query:  DLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDR
        DLGE  F++DILEA IPPKFKTP MKPYDGSKDPKDYVEVFE LMDFQAATDAIKC AFQIALTGSARLWYRRLPAR ISTYSQLRKEFISQFSSRHYDR
Subjt:  DLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDR

Query:  KTATHLATIRQKEGEMLREYVTRFQEEQLKVVHYSHDSAMCYFLTGLADETLTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQE
        KT THLATIRQKEGE LREYVTRF EEQLKV H S DSAMCYFLTGLADETLTVKL +EAPATFAEVLQK KKVIDGQELLRTKTGRPEK IDQ +  ++
Subjt:  KTATHLATIRQKEGEMLREYVTRFQEEQLKVVHYSHDSAMCYFLTGLADETLTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQE

Query:  RRRTDVKSKDKGSSSSSGRTEYRRSEGGSIRNRPYERYTPTTIPVFEILTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQI
        + + D KS+DKG SSSS R +YRRS     ++RPYE YTPTTIP+FEILTNIEE+GMEKLLKR EKLRGDPEKRN DKYCRFHRD+GHNT++ WELKRQI
Subjt:  RRRTDVKSKDKGSSSSSGRTEYRRSEGGSIRNRPYERYTPTTIPVFEILTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQI

Query:  EDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRQNDRPAVINTIFRGPSGGQSGNKRKELAREARCEVCIIREQKPTCSITFGDADLEGIHLPHNDAL
        EDLIQDGYFKKFVGKPRSNSVEKKEERKR RTPPR++DRPAVI             NK+KELAREAR EVCIIREQ+PT SI F  ADLEG+HLPHNDAL
Subjt:  EDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRQNDRPAVINTIFRGPSGGQSGNKRKELAREARCEVCIIREQKPTCSITFGDADLEGIHLPHNDAL

Query:  VIAPLIDHVL-------------------------------KSPTPLVGFSGESVSPEGCINLPVTIGQDATQITQMAEFVVVDGKSAYNAIFGRPIIHS
        VIAPLID VL                               KSPTPLVGFSGES+S EGCI+LPV+I QD TQ+TQMAEFVV+DG+SAYNAIFGRPIIHS
Subjt:  VIAPLIDHVL-------------------------------KSPTPLVGFSGESVSPEGCINLPVTIGQDATQITQMAEFVVVDGKSAYNAIFGRPIIHS

Query:  FRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALKEQTNR
        FRAVPSTLHQVLKYST NGVGTVRGE KTSRECYAS  K SSVCAL+EQT R
Subjt:  FRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALKEQTNR

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]5.7e-23569.01Show/hide
Query:  LRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSSLKAQSKYKPMAPETVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPI
        LRDNDLRKHLT+KKK+AS EPEDS SYSREFSNS+LKAQSKYKP+ PE VI REEFDLMKH+FDEQVEALKARCEKK+  FDD DLGESPFT+DI+EAPI
Subjt:  LRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSSLKAQSKYKPMAPETVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPI

Query:  PPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGEM
        PPKFKTP MKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW RRLPARSISTYSQLRKEFI QFS RHYDRKTATHLATIRQKE   
Subjt:  PPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGEM

Query:  LREYVTRFQEEQLKVVHYSHDSAMCYFLTGLADETLTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRTDVKSKDKGSSSS
                                        DETLTVKLG+EAPATFAEVLQ AKKVIDGQELLRTKT RPEKQIDQK+LSQ++R+ D KSKDKGSSSS
Subjt:  LREYVTRFQEEQLKVVHYSHDSAMCYFLTGLADETLTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRTDVKSKDKGSSSS

Query:  SGRTEYRRSEGGSIRNRPYERYTPTTIPVFEILTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIEDLIQDGYFKKFVGKP
          RTEYRRSE G  R+RPYER                                                       CWELKRQIEDLIQD YFKKFVGKP
Subjt:  SGRTEYRRSEGGSIRNRPYERYTPTTIPVFEILTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIEDLIQDGYFKKFVGKP

Query:  RSNSVEKKEERKRSRTPPRQNDRPAVINTIFRGPSGGQSGNKRKELAREARCEVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVL------
        RSNSVEKKEERKRSRTPPR+ DRPAVINTIF GPSGGQ  NKRKELA EAR +V IIREQKPTCSITF D DLEG+HLPHNDALVIAPLIDHVL      
Subjt:  RSNSVEKKEERKRSRTPPRQNDRPAVINTIFRGPSGGQSGNKRKELAREARCEVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVL------

Query:  -------------------------KSPTPLVGFSGESVSPEGCINLPVTIGQDATQITQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYST
                                 KSPTPLVGFS ESVSPEGCI+LPVTIGQD+TQ+TQMAEFVV+DG+ AYNAIF RPIIHSF+AVPS LHQVLKYST
Subjt:  -------------------------KSPTPLVGFSGESVSPEGCINLPVTIGQDATQITQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYST

Query:  PNGVGTVRGEQKTSRECYASALKGSSVCALKEQTNRGKQQESETDLPKEGKRQFSLPTEELELVPLLS
        PNGVGTVRGEQKTSRECYASALK SSVCAL+EQT       S+ DLP+E K           L P L+
Subjt:  PNGVGTVRGEQKTSRECYASALKGSSVCALKEQTNRGKQQESETDLPKEGKRQFSLPTEELELVPLLS

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]8.7e-24486.74Show/hide
Query:  MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGEMLREYVTRFQEEQLKVVHYSHDSAMCYFL
        MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSS HYDRKTATHLATIRQKE E LREYVTRFQEEQLKV H S DSAMCYFL
Subjt:  MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGEMLREYVTRFQEEQLKVVHYSHDSAMCYFL

Query:  TGLADETLTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRTDVKSKDKGSSSSSGRTEYRRSEGGSIRNRPYERYTPTTIP
        T LADETLTVKLG+EAP TF EVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQE+R+ D KS+DKGSSSS+ RTEYRR E G  R+RPYERYT +TIP
Subjt:  TGLADETLTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRTDVKSKDKGSSSSSGRTEYRRSEGGSIRNRPYERYTPTTIP

Query:  VFEILTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRQNDRPAVIN
        + EILTNIEESGMEKLLKR EKLRGD EKRNK+KYCRFHRD+GHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPR+ DRPAVIN
Subjt:  VFEILTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRQNDRPAVIN

Query:  TIFRGPSGGQSGNKRKELAREARCEVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLKSPTPLVGFSGESVSPEGCINLPVTIGQDATQIT
        TIF GP+GGQSGNKRKELAREAR EVCIIRE KPTCSITFGDADLEG+HLPHNDALVIA LIDH L     + G         GCI+LPVTIGQDATQ+T
Subjt:  TIFRGPSGGQSGNKRKELAREARCEVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLKSPTPLVGFSGESVSPEGCINLPVTIGQDATQIT

Query:  QMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALKEQTNRGKQQESETDLPKEGKRQFSLPTE
        QMAEFVV+DG+SAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPN VG VRGEQKTSRECYASALKGS+VCAL+EQTNRGK QESE DLPKEGKRQF  PTE
Subjt:  QMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALKEQTNRGKQQESETDLPKEGKRQFSLPTE

Query:  ELELVPLLSPEKQ
        ELELVPLLSPE+Q
Subjt:  ELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088135.2e-21875.09Show/hide
Query:  LKAQSKYKPMAPETVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAAT
        +KA+S   P  P  VITREEFD ++ + D QVEALKA+CE+K+   +DGDLGESPFT+D+LEAPIPPKFK P +KPYDGSKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYKPMAPETVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGEMLREYVTRFQEEQLKVVHYSHDSAMCYFLTGLADET
        DAIKCRAF+IALTGSARLWYRRLPA SISTYSQLR+EF++ FSSRHYD+KTATHLATIRQKEGE LREYVTRFQEEQLKV H S DSAMCYFLTGLADE 
Subjt:  DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGEMLREYVTRFQEEQLKVVHYSHDSAMCYFLTGLADET

Query:  LTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRTDVKSKDKGSSSSSGRTEYRRSEGGSIRNRPYERYTPTTIPVFEILTN
        LTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPE++I + +  ++    D KSKDKG S SSGR EYRR+E G  R+RPYER+TPTTIP+ EILTN
Subjt:  LTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRTDVKSKDKGSSSSSGRTEYRRSEGGSIRNRPYERYTPTTIPVFEILTN

Query:  IEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRQNDRPAVINTIFRGPS
        IEESGMEKLLKR EKLRG PE+R+KDKYCRFHR++GHNT+  WELKRQIE+LIQDGYFKKFVGKPR++S EKKEERKRSRTPPR+ DRPAVINTIF GPS
Subjt:  IEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRQNDRPAVINTIFRGPS

Query:  GGQSGNKRKELAREARCEVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVL-------------------------------KSPTPLVGFS
        GGQSG KRKELAR AR EVCIIREQ+PTC ITF  ADLE +HLPHNDALVIAPLIDHV+                               KSPTPLVGFS
Subjt:  GGQSGNKRKELAREARCEVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVL-------------------------------KSPTPLVGFS

Query:  GESVSPEGCINLPVTIGQDATQITQMAEFV
        GESV PEG I+LPVT+GQD TQ+TQMAEFV
Subjt:  GESVSPEGCINLPVTIGQDATQITQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.6e-21465.67Show/hide
Query:  NSSLKAQSKYKPMAPETVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQ
        +S+ +A+S + P  P+ VITREEFD ++ K + QVEALKA+CE+K+   +DGDLGESPFT+D+LEA        P +K YDGSKDPKDYVEVFEGLMDFQ
Subjt:  NSSLKAQSKYKPMAPETVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGEMLREYVTRFQEEQLKVVHYSHDSAMCYFLTGLA
        AA+DAIKCRAFQIALTGSARLW                                                     FQE+QLKV   S DSAMCYFLTGLA
Subjt:  AATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGEMLREYVTRFQEEQLKVVHYSHDSAMCYFLTGLA

Query:  DETLTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRTDVKSKDKGSSSSSGRTEYRRSEGGSIRNRPYERYTPTTIPVFEI
        DE LTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPE+ ID+ + S +  + D+KSKDKG S SSGR E+RR+  G  R+RPYER+TPTTIP+ EI
Subjt:  DETLTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRTDVKSKDKGSSSSSGRTEYRRSEGGSIRNRPYERYTPTTIPVFEI

Query:  LTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRQNDRPAVINTIFR
        LTNIEESGMEKLLKR EKLRG PE+RNKDKYCRFHR++ HNT+  WELKRQIEDLIQD YFKKFVGKPR++S EKKEERK SRTP R+ DRPAVINTIF 
Subjt:  LTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRQNDRPAVINTIFR

Query:  GPSGGQSGNKRKELAREARCEVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVL-------------------------------KSPTPLV
        GPSGGQSG+KRKELAR AR EVCIIREQ+PTC ITF  ADLE +HLPHNDALVIAPLIDHV+                               KS TPLV
Subjt:  GPSGGQSGNKRKELAREARCEVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVL-------------------------------KSPTPLV

Query:  GFSGESVSPEGCINLPVTIGQDATQITQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALKE
        GFS ESV PEGCI+LPVT+G D TQ+TQMAEFVV+DG+SAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ  SRECYASALKGSSVCAL+ 
Subjt:  GFSGESVSPEGCINLPVTIGQDATQITQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALKE

Query:  QTNRGKQQESETDLPKEGKRQFSLPTEELELVPLL
          +R    E + +LP   +R+F+ PTEELELVPLL
Subjt:  QTNRGKQQESETDLPKEGKRQFSLPTEELELVPLL

A0A6J1DHB3 uncharacterized protein LOC1110204794.3e-24964.89Show/hide
Query:  SNSANTTEQRGVNADNGPQQDLGARIVEDHVRAGQEGDLPRRSARHANQELPLAHPKPSKANRGRGGTSRKTSQRANQEADPEALSTLQRELDDMRHRLS
        +NS NT ++R + A++G Q+++GA +VE         +   RSAR     LP AHPKPS                                         
Subjt:  SNSANTTEQRGVNADNGPQQDLGARIVEDHVRAGQEGDLPRRSARHANQELPLAHPKPSKANRGRGGTSRKTSQRANQEADPEALSTLQRELDDMRHRLS

Query:  TMEEMYAEATRANRTDLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSSLKAQSKYKPMAPETVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDG
                                                            KA+S Y P+ P  VITREEFD +K KFD QVEALKARCEKK+ SFDDG
Subjt:  TMEEMYAEATRANRTDLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSSLKAQSKYKPMAPETVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDG

Query:  DLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDR
        DLGE  F++DILEA IPPKFKTP MKPYDGSKDPKDYVEVFE LMDFQAATDAIKC AFQIALTGSARLWYRRLPAR ISTYSQLRKEFISQFSSRHYDR
Subjt:  DLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDR

Query:  KTATHLATIRQKEGEMLREYVTRFQEEQLKVVHYSHDSAMCYFLTGLADETLTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQE
        KT THLATIRQKEGE LREYVTRF EEQLKV H S DSAMCYFLTGLADETLTVKL +EAPATFAEVLQK KKVIDGQELLRTKTGRPEK IDQ +  ++
Subjt:  KTATHLATIRQKEGEMLREYVTRFQEEQLKVVHYSHDSAMCYFLTGLADETLTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQE

Query:  RRRTDVKSKDKGSSSSSGRTEYRRSEGGSIRNRPYERYTPTTIPVFEILTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQI
        + + D KS+DKG SSSS R +YRRS     ++RPYE YTPTTIP+FEILTNIEE+GMEKLLKR EKLRGDPEKRN DKYCRFHRD+GHNT++ WELKRQI
Subjt:  RRRTDVKSKDKGSSSSSGRTEYRRSEGGSIRNRPYERYTPTTIPVFEILTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQI

Query:  EDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRQNDRPAVINTIFRGPSGGQSGNKRKELAREARCEVCIIREQKPTCSITFGDADLEGIHLPHNDAL
        EDLIQDGYFKKFVGKPRSNSVEKKEERKR RTPPR++DRPAVI             NK+KELAREAR EVCIIREQ+PT SI F  ADLEG+HLPHNDAL
Subjt:  EDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRQNDRPAVINTIFRGPSGGQSGNKRKELAREARCEVCIIREQKPTCSITFGDADLEGIHLPHNDAL

Query:  VIAPLIDHVL-------------------------------KSPTPLVGFSGESVSPEGCINLPVTIGQDATQITQMAEFVVVDGKSAYNAIFGRPIIHS
        VIAPLID VL                               KSPTPLVGFSGES+S EGCI+LPV+I QD TQ+TQMAEFVV+DG+SAYNAIFGRPIIHS
Subjt:  VIAPLIDHVL-------------------------------KSPTPLVGFSGESVSPEGCINLPVTIGQDATQITQMAEFVVVDGKSAYNAIFGRPIIHS

Query:  FRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALKEQTNR
        FRAVPSTLHQVLKYST NGVGTVRGE KTSRECYAS  K SSVCAL+EQT R
Subjt:  FRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALKEQTNR

A0A6J1DPC9 uncharacterized protein LOC1110222802.7e-23569.01Show/hide
Query:  LRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSSLKAQSKYKPMAPETVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPI
        LRDNDLRKHLT+KKK+AS EPEDS SYSREFSNS+LKAQSKYKP+ PE VI REEFDLMKH+FDEQVEALKARCEKK+  FDD DLGESPFT+DI+EAPI
Subjt:  LRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSSLKAQSKYKPMAPETVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPI

Query:  PPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGEM
        PPKFKTP MKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW RRLPARSISTYSQLRKEFI QFS RHYDRKTATHLATIRQKE   
Subjt:  PPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGEM

Query:  LREYVTRFQEEQLKVVHYSHDSAMCYFLTGLADETLTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRTDVKSKDKGSSSS
                                        DETLTVKLG+EAPATFAEVLQ AKKVIDGQELLRTKT RPEKQIDQK+LSQ++R+ D KSKDKGSSSS
Subjt:  LREYVTRFQEEQLKVVHYSHDSAMCYFLTGLADETLTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRTDVKSKDKGSSSS

Query:  SGRTEYRRSEGGSIRNRPYERYTPTTIPVFEILTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIEDLIQDGYFKKFVGKP
          RTEYRRSE G  R+RPYER                                                       CWELKRQIEDLIQD YFKKFVGKP
Subjt:  SGRTEYRRSEGGSIRNRPYERYTPTTIPVFEILTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIEDLIQDGYFKKFVGKP

Query:  RSNSVEKKEERKRSRTPPRQNDRPAVINTIFRGPSGGQSGNKRKELAREARCEVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVL------
        RSNSVEKKEERKRSRTPPR+ DRPAVINTIF GPSGGQ  NKRKELA EAR +V IIREQKPTCSITF D DLEG+HLPHNDALVIAPLIDHVL      
Subjt:  RSNSVEKKEERKRSRTPPRQNDRPAVINTIFRGPSGGQSGNKRKELAREARCEVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVL------

Query:  -------------------------KSPTPLVGFSGESVSPEGCINLPVTIGQDATQITQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYST
                                 KSPTPLVGFS ESVSPEGCI+LPVTIGQD+TQ+TQMAEFVV+DG+ AYNAIF RPIIHSF+AVPS LHQVLKYST
Subjt:  -------------------------KSPTPLVGFSGESVSPEGCINLPVTIGQDATQITQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYST

Query:  PNGVGTVRGEQKTSRECYASALKGSSVCALKEQTNRGKQQESETDLPKEGKRQFSLPTEELELVPLLS
        PNGVGTVRGEQKTSRECYASALK SSVCAL+EQT       S+ DLP+E K           L P L+
Subjt:  PNGVGTVRGEQKTSRECYASALKGSSVCALKEQTNRGKQQESETDLPKEGKRQFSLPTEELELVPLLS

A0A6J1DZB9 uncharacterized protein LOC1110249044.2e-24486.74Show/hide
Query:  MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGEMLREYVTRFQEEQLKVVHYSHDSAMCYFL
        MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSS HYDRKTATHLATIRQKE E LREYVTRFQEEQLKV H S DSAMCYFL
Subjt:  MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGEMLREYVTRFQEEQLKVVHYSHDSAMCYFL

Query:  TGLADETLTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRTDVKSKDKGSSSSSGRTEYRRSEGGSIRNRPYERYTPTTIP
        T LADETLTVKLG+EAP TF EVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQE+R+ D KS+DKGSSSS+ RTEYRR E G  R+RPYERYT +TIP
Subjt:  TGLADETLTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRTDVKSKDKGSSSSSGRTEYRRSEGGSIRNRPYERYTPTTIP

Query:  VFEILTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRQNDRPAVIN
        + EILTNIEESGMEKLLKR EKLRGD EKRNK+KYCRFHRD+GHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPR+ DRPAVIN
Subjt:  VFEILTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRQNDRPAVIN

Query:  TIFRGPSGGQSGNKRKELAREARCEVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLKSPTPLVGFSGESVSPEGCINLPVTIGQDATQIT
        TIF GP+GGQSGNKRKELAREAR EVCIIRE KPTCSITFGDADLEG+HLPHNDALVIA LIDH L     + G         GCI+LPVTIGQDATQ+T
Subjt:  TIFRGPSGGQSGNKRKELAREARCEVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLKSPTPLVGFSGESVSPEGCINLPVTIGQDATQIT

Query:  QMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALKEQTNRGKQQESETDLPKEGKRQFSLPTE
        QMAEFVV+DG+SAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPN VG VRGEQKTSRECYASALKGS+VCAL+EQTNRGK QESE DLPKEGKRQF  PTE
Subjt:  QMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALKEQTNRGKQQESETDLPKEGKRQFSLPTE

Query:  ELELVPLLSPEKQ
        ELELVPLLSPE+Q
Subjt:  ELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAACGCTCACACAAATTTGGAGTCCTCTGATTCTGCAACGAAGGGGGAAGGTGTAGTTCCCAATTTCCAAGAAGCAGCAACCAATCGCTCTGCTACTTCT
TCATTGGTGATTGATTTGGATGTCCACTCTAGTGTACAGGTCAGAATCGGAGACCGGGTTCGAGCTCGATTCGTGAAGAACCGTTCAAACTCTGCCAATACGACA
GAACAGAGGGGTGTGAACGCTGACAATGGCCCTCAGCAAGACCTCGGTGCAAGAATAGTCGAGGACCATGTTCGAGCAGGGCAAGAGGGAGATTTGCCGCGACGA
TCTGCCCGCCATGCGAACCAAGAGCTACCACTTGCTCACCCGAAACCCTCAAAAGCCAACAGAGGCCGAGGAGGGACGTCGAGAAAAACCTCCCAAAGGGCCAAT
CAGGAAGCAGACCCCGAAGCTCTGTCTACTCTCCAGCGCGAGTTGGATGATATGCGCCATCGGTTGAGCACAATGGAAGAAATGTACGCCGAGGCAACGCGTGCT
AACCGAACTGACTTGCGGGATAACGATTTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCATCTCGGGAGCCAGAAGACTCTCCTTCCTACTCCCGAGAATTC
TCCAACTCGAGCCTAAAGGCTCAATCAAAATACAAGCCTATGGCACCAGAAACTGTGATCACGAGGGAAGAGTTCGACCTGATGAAGCACAAGTTCGATGAGCAG
GTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGACTGTTCGTTCGACGATGGCGACTTAGGAGAATCGCCATTCACCGCGGACATCCTGGAGGCTCCAATCCCT
CCGAAGTTCAAGACTCCCGCCATGAAGCCCTATGACGGGTCTAAGGACCCTAAAGACTATGTTGAGGTTTTCGAGGGCCTCATGGACTTTCAAGCGGCGACAGAT
GCGATCAAATGCCGCGCCTTCCAGATAGCGCTTACCGGTAGCGCGCGCCTGTGGTACCGAAGACTGCCGGCCAGGTCGATCTCGACCTACTCCCAGCTGAGGAAA
GAGTTCATCAGTCAGTTCTCCTCTCGGCATTATGATAGAAAGACAGCGACTCACCTTGCCACCATCAGGCAGAAGGAAGGAGAGATGCTGAGAGAATATGTGACA
CGGTTCCAGGAAGAGCAGCTTAAGGTCGTGCACTACTCTCACGATTCGGCCATGTGCTACTTCCTCACCGGCTTGGCCGATGAGACCTTAACCGTAAAACTTGGA
AAGGAAGCTCCAGCCACCTTCGCCGAAGTGTTACAGAAGGCGAAGAAAGTCATTGATGGGCAGGAGCTCCTCCGAACCAAGACTGGCCGACCTGAAAAGCAGATC
GATCAGAAGAAACTTAGCCAGGAGAGGAGGAGGACCGATGTCAAGTCCAAAGATAAGGGATCATCCTCCTCCAGTGGCAGAACAGAGTACCGAAGGTCGGAGGGC
GGCTCCATCCGGAACCGACCTTATGAACGGTATACTCCAACCACCATCCCCGTCTTCGAGATACTCACGAACATCGAGGAAAGTGGGATGGAAAAGCTCCTCAAA
CGACTTGAGAAGCTCCGAGGAGACCCAGAGAAGCGCAACAAAGATAAGTACTGCCGTTTTCACCGCGACAACGGCCATAATACCACAAGCTGCTGGGAATTGAAG
CGTCAGATTGAAGATCTCATTCAAGATGGCTACTTCAAAAAATTTGTGGGCAAACCGAGGTCTAACTCGGTCGAAAAGAAAGAAGAGAGGAAGCGTTCAAGAACG
CCGCCTCGCCAGAATGACCGACCTGCGGTCATCAACACTATTTTCAGAGGTCCGAGCGGGGGCCAGTCCGGAAACAAGAGGAAGGAGCTAGCTCGCGAGGCCAGG
TGTGAGGTATGCATCATCAGGGAGCAGAAACCTACTTGCTCCATCACTTTCGGCGATGCCGATTTGGAGGGGATCCATTTGCCCCACAATGACGCGCTCGTGATC
GCCCCTCTCATTGATCACGTCTTGAAGAGTCCAACACCCTTGGTTGGATTCTCTGGAGAATCAGTCTCCCCAGAAGGATGCATCAACCTGCCGGTAACTATCGGG
CAAGATGCTACCCAGATAACGCAGATGGCCGAGTTCGTGGTAGTCGACGGCAAATCGGCCTACAACGCCATTTTTGGGAGACCCATTATCCACTCATTTCGAGCT
GTTCCTTCCACATTGCATCAAGTCTTGAAGTACTCAACCCCTAATGGAGTGGGCACGGTCCGAGGTGAGCAAAAAACTTCAAGAGAGTGCTACGCGTCCGCGCTC
AAGGGATCGTCGGTATGTGCCCTGAAGGAACAAACCAATCGTGGCAAGCAGCAGGAGTCAGAGACCGACCTGCCAAAGGAAGGCAAAAGGCAGTTCTCCCTGCCA
ACAGAAGAGCTCGAGCTTGTTCCTTTACTTAGCCCCGAAAAACAATTGGAGGACAGAGCCAAGGCTTATAGACCTTGTAGCTCTGCCCAAAAAGAGAGAATATAT
GTAAAGACAAAAGGAAAGAGAGCCTCAGCTACTTCCGAGGTGGGAGGACAGCTCGCCTTGGGAACCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGAACGCTCACACAAATTTGGAGTCCTCTGATTCTGCAACGAAGGGGGAAGGTGTAGTTCCCAATTTCCAAGAAGCAGCAACCAATCGCTCTGCTACTTCT
TCATTGGTGATTGATTTGGATGTCCACTCTAGTGTACAGGTCAGAATCGGAGACCGGGTTCGAGCTCGATTCGTGAAGAACCGTTCAAACTCTGCCAATACGACA
GAACAGAGGGGTGTGAACGCTGACAATGGCCCTCAGCAAGACCTCGGTGCAAGAATAGTCGAGGACCATGTTCGAGCAGGGCAAGAGGGAGATTTGCCGCGACGA
TCTGCCCGCCATGCGAACCAAGAGCTACCACTTGCTCACCCGAAACCCTCAAAAGCCAACAGAGGCCGAGGAGGGACGTCGAGAAAAACCTCCCAAAGGGCCAAT
CAGGAAGCAGACCCCGAAGCTCTGTCTACTCTCCAGCGCGAGTTGGATGATATGCGCCATCGGTTGAGCACAATGGAAGAAATGTACGCCGAGGCAACGCGTGCT
AACCGAACTGACTTGCGGGATAACGATTTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCATCTCGGGAGCCAGAAGACTCTCCTTCCTACTCCCGAGAATTC
TCCAACTCGAGCCTAAAGGCTCAATCAAAATACAAGCCTATGGCACCAGAAACTGTGATCACGAGGGAAGAGTTCGACCTGATGAAGCACAAGTTCGATGAGCAG
GTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGACTGTTCGTTCGACGATGGCGACTTAGGAGAATCGCCATTCACCGCGGACATCCTGGAGGCTCCAATCCCT
CCGAAGTTCAAGACTCCCGCCATGAAGCCCTATGACGGGTCTAAGGACCCTAAAGACTATGTTGAGGTTTTCGAGGGCCTCATGGACTTTCAAGCGGCGACAGAT
GCGATCAAATGCCGCGCCTTCCAGATAGCGCTTACCGGTAGCGCGCGCCTGTGGTACCGAAGACTGCCGGCCAGGTCGATCTCGACCTACTCCCAGCTGAGGAAA
GAGTTCATCAGTCAGTTCTCCTCTCGGCATTATGATAGAAAGACAGCGACTCACCTTGCCACCATCAGGCAGAAGGAAGGAGAGATGCTGAGAGAATATGTGACA
CGGTTCCAGGAAGAGCAGCTTAAGGTCGTGCACTACTCTCACGATTCGGCCATGTGCTACTTCCTCACCGGCTTGGCCGATGAGACCTTAACCGTAAAACTTGGA
AAGGAAGCTCCAGCCACCTTCGCCGAAGTGTTACAGAAGGCGAAGAAAGTCATTGATGGGCAGGAGCTCCTCCGAACCAAGACTGGCCGACCTGAAAAGCAGATC
GATCAGAAGAAACTTAGCCAGGAGAGGAGGAGGACCGATGTCAAGTCCAAAGATAAGGGATCATCCTCCTCCAGTGGCAGAACAGAGTACCGAAGGTCGGAGGGC
GGCTCCATCCGGAACCGACCTTATGAACGGTATACTCCAACCACCATCCCCGTCTTCGAGATACTCACGAACATCGAGGAAAGTGGGATGGAAAAGCTCCTCAAA
CGACTTGAGAAGCTCCGAGGAGACCCAGAGAAGCGCAACAAAGATAAGTACTGCCGTTTTCACCGCGACAACGGCCATAATACCACAAGCTGCTGGGAATTGAAG
CGTCAGATTGAAGATCTCATTCAAGATGGCTACTTCAAAAAATTTGTGGGCAAACCGAGGTCTAACTCGGTCGAAAAGAAAGAAGAGAGGAAGCGTTCAAGAACG
CCGCCTCGCCAGAATGACCGACCTGCGGTCATCAACACTATTTTCAGAGGTCCGAGCGGGGGCCAGTCCGGAAACAAGAGGAAGGAGCTAGCTCGCGAGGCCAGG
TGTGAGGTATGCATCATCAGGGAGCAGAAACCTACTTGCTCCATCACTTTCGGCGATGCCGATTTGGAGGGGATCCATTTGCCCCACAATGACGCGCTCGTGATC
GCCCCTCTCATTGATCACGTCTTGAAGAGTCCAACACCCTTGGTTGGATTCTCTGGAGAATCAGTCTCCCCAGAAGGATGCATCAACCTGCCGGTAACTATCGGG
CAAGATGCTACCCAGATAACGCAGATGGCCGAGTTCGTGGTAGTCGACGGCAAATCGGCCTACAACGCCATTTTTGGGAGACCCATTATCCACTCATTTCGAGCT
GTTCCTTCCACATTGCATCAAGTCTTGAAGTACTCAACCCCTAATGGAGTGGGCACGGTCCGAGGTGAGCAAAAAACTTCAAGAGAGTGCTACGCGTCCGCGCTC
AAGGGATCGTCGGTATGTGCCCTGAAGGAACAAACCAATCGTGGCAAGCAGCAGGAGTCAGAGACCGACCTGCCAAAGGAAGGCAAAAGGCAGTTCTCCCTGCCA
ACAGAAGAGCTCGAGCTTGTTCCTTTACTTAGCCCCGAAAAACAATTGGAGGACAGAGCCAAGGCTTATAGACCTTGTAGCTCTGCCCAAAAAGAGAGAATATAT
GTAAAGACAAAAGGAAAGAGAGCCTCAGCTACTTCCGAGGTGGGAGGACAGCTCGCCTTGGGAACCTAG
Protein sequenceShow/hide protein sequence
MANAHTNLESSDSATKGEGVVPNFQEAATNRSATSSLVIDLDVHSSVQVRIGDRVRARFVKNRSNSANTTEQRGVNADNGPQQDLGARIVEDHVRAGQEGDLPRR
SARHANQELPLAHPKPSKANRGRGGTSRKTSQRANQEADPEALSTLQRELDDMRHRLSTMEEMYAEATRANRTDLRDNDLRKHLTEKKKRASREPEDSPSYSREF
SNSSLKAQSKYKPMAPETVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATD
AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGEMLREYVTRFQEEQLKVVHYSHDSAMCYFLTGLADETLTVKLG
KEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRTDVKSKDKGSSSSSGRTEYRRSEGGSIRNRPYERYTPTTIPVFEILTNIEESGMEKLLK
RLEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRQNDRPAVINTIFRGPSGGQSGNKRKELAREAR
CEVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLKSPTPLVGFSGESVSPEGCINLPVTIGQDATQITQMAEFVVVDGKSAYNAIFGRPIIHSFRA
VPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALKEQTNRGKQQESETDLPKEGKRQFSLPTEELELVPLLSPEKQLEDRAKAYRPCSSAQKERIY
VKTKGKRASATSEVGGQLALGT