; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g04790 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g04790
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:4076642..4082535
RNA-Seq ExpressionMoc07g04790
SyntenyMoc07g04790
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]4.8e-24887.88Show/hide
Query:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEALMDFQAASD
        +AESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE+LMDFQAASD
Subjt:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEALMDFQAASD

Query:  AIKCRAFQIALTGSARLSYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVTHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARL YRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQ EGETLREYVTRFQEEQLKV HCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLSYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVTHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELL+TKTGRPERKIGRGRSGKDIE ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLR-EPRRGA--------------------ARTSIAASIGS------TAITRRTAGKKKEERKRSRTPPRRTNRPAVINTIFGGPSRG
        ESGMEKLLKRPEKLR  P R +                     +  I   I            R ++ +KKEERKRSRTPPRRT+RPAVINTIFGGPS G
Subjt:  ESGMEKLLKRPEKLR-EPRRGA--------------------ARTSIAASIGS------TAITRRTAGKKKEERKRSRTPPRRTNRPAVINTIFGGPSRG

Query:  QSGRKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
        QSGRKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGRKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPEGYIDLPVTLGQDQTQVTQMAEFV
        SVIPEG+IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGYIDLPVTLGQDQTQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]2.7e-24376.55Show/hide
Query:  SSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEALMDFQ
        SSNQQAESS NPATP GVITREEFDQLRG+L+AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFE LMDFQ
Subjt:  SSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEALMDFQ

Query:  AASDAIKCRAFQIALTGSARLSYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVTHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARL                                                      FQE+QLKV   SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLSYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVTHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELL+TKTGRPER I RGRSGKD EKAD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLKRPEKLR-EPRR----------------GAARTSIAASIGS----------TAITRRTAGKKKEERKRSRTPPRRTNRPAVINTIFGG
        TNIEESGMEKLLKRPEKLR  P R                 + R  +   I                R ++ +KKEERK SRTP RR +RPAVINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLR-EPRR----------------GAARTSIAASIGS----------TAITRRTAGKKKEERKRSRTPPRRTNRPAVINTIFGG

Query:  PSRGQSGRKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG
        PS GQSG KRKELARAARREVCIIREQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSRGQSGRKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG

Query:  FSGESVIPEGYIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETL
        FS ESVIPEG IDLPVTLG DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIPEGYIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETL

Query:  ASRDGTLEFKADLPRREFAAPTEELELVPLL
         SRDGTLEFKA+LPRREFAAPTEELELVPLL
Subjt:  ASRDGTLEFKADLPRREFAAPTEELELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.3e-23761.99Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVEAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGSAPAPTSENLDALQREIEAM
        MVQPANSTNTADRR LAA+  HQREV A VVEGQGH+ L TEPL RSARIT PVLPPAHP                                        
Subjt:  MVQPANSTNTADRRTLAASDAHQREVEAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGSAPAPTSENLDALQREIEAM

Query:  HTKMRSMEEMYNEMILAAGARSRSENRVTRAGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPA
                                                                                         PS+        AESS NP 
Subjt:  HTKMRSMEEMYNEMILAAGARSRSENRVTRAGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPA

Query:  TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEALMDFQAASDAIKCRAFQI
        TP GVITREEFDQL+ + DAQVEALKA+CE+KE   +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE+LMDFQAA+DAIKC AFQI
Subjt:  TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEALMDFQAASDAIKCRAFQI

Query:  ALTGSARLSYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVTHCSDDSAMCYFLTGLADEALTVKLGEEAP
        ALTGSARL YRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQ EGETLREYVTRF EEQLKV HCSDDSAMCYFLTGLADE LTVKL EEAP
Subjt:  ALTGSARLSYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVTHCSDDSAMCYFLTGLADEALTVKLGEEAP

Query:  ATFAEVLQKAKKVIDGQELLQTKTGRPERKIGRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLL
        ATFAEVLQK KKVIDGQELL+TKTGRPE+ I +GR+GKD  KAD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIEE+GMEKLL
Subjt:  ATFAEVLQKAKKVIDGQELLQTKTGRPERKIGRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLL

Query:  KRPEKLR---EPRRGAARTSIAASIGSTAIT------------------------RRTAGKKKEERKRSRTPPRRTNRPAVINTIFGGPSRGQSGRKRKE
        KRPEKLR   E R            G                             R  + +KKEERKR RTPPRR +RPAVIN             K+KE
Subjt:  KRPEKLR---EPRRGAARTSIAASIGSTAIT------------------------RRTAGKKKEERKRSRTPPRRTNRPAVINTIFGGPSRGQSGRKRKE

Query:  LARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYI
        LAR ARREVCIIREQRPT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EG I
Subjt:  LARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYI

Query:  DLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRD
        DLPV++ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  DLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRD

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]6.0e-19863.41Show/hide
Query:  EQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQ
        E+    + P   E   ++E   ++ +  DLR+HL  K+  +  + +   S SR   +SN +A+S   P  P  VI REEFD ++ + D QVEALKA+CE+
Subjt:  EQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQ

Query:  KEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEALMDFQAASDAIKCRAFQIALTGSARLSYRRLPARSISTYSQLRREFLAQ
        KE P +D DLGESPFTSD++EAPIPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARL  RRLPARSISTYSQLR+EF+ Q
Subjt:  KEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEALMDFQAASDAIKCRAFQIALTGSARLSYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVTHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPERKI
        FS RHYD+KTATHLATIRQ E                                   DE LTVKLGEEAPATFAEVLQ AKKVIDGQELL+TKT RPE++I
Subjt:  FSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVTHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPERKI

Query:  GRGRSGKDIEKADPKSKDKGSFSSG-RAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLREPRRGAARTSIAASIGSTAITRRT
         + R  +   K D KSKDKGS SSG R EYRR+E+GP+RSRPYER       I ++   I++S  +K + +P                         R  
Subjt:  GRGRSGKDIEKADPKSKDKGSFSSG-RAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLREPRRGAARTSIAASIGSTAITRRT

Query:  AGKKKEERKRSRTPPRRTNRPAVINTIFGGPSRGQSGRKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGG
        + +KKEERKRSRTPPRR +RPAVINTIFGGPS GQ   KRKELA  ARR+V IIREQ+PTC ITF   DLE VHLPHNDALVIAPLIDHV+VRRVLVDGG
Subjt:  AGKKKEERKRSRTPPRRTNRPAVINTIFGGPSRGQSGRKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGG

Query:  ASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNG
        ASANILSLPTYLAL  TRSQLKKSPTPLVGFS ESV PEG IDLPVT+GQD TQVTQMAEFVVIDGR AYNAIF RPIIHSF+A+PS LHQVLKYSTPNG
Subjt:  ASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNG

Query:  VGTVRGEQTASRECYASALKGSSVCALETLASRDGTLEFKADLPR
        VGTVRGEQ  SRECYASALK SSVCALE   S+D       DLPR
Subjt:  VGTVRGEQTASRECYASALKGSSVCALETLASRDGTLEFKADLPR

XP_022157676.1 uncharacterized protein LOC111024332 [Momordica charantia]1.7e-19288.38Show/hide
Query:  LADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE
        +ADEALTVKLGEEAPATFAEVLQKAKKVIDGQELL+TKTGRPERKIGRGRSGKD E+ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE
Subjt:  LADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE

Query:  ILTNIEESGMEKLLKRPEKLR--EPRRGAARTSIAASIGSTAITRRTAGKKKEERKRSRTPPRRTNRPAVINTIFGGPSRGQSGRKRKELARAARREVCI
        ILTNIE+SGMEKLLKRPEKLR    RR   +TS A              +KKEERKRSRTPPRRT+RPAVINTIFGGPS GQSG KRKELAR ARREVCI
Subjt:  ILTNIEESGMEKLLKRPEKLR--EPRRGAARTSIAASIGSTAITRRTAGKKKEERKRSRTPPRRTNRPAVINTIFGGPSRGQSGRKRKELARAARREVCI

Query:  IREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYIDLPVTLGQDQT
        IREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLK+SPTPLVGFSGESVIPEG IDLPVTLGQDQT
Subjt:  IREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYIDLPVTLGQDQT

Query:  QVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRDGTLEFKADLPRREFAAPTE
        +VTQM EFVV+DGRS YNAIFGRPIIHSFR IPSTLHQVLKYSTPNGVGTVRGEQT SRECYA+ALKGSSVCALETL  RDGTLE +ADLPR+EFAAPTE
Subjt:  QVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRDGTLEFKADLPRREFAAPTE

Query:  ELELVPLLSPEKQ
        ELELVPLLSPEKQ
Subjt:  ELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088132.3e-24887.88Show/hide
Query:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEALMDFQAASD
        +AESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE+LMDFQAASD
Subjt:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEALMDFQAASD

Query:  AIKCRAFQIALTGSARLSYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVTHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARL YRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQ EGETLREYVTRFQEEQLKV HCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLSYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVTHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELL+TKTGRPERKIGRGRSGKDIE ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLR-EPRRGA--------------------ARTSIAASIGS------TAITRRTAGKKKEERKRSRTPPRRTNRPAVINTIFGGPSRG
        ESGMEKLLKRPEKLR  P R +                     +  I   I            R ++ +KKEERKRSRTPPRRT+RPAVINTIFGGPS G
Subjt:  ESGMEKLLKRPEKLR-EPRRGA--------------------ARTSIAASIGS------TAITRRTAGKKKEERKRSRTPPRRTNRPAVINTIFGGPSRG

Query:  QSGRKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
        QSGRKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGRKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPEGYIDLPVTLGQDQTQVTQMAEFV
        SVIPEG+IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGYIDLPVTLGQDQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.3e-24376.55Show/hide
Query:  SSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEALMDFQ
        SSNQQAESS NPATP GVITREEFDQLRG+L+AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFE LMDFQ
Subjt:  SSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEALMDFQ

Query:  AASDAIKCRAFQIALTGSARLSYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVTHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARL                                                      FQE+QLKV   SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLSYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVTHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELL+TKTGRPER I RGRSGKD EKAD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLKRPEKLR-EPRR----------------GAARTSIAASIGS----------TAITRRTAGKKKEERKRSRTPPRRTNRPAVINTIFGG
        TNIEESGMEKLLKRPEKLR  P R                 + R  +   I                R ++ +KKEERK SRTP RR +RPAVINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLR-EPRR----------------GAARTSIAASIGS----------TAITRRTAGKKKEERKRSRTPPRRTNRPAVINTIFGG

Query:  PSRGQSGRKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG
        PS GQSG KRKELARAARREVCIIREQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSRGQSGRKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG

Query:  FSGESVIPEGYIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETL
        FS ESVIPEG IDLPVTLG DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIPEGYIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETL

Query:  ASRDGTLEFKADLPRREFAAPTEELELVPLL
         SRDGTLEFKA+LPRREFAAPTEELELVPLL
Subjt:  ASRDGTLEFKADLPRREFAAPTEELELVPLL

A0A6J1DHB3 uncharacterized protein LOC1110204796.4e-23861.99Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVEAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGSAPAPTSENLDALQREIEAM
        MVQPANSTNTADRR LAA+  HQREV A VVEGQGH+ L TEPL RSARIT PVLPPAHP                                        
Subjt:  MVQPANSTNTADRRTLAASDAHQREVEAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGSAPAPTSENLDALQREIEAM

Query:  HTKMRSMEEMYNEMILAAGARSRSENRVTRAGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPA
                                                                                         PS+        AESS NP 
Subjt:  HTKMRSMEEMYNEMILAAGARSRSENRVTRAGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPA

Query:  TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEALMDFQAASDAIKCRAFQI
        TP GVITREEFDQL+ + DAQVEALKA+CE+KE   +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE+LMDFQAA+DAIKC AFQI
Subjt:  TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEALMDFQAASDAIKCRAFQI

Query:  ALTGSARLSYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVTHCSDDSAMCYFLTGLADEALTVKLGEEAP
        ALTGSARL YRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQ EGETLREYVTRF EEQLKV HCSDDSAMCYFLTGLADE LTVKL EEAP
Subjt:  ALTGSARLSYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVTHCSDDSAMCYFLTGLADEALTVKLGEEAP

Query:  ATFAEVLQKAKKVIDGQELLQTKTGRPERKIGRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLL
        ATFAEVLQK KKVIDGQELL+TKTGRPE+ I +GR+GKD  KAD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIEE+GMEKLL
Subjt:  ATFAEVLQKAKKVIDGQELLQTKTGRPERKIGRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLL

Query:  KRPEKLR---EPRRGAARTSIAASIGSTAIT------------------------RRTAGKKKEERKRSRTPPRRTNRPAVINTIFGGPSRGQSGRKRKE
        KRPEKLR   E R            G                             R  + +KKEERKR RTPPRR +RPAVIN             K+KE
Subjt:  KRPEKLR---EPRRGAARTSIAASIGSTAIT------------------------RRTAGKKKEERKRSRTPPRRTNRPAVINTIFGGPSRGQSGRKRKE

Query:  LARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYI
        LAR ARREVCIIREQRPT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EG I
Subjt:  LARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYI

Query:  DLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRD
        DLPV++ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  DLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRD

A0A6J1DPC9 uncharacterized protein LOC1110222802.9e-19863.41Show/hide
Query:  EQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQ
        E+    + P   E   ++E   ++ +  DLR+HL  K+  +  + +   S SR   +SN +A+S   P  P  VI REEFD ++ + D QVEALKA+CE+
Subjt:  EQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQ

Query:  KEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEALMDFQAASDAIKCRAFQIALTGSARLSYRRLPARSISTYSQLRREFLAQ
        KE P +D DLGESPFTSD++EAPIPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARL  RRLPARSISTYSQLR+EF+ Q
Subjt:  KEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEALMDFQAASDAIKCRAFQIALTGSARLSYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVTHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPERKI
        FS RHYD+KTATHLATIRQ E                                   DE LTVKLGEEAPATFAEVLQ AKKVIDGQELL+TKT RPE++I
Subjt:  FSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVTHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPERKI

Query:  GRGRSGKDIEKADPKSKDKGSFSSG-RAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLREPRRGAARTSIAASIGSTAITRRT
         + R  +   K D KSKDKGS SSG R EYRR+E+GP+RSRPYER       I ++   I++S  +K + +P                         R  
Subjt:  GRGRSGKDIEKADPKSKDKGSFSSG-RAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLREPRRGAARTSIAASIGSTAITRRT

Query:  AGKKKEERKRSRTPPRRTNRPAVINTIFGGPSRGQSGRKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGG
        + +KKEERKRSRTPPRR +RPAVINTIFGGPS GQ   KRKELA  ARR+V IIREQ+PTC ITF   DLE VHLPHNDALVIAPLIDHV+VRRVLVDGG
Subjt:  AGKKKEERKRSRTPPRRTNRPAVINTIFGGPSRGQSGRKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGG

Query:  ASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNG
        ASANILSLPTYLAL  TRSQLKKSPTPLVGFS ESV PEG IDLPVT+GQD TQVTQMAEFVVIDGR AYNAIF RPIIHSF+A+PS LHQVLKYSTPNG
Subjt:  ASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNG

Query:  VGTVRGEQTASRECYASALKGSSVCALETLASRDGTLEFKADLPR
        VGTVRGEQ  SRECYASALK SSVCALE   S+D       DLPR
Subjt:  VGTVRGEQTASRECYASALKGSSVCALETLASRDGTLEFKADLPR

A0A6J1DYW5 uncharacterized protein LOC1110243328.2e-19388.38Show/hide
Query:  LADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE
        +ADEALTVKLGEEAPATFAEVLQKAKKVIDGQELL+TKTGRPERKIGRGRSGKD E+ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE
Subjt:  LADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE

Query:  ILTNIEESGMEKLLKRPEKLR--EPRRGAARTSIAASIGSTAITRRTAGKKKEERKRSRTPPRRTNRPAVINTIFGGPSRGQSGRKRKELARAARREVCI
        ILTNIE+SGMEKLLKRPEKLR    RR   +TS A              +KKEERKRSRTPPRRT+RPAVINTIFGGPS GQSG KRKELAR ARREVCI
Subjt:  ILTNIEESGMEKLLKRPEKLR--EPRRGAARTSIAASIGSTAITRRTAGKKKEERKRSRTPPRRTNRPAVINTIFGGPSRGQSGRKRKELARAARREVCI

Query:  IREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYIDLPVTLGQDQT
        IREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLK+SPTPLVGFSGESVIPEG IDLPVTLGQDQT
Subjt:  IREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGYIDLPVTLGQDQT

Query:  QVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRDGTLEFKADLPRREFAAPTE
        +VTQM EFVV+DGRS YNAIFGRPIIHSFR IPSTLHQVLKYSTPNGVGTVRGEQT SRECYA+ALKGSSVCALETL  RDGTLE +ADLPR+EFAAPTE
Subjt:  QVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRDGTLEFKADLPRREFAAPTE

Query:  ELELVPLLSPEKQ
        ELELVPLLSPEKQ
Subjt:  ELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCAGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGAAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGCGCACCCCCCAAGGACATCCAAGGCCACCCGTGGCCGAGGCGGAACCT
CTAAGAAGGGCGCCCGGGGTTCAGCCCCGGCCCCGACAAGTGAGAACTTGGACGCACTCCAGAGAGAAATTGAGGCAATGCACACAAAAATGCGGTCCATGGAGGAAATG
TATAACGAAATGATATTAGCTGCAGGCGCACGGTCCCGATCTGAGAACCGAGTGACGCGCGCTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGA
ACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCC
GCTCACACAGGAGCTCCAACCAACAGGCTGAATCCTCTCGCAACCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCT
CAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCC
GAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGCCCTCATGGATTTCCAAGCGGCATCAGACGCAATCA
AATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTCGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCC
CAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACTCATCTCGCCACCATCAGACAGAATGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCA
ATTGAAGGTCACACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCG
CCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCAAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATA
GAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGGCCTTACGAACGCTTCAC
CCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGAGCCCCGGAGAGGCGCAGCAA
GGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAAAAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTAACCGA
CCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCCGGGGTCAGTCCGGACGTAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCA
GAGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGA
GGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCACAATTGAAGAAAAGCCCGACACCGCTGGTTGGG
TTCTCTGGAGAATCGGTCATCCCAGAAGGTTACATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAG
ATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACAG
TCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCGCCAGTAGGGATGGGACGCTCGAGTTCAAG
GCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATTAGCATCAGCGTACGAGACCGACCTGGCCAG
GTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAAAGCTAGATCTGATGGAGATCGGCGCTCCAGAGTCCTCATGGATGGACCCGATTGTGGACTTTATTGGGG
GCAATTCACCACAAGACCCCAAGGAGCGCAGAAGGTTGGCCAGGCAAGCAGCTCGGTTCGTGGTCCGAGAAATTAAATGGGGGCCACGGACTCCCACACGATCACATTCC
AGCAGTCGGTTAAAATCCAATCCTCCAAAACCTAAGGGTACGAGGTGCGATGCCAAAACCACTGACGAACTTAAAATTCAAACCTTCAAGGTAAAGGGGCGATGTGAAAA
GTTCAAAATGATCAAGCCTCCGAACTTGAGGGTACGAGGTGCGATATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCAGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGAAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGCGCACCCCCCAAGGACATCCAAGGCCACCCGTGGCCGAGGCGGAACCT
CTAAGAAGGGCGCCCGGGGTTCAGCCCCGGCCCCGACAAGTGAGAACTTGGACGCACTCCAGAGAGAAATTGAGGCAATGCACACAAAAATGCGGTCCATGGAGGAAATG
TATAACGAAATGATATTAGCTGCAGGCGCACGGTCCCGATCTGAGAACCGAGTGACGCGCGCTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGA
ACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCC
GCTCACACAGGAGCTCCAACCAACAGGCTGAATCCTCTCGCAACCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCT
CAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCC
GAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGCCCTCATGGATTTCCAAGCGGCATCAGACGCAATCA
AATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTCGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCC
CAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACTCATCTCGCCACCATCAGACAGAATGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCA
ATTGAAGGTCACACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCG
CCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCAAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATA
GAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGGCCTTACGAACGCTTCAC
CCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGAGCCCCGGAGAGGCGCAGCAA
GGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAAAAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTAACCGA
CCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCCGGGGTCAGTCCGGACGTAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCA
GAGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGA
GGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCACAATTGAAGAAAAGCCCGACACCGCTGGTTGGG
TTCTCTGGAGAATCGGTCATCCCAGAAGGTTACATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAG
ATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACAG
TCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCGCCAGTAGGGATGGGACGCTCGAGTTCAAG
GCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATTAGCATCAGCGTACGAGACCGACCTGGCCAG
GTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAAAGCTAGATCTGATGGAGATCGGCGCTCCAGAGTCCTCATGGATGGACCCGATTGTGGACTTTATTGGGG
GCAATTCACCACAAGACCCCAAGGAGCGCAGAAGGTTGGCCAGGCAAGCAGCTCGGTTCGTGGTCCGAGAAATTAAATGGGGGCCACGGACTCCCACACGATCACATTCC
AGCAGTCGGTTAAAATCCAATCCTCCAAAACCTAAGGGTACGAGGTGCGATGCCAAAACCACTGACGAACTTAAAATTCAAACCTTCAAGGTAAAGGGGCGATGTGAAAA
GTTCAAAATGATCAAGCCTCCGAACTTGAGGGTACGAGGTGCGATATGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVEAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGSAPAPTSENLDALQREIEAMHTKMRSMEEM
YNEMILAAGARSRSENRVTRAGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDA
QVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEALMDFQAASDAIKCRAFQIALTGSARLSYRRLPARSISTYSQLRREFLA
QFSSRHYDKKTATHLATIRQNEGETLREYVTRFQEEQLKVTHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPERKIGRGRSGKDI
EKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLREPRRGAARTSIAASIGSTAITRRTAGKKKEERKRSRTPPRRTNR
PAVINTIFGGPSRGQSGRKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG
FSGESVIPEGYIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRDGTLEFK
ADLPRREFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSISKLDLMEIGAPESSWMDPIVDFIGGNSPQDPKERRRLARQAARFVVREIKWGPRTPTRSHS
SSRLKSNPPKPKGTRCDAKTTDELKIQTFKVKGRCEKFKMIKPPNLRVRGAI