; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g06040 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g06040
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:4999238..5006382
RNA-Seq ExpressionMoc07g06040
SyntenyMoc07g06040
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]3.7e-19670Show/hide
Query:  LKAQSKYKPLTPEVVITREEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESSFTSDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQAAT
        +KA+S   P TP  VITREEFD ++ + D QVEALKA+CE+KE   +DGDLGES FTSD+LEAPIPPKFK PT+K YDGSKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYKPLTPEVVITREEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESSFTSDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCHAFQIALTGSARLWYRRLPT---------------------------------RQKEGETLKEYVTRFQEEQLKVVHCSDDSTMCYFHTGLADET
        DAIKC AF+IALTGSARLWYRRLP                                  RQKEGETL+EYVTRFQEEQLKV HCSDDS MCYF TGLADE 
Subjt:  DAIKCHAFQIALTGSARLWYRRLPT---------------------------------RQKEGETLKEYVTRFQEEQLKVVHCSDDSTMCYFHTGLADET

Query:  LTVKLGEEAPATFTKVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRRSESGPNRSRPYERYTPTTIPISEILTN
        LTVKLGEEAPATF +VLQKAKKVIDGQELLRTKTGRPE++I + +  ++   AD KS+DKGS SS  R EYRR+E+GP RSRPYER+TPTTIPISEILTN
Subjt:  LTVKLGEEAPATFTKVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRRSESGPNRSRPYERYTPTTIPISEILTN

Query:  IEESGMEKLLKRPEKLRGDPKKRNKDKYCRFHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGK----------------------------------PS
        IEESGMEKLLKRPEKLRG P++R+KDKYCRFHR+HGHNT+  WELKRQIE+LIQDGYFKKFVGK                                  PS
Subjt:  IEESGMEKLLKRPEKLRGDPKKRNKDKYCRFHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGK----------------------------------PS

Query:  GGQSENKRKVLAREARREVCIIREQKPTCSITFGDTNLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTKSQLKKSPTPLVGFS
        GGQS  KRK LAR ARREVCIIREQ+PTC ITF   +LE VHLPHNDALVIAPLIDHV+V RVL+DGG SANILSLPTYLALGWT+SQLKKSPTPLVGFS
Subjt:  GGQSENKRKVLAREARREVCIIREQKPTCSITFGDTNLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTKSQLKKSPTPLVGFS

Query:  GETVSPEGCIDLPVTIGQDATQVTQMVEFV
        GE+V PEG IDLPVT+GQD TQVTQM EFV
Subjt:  GETVSPEGCIDLPVTIGQDATQVTQMVEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]7.6e-21869.12Show/hide
Query:  NSNLKAQSKYKPLTPEVVITREEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESSFTSDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQ
        +SN +A+S + P TP+ VITREEFD ++ K + QVEALKA+CE+KE   +DGDLGES FTSD+LEA        PT+KSYDGSKDPKDYVEVFEGLMDFQ
Subjt:  NSNLKAQSKYKPLTPEVVITREEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESSFTSDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCHAFQIALTGSARLWYRRLPTRQKEGETLKEYVTRFQEEQLKVVHCSDDSTMCYFHTGLADETLTVKLGEEAPATFTKVLQKAKKVIDGQELL
        AA+DAIKC AFQIALTGSARLW                    FQE+QLKV   SDDS MCYF TGLADE LTVKLG+EAPATF +VLQKAKKVIDGQELL
Subjt:  AATDAIKCHAFQIALTGSARLWYRRLPTRQKEGETLKEYVTRFQEEQLKVVHCSDDSTMCYFHTGLADETLTVKLGEEAPATFTKVLQKAKKVIDGQELL

Query:  RTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRRSESGPNRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCR
        RTKTGRPE+ ID+ + S +  KAD KS+DKGS SS  R E+RR+ +GP RSRPYER+TPTTIPISEILTNIEESGMEKLLKRPEKLRG P++RNKDKYCR
Subjt:  RTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRRSESGPNRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCR

Query:  FHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGK----------------------------------PSGGQSENKRKVLAREARREVCIIREQKPTCS
        FHR+H HNT+  WELKRQIEDLIQD YFKKFVGK                                  PSGGQS +KRK LAR ARREVCIIREQ+PTC 
Subjt:  FHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGK----------------------------------PSGGQSENKRKVLAREARREVCIIREQKPTCS

Query:  ITFGDTNLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTKSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQDATQVTQMVEFV
        ITF   +LE VHLPHNDALVIAPLIDHV+VRRVL+D G SANI+SL TYLALGWT+SQLKKS TPLVGFS E+V PEGCIDLPVT+G D TQVTQM EFV
Subjt:  ITFGDTNLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTKSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQDATQVTQMVEFV

Query:  VIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQD-------DLPRKSKIQFSPPTDELELVP
        VIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ  SRECYASALKGSSVCALE   S+D       +LPR+   +F+ PT+ELELVP
Subjt:  VIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQD-------DLPRKSKIQFSPPTDELELVP

Query:  LLSPEKQLPQDH
        LL  +     DH
Subjt:  LLSPEKQLPQDH

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.4e-24061.92Show/hide
Query:  MVQPANSANTTERRGVNADNGTQRDLDARIVEDQVRAGQEGDLPRRSARHANQELPPAHPKPSKANRGRGGTTRKTSQRASQAADPEALSTLQRELDDMR
        MVQPANS NT +RR + A++G QR++ A +VE Q       +   RSAR     LPPAHPKPS                                     
Subjt:  MVQPANSANTTERRGVNADNGTQRDLDARIVEDQVRAGQEGDLPRRSARHANQELPPAHPKPSKANRGRGGTTRKTSQRASQAADPEALSTLQRELDDMR

Query:  HRLRTMEEMYAEATRANRTASPSRVSGAPGEKGAPSIQPGDREPIPNDEGVDYSLQDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLT
                                                                                                  KA+S Y P+T
Subjt:  HRLRTMEEMYAEATRANRTASPSRVSGAPGEKGAPSIQPGDREPIPNDEGVDYSLQDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLT

Query:  PEVVITREEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESSFTSDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQAATDAIKCHAFQIA
        P  VITREEFD +K KFD QVEALKARCEKKE SFDDGDLGE SF+SDILEA IPPKFKTPTMK YDGSKDPKDYVEVFE LMDFQAATDAIKC AFQIA
Subjt:  PEVVITREEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESSFTSDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQAATDAIKCHAFQIA

Query:  LTGSARLWYRRLPT---------------------------------RQKEGETLKEYVTRFQEEQLKVVHCSDDSTMCYFHTGLADETLTVKLGEEAPA
        LTGSARLWYRRLP                                  RQKEGETL+EYVTRF EEQLKV HCSDDS MCYF TGLADETLTVKL EEAPA
Subjt:  LTGSARLWYRRLPT---------------------------------RQKEGETLKEYVTRFQEEQLKVVHCSDDSTMCYFHTGLADETLTVKLGEEAPA

Query:  TFTKVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRRSESGPNRSRPYERYTPTTIPISEILTNIEESGMEKLLK
        TF +VLQK KKVIDGQELLRTKTGRPEK IDQ +  ++K KADSKSRDKG SSS+SR +YRRS S  N+SRPYE YTPTTIPI EILTNIEE+GMEKLLK
Subjt:  TFTKVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRRSESGPNRSRPYERYTPTTIPISEILTNIEESGMEKLLK

Query:  RPEKLRGDPKKRNKDKYCRFHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGKPSGGQSE---------------------NKRKVLAREARREVCIIRE
        RPEKLRGDP+KRN DKYCRFHRDHGHNT++ WELKRQIEDLIQDGYFKKFVGKP     E                     NK+K LAREARREVCIIRE
Subjt:  RPEKLRGDPKKRNKDKYCRFHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGKPSGGQSE---------------------NKRKVLAREARREVCIIRE

Query:  QKPTCSITFGDTNLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTKSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQDATQVT
        Q+PT SI F   +LEGVHLPHNDALVIAPLID VLVRR+L+DGGASANILSL TYLALGWT+SQLKKSPTPLVGFSGE++S EGCIDLPV+I QD TQVT
Subjt:  QKPTCSITFGDTNLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTKSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQDATQVT

Query:  QMVEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDDL
        QM EFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYST NGVGTVRGE KTSRECYAS  K SSVCALEEQT +D+L
Subjt:  QMVEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDDL

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]6.0e-23972.25Show/hide
Query:  VSGAPGEKGAPSIQPGDREPIPNDEGVDYSLQDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLTPEVVITREEFDLMKHKFDEQVEAL
        + GAPGEKGAPSIQPG+REPIPNDEGVDYSL+DNDLRKHLT+KKK+AS EPEDS SYSREFSNSNLKAQSKYKPL PE VI REEFDLMKH+FDEQVEAL
Subjt:  VSGAPGEKGAPSIQPGDREPIPNDEGVDYSLQDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLTPEVVITREEFDLMKHKFDEQVEAL

Query:  KARCEKKECSFDDGDLGESSFTSDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQAATDAIKCHAFQIALTGSARLWYRRLPTR--QKEGETL
        KARCEKKE  FDD DLGES FTSDI+EAPIPPKFKTPTMK YDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW RRLP R      +  
Subjt:  KARCEKKECSFDDGDLGESSFTSDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQAATDAIKCHAFQIALTGSARLWYRRLPTR--QKEGETL

Query:  KEYVTRFQEEQLKVVHCSDDSTMCYFHT--GLADETLTVKLGEEAPATFTKVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSS
        KE++ +F           D  T  +  T     DETLTVKLGEEAPATF +VLQ AKKVIDGQELLRTKT RPEKQIDQK+LSQ+KRK DSKS+DKGSSS
Subjt:  KEYVTRFQEEQLKVVHCSDDSTMCYFHT--GLADETLTVKLGEEAPATFTKVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSS

Query:  SASRTEYRRSESGPNRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCRFHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGK
        S SRTEYRRSESGP+RSRPYER                                                       CWELKRQIEDLIQD YFKKFVGK
Subjt:  SASRTEYRRSESGPNRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCRFHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGK

Query:  ----------------------------------PSGGQSENKRKVLAREARREVCIIREQKPTCSITFGDTNLEGVHLPHNDALVIAPLIDHVLVRRVL
                                          PSGGQ ENKRK LA EARR+V IIREQKPTCSITF DT+LEGVHLPHNDALVIAPLIDHVLVRRVL
Subjt:  ----------------------------------PSGGQSENKRKVLAREARREVCIIREQKPTCSITFGDTNLEGVHLPHNDALVIAPLIDHVLVRRVL

Query:  IDGGASANILSLPTYLALGWTKSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQDATQVTQMVEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYS
        +DGGASANILSLPTYLAL  T+SQLKKSPTPLVGFS E+VSPEGCIDLPVTIGQD+TQVTQM EFVVIDGR AYNAIF RPIIHSF+AVPS LHQVLKYS
Subjt:  IDGGASANILSLPTYLALGWTKSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQDATQVTQMVEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYS

Query:  TPNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDDLPRKSK
        TPNGVGTVRGEQKTSRECYASALK SSVCALEEQTSQDDLPR++K
Subjt:  TPNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDDLPRKSK

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]3.0e-19868.46Show/hide
Query:  MDFQAATDAIKCHAFQIALTGSARLWYRRLPT---------------------------------RQKEGETLKEYVTRFQEEQLKVVHCSDDSTMCYFH
        MDFQAATDAIKC AFQIALTGSARLWYRRLP                                  RQKE ETL+EYVTRFQEEQLKV HCSDDS MCYF 
Subjt:  MDFQAATDAIKCHAFQIALTGSARLWYRRLPT---------------------------------RQKEGETLKEYVTRFQEEQLKVVHCSDDSTMCYFH

Query:  TGLADETLTVKLGEEAPATFTKVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRRSESGPNRSRPYERYTPTTIP
        T LADETLTVKLGEEAP TF +VLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRR ESGP+RSRPYERYT +TIP
Subjt:  TGLADETLTVKLGEEAPATFTKVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRRSESGPNRSRPYERYTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCRFHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGK-----------------------------
        ISEILTNIEESGMEKLLKRPEKLRGD +KRNK+KYCRFHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGK                             
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCRFHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGK-----------------------------

Query:  -----PSGGQSENKRKVLAREARREVCIIREQKPTCSITFGDTNLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTKSQLKKSP
             P+GGQS NKRK LAREARREVCIIRE KPTCSITFGD +LEGVHLPHNDALVIA LIDH LVRRVLIDG                          
Subjt:  -----PSGGQSENKRKVLAREARREVCIIREQKPTCSITFGDTNLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTKSQLKKSP

Query:  TPLVGFSGETVSPEGCIDLPVTIGQDATQVTQMVEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVC
                      GCIDLPVTIGQDATQVTQM EFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPN VG VRGEQKTSRECYASALKGS+VC
Subjt:  TPLVGFSGETVSPEGCIDLPVTIGQDATQVTQMVEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVC

Query:  ALEEQT-------SQDDLPRKSKIQFSPPTDELELVPLLSPEKQLPQDHLGVQINQKKQKDGKTFMGGARRLGSLQK
        ALEEQT       S+ DLP++ K QF PPT+ELELVPLLSPE+Q   + +   +  +  K  K       R+ +L +
Subjt:  ALEEQT-------SQDDLPRKSKIQFSPPTDELELVPLLSPEKQLPQDHLGVQINQKKQKDGKTFMGGARRLGSLQK

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.8e-19670Show/hide
Query:  LKAQSKYKPLTPEVVITREEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESSFTSDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQAAT
        +KA+S   P TP  VITREEFD ++ + D QVEALKA+CE+KE   +DGDLGES FTSD+LEAPIPPKFK PT+K YDGSKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYKPLTPEVVITREEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESSFTSDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCHAFQIALTGSARLWYRRLPT---------------------------------RQKEGETLKEYVTRFQEEQLKVVHCSDDSTMCYFHTGLADET
        DAIKC AF+IALTGSARLWYRRLP                                  RQKEGETL+EYVTRFQEEQLKV HCSDDS MCYF TGLADE 
Subjt:  DAIKCHAFQIALTGSARLWYRRLPT---------------------------------RQKEGETLKEYVTRFQEEQLKVVHCSDDSTMCYFHTGLADET

Query:  LTVKLGEEAPATFTKVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRRSESGPNRSRPYERYTPTTIPISEILTN
        LTVKLGEEAPATF +VLQKAKKVIDGQELLRTKTGRPE++I + +  ++   AD KS+DKGS SS  R EYRR+E+GP RSRPYER+TPTTIPISEILTN
Subjt:  LTVKLGEEAPATFTKVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRRSESGPNRSRPYERYTPTTIPISEILTN

Query:  IEESGMEKLLKRPEKLRGDPKKRNKDKYCRFHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGK----------------------------------PS
        IEESGMEKLLKRPEKLRG P++R+KDKYCRFHR+HGHNT+  WELKRQIE+LIQDGYFKKFVGK                                  PS
Subjt:  IEESGMEKLLKRPEKLRGDPKKRNKDKYCRFHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGK----------------------------------PS

Query:  GGQSENKRKVLAREARREVCIIREQKPTCSITFGDTNLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTKSQLKKSPTPLVGFS
        GGQS  KRK LAR ARREVCIIREQ+PTC ITF   +LE VHLPHNDALVIAPLIDHV+V RVL+DGG SANILSLPTYLALGWT+SQLKKSPTPLVGFS
Subjt:  GGQSENKRKVLAREARREVCIIREQKPTCSITFGDTNLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTKSQLKKSPTPLVGFS

Query:  GETVSPEGCIDLPVTIGQDATQVTQMVEFV
        GE+V PEG IDLPVT+GQD TQVTQM EFV
Subjt:  GETVSPEGCIDLPVTIGQDATQVTQMVEFV

A0A6J1D9E1 uncharacterized protein LOC1110188233.7e-21869.12Show/hide
Query:  NSNLKAQSKYKPLTPEVVITREEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESSFTSDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQ
        +SN +A+S + P TP+ VITREEFD ++ K + QVEALKA+CE+KE   +DGDLGES FTSD+LEA        PT+KSYDGSKDPKDYVEVFEGLMDFQ
Subjt:  NSNLKAQSKYKPLTPEVVITREEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESSFTSDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCHAFQIALTGSARLWYRRLPTRQKEGETLKEYVTRFQEEQLKVVHCSDDSTMCYFHTGLADETLTVKLGEEAPATFTKVLQKAKKVIDGQELL
        AA+DAIKC AFQIALTGSARLW                    FQE+QLKV   SDDS MCYF TGLADE LTVKLG+EAPATF +VLQKAKKVIDGQELL
Subjt:  AATDAIKCHAFQIALTGSARLWYRRLPTRQKEGETLKEYVTRFQEEQLKVVHCSDDSTMCYFHTGLADETLTVKLGEEAPATFTKVLQKAKKVIDGQELL

Query:  RTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRRSESGPNRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCR
        RTKTGRPE+ ID+ + S +  KAD KS+DKGS SS  R E+RR+ +GP RSRPYER+TPTTIPISEILTNIEESGMEKLLKRPEKLRG P++RNKDKYCR
Subjt:  RTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRRSESGPNRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCR

Query:  FHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGK----------------------------------PSGGQSENKRKVLAREARREVCIIREQKPTCS
        FHR+H HNT+  WELKRQIEDLIQD YFKKFVGK                                  PSGGQS +KRK LAR ARREVCIIREQ+PTC 
Subjt:  FHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGK----------------------------------PSGGQSENKRKVLAREARREVCIIREQKPTCS

Query:  ITFGDTNLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTKSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQDATQVTQMVEFV
        ITF   +LE VHLPHNDALVIAPLIDHV+VRRVL+D G SANI+SL TYLALGWT+SQLKKS TPLVGFS E+V PEGCIDLPVT+G D TQVTQM EFV
Subjt:  ITFGDTNLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTKSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQDATQVTQMVEFV

Query:  VIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQD-------DLPRKSKIQFSPPTDELELVP
        VIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ  SRECYASALKGSSVCALE   S+D       +LPR+   +F+ PT+ELELVP
Subjt:  VIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQD-------DLPRKSKIQFSPPTDELELVP

Query:  LLSPEKQLPQDH
        LL  +     DH
Subjt:  LLSPEKQLPQDH

A0A6J1DHB3 uncharacterized protein LOC1110204791.2e-24061.92Show/hide
Query:  MVQPANSANTTERRGVNADNGTQRDLDARIVEDQVRAGQEGDLPRRSARHANQELPPAHPKPSKANRGRGGTTRKTSQRASQAADPEALSTLQRELDDMR
        MVQPANS NT +RR + A++G QR++ A +VE Q       +   RSAR     LPPAHPKPS                                     
Subjt:  MVQPANSANTTERRGVNADNGTQRDLDARIVEDQVRAGQEGDLPRRSARHANQELPPAHPKPSKANRGRGGTTRKTSQRASQAADPEALSTLQRELDDMR

Query:  HRLRTMEEMYAEATRANRTASPSRVSGAPGEKGAPSIQPGDREPIPNDEGVDYSLQDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLT
                                                                                                  KA+S Y P+T
Subjt:  HRLRTMEEMYAEATRANRTASPSRVSGAPGEKGAPSIQPGDREPIPNDEGVDYSLQDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLT

Query:  PEVVITREEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESSFTSDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQAATDAIKCHAFQIA
        P  VITREEFD +K KFD QVEALKARCEKKE SFDDGDLGE SF+SDILEA IPPKFKTPTMK YDGSKDPKDYVEVFE LMDFQAATDAIKC AFQIA
Subjt:  PEVVITREEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESSFTSDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQAATDAIKCHAFQIA

Query:  LTGSARLWYRRLPT---------------------------------RQKEGETLKEYVTRFQEEQLKVVHCSDDSTMCYFHTGLADETLTVKLGEEAPA
        LTGSARLWYRRLP                                  RQKEGETL+EYVTRF EEQLKV HCSDDS MCYF TGLADETLTVKL EEAPA
Subjt:  LTGSARLWYRRLPT---------------------------------RQKEGETLKEYVTRFQEEQLKVVHCSDDSTMCYFHTGLADETLTVKLGEEAPA

Query:  TFTKVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRRSESGPNRSRPYERYTPTTIPISEILTNIEESGMEKLLK
        TF +VLQK KKVIDGQELLRTKTGRPEK IDQ +  ++K KADSKSRDKG SSS+SR +YRRS S  N+SRPYE YTPTTIPI EILTNIEE+GMEKLLK
Subjt:  TFTKVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRRSESGPNRSRPYERYTPTTIPISEILTNIEESGMEKLLK

Query:  RPEKLRGDPKKRNKDKYCRFHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGKPSGGQSE---------------------NKRKVLAREARREVCIIRE
        RPEKLRGDP+KRN DKYCRFHRDHGHNT++ WELKRQIEDLIQDGYFKKFVGKP     E                     NK+K LAREARREVCIIRE
Subjt:  RPEKLRGDPKKRNKDKYCRFHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGKPSGGQSE---------------------NKRKVLAREARREVCIIRE

Query:  QKPTCSITFGDTNLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTKSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQDATQVT
        Q+PT SI F   +LEGVHLPHNDALVIAPLID VLVRR+L+DGGASANILSL TYLALGWT+SQLKKSPTPLVGFSGE++S EGCIDLPV+I QD TQVT
Subjt:  QKPTCSITFGDTNLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTKSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQDATQVT

Query:  QMVEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDDL
        QM EFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYST NGVGTVRGE KTSRECYAS  K SSVCALEEQT +D+L
Subjt:  QMVEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDDL

A0A6J1DPC9 uncharacterized protein LOC1110222802.9e-23972.25Show/hide
Query:  VSGAPGEKGAPSIQPGDREPIPNDEGVDYSLQDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLTPEVVITREEFDLMKHKFDEQVEAL
        + GAPGEKGAPSIQPG+REPIPNDEGVDYSL+DNDLRKHLT+KKK+AS EPEDS SYSREFSNSNLKAQSKYKPL PE VI REEFDLMKH+FDEQVEAL
Subjt:  VSGAPGEKGAPSIQPGDREPIPNDEGVDYSLQDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLTPEVVITREEFDLMKHKFDEQVEAL

Query:  KARCEKKECSFDDGDLGESSFTSDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQAATDAIKCHAFQIALTGSARLWYRRLPTR--QKEGETL
        KARCEKKE  FDD DLGES FTSDI+EAPIPPKFKTPTMK YDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW RRLP R      +  
Subjt:  KARCEKKECSFDDGDLGESSFTSDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQAATDAIKCHAFQIALTGSARLWYRRLPTR--QKEGETL

Query:  KEYVTRFQEEQLKVVHCSDDSTMCYFHT--GLADETLTVKLGEEAPATFTKVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSS
        KE++ +F           D  T  +  T     DETLTVKLGEEAPATF +VLQ AKKVIDGQELLRTKT RPEKQIDQK+LSQ+KRK DSKS+DKGSSS
Subjt:  KEYVTRFQEEQLKVVHCSDDSTMCYFHT--GLADETLTVKLGEEAPATFTKVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSS

Query:  SASRTEYRRSESGPNRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCRFHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGK
        S SRTEYRRSESGP+RSRPYER                                                       CWELKRQIEDLIQD YFKKFVGK
Subjt:  SASRTEYRRSESGPNRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCRFHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGK

Query:  ----------------------------------PSGGQSENKRKVLAREARREVCIIREQKPTCSITFGDTNLEGVHLPHNDALVIAPLIDHVLVRRVL
                                          PSGGQ ENKRK LA EARR+V IIREQKPTCSITF DT+LEGVHLPHNDALVIAPLIDHVLVRRVL
Subjt:  ----------------------------------PSGGQSENKRKVLAREARREVCIIREQKPTCSITFGDTNLEGVHLPHNDALVIAPLIDHVLVRRVL

Query:  IDGGASANILSLPTYLALGWTKSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQDATQVTQMVEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYS
        +DGGASANILSLPTYLAL  T+SQLKKSPTPLVGFS E+VSPEGCIDLPVTIGQD+TQVTQM EFVVIDGR AYNAIF RPIIHSF+AVPS LHQVLKYS
Subjt:  IDGGASANILSLPTYLALGWTKSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQDATQVTQMVEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYS

Query:  TPNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDDLPRKSK
        TPNGVGTVRGEQKTSRECYASALK SSVCALEEQTSQDDLPR++K
Subjt:  TPNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDDLPRKSK

A0A6J1DZB9 uncharacterized protein LOC1110249041.4e-19868.46Show/hide
Query:  MDFQAATDAIKCHAFQIALTGSARLWYRRLPT---------------------------------RQKEGETLKEYVTRFQEEQLKVVHCSDDSTMCYFH
        MDFQAATDAIKC AFQIALTGSARLWYRRLP                                  RQKE ETL+EYVTRFQEEQLKV HCSDDS MCYF 
Subjt:  MDFQAATDAIKCHAFQIALTGSARLWYRRLPT---------------------------------RQKEGETLKEYVTRFQEEQLKVVHCSDDSTMCYFH

Query:  TGLADETLTVKLGEEAPATFTKVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRRSESGPNRSRPYERYTPTTIP
        T LADETLTVKLGEEAP TF +VLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRR ESGP+RSRPYERYT +TIP
Subjt:  TGLADETLTVKLGEEAPATFTKVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRRSESGPNRSRPYERYTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCRFHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGK-----------------------------
        ISEILTNIEESGMEKLLKRPEKLRGD +KRNK+KYCRFHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGK                             
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCRFHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGK-----------------------------

Query:  -----PSGGQSENKRKVLAREARREVCIIREQKPTCSITFGDTNLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTKSQLKKSP
             P+GGQS NKRK LAREARREVCIIRE KPTCSITFGD +LEGVHLPHNDALVIA LIDH LVRRVLIDG                          
Subjt:  -----PSGGQSENKRKVLAREARREVCIIREQKPTCSITFGDTNLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTKSQLKKSP

Query:  TPLVGFSGETVSPEGCIDLPVTIGQDATQVTQMVEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVC
                      GCIDLPVTIGQDATQVTQM EFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPN VG VRGEQKTSRECYASALKGS+VC
Subjt:  TPLVGFSGETVSPEGCIDLPVTIGQDATQVTQMVEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVC

Query:  ALEEQT-------SQDDLPRKSKIQFSPPTDELELVPLLSPEKQLPQDHLGVQINQKKQKDGKTFMGGARRLGSLQK
        ALEEQT       S+ DLP++ K QF PPT+ELELVPLLSPE+Q   + +   +  +  K  K       R+ +L +
Subjt:  ALEEQT-------SQDDLPRKSKIQFSPPTDELELVPLLSPEKQLPQDHLGVQINQKKQKDGKTFMGGARRLGSLQK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCAACCAGCAAACTCTGCTAATACGACAGAACGGAGGGGTGTGAATGCTGATAATGGCACTCAACGAGACCTCGACGCAAGAATAGTCGAGGACCAGGTCCGAGC
AGGGCAAGAGGGAGATCTGCCGCGTAGATCTGCCCGCCATGCGAACCAAGAGTTACCCCCTGCTCACCCAAAACCCTCAAAGGCCAACCGAGGCCGAGGTGGGACCACGA
GAAAGACCTCCCAAAGGGCCAGCCAGGCAGCAGACCCTGAAGCTTTGTCTACTCTTCAGCGCGAGCTGGATGATATGCGTCATCGGTTGCGTACAATGGAAGAAATGTAC
GCCGAGGCAACGCGTGCTAACCGAACTGCGTCTCCCTCTAGGGTCTCGGGCGCACCCGGTGAAAAGGGAGCTCCATCTATCCAACCTGGCGACCGCGAGCCCATTCCCAA
CGATGAAGGAGTGGATTACAGCTTGCAGGATAACGATCTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCATCTCGGGAGCCGGAAGACTCTCCTTCCTACTCCCGAG
AGTTCTCGAACTCGAACCTAAAGGCTCAGTCAAAATACAAGCCTCTGACGCCAGAAGTTGTGATAACTAGAGAAGAGTTCGACCTGATGAAGCACAAGTTCGATGAGCAG
GTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGAGTGCTCGTTCGACGATGGCGACTTGGGAGAATCGTCATTCACCTCAGACATCTTGGAGGCTCCAATCCCTCCGAA
GTTCAAAACTCCCACGATGAAGTCTTACGATGGGTCTAAGGATCCAAAAGATTATGTTGAGGTCTTCGAGGGCCTCATGGACTTTCAAGCGGCAACAGATGCAATTAAAT
GCCATGCCTTCCAGATCGCGCTTACCGGCAGCGCGCGCCTGTGGTATCGGAGACTGCCGACTAGACAGAAGGAGGGTGAGACGCTGAAAGAGTATGTCACAAGGTTCCAG
GAGGAGCAGCTGAAGGTTGTGCACTGCTCTGATGATTCGACCATGTGCTACTTCCACACCGGCCTGGCCGATGAGACCCTCACCGTGAAACTCGGAGAGGAGGCTCCAGC
AACCTTCACTAAAGTCCTGCAGAAGGCGAAGAAGGTTATTGATGGGCAAGAGCTCCTCCGAACCAAAACTGGCCGACCTGAGAAGCAGATCGACCAGAAGAAGTTGAGCC
AAGAGAAGAGGAAGGCTGATTCCAAGTCTAGAGATAAGGGATCGTCCTCTTCCGCCAGCAGAACAGAGTACCGTAGGTCGGAGAGCGGCCCCAACCGGAGCCGACCTTAT
GAACGGTATACCCCAACCACAATCCCCATCTCCGAGATACTCACGAACATCGAGGAGAGCGGGATGGAAAAGCTCCTCAAGCGACCTGAGAAGCTCCGAGGAGACCCAAA
AAAGCGCAACAAAGATAAGTACTGCCGTTTTCATCGCGATCACGGCCATAATACGACAAGTTGCTGGGAATTGAAGCGCCAGATTGAAGACCTCATTCAAGACGGCTACT
TCAAAAAATTTGTGGGCAAACCGAGTGGGGGCCAGTCCGAAAACAAAAGGAAAGTGCTAGCTCGCGAGGCCAGGCGCGAGGTATGTATCATTAGGGAGCAGAAACCTACT
TGCTCCATCACCTTTGGCGATACCAACCTAGAGGGGGTCCACTTGCCCCATAACGATGCACTGGTGATCGCCCCTCTGATCGATCATGTCCTGGTCAGAAGAGTGTTGAT
AGATGGAGGCGCGTCTGCCAACATCTTGTCCCTCCCAACATATCTTGCCTTGGGGTGGACCAAGTCTCAGTTGAAGAAAAGTCCAACACCTTTGGTTGGATTCTCTGGGG
AAACGGTCTCCCCTGAAGGGTGTATCGATCTACCAGTCACGATTGGACAAGATGCTACCCAAGTAACGCAGATGGTTGAGTTCGTGGTAATCGACGGCAGGTCGGCCTAC
AACGCTATCTTCGGGAGACCCATCATCCACTCATTCCGGGCCGTCCCCTCCACATTGCATCAAGTCCTGAAGTACTCAACCCCTAATGGAGTGGGCACGGTCCGGGGTGA
ACAAAAAACTTCAAGGGAGTGTTACGCGTCCGCGCTTAAAGGATCGTCAGTATGTGCCCTGGAAGAGCAAACCAGTCAAGACGACCTTCCGAGGAAAAGCAAAATCCAGT
TCTCTCCACCAACAGACGAGCTCGAGCTTGTTCCCTTACTTAGCCCTGAAAAACAACTTCCTCAGGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCAAAAAGACGGA
AAAACTTTCATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTTCCAACTTTGCCCTTAATGAAATGCGTCTTCCAATGCATTTTGGTGGTTCCAA
CCGATGCATACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGTCAGATCGCACCTCGCTCAGTTCGGGACTTACGAGGTGAGTCAAGTTCCAAGATCTG
AAAACTCTAATGCGGATGCCTTAGCCAAATTGGCATCAGCATATGAGACCGACCTGGCTAGATCGGTCCCGGTCGAGATCTTGGACAGTCCTTCAATCTTGGAGCCAGAT
GTAATGGAGGTTGATACTCCATCACCCTCTTGGATGGACCCAATCGTGGAGTTCATCAAAGGAAACTCACCGCAAGATCCGAAGGAGCAAAAGAAGATGGCTCGGAGGGC
AGCTCGGTTCACACTCCGGGAAGGAGCGTTGGTAGAACAGTACGAGCCAACAAAGAACGAAGACGAGCTACTCCTTAACCTGGACTTATTGGAAGGGAAAAGGGAAATGG
CTCAGCTGCGCTTAGTAGAGTATCAGAACAGAATGGCCAGACATTACAATGCCCGAGTTCGACCTCGAAGCTTCCAAGTTGGACATTTGGTCTTAAGGAAAATTCAGAGT
CATGTCGGCACCCTTGACCCAAGTTGGGAGGGACCGTTCGAAGTCAAAGGCATAGTCCGACCTGGAACTTACATGCTGGCCGACCTGGAAGGAAGAGTGCTTGCGCATCC
ATGGAACGCGGAGCACTTGAAGCGCTATTACCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGCAACCAGCAAACTCTGCTAATACGACAGAACGGAGGGGTGTGAATGCTGATAATGGCACTCAACGAGACCTCGACGCAAGAATAGTCGAGGACCAGGTCCGAGC
AGGGCAAGAGGGAGATCTGCCGCGTAGATCTGCCCGCCATGCGAACCAAGAGTTACCCCCTGCTCACCCAAAACCCTCAAAGGCCAACCGAGGCCGAGGTGGGACCACGA
GAAAGACCTCCCAAAGGGCCAGCCAGGCAGCAGACCCTGAAGCTTTGTCTACTCTTCAGCGCGAGCTGGATGATATGCGTCATCGGTTGCGTACAATGGAAGAAATGTAC
GCCGAGGCAACGCGTGCTAACCGAACTGCGTCTCCCTCTAGGGTCTCGGGCGCACCCGGTGAAAAGGGAGCTCCATCTATCCAACCTGGCGACCGCGAGCCCATTCCCAA
CGATGAAGGAGTGGATTACAGCTTGCAGGATAACGATCTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCATCTCGGGAGCCGGAAGACTCTCCTTCCTACTCCCGAG
AGTTCTCGAACTCGAACCTAAAGGCTCAGTCAAAATACAAGCCTCTGACGCCAGAAGTTGTGATAACTAGAGAAGAGTTCGACCTGATGAAGCACAAGTTCGATGAGCAG
GTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGAGTGCTCGTTCGACGATGGCGACTTGGGAGAATCGTCATTCACCTCAGACATCTTGGAGGCTCCAATCCCTCCGAA
GTTCAAAACTCCCACGATGAAGTCTTACGATGGGTCTAAGGATCCAAAAGATTATGTTGAGGTCTTCGAGGGCCTCATGGACTTTCAAGCGGCAACAGATGCAATTAAAT
GCCATGCCTTCCAGATCGCGCTTACCGGCAGCGCGCGCCTGTGGTATCGGAGACTGCCGACTAGACAGAAGGAGGGTGAGACGCTGAAAGAGTATGTCACAAGGTTCCAG
GAGGAGCAGCTGAAGGTTGTGCACTGCTCTGATGATTCGACCATGTGCTACTTCCACACCGGCCTGGCCGATGAGACCCTCACCGTGAAACTCGGAGAGGAGGCTCCAGC
AACCTTCACTAAAGTCCTGCAGAAGGCGAAGAAGGTTATTGATGGGCAAGAGCTCCTCCGAACCAAAACTGGCCGACCTGAGAAGCAGATCGACCAGAAGAAGTTGAGCC
AAGAGAAGAGGAAGGCTGATTCCAAGTCTAGAGATAAGGGATCGTCCTCTTCCGCCAGCAGAACAGAGTACCGTAGGTCGGAGAGCGGCCCCAACCGGAGCCGACCTTAT
GAACGGTATACCCCAACCACAATCCCCATCTCCGAGATACTCACGAACATCGAGGAGAGCGGGATGGAAAAGCTCCTCAAGCGACCTGAGAAGCTCCGAGGAGACCCAAA
AAAGCGCAACAAAGATAAGTACTGCCGTTTTCATCGCGATCACGGCCATAATACGACAAGTTGCTGGGAATTGAAGCGCCAGATTGAAGACCTCATTCAAGACGGCTACT
TCAAAAAATTTGTGGGCAAACCGAGTGGGGGCCAGTCCGAAAACAAAAGGAAAGTGCTAGCTCGCGAGGCCAGGCGCGAGGTATGTATCATTAGGGAGCAGAAACCTACT
TGCTCCATCACCTTTGGCGATACCAACCTAGAGGGGGTCCACTTGCCCCATAACGATGCACTGGTGATCGCCCCTCTGATCGATCATGTCCTGGTCAGAAGAGTGTTGAT
AGATGGAGGCGCGTCTGCCAACATCTTGTCCCTCCCAACATATCTTGCCTTGGGGTGGACCAAGTCTCAGTTGAAGAAAAGTCCAACACCTTTGGTTGGATTCTCTGGGG
AAACGGTCTCCCCTGAAGGGTGTATCGATCTACCAGTCACGATTGGACAAGATGCTACCCAAGTAACGCAGATGGTTGAGTTCGTGGTAATCGACGGCAGGTCGGCCTAC
AACGCTATCTTCGGGAGACCCATCATCCACTCATTCCGGGCCGTCCCCTCCACATTGCATCAAGTCCTGAAGTACTCAACCCCTAATGGAGTGGGCACGGTCCGGGGTGA
ACAAAAAACTTCAAGGGAGTGTTACGCGTCCGCGCTTAAAGGATCGTCAGTATGTGCCCTGGAAGAGCAAACCAGTCAAGACGACCTTCCGAGGAAAAGCAAAATCCAGT
TCTCTCCACCAACAGACGAGCTCGAGCTTGTTCCCTTACTTAGCCCTGAAAAACAACTTCCTCAGGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCAAAAAGACGGA
AAAACTTTCATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTTCCAACTTTGCCCTTAATGAAATGCGTCTTCCAATGCATTTTGGTGGTTCCAA
CCGATGCATACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGTCAGATCGCACCTCGCTCAGTTCGGGACTTACGAGGTGAGTCAAGTTCCAAGATCTG
AAAACTCTAATGCGGATGCCTTAGCCAAATTGGCATCAGCATATGAGACCGACCTGGCTAGATCGGTCCCGGTCGAGATCTTGGACAGTCCTTCAATCTTGGAGCCAGAT
GTAATGGAGGTTGATACTCCATCACCCTCTTGGATGGACCCAATCGTGGAGTTCATCAAAGGAAACTCACCGCAAGATCCGAAGGAGCAAAAGAAGATGGCTCGGAGGGC
AGCTCGGTTCACACTCCGGGAAGGAGCGTTGGTAGAACAGTACGAGCCAACAAAGAACGAAGACGAGCTACTCCTTAACCTGGACTTATTGGAAGGGAAAAGGGAAATGG
CTCAGCTGCGCTTAGTAGAGTATCAGAACAGAATGGCCAGACATTACAATGCCCGAGTTCGACCTCGAAGCTTCCAAGTTGGACATTTGGTCTTAAGGAAAATTCAGAGT
CATGTCGGCACCCTTGACCCAAGTTGGGAGGGACCGTTCGAAGTCAAAGGCATAGTCCGACCTGGAACTTACATGCTGGCCGACCTGGAAGGAAGAGTGCTTGCGCATCC
ATGGAACGCGGAGCACTTGAAGCGCTATTACCCCTGA
Protein sequenceShow/hide protein sequence
MVQPANSANTTERRGVNADNGTQRDLDARIVEDQVRAGQEGDLPRRSARHANQELPPAHPKPSKANRGRGGTTRKTSQRASQAADPEALSTLQRELDDMRHRLRTMEEMY
AEATRANRTASPSRVSGAPGEKGAPSIQPGDREPIPNDEGVDYSLQDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLTPEVVITREEFDLMKHKFDEQ
VEALKARCEKKECSFDDGDLGESSFTSDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQAATDAIKCHAFQIALTGSARLWYRRLPTRQKEGETLKEYVTRFQ
EEQLKVVHCSDDSTMCYFHTGLADETLTVKLGEEAPATFTKVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRRSESGPNRSRPY
ERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCRFHRDHGHNTTSCWELKRQIEDLIQDGYFKKFVGKPSGGQSENKRKVLAREARREVCIIREQKPT
CSITFGDTNLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTKSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQDATQVTQMVEFVVIDGRSAY
NAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDDLPRKSKIQFSPPTDELELVPLLSPEKQLPQDHLGVQINQKKQKDG
KTFMGGARRLGSLQKNWFSSNFALNEMRLPMHFGGSNRCIRVEEVFHYQFEHDLVRSHLAQFGTYEVSQVPRSENSNADALAKLASAYETDLARSVPVEILDSPSILEPD
VMEVDTPSPSWMDPIVEFIKGNSPQDPKEQKKMARRAARFTLREGALVEQYEPTKNEDELLLNLDLLEGKREMAQLRLVEYQNRMARHYNARVRPRSFQVGHLVLRKIQS
HVGTLDPSWEGPFEVKGIVRPGTYMLADLEGRVLAHPWNAEHLKRYYP