; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g18450 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g18450
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:13168637..13173916
RNA-Seq ExpressionMoc07g18450
SyntenyMoc07g18450
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.6e-23974.62Show/hide
Query:  NSNLKAQSKYKPLAPEAVITREELDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDRSKDPKDYVEVFEGLMDFQ
        +SN +A+S + P  P+ VITREE D ++ K + QVEALKA+CE+K+   +DGDLGESPFT+D+LEA        P +K YD SKDPKDYVEVFEGLMDFQ
Subjt:  NSNLKAQSKYKPLAPEAVITREELDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDRSKDPKDYVEVFEGLMDFQ

Query:  ATTDAIKCRAFQIVLTGSARLWQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP
        A +DAIKCRAFQI LTGSARLW             FQE+QLKV   SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRP
Subjt:  ATTDAIKCRAFQIVLTGSARLWQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP

Query:  EKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGY
        E+ ID+ + S +  + D+KSKDKG S SSGR E+RR+  G  RSRPYER+TPTTIPISEILTNIEESGMEKLLKRPEKLRG PE+RNKDKYCRFHR++ +
Subjt:  EKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGY

Query:  NTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRSAVINTIFRGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDAD
        NT+  WELKRQIEDLIQD YFKKFVGKPR++S EKKEERK SRTP RR DR AVINTIF GPSGGQSG+KRKELAR ARREVCIIREQ+PTC ITF  AD
Subjt:  NTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRSAVINTIFRGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDAD

Query:  LEGIHLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSA
        LE +HLPHNDALVIAPLIDHV+VR+VLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS ESV PEGCI+LPVT+G D TQVTQMAEFVV+DG+SA
Subjt:  LEGIHLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSA

Query:  YNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVGAVRGEQKTSRECYASALRGSLVCALEEQINRGKQQESGTDLPKEGKRQFSPPTEELELVPLL
        YNAIFGRPIIHSFR +PSTLHQVLKYSTPNGVG VRGEQ  SRECYASAL+GS VCALE  ++R    E   +LP   +R+F+ PTEELELVPLL
Subjt:  YNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVGAVRGEQKTSRECYASALRGSLVCALEEQINRGKQQESGTDLPKEGKRQFSPPTEELELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.2e-24475.5Show/hide
Query:  KAQSKYKPLAPEAVITREELDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDRSKDPKDYVEVFEGLMDFQATTD
        KA+S Y P+ P  VITREE D +K KFD QVEALKARCEKK+ SFDDGDLGE  F++DILEA IPPKFKTP MKPYD SKDPKDYVEVFE LMDFQA TD
Subjt:  KAQSKYKPLAPEAVITREELDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDRSKDPKDYVEVFEGLMDFQATTD

Query:  AIKCRAFQIVLTGSARLW----------------------------------------QKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEAL
        AIKC AFQI LTGSARLW                                        QKEGETLREYVTRF EEQLKV HCSDDSAMCYFLTGLADE L
Subjt:  AIKCRAFQIVLTGSARLW----------------------------------------QKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNI
        TVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPEK IDQ +  +++ + D KS+DKGPSSSS R +YRRS     +SRPYE YTPTTIPI EILTNI
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNI

Query:  EESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGYNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRSAVINTIFRGPSG
        EE+GMEKLLKRPEKLRGDPEKRN DKYCRFHRD+G+NT++ WELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKR RTPPRR+DR AVI         
Subjt:  EESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGYNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRSAVINTIFRGPSG

Query:  GQSGNKRKELAREARREVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG
            NK+KELAREARREVCIIREQ+PT SI F  ADLEG+HLPHNDALVIAPLID VLVR++LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSG
Subjt:  GQSGNKRKELAREARREVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG

Query:  ESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVGAVRGEQKTSRECYASALRGSLVCALEEQINR
        ES+S EGCI+LPV+I QD TQVTQMAEFVV+DG+SAYNAIFGRPIIHSFR VPSTLHQVLKYST NGVG VRGE KTSRECYAS  + S VCALEEQ  R
Subjt:  ESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVGAVRGEQKTSRECYASALRGSLVCALEEQINR

XP_022154846.1 uncharacterized protein LOC111022006 [Momordica charantia]3.7e-22383.85Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYT
        M YFL GLADE LT++LGEEAP TFAEVLQKAKKVIDGQELLRTKTGRPEKQ+DQKK  Q + R DV+SKDKGPSSSS RTEYRR+E G  RSRP+ERYT
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYT

Query:  PTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGYNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDR
        PTTIPISE+LTNIEESGMEKLLKRPEKLRGDPEK NKD              +CWELKRQIE+LIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRR+DR
Subjt:  PTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGYNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDR

Query:  SAVINTIFRGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLPTYLALGWTRSQ
         AVINTIF GPSGGQ GNKR +LAR  RREVCIIREQKPTC ITFGDADLEG+HLPHNDALVIAPLIDH+LVR+VL+DGGASANI SLPTYLALGWTRSQ
Subjt:  SAVINTIFRGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLPTYLALGWTRSQ

Query:  LKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVGAVRGEQKTSRECYASALR
        LKKSPTPLVGFSGESVSPEGCI+L VTIGQDATQVTQMAEFVV+D KSAYNAIFGRPIIHSF  V STLHQVLKYST NGVG VRGEQKTSR+CYAS L+
Subjt:  LKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVGAVRGEQKTSRECYASALR

Query:  GSLVCALEEQINRGKQQESGTDLPKEGKRQFSPPTEELELVPLLSPEKQVSIGTKLGATDREELINFLRSNSDVFAWSHEDMP
        G  VC LEEQ NRGK Q S  DLPK+ KRQFSPPTEELELVPLLSPEK V+IGTKL ATDR+ELINFLRSNSDVFAWSHEDMP
Subjt:  GSLVCALEEQINRGKQQESGTDLPKEGKRQFSPPTEELELVPLLSPEKQVSIGTKLGATDREELINFLRSNSDVFAWSHEDMP

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]2.8e-24771.15Show/hide
Query:  PGAPGEKGAPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREELDLMKHKFDEQVEALK
        PGAPGEKGAPSIQPG+REPIPND GVDYSLRDNDLRKHLT+KKK+AS EPEDS SYSREFSNSNLKAQSKYKPL PEAVI REE DLMKH+FDEQVEALK
Subjt:  PGAPGEKGAPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREELDLMKHKFDEQVEALK

Query:  ARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDRSKDPKDYVEVFEGLMDFQATTDAIKCRAFQIVLTGSARLWQKE---------GETLR
        ARCEKK+  FDD DLGESPFT+DI+EAPIPPKFKTP MKPYD SKDPKDYVEVFEGLMDFQA TDAIKC AFQI LTGSARLW +           +  +
Subjt:  ARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDRSKDPKDYVEVFEGLMDFQATTDAIKCRAFQIVLTGSARLWQKE---------GETLR

Query:  EYVTRF---QEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSS
        E++ +F     ++    H +        +    DE LTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT RPEKQIDQK+LSQ++R+ D KSKDKG SS
Subjt:  EYVTRF---QEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSS

Query:  SSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGYNTTSCWELKRQIEDLIQDGYFKKFVGK
        S  RTEYRRSE G  RSRPYER                                                       CWELKRQIEDLIQD YFKKFVGK
Subjt:  SSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGYNTTSCWELKRQIEDLIQDGYFKKFVGK

Query:  PRSNSVEKKEERKRSRTPPRRNDRSAVINTIFRGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLVRKVL
        PRSNSVEKKEERKRSRTPPRR DR AVINTIF GPSGGQ  NKRKELA EARR+V IIREQKPTCSITF D DLEG+HLPHNDALVIAPLIDHVLVR+VL
Subjt:  PRSNSVEKKEERKRSRTPPRRNDRSAVINTIFRGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLVRKVL

Query:  VDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYS
        VDGGASANILSLPTYLAL  TRSQLKKSPTPLVGFS ESVSPEGCI+LPVTIGQD+TQVTQMAEFVV+DG+ AYNAIF RPIIHSF+ VPS LHQVLKYS
Subjt:  VDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYS

Query:  TPNGVGAVRGEQKTSRECYASALRGSLVCALEEQINRGKQQESGTDLPKEGKRQFSPPTEELELVPLLS
        TPNGVG VRGEQKTSRECYASAL+ S VCALEE       Q S  DLP+E K           L P L+
Subjt:  TPNGVGAVRGEQKTSRECYASALRGSLVCALEEQINRGKQQESGTDLPKEGKRQFSPPTEELELVPLLS

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]1.0e-22075.82Show/hide
Query:  MDFQATTDAIKCRAFQIVLTGSARLW----------------------------------------QKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFL
        MDFQA TDAIKCRAFQI LTGSARLW                                        QKE ETLREYVTRFQEEQLKV HCSDDSAMCYFL
Subjt:  MDFQATTDAIKCRAFQIVLTGSARLW----------------------------------------QKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQE+R+ D KS+DKG SSS+ RTEYRR E G  RSRPYERYT +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGYNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRSAVIN
        ISEILTNIEESGMEKLLKRPEKLRGD EKRNK+KYCRFHRD+G+NTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRR DR AVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGYNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRSAVIN

Query:  TIFRGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLPTYLALGWTRSQLKKSP
        TIF GP+GGQSGNKRKELAREARREVCIIRE KPTCSITFGDADLEG+HLPHNDALVIA LIDH LVR+VL+DG                          
Subjt:  TIFRGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLPTYLALGWTRSQLKKSP

Query:  TPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVGAVRGEQKTSRECYASALRGSLVC
                      GCI+LPVTIGQDATQVTQMAEFVV+DG+SAYNAIFGRPIIHSFR VPSTLHQVLKYSTPN VG VRGEQKTSRECYASAL+GS VC
Subjt:  TPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVGAVRGEQKTSRECYASALRGSLVC

Query:  ALEEQINRGKQQESGTDLPKEGKRQFSPPTEELELVPLLSPEKQVS
        ALEEQ NRGK QES  DLPKEGKRQF PPTEELELVPLLSPE+Q +
Subjt:  ALEEQINRGKQQESGTDLPKEGKRQFSPPTEELELVPLLSPEKQVS

TrEMBL top hitse value%identityAlignment
A0A6J1D9E1 uncharacterized protein LOC1110188238.0e-24074.62Show/hide
Query:  NSNLKAQSKYKPLAPEAVITREELDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDRSKDPKDYVEVFEGLMDFQ
        +SN +A+S + P  P+ VITREE D ++ K + QVEALKA+CE+K+   +DGDLGESPFT+D+LEA        P +K YD SKDPKDYVEVFEGLMDFQ
Subjt:  NSNLKAQSKYKPLAPEAVITREELDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDRSKDPKDYVEVFEGLMDFQ

Query:  ATTDAIKCRAFQIVLTGSARLWQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP
        A +DAIKCRAFQI LTGSARLW             FQE+QLKV   SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRP
Subjt:  ATTDAIKCRAFQIVLTGSARLWQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP

Query:  EKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGY
        E+ ID+ + S +  + D+KSKDKG S SSGR E+RR+  G  RSRPYER+TPTTIPISEILTNIEESGMEKLLKRPEKLRG PE+RNKDKYCRFHR++ +
Subjt:  EKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGY

Query:  NTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRSAVINTIFRGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDAD
        NT+  WELKRQIEDLIQD YFKKFVGKPR++S EKKEERK SRTP RR DR AVINTIF GPSGGQSG+KRKELAR ARREVCIIREQ+PTC ITF  AD
Subjt:  NTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRSAVINTIFRGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDAD

Query:  LEGIHLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSA
        LE +HLPHNDALVIAPLIDHV+VR+VLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS ESV PEGCI+LPVT+G D TQVTQMAEFVV+DG+SA
Subjt:  LEGIHLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSA

Query:  YNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVGAVRGEQKTSRECYASALRGSLVCALEEQINRGKQQESGTDLPKEGKRQFSPPTEELELVPLL
        YNAIFGRPIIHSFR +PSTLHQVLKYSTPNGVG VRGEQ  SRECYASAL+GS VCALE  ++R    E   +LP   +R+F+ PTEELELVPLL
Subjt:  YNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVGAVRGEQKTSRECYASALRGSLVCALEEQINRGKQQESGTDLPKEGKRQFSPPTEELELVPLL

A0A6J1DHB3 uncharacterized protein LOC1110204791.1e-24475.5Show/hide
Query:  KAQSKYKPLAPEAVITREELDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDRSKDPKDYVEVFEGLMDFQATTD
        KA+S Y P+ P  VITREE D +K KFD QVEALKARCEKK+ SFDDGDLGE  F++DILEA IPPKFKTP MKPYD SKDPKDYVEVFE LMDFQA TD
Subjt:  KAQSKYKPLAPEAVITREELDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDRSKDPKDYVEVFEGLMDFQATTD

Query:  AIKCRAFQIVLTGSARLW----------------------------------------QKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEAL
        AIKC AFQI LTGSARLW                                        QKEGETLREYVTRF EEQLKV HCSDDSAMCYFLTGLADE L
Subjt:  AIKCRAFQIVLTGSARLW----------------------------------------QKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNI
        TVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPEK IDQ +  +++ + D KS+DKGPSSSS R +YRRS     +SRPYE YTPTTIPI EILTNI
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNI

Query:  EESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGYNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRSAVINTIFRGPSG
        EE+GMEKLLKRPEKLRGDPEKRN DKYCRFHRD+G+NT++ WELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKR RTPPRR+DR AVI         
Subjt:  EESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGYNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRSAVINTIFRGPSG

Query:  GQSGNKRKELAREARREVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG
            NK+KELAREARREVCIIREQ+PT SI F  ADLEG+HLPHNDALVIAPLID VLVR++LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSG
Subjt:  GQSGNKRKELAREARREVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG

Query:  ESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVGAVRGEQKTSRECYASALRGSLVCALEEQINR
        ES+S EGCI+LPV+I QD TQVTQMAEFVV+DG+SAYNAIFGRPIIHSFR VPSTLHQVLKYST NGVG VRGE KTSRECYAS  + S VCALEEQ  R
Subjt:  ESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVGAVRGEQKTSRECYASALRGSLVCALEEQINR

A0A6J1DPC9 uncharacterized protein LOC1110222801.4e-24771.15Show/hide
Query:  PGAPGEKGAPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREELDLMKHKFDEQVEALK
        PGAPGEKGAPSIQPG+REPIPND GVDYSLRDNDLRKHLT+KKK+AS EPEDS SYSREFSNSNLKAQSKYKPL PEAVI REE DLMKH+FDEQVEALK
Subjt:  PGAPGEKGAPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREELDLMKHKFDEQVEALK

Query:  ARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDRSKDPKDYVEVFEGLMDFQATTDAIKCRAFQIVLTGSARLWQKE---------GETLR
        ARCEKK+  FDD DLGESPFT+DI+EAPIPPKFKTP MKPYD SKDPKDYVEVFEGLMDFQA TDAIKC AFQI LTGSARLW +           +  +
Subjt:  ARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDRSKDPKDYVEVFEGLMDFQATTDAIKCRAFQIVLTGSARLWQKE---------GETLR

Query:  EYVTRF---QEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSS
        E++ +F     ++    H +        +    DE LTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT RPEKQIDQK+LSQ++R+ D KSKDKG SS
Subjt:  EYVTRF---QEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSS

Query:  SSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGYNTTSCWELKRQIEDLIQDGYFKKFVGK
        S  RTEYRRSE G  RSRPYER                                                       CWELKRQIEDLIQD YFKKFVGK
Subjt:  SSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGYNTTSCWELKRQIEDLIQDGYFKKFVGK

Query:  PRSNSVEKKEERKRSRTPPRRNDRSAVINTIFRGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLVRKVL
        PRSNSVEKKEERKRSRTPPRR DR AVINTIF GPSGGQ  NKRKELA EARR+V IIREQKPTCSITF D DLEG+HLPHNDALVIAPLIDHVLVR+VL
Subjt:  PRSNSVEKKEERKRSRTPPRRNDRSAVINTIFRGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLVRKVL

Query:  VDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYS
        VDGGASANILSLPTYLAL  TRSQLKKSPTPLVGFS ESVSPEGCI+LPVTIGQD+TQVTQMAEFVV+DG+ AYNAIF RPIIHSF+ VPS LHQVLKYS
Subjt:  VDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYS

Query:  TPNGVGAVRGEQKTSRECYASALRGSLVCALEEQINRGKQQESGTDLPKEGKRQFSPPTEELELVPLLS
        TPNGVG VRGEQKTSRECYASAL+ S VCALEE       Q S  DLP+E K           L P L+
Subjt:  TPNGVGAVRGEQKTSRECYASALRGSLVCALEEQINRGKQQESGTDLPKEGKRQFSPPTEELELVPLLS

A0A6J1DPX9 uncharacterized protein LOC1110220061.8e-22383.85Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYT
        M YFL GLADE LT++LGEEAP TFAEVLQKAKKVIDGQELLRTKTGRPEKQ+DQKK  Q + R DV+SKDKGPSSSS RTEYRR+E G  RSRP+ERYT
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYT

Query:  PTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGYNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDR
        PTTIPISE+LTNIEESGMEKLLKRPEKLRGDPEK NKD              +CWELKRQIE+LIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRR+DR
Subjt:  PTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGYNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDR

Query:  SAVINTIFRGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLPTYLALGWTRSQ
         AVINTIF GPSGGQ GNKR +LAR  RREVCIIREQKPTC ITFGDADLEG+HLPHNDALVIAPLIDH+LVR+VL+DGGASANI SLPTYLALGWTRSQ
Subjt:  SAVINTIFRGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLPTYLALGWTRSQ

Query:  LKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVGAVRGEQKTSRECYASALR
        LKKSPTPLVGFSGESVSPEGCI+L VTIGQDATQVTQMAEFVV+D KSAYNAIFGRPIIHSF  V STLHQVLKYST NGVG VRGEQKTSR+CYAS L+
Subjt:  LKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVGAVRGEQKTSRECYASALR

Query:  GSLVCALEEQINRGKQQESGTDLPKEGKRQFSPPTEELELVPLLSPEKQVSIGTKLGATDREELINFLRSNSDVFAWSHEDMP
        G  VC LEEQ NRGK Q S  DLPK+ KRQFSPPTEELELVPLLSPEK V+IGTKL ATDR+ELINFLRSNSDVFAWSHEDMP
Subjt:  GSLVCALEEQINRGKQQESGTDLPKEGKRQFSPPTEELELVPLLSPEKQVSIGTKLGATDREELINFLRSNSDVFAWSHEDMP

A0A6J1DZB9 uncharacterized protein LOC1110249044.9e-22175.82Show/hide
Query:  MDFQATTDAIKCRAFQIVLTGSARLW----------------------------------------QKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFL
        MDFQA TDAIKCRAFQI LTGSARLW                                        QKE ETLREYVTRFQEEQLKV HCSDDSAMCYFL
Subjt:  MDFQATTDAIKCRAFQIVLTGSARLW----------------------------------------QKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQE+R+ D KS+DKG SSS+ RTEYRR E G  RSRPYERYT +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGYNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRSAVIN
        ISEILTNIEESGMEKLLKRPEKLRGD EKRNK+KYCRFHRD+G+NTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRR DR AVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGYNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRSAVIN

Query:  TIFRGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLPTYLALGWTRSQLKKSP
        TIF GP+GGQSGNKRKELAREARREVCIIRE KPTCSITFGDADLEG+HLPHNDALVIA LIDH LVR+VL+DG                          
Subjt:  TIFRGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLVRKVLVDGGASANILSLPTYLALGWTRSQLKKSP

Query:  TPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVGAVRGEQKTSRECYASALRGSLVC
                      GCI+LPVTIGQDATQVTQMAEFVV+DG+SAYNAIFGRPIIHSFR VPSTLHQVLKYSTPN VG VRGEQKTSRECYASAL+GS VC
Subjt:  TPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVGAVRGEQKTSRECYASALRGSLVC

Query:  ALEEQINRGKQQESGTDLPKEGKRQFSPPTEELELVPLLSPEKQVS
        ALEEQ NRGK QES  DLPKEGKRQF PPTEELELVPLLSPE+Q +
Subjt:  ALEEQINRGKQQESGTDLPKEGKRQFSPPTEELELVPLLSPEKQVS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCCATCAGTTGCGCACAATGGAAGAGATGTACGCCGAGGCAACGCGCGCTAACCGAACTGCGTCTCCCTCTATGGCTCCGGGAGCACCCGGTGAAAAGGGA
GCTCCATCTATCCAACCTGGCGATCGCGAGCCCATTCCAAACGATGGGGGAGTGGATTACAGCTTGCGAGACAACGATTTGAGAAAGCATCTCACTGAAAAGAAG
AAGAGAGCATCTCGGGAGCCGGAAGACTCTCCTTCCTACTCCCGAGAATTCTCCAACTCGAACCTAAAGGCTCAATCAAAATACAAGCCCCTGGCACCAGAAGCT
GTGATCACGAGGGAAGAGTTAGACCTGATGAAGCACAAGTTCGATGAGCAGGTCGAGGCGCTTAAAGCCAGGTGCGAGAAGAAAGACTGTTCGTTCGACGATGGC
GACTTGGGAGAATCGCCATTCACCGCGGACATCTTGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCCAGCCATGAAGCCCTATGACCGGTCTAAGGACCCTAAA
GATTATGTTGAGGTTTTCGAGGGCCTCATGGACTTTCAAGCGACGACGGATGCGATCAAATGCCGCGCCTTCCAGATCGTGCTTACCGGTAGCGCGCGCCTGTGG
CAGAAGGAAGGAGAGACGCTGAGAGAATATGTCACACGGTTCCAGGAAGAGCAGCTTAAGGTCGTGCACTGCTCTGACGATTCGGCTATGTGCTACTTCCTCACC
GGCCTGGCCGATGAGGCCTTAACTGTGAAACTTGGAGAGGAAGCTCCAGCCACTTTCGCCGAAGTGTTACAGAAGGCGAAGAAAGTCATTGATGGGCAGGAGCTC
CTCCGAACCAAGACTGGTCGACCTGAAAAGCAGATCGATCAGAAGAAACTTAGCCAGGAGAGGAGGAGGATTGATGTCAAGTCCAAAGATAAGGGACCATCCTCC
TCCAGTGGCAGAACAGAGTACCGAAGGTCGGAGGGCGGCTCCATCCGGAGCCGACCTTATGAGCGGTATACTCCAACCACCATCCCCATCTCCGAGATACTCACG
AACATCGAGGAAAGTGGGATGGAAAAGCTCCTCAAGCGACCTGAGAAGCTCCGAGGAGACCCAGAGAAGCGCAACAAAGATAAGTACTGCCGTTTTCACCGCGAT
AACGGCTATAATACGACAAGCTGCTGGGAATTGAAGCGCCAGATTGAAGATCTCATTCAAGATGGCTACTTCAAAAAATTTGTGGGCAAACCGAGGTCTAACTCG
GTCGAAAAGAAAGAAGAGAGGAAGCGTTCAAGAACGCCGCCTCGCCGGAATGACCGATCTGCAGTCATCAACACTATTTTCAGAGGCCCGAGCGGGGGCCAGTCC
GGAAACAAGAGGAAGGAGCTAGCTCGCGAAGCCAGGCGCGAGGTATGCATCATCAGGGAGCAGAAACCTACTTGCTCCATCACTTTCGGCGATGCCGATTTGGAG
GGGATTCATTTGCCCCACAATGACGCGCTCGTGATCGCACCTCTTATTGATCACGTCCTGGTCCGAAAGGTATTGGTTGATGGAGGCGCATCTGCAAACATCTTG
TCCCTCCCAACATATTTAGCATTGGGATGGACCAGGTCACAATTGAAGAAGAGTCCAACACCCCTGGTTGGATTCTCTGGAGAATCGGTCTCCCCAGAGGGGTGC
ATCAACCTGCCGGTAACTATAGGGCAAGATGCCACCCAAGTAACGCAGATGGCTGAGTTCGTGGTAGTCGACGGCAAATCGGCCTACAACGCCATTTTTGGGAGA
CCCATTATCCACTCATTTCGAGTTGTTCCTTCCACCTTGCATCAAGTCTTGAAGTACTCAACCCCTAATGGAGTGGGCGCGGTCCGAGGTGAACAAAAAACTTCA
AGAGAGTGCTACGCGTCCGCGCTCAGGGGATCGTTGGTATGTGCCCTGGAGGAACAAATCAATCGTGGCAAGCAGCAGGAGTCAGGGACCGACCTGCCAAAAGAA
GGTAAAAGGCAGTTCTCCCCGCCAACAGAAGAGCTCGAGCTTGTTCCTTTACTTAGCCCCGAAAAACAAGTAAGTATAGGAACCAAGCTGGGGGCCACTGACAGG
GAAGAACTGATCAACTTCCTCAGGTCTAACTCAGACGTCTTCGCATGGTCTCACGAGGACATGCCTGACAGGGTAGAACAGTATGAGCCAACGAAGAACGAGGAA
GAGCTACTCCTTAACCTGGACTTGTTGGAAGGGAAAAGGAAAATGGCTCAGCTGCGCTTAGCAGAGTATCAGAACAGAATGGCCAGACATTATAATGCCCGAGTT
CGACCTCGAAGCTTCCAAGTTGGACATTTGGTCTTGAGAAAAATTCAGAGTCATGTTGGCACCATTGACCCAAGTTGGGAGGGACCGTTCGAAGTCAAAGGCATA
GTCCGACCTGGAACTTATATGCTGGCCGACTTGGAAGGAAGAGTGCTTGCGCATCCATGGAACACGGAGCACTTGAAGCGCTATTACCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGCCATCAGTTGCGCACAATGGAAGAGATGTACGCCGAGGCAACGCGCGCTAACCGAACTGCGTCTCCCTCTATGGCTCCGGGAGCACCCGGTGAAAAGGGA
GCTCCATCTATCCAACCTGGCGATCGCGAGCCCATTCCAAACGATGGGGGAGTGGATTACAGCTTGCGAGACAACGATTTGAGAAAGCATCTCACTGAAAAGAAG
AAGAGAGCATCTCGGGAGCCGGAAGACTCTCCTTCCTACTCCCGAGAATTCTCCAACTCGAACCTAAAGGCTCAATCAAAATACAAGCCCCTGGCACCAGAAGCT
GTGATCACGAGGGAAGAGTTAGACCTGATGAAGCACAAGTTCGATGAGCAGGTCGAGGCGCTTAAAGCCAGGTGCGAGAAGAAAGACTGTTCGTTCGACGATGGC
GACTTGGGAGAATCGCCATTCACCGCGGACATCTTGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCCAGCCATGAAGCCCTATGACCGGTCTAAGGACCCTAAA
GATTATGTTGAGGTTTTCGAGGGCCTCATGGACTTTCAAGCGACGACGGATGCGATCAAATGCCGCGCCTTCCAGATCGTGCTTACCGGTAGCGCGCGCCTGTGG
CAGAAGGAAGGAGAGACGCTGAGAGAATATGTCACACGGTTCCAGGAAGAGCAGCTTAAGGTCGTGCACTGCTCTGACGATTCGGCTATGTGCTACTTCCTCACC
GGCCTGGCCGATGAGGCCTTAACTGTGAAACTTGGAGAGGAAGCTCCAGCCACTTTCGCCGAAGTGTTACAGAAGGCGAAGAAAGTCATTGATGGGCAGGAGCTC
CTCCGAACCAAGACTGGTCGACCTGAAAAGCAGATCGATCAGAAGAAACTTAGCCAGGAGAGGAGGAGGATTGATGTCAAGTCCAAAGATAAGGGACCATCCTCC
TCCAGTGGCAGAACAGAGTACCGAAGGTCGGAGGGCGGCTCCATCCGGAGCCGACCTTATGAGCGGTATACTCCAACCACCATCCCCATCTCCGAGATACTCACG
AACATCGAGGAAAGTGGGATGGAAAAGCTCCTCAAGCGACCTGAGAAGCTCCGAGGAGACCCAGAGAAGCGCAACAAAGATAAGTACTGCCGTTTTCACCGCGAT
AACGGCTATAATACGACAAGCTGCTGGGAATTGAAGCGCCAGATTGAAGATCTCATTCAAGATGGCTACTTCAAAAAATTTGTGGGCAAACCGAGGTCTAACTCG
GTCGAAAAGAAAGAAGAGAGGAAGCGTTCAAGAACGCCGCCTCGCCGGAATGACCGATCTGCAGTCATCAACACTATTTTCAGAGGCCCGAGCGGGGGCCAGTCC
GGAAACAAGAGGAAGGAGCTAGCTCGCGAAGCCAGGCGCGAGGTATGCATCATCAGGGAGCAGAAACCTACTTGCTCCATCACTTTCGGCGATGCCGATTTGGAG
GGGATTCATTTGCCCCACAATGACGCGCTCGTGATCGCACCTCTTATTGATCACGTCCTGGTCCGAAAGGTATTGGTTGATGGAGGCGCATCTGCAAACATCTTG
TCCCTCCCAACATATTTAGCATTGGGATGGACCAGGTCACAATTGAAGAAGAGTCCAACACCCCTGGTTGGATTCTCTGGAGAATCGGTCTCCCCAGAGGGGTGC
ATCAACCTGCCGGTAACTATAGGGCAAGATGCCACCCAAGTAACGCAGATGGCTGAGTTCGTGGTAGTCGACGGCAAATCGGCCTACAACGCCATTTTTGGGAGA
CCCATTATCCACTCATTTCGAGTTGTTCCTTCCACCTTGCATCAAGTCTTGAAGTACTCAACCCCTAATGGAGTGGGCGCGGTCCGAGGTGAACAAAAAACTTCA
AGAGAGTGCTACGCGTCCGCGCTCAGGGGATCGTTGGTATGTGCCCTGGAGGAACAAATCAATCGTGGCAAGCAGCAGGAGTCAGGGACCGACCTGCCAAAAGAA
GGTAAAAGGCAGTTCTCCCCGCCAACAGAAGAGCTCGAGCTTGTTCCTTTACTTAGCCCCGAAAAACAAGTAAGTATAGGAACCAAGCTGGGGGCCACTGACAGG
GAAGAACTGATCAACTTCCTCAGGTCTAACTCAGACGTCTTCGCATGGTCTCACGAGGACATGCCTGACAGGGTAGAACAGTATGAGCCAACGAAGAACGAGGAA
GAGCTACTCCTTAACCTGGACTTGTTGGAAGGGAAAAGGAAAATGGCTCAGCTGCGCTTAGCAGAGTATCAGAACAGAATGGCCAGACATTATAATGCCCGAGTT
CGACCTCGAAGCTTCCAAGTTGGACATTTGGTCTTGAGAAAAATTCAGAGTCATGTTGGCACCATTGACCCAAGTTGGGAGGGACCGTTCGAAGTCAAAGGCATA
GTCCGACCTGGAACTTATATGCTGGCCGACTTGGAAGGAAGAGTGCTTGCGCATCCATGGAACACGGAGCACTTGAAGCGCTATTACCCCTGA
Protein sequenceShow/hide protein sequence
MRHQLRTMEEMYAEATRANRTASPSMAPGAPGEKGAPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLAPEA
VITREELDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDRSKDPKDYVEVFEGLMDFQATTDAIKCRAFQIVLTGSARLW
QKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSS
SSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGYNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNS
VEKKEERKRSRTPPRRNDRSAVINTIFRGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLEGIHLPHNDALVIAPLIDHVLVRKVLVDGGASANIL
SLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRVVPSTLHQVLKYSTPNGVGAVRGEQKTS
RECYASALRGSLVCALEEQINRGKQQESGTDLPKEGKRQFSPPTEELELVPLLSPEKQVSIGTKLGATDREELINFLRSNSDVFAWSHEDMPDRVEQYEPTKNEE
ELLLNLDLLEGKRKMAQLRLAEYQNRMARHYNARVRPRSFQVGHLVLRKIQSHVGTIDPSWEGPFEVKGIVRPGTYMLADLEGRVLAHPWNTEHLKRYYP