; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g20250 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g20250
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionEnzymatic polyprotein
Genome locationchr6:15844467..15850982
RNA-Seq ExpressionMoc06g20250
SyntenyMoc06g20250
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001995 - Peptidase A2A, retrovirus, catalytic
IPR018061 - Retropepsins


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]6.5e-16061.7Show/hide
Query:  LKAQSKYKPLTPEAVITREEFDLMKHRFDEQ-------------------------------ALIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAAT
        +KA+S   P TP  VITREEFD ++ + D Q                               A IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYKPLTPEAVITREEFDLMKHRFDEQ-------------------------------ALIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCRAFKIALTGSVRL-------------------------------KTTTHLATIRQKEGETLREYATRFQEERLKVAHCSDDSAMCYFLTGLTDET
        DAIKCRAF+IALTGS RL                               KT THLATIRQKEGETLREY TRFQEE+LKVAHCSDDSAMCYFLTGL DE 
Subjt:  DAIKCRAFKIALTGSVRL-------------------------------KTTTHLATIRQKEGETLREYATRFQEERLKVAHCSDDSAMCYFLTGLTDET

Query:  LT-----------------------------------------GRTDKDKGKTDAKSKDKGPSSSSGRTEYRRSENGPSRSRPYERYTPTTIPISEILTN
        LT                                         GR+ KD    D KSKDKG S SSGR EYRR+ENGP+RSRPYER+TPTTIPISEILTN
Subjt:  LT-----------------------------------------GRTDKDKGKTDAKSKDKGPSSSSGRTEYRRSENGPSRSRPYERYTPTTIPISEILTN

Query:  IKENGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDSGHNTSNCWELKRQIEDLIQDGYCKKFVGKPRSNSVEKKEERTRSRTPPRRDDRPAVINTIFGGPS
        I+E+GMEKLLKRPEKLRG PE+ +KDKYCRFHR+ GHNTS+ WELKRQIE+LIQDGY KKFVGKPR++S EKKEER RSRTPPRR DRPAVINTIFGGPS
Subjt:  IKENGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDSGHNTSNCWELKRQIEDLIQDGYCKKFVGKPRSNSVEKKEERTRSRTPPRRDDRPAVINTIFGGPS

Query:  GGQSRNKRKELAREARREVCIIREQKPTCSITFGDADLEGVHLPHNDALVIAPLIDHVL-----------------------------LKKSPSPLVGFS
        GGQS  KRKELAR ARREVCIIREQ+PTC ITF  ADLE VHLPHNDALVIAPLIDHV+                             LKKSP+PLVGFS
Subjt:  GGQSRNKRKELAREARREVCIIREQKPTCSITFGDADLEGVHLPHNDALVIAPLIDHVL-----------------------------LKKSPSPLVGFS

Query:  GESVSPEGCIDLPVTIGQDDTQVTQMAEFV
        GESV PEG IDLPVT+GQD TQVTQMAEFV
Subjt:  GESVSPEGCIDLPVTIGQDDTQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]2.5e-18058.8Show/hide
Query:  NSDLKAQSKYKPLTPEAVITREEFDLMKHRFD-----------------------EQALIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC
        +S+ +A+S + P TP+ VITREEFD ++ + +                       E        + PT+K YDGSKDPKDYVEVFEGLMDFQAA+DAIKC
Subjt:  NSDLKAQSKYKPLTPEAVITREEFDLMKHRFD-----------------------EQALIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC

Query:  RAFKIALTGSVRLKTTTHLATIRQKEGETLREYATRFQEERLKVAHCSDDSAMCYFLTGLTDETLT----------------------------------
        RAF+IALTGS RL                       FQE++LKVA  SDDSAMCYFLTGL DE LT                                  
Subjt:  RAFKIALTGSVRLKTTTHLATIRQKEGETLREYATRFQEERLKVAHCSDDSAMCYFLTGLTDETLT----------------------------------

Query:  -------GRTDKDKGKTDAKSKDKGPSSSSGRTEYRRSENGPSRSRPYERYTPTTIPISEILTNIKENGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDSG
               GR+ KD+ K D KSKDKG S SSGR E+RR+ NGP+RSRPYER+TPTTIPISEILTNI+E+GMEKLLKRPEKLRG PE+ NKDKYCRFHR+  
Subjt:  -------GRTDKDKGKTDAKSKDKGPSSSSGRTEYRRSENGPSRSRPYERYTPTTIPISEILTNIKENGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDSG

Query:  HNTSNCWELKRQIEDLIQDGYCKKFVGKPRSNSVEKKEERTRSRTPPRRDDRPAVINTIFGGPSGGQSRNKRKELAREARREVCIIREQKPTCSITFGDA
        HNTS+ WELKRQIEDLIQD Y KKFVGKPR++S EKKEER  SRTP RR DRPAVINTIFGGPSGGQS +KRKELAR ARREVCIIREQ+PTC ITF  A
Subjt:  HNTSNCWELKRQIEDLIQDGYCKKFVGKPRSNSVEKKEERTRSRTPPRRDDRPAVINTIFGGPSGGQSRNKRKELAREARREVCIIREQKPTCSITFGDA

Query:  DLEGVHLPHNDALVIAPLIDHVL-----------------------------LKKSPSPLVGFSGESVSPEGCIDLPVTIGQDDTQVTQMAEFVVIDGRS
        DLE VHLPHNDALVIAPLIDHV+                             LKKS +PLVGFS ESV PEGCIDLPVT+G D TQVTQMAEFVVIDGRS
Subjt:  DLEGVHLPHNDALVIAPLIDHVL-----------------------------LKKSPSPLVGFSGESVSPEGCIDLPVTIGQDDTQVTQMAEFVVIDGRS

Query:  AYNAIFGRPIIHSFRAVPSILHQVLKYSTPNGAGTVRGEQRTSRECYASALKGSLVCALEGQANRDELPKLEADLSKSDKREFSAPTKELELVPLLSPER
        AYNAIFGRPIIHSFRA+PS LHQVLKYSTPNG G VRGEQ  SRECYASALKGS VCALE   +RD   + +A+L    +REF+APT+ELELVPLL  + 
Subjt:  AYNAIFGRPIIHSFRAVPSILHQVLKYSTPNGAGTVRGEQRTSRECYASALKGSLVCALEGQANRDELPKLEADLSKSDKREFSAPTKELELVPLLSPER

Query:  QTDLARSVPVEILDNPSILDPDVMEIDTPSP
          ++     ++   + + +D D+     P P
Subjt:  QTDLARSVPVEILDNPSILDPDVMEIDTPSP

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]3.8e-19264.51Show/hide
Query:  KAQSKYKPLTPEAVITREEFDLMKHRFDEQ-------------------------------ALIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATD
        KA+S Y P+TP  VITREEFD +K +FD Q                               ALIPPKFKTPTMKPYDGSKDPKDYVEVFE LMDFQAATD
Subjt:  KAQSKYKPLTPEAVITREEFDLMKHRFDEQ-------------------------------ALIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATD

Query:  AIKCRAFKIALTGSVRL-------------------------------KTTTHLATIRQKEGETLREYATRFQEERLKVAHCSDDSAMCYFLTGLTDETL
        AIKC AF+IALTGS RL                               KT THLATIRQKEGETLREY TRF EE+LKVAHCSDDSAMCYFLTGL DETL
Subjt:  AIKCRAFKIALTGSVRL-------------------------------KTTTHLATIRQKEGETLREYATRFQEERLKVAHCSDDSAMCYFLTGLTDETL

Query:  T-----------------------------------------GRTDKDKGKTDAKSKDKGPSSSSGRTEYRRSENGPSRSRPYERYTPTTIPISEILTNI
        T                                         GR  KDKGK D+KS+DKGPSSSS R +YRRS +  ++SRPYE YTPTTIPI EILTNI
Subjt:  T-----------------------------------------GRTDKDKGKTDAKSKDKGPSSSSGRTEYRRSENGPSRSRPYERYTPTTIPISEILTNI

Query:  KENGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDSGHNTSNCWELKRQIEDLIQDGYCKKFVGKPRSNSVEKKEERTRSRTPPRRDDRPAVINTIFGGPSG
        +E GMEKLLKRPEKLRGDPEK N DKYCRFHRD GHNTSN WELKRQIEDLIQDGY KKFVGKPRSNSVEKKEER R RTPPRRDDRPAVI         
Subjt:  KENGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDSGHNTSNCWELKRQIEDLIQDGYCKKFVGKPRSNSVEKKEERTRSRTPPRRDDRPAVINTIFGGPSG

Query:  GQSRNKRKELAREARREVCIIREQKPTCSITFGDADLEGVHLPHNDALVIAPLIDHVL-----------------------------LKKSPSPLVGFSG
            NK+KELAREARREVCIIREQ+PT SI F  ADLEGVHLPHNDALVIAPLID VL                             LKKSP+PLVGFSG
Subjt:  GQSRNKRKELAREARREVCIIREQKPTCSITFGDADLEGVHLPHNDALVIAPLIDHVL-----------------------------LKKSPSPLVGFSG

Query:  ESVSPEGCIDLPVTIGQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSILHQVLKYSTPNGAGTVRGEQRTSRECYASALKGSLVCALEGQANR
        ES+S EGCIDLPV+I QDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPS LHQVLKYST NG GTVRGE +TSRECYAS  K S VCALE Q  R
Subjt:  ESVSPEGCIDLPVTIGQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSILHQVLKYSTPNGAGTVRGEQRTSRECYASALKGSLVCALEGQANR

Query:  DEL
        DEL
Subjt:  DEL

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]6.2e-18760.43Show/hide
Query:  VPGVLGEKEDQVPSLHPGDRETIPNNEGVDYSLRDNDLRKHLADKKKRASREPEDSPSYSREFSNSDLKAQSKYKPLTPEAVITREEFDLMKHRFDEQ--
        +PG  GEK    PS+ PG+RE IPN+EGVDYSLRDNDLRKHL DKKK+AS EPEDS SYSREFSNS+LKAQSKYKPL PEAVI REEFDLMKHRFDEQ  
Subjt:  VPGVLGEKEDQVPSLHPGDRETIPNNEGVDYSLRDNDLRKHLADKKKRASREPEDSPSYSREFSNSDLKAQSKYKPLTPEAVITREEFDLMKHRFDEQ--

Query:  -----------------------------ALIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFKIALTGSVRL---------------
                                     A IPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AF+IALTGS RL               
Subjt:  -----------------------------ALIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFKIALTGSVRL---------------

Query:  ----------------KTTTHLATIRQKEGETL-----REYATRFQEERLKVAHCSDDSAMCYFLTGLTDETL-TGRTDKDKGKTDAKSKDKGPSSSSGR
                        KT THLATIRQKE ETL      E    F E         D   +    T   ++ +   R  + K K D+KSKDKG SSS  R
Subjt:  ----------------KTTTHLATIRQKEGETL-----REYATRFQEERLKVAHCSDDSAMCYFLTGLTDETL-TGRTDKDKGKTDAKSKDKGPSSSSGR

Query:  TEYRRSENGPSRSRPYERYTPTTIPISEILTNIKENGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDSGHNTSNCWELKRQIEDLIQDGYCKKFVGKPRSN
        TEYRRSE+GPSRSRPYER                                                       CWELKRQIEDLIQD Y KKFVGKPRSN
Subjt:  TEYRRSENGPSRSRPYERYTPTTIPISEILTNIKENGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDSGHNTSNCWELKRQIEDLIQDGYCKKFVGKPRSN

Query:  SVEKKEERTRSRTPPRRDDRPAVINTIFGGPSGGQSRNKRKELAREARREVCIIREQKPTCSITFGDADLEGVHLPHNDALVIAPLIDHVL---------
        SVEKKEER RSRTPPRR+DRPAVINTIFGGPSGGQ  NKRKELA EARR+V IIREQKPTCSITF D DLEGVHLPHNDALVIAPLIDHVL         
Subjt:  SVEKKEERTRSRTPPRRDDRPAVINTIFGGPSGGQSRNKRKELAREARREVCIIREQKPTCSITFGDADLEGVHLPHNDALVIAPLIDHVL---------

Query:  --------------------LKKSPSPLVGFSGESVSPEGCIDLPVTIGQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSILHQVLKYSTPNG
                            LKKSP+PLVGFS ESVSPEGCIDLPVTIGQD TQVTQMAEFVVIDGR AYNAIF RPIIHSF+AVPSILHQVLKYSTPNG
Subjt:  --------------------LKKSPSPLVGFSGESVSPEGCIDLPVTIGQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSILHQVLKYSTPNG

Query:  AGTVRGEQRTSRECYASALKGSLVCALEGQANRDELPKLEADLSKSDKREFS
         GTVRGEQ+TSRECYASALK S VCALE Q ++D+LP+     S   +R  S
Subjt:  AGTVRGEQRTSRECYASALKGSLVCALEGQANRDELPKLEADLSKSDKREFS

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]2.1e-17972.75Show/hide
Query:  KTTTHLATIRQKEGETLREYATRFQEERLKVAHCSDDSAMCYFLTGLTDETL--------------------------------TGRTDK---------D
        KT THLATIRQKE ETLREY TRFQEE+LKVAHCSDDSAMCYFLT L DETL                                TGR +K         +
Subjt:  KTTTHLATIRQKEGETLREYATRFQEERLKVAHCSDDSAMCYFLTGLTDETL--------------------------------TGRTDK---------D

Query:  KGKTDAKSKDKGPSSSSGRTEYRRSENGPSRSRPYERYTPTTIPISEILTNIKENGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDSGHNTSNCWELKRQI
        K K D+KS+DKG SSS+ RTEYRR E+GPSRSRPYERYT +TIPISEILTNI+E+GMEKLLKRPEKLRGD EK NK+KYCRFHRD GHNT++CWELKRQI
Subjt:  KGKTDAKSKDKGPSSSSGRTEYRRSENGPSRSRPYERYTPTTIPISEILTNIKENGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDSGHNTSNCWELKRQI

Query:  EDLIQDGYCKKFVGKPRSNSVEKKEERTRSRTPPRRDDRPAVINTIFGGPSGGQSRNKRKELAREARREVCIIREQKPTCSITFGDADLEGVHLPHNDAL
        EDLIQDGY KKFVGKPRSNSVEKKEER RSRTPPRR+DRPAVINTIFGGP+GGQS NKRKELAREARREVCIIRE KPTCSITFGDADLEGVHLPHNDAL
Subjt:  EDLIQDGYCKKFVGKPRSNSVEKKEERTRSRTPPRRDDRPAVINTIFGGPSGGQSRNKRKELAREARREVCIIREQKPTCSITFGDADLEGVHLPHNDAL

Query:  VIAPLIDHVLLKKSPSPLVGFSGESVSPEGCIDLPVTIGQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSILHQVLKYSTPNGAGTVRGEQRT
        VIA LIDH L+++     V   G      GCIDLPVTIGQD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPS LHQVLKYSTPN  G VRGEQ+T
Subjt:  VIAPLIDHVLLKKSPSPLVGFSGESVSPEGCIDLPVTIGQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSILHQVLKYSTPNGAGTVRGEQRT

Query:  SRECYASALKGSLVCALEGQANRDELPKLEADLSKSDKREFSAPTKELELVPLLSPERQTDLARSVPVEILDNPSIL
        SRECYASALKGS VCALE Q NR +L + EADL K  KR+F  PT+ELELVPLLSPERQ +  +   V  ++ P  L
Subjt:  SRECYASALKGSLVCALEGQANRDELPKLEADLSKSDKREFSAPTKELELVPLLSPERQTDLARSVPVEILDNPSIL

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088133.1e-16061.7Show/hide
Query:  LKAQSKYKPLTPEAVITREEFDLMKHRFDEQ-------------------------------ALIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAAT
        +KA+S   P TP  VITREEFD ++ + D Q                               A IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYKPLTPEAVITREEFDLMKHRFDEQ-------------------------------ALIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCRAFKIALTGSVRL-------------------------------KTTTHLATIRQKEGETLREYATRFQEERLKVAHCSDDSAMCYFLTGLTDET
        DAIKCRAF+IALTGS RL                               KT THLATIRQKEGETLREY TRFQEE+LKVAHCSDDSAMCYFLTGL DE 
Subjt:  DAIKCRAFKIALTGSVRL-------------------------------KTTTHLATIRQKEGETLREYATRFQEERLKVAHCSDDSAMCYFLTGLTDET

Query:  LT-----------------------------------------GRTDKDKGKTDAKSKDKGPSSSSGRTEYRRSENGPSRSRPYERYTPTTIPISEILTN
        LT                                         GR+ KD    D KSKDKG S SSGR EYRR+ENGP+RSRPYER+TPTTIPISEILTN
Subjt:  LT-----------------------------------------GRTDKDKGKTDAKSKDKGPSSSSGRTEYRRSENGPSRSRPYERYTPTTIPISEILTN

Query:  IKENGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDSGHNTSNCWELKRQIEDLIQDGYCKKFVGKPRSNSVEKKEERTRSRTPPRRDDRPAVINTIFGGPS
        I+E+GMEKLLKRPEKLRG PE+ +KDKYCRFHR+ GHNTS+ WELKRQIE+LIQDGY KKFVGKPR++S EKKEER RSRTPPRR DRPAVINTIFGGPS
Subjt:  IKENGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDSGHNTSNCWELKRQIEDLIQDGYCKKFVGKPRSNSVEKKEERTRSRTPPRRDDRPAVINTIFGGPS

Query:  GGQSRNKRKELAREARREVCIIREQKPTCSITFGDADLEGVHLPHNDALVIAPLIDHVL-----------------------------LKKSPSPLVGFS
        GGQS  KRKELAR ARREVCIIREQ+PTC ITF  ADLE VHLPHNDALVIAPLIDHV+                             LKKSP+PLVGFS
Subjt:  GGQSRNKRKELAREARREVCIIREQKPTCSITFGDADLEGVHLPHNDALVIAPLIDHVL-----------------------------LKKSPSPLVGFS

Query:  GESVSPEGCIDLPVTIGQDDTQVTQMAEFV
        GESV PEG IDLPVT+GQD TQVTQMAEFV
Subjt:  GESVSPEGCIDLPVTIGQDDTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.2e-18058.8Show/hide
Query:  NSDLKAQSKYKPLTPEAVITREEFDLMKHRFD-----------------------EQALIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC
        +S+ +A+S + P TP+ VITREEFD ++ + +                       E        + PT+K YDGSKDPKDYVEVFEGLMDFQAA+DAIKC
Subjt:  NSDLKAQSKYKPLTPEAVITREEFDLMKHRFD-----------------------EQALIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC

Query:  RAFKIALTGSVRLKTTTHLATIRQKEGETLREYATRFQEERLKVAHCSDDSAMCYFLTGLTDETLT----------------------------------
        RAF+IALTGS RL                       FQE++LKVA  SDDSAMCYFLTGL DE LT                                  
Subjt:  RAFKIALTGSVRLKTTTHLATIRQKEGETLREYATRFQEERLKVAHCSDDSAMCYFLTGLTDETLT----------------------------------

Query:  -------GRTDKDKGKTDAKSKDKGPSSSSGRTEYRRSENGPSRSRPYERYTPTTIPISEILTNIKENGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDSG
               GR+ KD+ K D KSKDKG S SSGR E+RR+ NGP+RSRPYER+TPTTIPISEILTNI+E+GMEKLLKRPEKLRG PE+ NKDKYCRFHR+  
Subjt:  -------GRTDKDKGKTDAKSKDKGPSSSSGRTEYRRSENGPSRSRPYERYTPTTIPISEILTNIKENGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDSG

Query:  HNTSNCWELKRQIEDLIQDGYCKKFVGKPRSNSVEKKEERTRSRTPPRRDDRPAVINTIFGGPSGGQSRNKRKELAREARREVCIIREQKPTCSITFGDA
        HNTS+ WELKRQIEDLIQD Y KKFVGKPR++S EKKEER  SRTP RR DRPAVINTIFGGPSGGQS +KRKELAR ARREVCIIREQ+PTC ITF  A
Subjt:  HNTSNCWELKRQIEDLIQDGYCKKFVGKPRSNSVEKKEERTRSRTPPRRDDRPAVINTIFGGPSGGQSRNKRKELAREARREVCIIREQKPTCSITFGDA

Query:  DLEGVHLPHNDALVIAPLIDHVL-----------------------------LKKSPSPLVGFSGESVSPEGCIDLPVTIGQDDTQVTQMAEFVVIDGRS
        DLE VHLPHNDALVIAPLIDHV+                             LKKS +PLVGFS ESV PEGCIDLPVT+G D TQVTQMAEFVVIDGRS
Subjt:  DLEGVHLPHNDALVIAPLIDHVL-----------------------------LKKSPSPLVGFSGESVSPEGCIDLPVTIGQDDTQVTQMAEFVVIDGRS

Query:  AYNAIFGRPIIHSFRAVPSILHQVLKYSTPNGAGTVRGEQRTSRECYASALKGSLVCALEGQANRDELPKLEADLSKSDKREFSAPTKELELVPLLSPER
        AYNAIFGRPIIHSFRA+PS LHQVLKYSTPNG G VRGEQ  SRECYASALKGS VCALE   +RD   + +A+L    +REF+APT+ELELVPLL  + 
Subjt:  AYNAIFGRPIIHSFRAVPSILHQVLKYSTPNGAGTVRGEQRTSRECYASALKGSLVCALEGQANRDELPKLEADLSKSDKREFSAPTKELELVPLLSPER

Query:  QTDLARSVPVEILDNPSILDPDVMEIDTPSP
          ++     ++   + + +D D+     P P
Subjt:  QTDLARSVPVEILDNPSILDPDVMEIDTPSP

A0A6J1DHB3 uncharacterized protein LOC1110204791.8e-19264.51Show/hide
Query:  KAQSKYKPLTPEAVITREEFDLMKHRFDEQ-------------------------------ALIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATD
        KA+S Y P+TP  VITREEFD +K +FD Q                               ALIPPKFKTPTMKPYDGSKDPKDYVEVFE LMDFQAATD
Subjt:  KAQSKYKPLTPEAVITREEFDLMKHRFDEQ-------------------------------ALIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATD

Query:  AIKCRAFKIALTGSVRL-------------------------------KTTTHLATIRQKEGETLREYATRFQEERLKVAHCSDDSAMCYFLTGLTDETL
        AIKC AF+IALTGS RL                               KT THLATIRQKEGETLREY TRF EE+LKVAHCSDDSAMCYFLTGL DETL
Subjt:  AIKCRAFKIALTGSVRL-------------------------------KTTTHLATIRQKEGETLREYATRFQEERLKVAHCSDDSAMCYFLTGLTDETL

Query:  T-----------------------------------------GRTDKDKGKTDAKSKDKGPSSSSGRTEYRRSENGPSRSRPYERYTPTTIPISEILTNI
        T                                         GR  KDKGK D+KS+DKGPSSSS R +YRRS +  ++SRPYE YTPTTIPI EILTNI
Subjt:  T-----------------------------------------GRTDKDKGKTDAKSKDKGPSSSSGRTEYRRSENGPSRSRPYERYTPTTIPISEILTNI

Query:  KENGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDSGHNTSNCWELKRQIEDLIQDGYCKKFVGKPRSNSVEKKEERTRSRTPPRRDDRPAVINTIFGGPSG
        +E GMEKLLKRPEKLRGDPEK N DKYCRFHRD GHNTSN WELKRQIEDLIQDGY KKFVGKPRSNSVEKKEER R RTPPRRDDRPAVI         
Subjt:  KENGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDSGHNTSNCWELKRQIEDLIQDGYCKKFVGKPRSNSVEKKEERTRSRTPPRRDDRPAVINTIFGGPSG

Query:  GQSRNKRKELAREARREVCIIREQKPTCSITFGDADLEGVHLPHNDALVIAPLIDHVL-----------------------------LKKSPSPLVGFSG
            NK+KELAREARREVCIIREQ+PT SI F  ADLEGVHLPHNDALVIAPLID VL                             LKKSP+PLVGFSG
Subjt:  GQSRNKRKELAREARREVCIIREQKPTCSITFGDADLEGVHLPHNDALVIAPLIDHVL-----------------------------LKKSPSPLVGFSG

Query:  ESVSPEGCIDLPVTIGQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSILHQVLKYSTPNGAGTVRGEQRTSRECYASALKGSLVCALEGQANR
        ES+S EGCIDLPV+I QDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPS LHQVLKYST NG GTVRGE +TSRECYAS  K S VCALE Q  R
Subjt:  ESVSPEGCIDLPVTIGQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSILHQVLKYSTPNGAGTVRGEQRTSRECYASALKGSLVCALEGQANR

Query:  DEL
        DEL
Subjt:  DEL

A0A6J1DPC9 uncharacterized protein LOC1110222803.0e-18760.43Show/hide
Query:  VPGVLGEKEDQVPSLHPGDRETIPNNEGVDYSLRDNDLRKHLADKKKRASREPEDSPSYSREFSNSDLKAQSKYKPLTPEAVITREEFDLMKHRFDEQ--
        +PG  GEK    PS+ PG+RE IPN+EGVDYSLRDNDLRKHL DKKK+AS EPEDS SYSREFSNS+LKAQSKYKPL PEAVI REEFDLMKHRFDEQ  
Subjt:  VPGVLGEKEDQVPSLHPGDRETIPNNEGVDYSLRDNDLRKHLADKKKRASREPEDSPSYSREFSNSDLKAQSKYKPLTPEAVITREEFDLMKHRFDEQ--

Query:  -----------------------------ALIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFKIALTGSVRL---------------
                                     A IPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AF+IALTGS RL               
Subjt:  -----------------------------ALIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFKIALTGSVRL---------------

Query:  ----------------KTTTHLATIRQKEGETL-----REYATRFQEERLKVAHCSDDSAMCYFLTGLTDETL-TGRTDKDKGKTDAKSKDKGPSSSSGR
                        KT THLATIRQKE ETL      E    F E         D   +    T   ++ +   R  + K K D+KSKDKG SSS  R
Subjt:  ----------------KTTTHLATIRQKEGETL-----REYATRFQEERLKVAHCSDDSAMCYFLTGLTDETL-TGRTDKDKGKTDAKSKDKGPSSSSGR

Query:  TEYRRSENGPSRSRPYERYTPTTIPISEILTNIKENGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDSGHNTSNCWELKRQIEDLIQDGYCKKFVGKPRSN
        TEYRRSE+GPSRSRPYER                                                       CWELKRQIEDLIQD Y KKFVGKPRSN
Subjt:  TEYRRSENGPSRSRPYERYTPTTIPISEILTNIKENGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDSGHNTSNCWELKRQIEDLIQDGYCKKFVGKPRSN

Query:  SVEKKEERTRSRTPPRRDDRPAVINTIFGGPSGGQSRNKRKELAREARREVCIIREQKPTCSITFGDADLEGVHLPHNDALVIAPLIDHVL---------
        SVEKKEER RSRTPPRR+DRPAVINTIFGGPSGGQ  NKRKELA EARR+V IIREQKPTCSITF D DLEGVHLPHNDALVIAPLIDHVL         
Subjt:  SVEKKEERTRSRTPPRRDDRPAVINTIFGGPSGGQSRNKRKELAREARREVCIIREQKPTCSITFGDADLEGVHLPHNDALVIAPLIDHVL---------

Query:  --------------------LKKSPSPLVGFSGESVSPEGCIDLPVTIGQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSILHQVLKYSTPNG
                            LKKSP+PLVGFS ESVSPEGCIDLPVTIGQD TQVTQMAEFVVIDGR AYNAIF RPIIHSF+AVPSILHQVLKYSTPNG
Subjt:  --------------------LKKSPSPLVGFSGESVSPEGCIDLPVTIGQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSILHQVLKYSTPNG

Query:  AGTVRGEQRTSRECYASALKGSLVCALEGQANRDELPKLEADLSKSDKREFS
         GTVRGEQ+TSRECYASALK S VCALE Q ++D+LP+     S   +R  S
Subjt:  AGTVRGEQRTSRECYASALKGSLVCALEGQANRDELPKLEADLSKSDKREFS

A0A6J1DZB9 uncharacterized protein LOC1110249041.0e-17972.75Show/hide
Query:  KTTTHLATIRQKEGETLREYATRFQEERLKVAHCSDDSAMCYFLTGLTDETL--------------------------------TGRTDK---------D
        KT THLATIRQKE ETLREY TRFQEE+LKVAHCSDDSAMCYFLT L DETL                                TGR +K         +
Subjt:  KTTTHLATIRQKEGETLREYATRFQEERLKVAHCSDDSAMCYFLTGLTDETL--------------------------------TGRTDK---------D

Query:  KGKTDAKSKDKGPSSSSGRTEYRRSENGPSRSRPYERYTPTTIPISEILTNIKENGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDSGHNTSNCWELKRQI
        K K D+KS+DKG SSS+ RTEYRR E+GPSRSRPYERYT +TIPISEILTNI+E+GMEKLLKRPEKLRGD EK NK+KYCRFHRD GHNT++CWELKRQI
Subjt:  KGKTDAKSKDKGPSSSSGRTEYRRSENGPSRSRPYERYTPTTIPISEILTNIKENGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDSGHNTSNCWELKRQI

Query:  EDLIQDGYCKKFVGKPRSNSVEKKEERTRSRTPPRRDDRPAVINTIFGGPSGGQSRNKRKELAREARREVCIIREQKPTCSITFGDADLEGVHLPHNDAL
        EDLIQDGY KKFVGKPRSNSVEKKEER RSRTPPRR+DRPAVINTIFGGP+GGQS NKRKELAREARREVCIIRE KPTCSITFGDADLEGVHLPHNDAL
Subjt:  EDLIQDGYCKKFVGKPRSNSVEKKEERTRSRTPPRRDDRPAVINTIFGGPSGGQSRNKRKELAREARREVCIIREQKPTCSITFGDADLEGVHLPHNDAL

Query:  VIAPLIDHVLLKKSPSPLVGFSGESVSPEGCIDLPVTIGQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSILHQVLKYSTPNGAGTVRGEQRT
        VIA LIDH L+++     V   G      GCIDLPVTIGQD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPS LHQVLKYSTPN  G VRGEQ+T
Subjt:  VIAPLIDHVLLKKSPSPLVGFSGESVSPEGCIDLPVTIGQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSILHQVLKYSTPNGAGTVRGEQRT

Query:  SRECYASALKGSLVCALEGQANRDELPKLEADLSKSDKREFSAPTKELELVPLLSPERQTDLARSVPVEILDNPSIL
        SRECYASALKGS VCALE Q NR +L + EADL K  KR+F  PT+ELELVPLLSPERQ +  +   V  ++ P  L
Subjt:  SRECYASALKGSLVCALEGQANRDELPKLEADLSKSDKREFSAPTKELELVPLLSPERQTDLARSVPVEILDNPSIL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCAATCTCTACCTAGAGATAAGCATACAACCAAAGTTATCAAAGATCCTGACTACCGAAAAGAATTGGGGACTTTCTGCAAACAATACGGTCTTGACTAT
GGACCCAAAGATGAAAGGAAGAAGAAAAAGAAATCTTCCAACAAGCGACTCTTCAGCAAGAGTAAATCTAAAGATTCCGAATTACCACGGCGTAAAAGGAAGAAC
AAAATCAATTCTTTGACCATAGATGAAGAAACGCGGCAATCTCTTCTCAATGCCATCAAAAGCGAAGAAGAATACTCTTCATACTTCGAGTCTTCAACTGATAAT
GATGAGATTAATCTCATAAATGAAGAAGATTCAGATGAAGAAACCTTTTTCTCTCAAAGTGATTCCTCTGAGGAAGATGGAATTATTCCCTGCACTGGCCACTGC
GCTGGAAAATGTCACGGCCATATCAATGTCATCAGTAAGGATCAAGAGGCTCTCTTTGATCTAATTGAGCAATTACCCGATGAGGACTCTAAACGAATGTGTCTT
ATCAAACTTCGGGAAAGCCTTGAAACAGAAGCTCTTCAAAGGAAGCCAGAATTAAACCTGATAGAATATTCATTCCAAGATATTCTAAAAAGGGTCAAAGGAGAA
GCTAAGAAGCCTATCCAAATTGAAGATCTCCACAATGAAGTGAAGACTCTCAAAAGGGAAGTTGCTGAAAATAAGCAACGTCTTTCTACTCTTGAATACGCCTTC
AAAAGATTTCAAGAGTCAGAACCCACGGAAGGAGAAACCTCCTCAACACCTGAGAAAACTCTACTGACTCGTTCACCAAGTGGAATCAATTATATTAGTAAAGTT
CAAAACCAGAAGTGTATGTCTAAGATTATCTTCAAAATCAGAGACTTCCAGCTGGAGACATTCGCTCTTATCGACTCTGGAGCCGATCAGAACGTCATTCAAGAA
GGTTTAGTTCCTTCTAAATACTTCGAGAAAACTAAAGAAGTTGTCAGCGGAGCCGTCACTGATAAAGAAATAGTTTCTAAGAAGTTCAACAAAGAGATTATCTTC
GAATTCAGTCATTCGATGATTCCGAGATATCTTTCATCAATTGAAGAAGATATTAGTCTCTATATCAATAGTATCGGAAAGAAGGAAAAACAGATTGAATTCCTT
CAAGATGATATCAAGACCTGCAATGTAACGACCCAGAATTTTGGCTCGATCTGCTTGGAACCCGACAGGGGTTTGAACGCTGATAACGACACTCAACGAGACCTC
GAAGCTAGAATAGTTGAGGACCAGGTCCGAGCAGGGCAAGAGGGAGATCTGCCACGGAGATCTACCCGCCATACGAATCAGGAGTTACCCCCTGCTCACCCGAAA
CGCTCAAAAGCCAACCGAGGCCGAGGTGGGACCTCAAGAAAGGCCTCCCGAAGGGCTGCCCCGACGGCAGACCCTGAGGCTCTGGCTACCCTCCAACGCGAGTTG
GATGACATGCGCCATCGACTGCGCACAATGGAAGAAATGTACACCGAGGCAACGCGGGCTAACCGAACAGTGTCTCCCTTAAGAGTCCCAGGCGTACTCGGAGAG
AAGGAAGACCAAGTTCCATCTCTCCACCCTGGTGACCGCGAGACCATTCCCAACAATGAGGGGGTGGATTATAGCTTGCGGGACAACGATCTTCGGAAGCACCTT
GCTGATAAAAAGAAGAGAGCATCGCGAGAACCGGAAGACTCTCCGTCTTACTCCCGAGAGTTCTCCAACTCCGACCTCAAAGCTCAATCAAAGTATAAGCCTTTG
ACACCAGAAGCTGTGATCACCAGGGAAGAGTTCGACCTGATGAAACACAGGTTCGACGAACAGGCTCTAATCCCTCCAAAGTTCAAGACTCCTACCATGAAGCCT
TATGATGGGTCTAAGGACCCAAAAGACTATGTTGAGGTATTCGAAGGCCTCATGGACTTTCAAGCGGCAACAGATGCAATCAAGTGCCGCGCCTTCAAGATCGCG
CTCACCGGCAGCGTGCGCCTAAAGACAACAACTCACCTTGCCACCATCAGACAGAAGGAAGGTGAGACGCTGAGAGAATATGCCACAAGGTTCCAGGAGGAGCGG
CTGAAAGTCGCGCACTGCTCCGATGATTCGGCCATGTGCTACTTCCTCACCGGTCTGACCGATGAGACCCTCACCGGGAGAACTGACAAAGATAAGGGAAAGACT
GATGCCAAGTCCAAAGACAAGGGACCATCCTCCTCCAGTGGCCGAACTGAGTATCGTAGGTCGGAGAACGGCCCCAGTCGAAGCCGACCTTACGAACGTTATACT
CCGACCACCATCCCCATCTCTGAAATACTTACAAACATCAAGGAAAATGGGATGGAGAAGCTCCTCAAGCGACCTGAGAAGCTCCGAGGAGACCCAGAAAAGTGT
AATAAGGATAAATATTGTCGTTTTCATCGCGATAGCGGCCATAATACATCAAATTGTTGGGAGTTAAAACGCCAAATTGAAGACCTCATTCAAGATGGTTACTGC
AAAAAATTTGTTGGAAAACCGAGGTCCAACTCGGTAGAGAAGAAAGAAGAAAGGACGCGTTCAAGGACGCCACCTCGTCGAGATGACCGACCTGCAGTCATCAAC
ACTATTTTCGGGGGCCCTAGTGGGGGCCAGTCTAGAAATAAAAGGAAAGAGCTAGCTCGTGAGGCCAGGCGCGAGGTATGCATCATCAGGGAGCAGAAACCAACT
TGCTCCATTACCTTCGGCGATGCCGATCTGGAGGGGGTACACTTGCCCCACAATGATGCACTCGTGATCGCTCCCCTCATCGATCATGTTCTGCTGAAGAAAAGT
CCATCTCCCTTGGTTGGATTCTCTGGAGAATCGGTCTCTCCAGAAGGGTGTATCGACTTGCCGGTTACGATTGGGCAAGATGATACACAGGTAACCCAGATGGCC
GAGTTCGTTGTGATCGACGGAAGGTCGGCCTACAATGCCATCTTCGGGAGACCCATCATCCATTCGTTCCGGGCCGTTCCCTCAATACTTCATCAAGTCCTGAAA
TACTCAACTCCTAATGGTGCGGGCACGGTCCGAGGAGAGCAGAGAACTTCAAGGGAGTGCTACGCCTCCGCACTCAAGGGGTCATTAGTATGCGCCCTGGAAGGA
CAAGCTAACAGGGACGAGTTGCCGAAGCTCGAGGCTGACCTATCGAAATCTGATAAAAGGGAGTTCTCAGCACCAACCAAGGAACTCGAGCTTGTTCCTTTGCTT
AGCCCTGAAAGACAAACCGACCTGGCCAGGTCAGTCCCAGTGGAGATCTTGGACAATCCTTCAATCCTGGATCCAGACGTGATGGAGATTGACACTCCATCACCC
TCATGGATGGATCCAATCGTGGAGTTCATCAACGGAAATCCGCCGCAAGATCTGAAGAAGCAAAAGAAGATGGCACGGAAAGCAGCTCGGTTCATACTCCGAGAA
GGGGCGTTGTACCGACGTGGCTTCTCCCTGCCTCTGCTTAAATGTGTGACTTCCGAAGAAGGCTTTTACATCCTTAGGGAAATCCATGAAGGAGTGTGTGCGAAC
TACTCTGGCTCCAGGTCGTTGTCGGCCAAGGTGGTTCGACAAGGGTACTATTGGTCCACTGTCGAGCAGGATGCGAAGCAATTTGTGAAAACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATCAATCTCTACCTAGAGATAAGCATACAACCAAAGTTATCAAAGATCCTGACTACCGAAAAGAATTGGGGACTTTCTGCAAACAATACGGTCTTGACTAT
GGACCCAAAGATGAAAGGAAGAAGAAAAAGAAATCTTCCAACAAGCGACTCTTCAGCAAGAGTAAATCTAAAGATTCCGAATTACCACGGCGTAAAAGGAAGAAC
AAAATCAATTCTTTGACCATAGATGAAGAAACGCGGCAATCTCTTCTCAATGCCATCAAAAGCGAAGAAGAATACTCTTCATACTTCGAGTCTTCAACTGATAAT
GATGAGATTAATCTCATAAATGAAGAAGATTCAGATGAAGAAACCTTTTTCTCTCAAAGTGATTCCTCTGAGGAAGATGGAATTATTCCCTGCACTGGCCACTGC
GCTGGAAAATGTCACGGCCATATCAATGTCATCAGTAAGGATCAAGAGGCTCTCTTTGATCTAATTGAGCAATTACCCGATGAGGACTCTAAACGAATGTGTCTT
ATCAAACTTCGGGAAAGCCTTGAAACAGAAGCTCTTCAAAGGAAGCCAGAATTAAACCTGATAGAATATTCATTCCAAGATATTCTAAAAAGGGTCAAAGGAGAA
GCTAAGAAGCCTATCCAAATTGAAGATCTCCACAATGAAGTGAAGACTCTCAAAAGGGAAGTTGCTGAAAATAAGCAACGTCTTTCTACTCTTGAATACGCCTTC
AAAAGATTTCAAGAGTCAGAACCCACGGAAGGAGAAACCTCCTCAACACCTGAGAAAACTCTACTGACTCGTTCACCAAGTGGAATCAATTATATTAGTAAAGTT
CAAAACCAGAAGTGTATGTCTAAGATTATCTTCAAAATCAGAGACTTCCAGCTGGAGACATTCGCTCTTATCGACTCTGGAGCCGATCAGAACGTCATTCAAGAA
GGTTTAGTTCCTTCTAAATACTTCGAGAAAACTAAAGAAGTTGTCAGCGGAGCCGTCACTGATAAAGAAATAGTTTCTAAGAAGTTCAACAAAGAGATTATCTTC
GAATTCAGTCATTCGATGATTCCGAGATATCTTTCATCAATTGAAGAAGATATTAGTCTCTATATCAATAGTATCGGAAAGAAGGAAAAACAGATTGAATTCCTT
CAAGATGATATCAAGACCTGCAATGTAACGACCCAGAATTTTGGCTCGATCTGCTTGGAACCCGACAGGGGTTTGAACGCTGATAACGACACTCAACGAGACCTC
GAAGCTAGAATAGTTGAGGACCAGGTCCGAGCAGGGCAAGAGGGAGATCTGCCACGGAGATCTACCCGCCATACGAATCAGGAGTTACCCCCTGCTCACCCGAAA
CGCTCAAAAGCCAACCGAGGCCGAGGTGGGACCTCAAGAAAGGCCTCCCGAAGGGCTGCCCCGACGGCAGACCCTGAGGCTCTGGCTACCCTCCAACGCGAGTTG
GATGACATGCGCCATCGACTGCGCACAATGGAAGAAATGTACACCGAGGCAACGCGGGCTAACCGAACAGTGTCTCCCTTAAGAGTCCCAGGCGTACTCGGAGAG
AAGGAAGACCAAGTTCCATCTCTCCACCCTGGTGACCGCGAGACCATTCCCAACAATGAGGGGGTGGATTATAGCTTGCGGGACAACGATCTTCGGAAGCACCTT
GCTGATAAAAAGAAGAGAGCATCGCGAGAACCGGAAGACTCTCCGTCTTACTCCCGAGAGTTCTCCAACTCCGACCTCAAAGCTCAATCAAAGTATAAGCCTTTG
ACACCAGAAGCTGTGATCACCAGGGAAGAGTTCGACCTGATGAAACACAGGTTCGACGAACAGGCTCTAATCCCTCCAAAGTTCAAGACTCCTACCATGAAGCCT
TATGATGGGTCTAAGGACCCAAAAGACTATGTTGAGGTATTCGAAGGCCTCATGGACTTTCAAGCGGCAACAGATGCAATCAAGTGCCGCGCCTTCAAGATCGCG
CTCACCGGCAGCGTGCGCCTAAAGACAACAACTCACCTTGCCACCATCAGACAGAAGGAAGGTGAGACGCTGAGAGAATATGCCACAAGGTTCCAGGAGGAGCGG
CTGAAAGTCGCGCACTGCTCCGATGATTCGGCCATGTGCTACTTCCTCACCGGTCTGACCGATGAGACCCTCACCGGGAGAACTGACAAAGATAAGGGAAAGACT
GATGCCAAGTCCAAAGACAAGGGACCATCCTCCTCCAGTGGCCGAACTGAGTATCGTAGGTCGGAGAACGGCCCCAGTCGAAGCCGACCTTACGAACGTTATACT
CCGACCACCATCCCCATCTCTGAAATACTTACAAACATCAAGGAAAATGGGATGGAGAAGCTCCTCAAGCGACCTGAGAAGCTCCGAGGAGACCCAGAAAAGTGT
AATAAGGATAAATATTGTCGTTTTCATCGCGATAGCGGCCATAATACATCAAATTGTTGGGAGTTAAAACGCCAAATTGAAGACCTCATTCAAGATGGTTACTGC
AAAAAATTTGTTGGAAAACCGAGGTCCAACTCGGTAGAGAAGAAAGAAGAAAGGACGCGTTCAAGGACGCCACCTCGTCGAGATGACCGACCTGCAGTCATCAAC
ACTATTTTCGGGGGCCCTAGTGGGGGCCAGTCTAGAAATAAAAGGAAAGAGCTAGCTCGTGAGGCCAGGCGCGAGGTATGCATCATCAGGGAGCAGAAACCAACT
TGCTCCATTACCTTCGGCGATGCCGATCTGGAGGGGGTACACTTGCCCCACAATGATGCACTCGTGATCGCTCCCCTCATCGATCATGTTCTGCTGAAGAAAAGT
CCATCTCCCTTGGTTGGATTCTCTGGAGAATCGGTCTCTCCAGAAGGGTGTATCGACTTGCCGGTTACGATTGGGCAAGATGATACACAGGTAACCCAGATGGCC
GAGTTCGTTGTGATCGACGGAAGGTCGGCCTACAATGCCATCTTCGGGAGACCCATCATCCATTCGTTCCGGGCCGTTCCCTCAATACTTCATCAAGTCCTGAAA
TACTCAACTCCTAATGGTGCGGGCACGGTCCGAGGAGAGCAGAGAACTTCAAGGGAGTGCTACGCCTCCGCACTCAAGGGGTCATTAGTATGCGCCCTGGAAGGA
CAAGCTAACAGGGACGAGTTGCCGAAGCTCGAGGCTGACCTATCGAAATCTGATAAAAGGGAGTTCTCAGCACCAACCAAGGAACTCGAGCTTGTTCCTTTGCTT
AGCCCTGAAAGACAAACCGACCTGGCCAGGTCAGTCCCAGTGGAGATCTTGGACAATCCTTCAATCCTGGATCCAGACGTGATGGAGATTGACACTCCATCACCC
TCATGGATGGATCCAATCGTGGAGTTCATCAACGGAAATCCGCCGCAAGATCTGAAGAAGCAAAAGAAGATGGCACGGAAAGCAGCTCGGTTCATACTCCGAGAA
GGGGCGTTGTACCGACGTGGCTTCTCCCTGCCTCTGCTTAAATGTGTGACTTCCGAAGAAGGCTTTTACATCCTTAGGGAAATCCATGAAGGAGTGTGTGCGAAC
TACTCTGGCTCCAGGTCGTTGTCGGCCAAGGTGGTTCGACAAGGGTACTATTGGTCCACTGTCGAGCAGGATGCGAAGCAATTTGTGAAAACCTGA
Protein sequenceShow/hide protein sequence
MHQSLPRDKHTTKVIKDPDYRKELGTFCKQYGLDYGPKDERKKKKKSSNKRLFSKSKSKDSELPRRKRKNKINSLTIDEETRQSLLNAIKSEEEYSSYFESSTDN
DEINLINEEDSDEETFFSQSDSSEEDGIIPCTGHCAGKCHGHINVISKDQEALFDLIEQLPDEDSKRMCLIKLRESLETEALQRKPELNLIEYSFQDILKRVKGE
AKKPIQIEDLHNEVKTLKREVAENKQRLSTLEYAFKRFQESEPTEGETSSTPEKTLLTRSPSGINYISKVQNQKCMSKIIFKIRDFQLETFALIDSGADQNVIQE
GLVPSKYFEKTKEVVSGAVTDKEIVSKKFNKEIIFEFSHSMIPRYLSSIEEDISLYINSIGKKEKQIEFLQDDIKTCNVTTQNFGSICLEPDRGLNADNDTQRDL
EARIVEDQVRAGQEGDLPRRSTRHTNQELPPAHPKRSKANRGRGGTSRKASRRAAPTADPEALATLQRELDDMRHRLRTMEEMYTEATRANRTVSPLRVPGVLGE
KEDQVPSLHPGDRETIPNNEGVDYSLRDNDLRKHLADKKKRASREPEDSPSYSREFSNSDLKAQSKYKPLTPEAVITREEFDLMKHRFDEQALIPPKFKTPTMKP
YDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFKIALTGSVRLKTTTHLATIRQKEGETLREYATRFQEERLKVAHCSDDSAMCYFLTGLTDETLTGRTDKDKGKT
DAKSKDKGPSSSSGRTEYRRSENGPSRSRPYERYTPTTIPISEILTNIKENGMEKLLKRPEKLRGDPEKCNKDKYCRFHRDSGHNTSNCWELKRQIEDLIQDGYC
KKFVGKPRSNSVEKKEERTRSRTPPRRDDRPAVINTIFGGPSGGQSRNKRKELAREARREVCIIREQKPTCSITFGDADLEGVHLPHNDALVIAPLIDHVLLKKS
PSPLVGFSGESVSPEGCIDLPVTIGQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSILHQVLKYSTPNGAGTVRGEQRTSRECYASALKGSLVCALEG
QANRDELPKLEADLSKSDKREFSAPTKELELVPLLSPERQTDLARSVPVEILDNPSILDPDVMEIDTPSPSWMDPIVEFINGNPPQDLKKQKKMARKAARFILRE
GALYRRGFSLPLLKCVTSEEGFYILREIHEGVCANYSGSRSLSAKVVRQGYYWSTVEQDAKQFVKT