; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g10500 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g10500
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionMuDRA-like transposase
Genome locationchr4:7770200..7778472
RNA-Seq ExpressionMoc04g10500
SyntenyMoc04g10500
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR004332 - Transposase, MuDR, plant
IPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type
IPR015410 - Domain of unknown function DUF1985
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145823.1 uncharacterized protein LOC111015183 [Momordica charantia]6.8e-9494.21Show/hide
Query:  MDHQLRIKENDRFLAQATSMSHLSNVNKLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLLREVAGSSDDSMSFLIGGNVLTFSKDQFMLITG
        MDHQLRIKENDRF  QATSMSHLSNVN+LIKDKLT DQLDMFRRRTIFGRFVDLEMMFCSGVVHHFL REVAGSSDDS+  LIGGNV TFSKDQFMLITG
Subjt:  MDHQLRIKENDRFLAQATSMSHLSNVNKLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLLREVAGSSDDSMSFLIGGNVLTFSKDQFMLITG

Query:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLN
        LWRLPGKVVQKKIGKNRLRRKYFNDEASM+LEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHL+
Subjt:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLN

XP_022154925.1 uncharacterized protein LOC111022071 [Momordica charantia]9.6e-18164.17Show/hide
Query:  MKIGKSLFIRFGGTWEDNTNSYVGGELKGIIVPLTITYKELKNRLYRLMKVDQNGYDRVIRVLYHLACDSPPMFVTDDDDLQFALVQEQVSKVPLFVSTV
        M+IGKSLFIRFGGTWEDNTNSYVGGELKGIIVPLT+TYKELKNRL+RLMKVDQNGYD VIRV YHLACDSPPMFVTDDDDL+FALVQEQVSKVPLFVST+
Subjt:  MKIGKSLFIRFGGTWEDNTNSYVGGELKGIIVPLTITYKELKNRLYRLMKVDQNGYDRVIRVLYHLACDSPPMFVTDDDDLQFALVQEQVSKVPLFVSTV

Query:  PRESIDIQSSRLNQDGAQSIPNGGSVPEEGSTGVDEWGLNDVYMQDMYSSDCIYTQDSELVAPTTRMPPVMHVSSDEINNTAQVQHASPRTVHVFPFDEL
        PRESIDIQSSRLNQDG QSIPNGGSVPEEGST VDEWGLNDVYMQDMYSSD                                                 
Subjt:  PRESIDIQSSRLNQDGAQSIPNGGSVPEEGSTGVDEWGLNDVYMQDMYSSDCIYTQDSELVAPTTRMPPVMHVSSDEINNTAQVQHASPRTVHVFPFDEL

Query:  NNTSQVRHASPLTEHVFPSVEPSSSDPRNVGDHQIPVPQSPSTVTIEQARRYNLLPGGSELHVGKIFVSKQDLRMVLSNAAMRSNREYKVSRSTKSKFTV
             +RHASPLT      VEPSSSDPRNV   QIPVPQSPSTVTIEQA+RYNLLPGGSELHVGKIFVSKQDLRMVLSNAAMRSNREYKVSRSTKSKF V
Subjt:  NNTSQVRHASPLTEHVFPSVEPSSSDPRNVGDHQIPVPQSPSTVTIEQARRYNLLPGGSELHVGKIFVSKQDLRMVLSNAAMRSNREYKVSRSTKSKFTV

Query:  PCIDNTCNWRVAAHSVGKSSIFCISKYVDAHTCTIDSVNHDHKQASRWVVANLINDKVAGTGRIYKIKHIKEDVRKEYGVNISYDKAHRARELAYAIVR-
         CIDNTCNWRVAAHSVGKSSIFCISKYVDAHTCTIDSVNHDHKQAS WVVANLI D VAGTGRIYKIKHIKEDVRKEYGVNISYDKAHRARELAY IVR 
Subjt:  PCIDNTCNWRVAAHSVGKSSIFCISKYVDAHTCTIDSVNHDHKQASRWVVANLINDKVAGTGRIYKIKHIKEDVRKEYGVNISYDKAHRARELAYAIVR-

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------ADRKTPTYISMRTDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIER
                    D++   +IS R DAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIER
Subjt:  -----------ADRKTPTYISMRTDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIER

XP_022154997.1 uncharacterized protein LOC111022140 [Momordica charantia]8.5e-10536.3Show/hide
Query:  MDAVLGQVEDFIPSWVDVDVVYSPLCIKDHWVLVAIDMTQSEIFVYDSLPGHISTLKLLTDMRLLSHTIPSLLYACGLMDTADCKMKRTPWRVYRPTTDT
        MD VLGQVEDFIP+WVDVDVVYS L I+DHWVLVAIDMTQSEIFVYDSLPGHIST KL+ D+R LSHTIPSLLYACGLMDTADCK+++TPW VYRPTTDT
Subjt:  MDAVLGQVEDFIPSWVDVDVVYSPLCIKDHWVLVAIDMTQSEIFVYDSLPGHISTLKLLTDMRLLSHTIPSLLYACGLMDTADCKMKRTPWRVYRPTTDT

Query:  RQKGSIDCGIFACKFLEYLVSGNSLETLVHAEVSHIRRQMKIGKSLFIRFGGTWEDNTNSYVGGELKGIIVPLTITYKELKNRLYRLMKVDQNGYDRVIR
        RQK                                                                                                 
Subjt:  RQKGSIDCGIFACKFLEYLVSGNSLETLVHAEVSHIRRQMKIGKSLFIRFGGTWEDNTNSYVGGELKGIIVPLTITYKELKNRLYRLMKVDQNGYDRVIR

Query:  VLYHLACDSPPMFVTDDDDLQFALVQEQVSKVPLFVSTVPRESIDIQSSRLNQDGAQSIPNGGSVPEEGSTGVDEWGLNDVYMQDMYSSDCIYTQDSELV
                                                                                                            
Subjt:  VLYHLACDSPPMFVTDDDDLQFALVQEQVSKVPLFVSTVPRESIDIQSSRLNQDGAQSIPNGGSVPEEGSTGVDEWGLNDVYMQDMYSSDCIYTQDSELV

Query:  APTTRMPPVMHVSSDEINNTAQVQHASPRTVHVFPFDELNNTSQVRHASPLTEHVFPSVEPSSSDPRNVGDHQIPVPQSPSTVTIEQARRYNLLPGGSEL
                                                                                                       GGSEL
Subjt:  APTTRMPPVMHVSSDEINNTAQVQHASPRTVHVFPFDELNNTSQVRHASPLTEHVFPSVEPSSSDPRNVGDHQIPVPQSPSTVTIEQARRYNLLPGGSEL

Query:  HVGKIFVSKQDLRMVLSNAAMRSNREYKVSRSTKSKFTVPCIDNTCNWRVAAHSVGKSSIFCISKYVDAHTCTIDSVNHDHKQASRWVVANLINDKVAGT
        HVGKIFVSKQDL MV+SNAAMRSNREYKVSRSTKSKF V CI+NTCN RVA HSVGKSSIFCISKYVDAHTC ID+VNHDHKQAS  +VANLI D+VAG 
Subjt:  HVGKIFVSKQDLRMVLSNAAMRSNREYKVSRSTKSKFTVPCIDNTCNWRVAAHSVGKSSIFCISKYVDAHTCTIDSVNHDHKQASRWVVANLINDKVAGT

Query:  GRIYKIKHIKEDVRKEYGVNISYDKAHRARELAYAIVR--------------------------------------------------------------
        GRIY IKHIKEDVRKE+GVN SYDKAHRARELAYAIVR                                                              
Subjt:  GRIYKIKHIKEDVRKEYGVNISYDKAHRARELAYAIVR--------------------------------------------------------------

Query:  --------------------------------------------------ADRKTPTYISMRTDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIER
                                                           D +   +I  R DAAYAYRKSQFTYYWNQILS+GSGSLAKYLQEIG+ER
Subjt:  --------------------------------------------------ADRKTPTYISMRTDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIER

Query:  -----------DGSTKEGLTGPPKTP-LIQTMQKSDLQYSLRSLVATQSNQST-----GACFMLRTVAWDGTVDLNARTCTCMEFQYMGIPCSHAIAAA
                   +  T        +T    Q    SD      +L   +S + T        F +     DGTVDLNA TCTCMEFQYMGIPCSHAIAAA
Subjt:  -----------DGSTKEGLTGPPKTP-LIQTMQKSDLQYSLRSLVATQSNQST-----GACFMLRTVAWDGTVDLNARTCTCMEFQYMGIPCSHAIAAA

XP_022155155.1 uncharacterized protein LOC111022298 [Momordica charantia]2.1e-11194.5Show/hide
Query:  MDHQLRIKENDRFLAQATSMSHLSNVNKLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLLREVAGSSDDSMSFLIGGNVLTFSKDQFMLITG
        MDHQLRIKEND F AQAT MSHLSNVN+LIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFL REVAGSSD++MSFLIGGNVLTFSKDQFMLITG
Subjt:  MDHQLRIKENDRFLAQATSMSHLSNVNKLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLLREVAGSSDDSMSFLIGGNVLTFSKDQFMLITG

Query:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLNWGSDVWSRTI
        LWRLPGK+VQKKIGKN LRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVK+TLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHL+WGSDVWSRT+
Subjt:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLNWGSDVWSRTI

Query:  NGLKRAMNGKVALYKNKV
        NGLKRAMNGKVALYKNKV
Subjt:  NGLKRAMNGKVALYKNKV

XP_022159086.1 uncharacterized protein LOC111025530 [Momordica charantia]2.0e-12252.66Show/hide
Query:  IPVPQSPSTVTIEQARRYNLLPGGSELHVGKIFVSKQDLRMVLSNAAMRSNREYKVSRSTKSKFTVPCIDNTCNWRVAAHSVGKSSIFCISKYVDAHTCT
        IP+  SPS+    + RR       S+L++G+I   K +L   +       NREYK SRSTKSKF V CIDNTCNWRVAAHSVGKSSIFC SKYVDAHTCT
Subjt:  IPVPQSPSTVTIEQARRYNLLPGGSELHVGKIFVSKQDLRMVLSNAAMRSNREYKVSRSTKSKFTVPCIDNTCNWRVAAHSVGKSSIFCISKYVDAHTCT

Query:  IDSVNHDHKQASRWVVANLINDKVAGTGRIYKIKHIKEDVRKEYGVNISYDKAHRARELAYAIVR-----------------------------------
        ID+VNHDHKQAS WVVANLI D+VA T RIYKIKHIKEDVR+E+ VNISYDKAHRARELAYAIVR                                   
Subjt:  IDSVNHDHKQASRWVVANLINDKVAGTGRIYKIKHIKEDVRKEYGVNISYDKAHRARELAYAIVR-----------------------------------

Query:  ----------------------ADRKTP--------------------TYISMRTDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIER------DG
                               D ++                      +IS R DAAYAYRKSQFTYYWNQILS+GSGSLAKYLQEIG+ER       G
Subjt:  ----------------------ADRKTP--------------------TYISMRTDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIER------DG

Query:  STKEGLTGP---------------PKTPLIQ------------------TMQKSDLQYSLRSLVATQSNQSTGACFMLRTVAW----------DGTVDLN
           E +T                 P T +++                  T   S   Y+   L A Q  +S    + ++ V W          DGTVDLN
Subjt:  STKEGLTGP---------------PKTPLIQ------------------TMQKSDLQYSLRSLVATQSNQSTGACFMLRTVAW----------DGTVDLN

Query:  ARTCTCMEFQYMGIPCSHAIAAARHKNINCHTLIDPCYSVDSLISAYAEPILPVGHMSEWKRPADYQPIPVQPPRLVKRAGRRRTQWIASTGERRVVNKC
        ARTCTCMEFQYMGIPCSHAIAAARHKNINCHTLIDPCYSVDSLISAYAEPILP+GHMSEWKRPA+YQ IPVQPP LVKRAGRRRTQ IASTGERRVVNKC
Subjt:  ARTCTCMEFQYMGIPCSHAIAAARHKNINCHTLIDPCYSVDSLISAYAEPILPVGHMSEWKRPADYQPIPVQPPRLVKRAGRRRTQWIASTGERRVVNKC

Query:  SRCEANMSTSYESDGSSSDGATSPIS
        SRC         + G +    T+PI+
Subjt:  SRCEANMSTSYESDGSSSDGATSPIS

TrEMBL top hitse value%identityAlignment
A0A6J1CX02 uncharacterized protein LOC1110151833.3e-9494.21Show/hide
Query:  MDHQLRIKENDRFLAQATSMSHLSNVNKLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLLREVAGSSDDSMSFLIGGNVLTFSKDQFMLITG
        MDHQLRIKENDRF  QATSMSHLSNVN+LIKDKLT DQLDMFRRRTIFGRFVDLEMMFCSGVVHHFL REVAGSSDDS+  LIGGNV TFSKDQFMLITG
Subjt:  MDHQLRIKENDRFLAQATSMSHLSNVNKLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLLREVAGSSDDSMSFLIGGNVLTFSKDQFMLITG

Query:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLN
        LWRLPGKVVQKKIGKNRLRRKYFNDEASM+LEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHL+
Subjt:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLN

A0A6J1DL08 uncharacterized protein LOC1110220714.6e-18164.17Show/hide
Query:  MKIGKSLFIRFGGTWEDNTNSYVGGELKGIIVPLTITYKELKNRLYRLMKVDQNGYDRVIRVLYHLACDSPPMFVTDDDDLQFALVQEQVSKVPLFVSTV
        M+IGKSLFIRFGGTWEDNTNSYVGGELKGIIVPLT+TYKELKNRL+RLMKVDQNGYD VIRV YHLACDSPPMFVTDDDDL+FALVQEQVSKVPLFVST+
Subjt:  MKIGKSLFIRFGGTWEDNTNSYVGGELKGIIVPLTITYKELKNRLYRLMKVDQNGYDRVIRVLYHLACDSPPMFVTDDDDLQFALVQEQVSKVPLFVSTV

Query:  PRESIDIQSSRLNQDGAQSIPNGGSVPEEGSTGVDEWGLNDVYMQDMYSSDCIYTQDSELVAPTTRMPPVMHVSSDEINNTAQVQHASPRTVHVFPFDEL
        PRESIDIQSSRLNQDG QSIPNGGSVPEEGST VDEWGLNDVYMQDMYSSD                                                 
Subjt:  PRESIDIQSSRLNQDGAQSIPNGGSVPEEGSTGVDEWGLNDVYMQDMYSSDCIYTQDSELVAPTTRMPPVMHVSSDEINNTAQVQHASPRTVHVFPFDEL

Query:  NNTSQVRHASPLTEHVFPSVEPSSSDPRNVGDHQIPVPQSPSTVTIEQARRYNLLPGGSELHVGKIFVSKQDLRMVLSNAAMRSNREYKVSRSTKSKFTV
             +RHASPLT      VEPSSSDPRNV   QIPVPQSPSTVTIEQA+RYNLLPGGSELHVGKIFVSKQDLRMVLSNAAMRSNREYKVSRSTKSKF V
Subjt:  NNTSQVRHASPLTEHVFPSVEPSSSDPRNVGDHQIPVPQSPSTVTIEQARRYNLLPGGSELHVGKIFVSKQDLRMVLSNAAMRSNREYKVSRSTKSKFTV

Query:  PCIDNTCNWRVAAHSVGKSSIFCISKYVDAHTCTIDSVNHDHKQASRWVVANLINDKVAGTGRIYKIKHIKEDVRKEYGVNISYDKAHRARELAYAIVR-
         CIDNTCNWRVAAHSVGKSSIFCISKYVDAHTCTIDSVNHDHKQAS WVVANLI D VAGTGRIYKIKHIKEDVRKEYGVNISYDKAHRARELAY IVR 
Subjt:  PCIDNTCNWRVAAHSVGKSSIFCISKYVDAHTCTIDSVNHDHKQASRWVVANLINDKVAGTGRIYKIKHIKEDVRKEYGVNISYDKAHRARELAYAIVR-

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------ADRKTPTYISMRTDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIER
                    D++   +IS R DAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIER
Subjt:  -----------ADRKTPTYISMRTDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIER

A0A6J1DLM5 uncharacterized protein LOC1110222981.0e-11194.5Show/hide
Query:  MDHQLRIKENDRFLAQATSMSHLSNVNKLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLLREVAGSSDDSMSFLIGGNVLTFSKDQFMLITG
        MDHQLRIKEND F AQAT MSHLSNVN+LIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFL REVAGSSD++MSFLIGGNVLTFSKDQFMLITG
Subjt:  MDHQLRIKENDRFLAQATSMSHLSNVNKLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLLREVAGSSDDSMSFLIGGNVLTFSKDQFMLITG

Query:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLNWGSDVWSRTI
        LWRLPGK+VQKKIGKN LRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVK+TLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHL+WGSDVWSRT+
Subjt:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLNWGSDVWSRTI

Query:  NGLKRAMNGKVALYKNKV
        NGLKRAMNGKVALYKNKV
Subjt:  NGLKRAMNGKVALYKNKV

A0A6J1DN69 uncharacterized protein LOC1110221404.1e-10536.3Show/hide
Query:  MDAVLGQVEDFIPSWVDVDVVYSPLCIKDHWVLVAIDMTQSEIFVYDSLPGHISTLKLLTDMRLLSHTIPSLLYACGLMDTADCKMKRTPWRVYRPTTDT
        MD VLGQVEDFIP+WVDVDVVYS L I+DHWVLVAIDMTQSEIFVYDSLPGHIST KL+ D+R LSHTIPSLLYACGLMDTADCK+++TPW VYRPTTDT
Subjt:  MDAVLGQVEDFIPSWVDVDVVYSPLCIKDHWVLVAIDMTQSEIFVYDSLPGHISTLKLLTDMRLLSHTIPSLLYACGLMDTADCKMKRTPWRVYRPTTDT

Query:  RQKGSIDCGIFACKFLEYLVSGNSLETLVHAEVSHIRRQMKIGKSLFIRFGGTWEDNTNSYVGGELKGIIVPLTITYKELKNRLYRLMKVDQNGYDRVIR
        RQK                                                                                                 
Subjt:  RQKGSIDCGIFACKFLEYLVSGNSLETLVHAEVSHIRRQMKIGKSLFIRFGGTWEDNTNSYVGGELKGIIVPLTITYKELKNRLYRLMKVDQNGYDRVIR

Query:  VLYHLACDSPPMFVTDDDDLQFALVQEQVSKVPLFVSTVPRESIDIQSSRLNQDGAQSIPNGGSVPEEGSTGVDEWGLNDVYMQDMYSSDCIYTQDSELV
                                                                                                            
Subjt:  VLYHLACDSPPMFVTDDDDLQFALVQEQVSKVPLFVSTVPRESIDIQSSRLNQDGAQSIPNGGSVPEEGSTGVDEWGLNDVYMQDMYSSDCIYTQDSELV

Query:  APTTRMPPVMHVSSDEINNTAQVQHASPRTVHVFPFDELNNTSQVRHASPLTEHVFPSVEPSSSDPRNVGDHQIPVPQSPSTVTIEQARRYNLLPGGSEL
                                                                                                       GGSEL
Subjt:  APTTRMPPVMHVSSDEINNTAQVQHASPRTVHVFPFDELNNTSQVRHASPLTEHVFPSVEPSSSDPRNVGDHQIPVPQSPSTVTIEQARRYNLLPGGSEL

Query:  HVGKIFVSKQDLRMVLSNAAMRSNREYKVSRSTKSKFTVPCIDNTCNWRVAAHSVGKSSIFCISKYVDAHTCTIDSVNHDHKQASRWVVANLINDKVAGT
        HVGKIFVSKQDL MV+SNAAMRSNREYKVSRSTKSKF V CI+NTCN RVA HSVGKSSIFCISKYVDAHTC ID+VNHDHKQAS  +VANLI D+VAG 
Subjt:  HVGKIFVSKQDLRMVLSNAAMRSNREYKVSRSTKSKFTVPCIDNTCNWRVAAHSVGKSSIFCISKYVDAHTCTIDSVNHDHKQASRWVVANLINDKVAGT

Query:  GRIYKIKHIKEDVRKEYGVNISYDKAHRARELAYAIVR--------------------------------------------------------------
        GRIY IKHIKEDVRKE+GVN SYDKAHRARELAYAIVR                                                              
Subjt:  GRIYKIKHIKEDVRKEYGVNISYDKAHRARELAYAIVR--------------------------------------------------------------

Query:  --------------------------------------------------ADRKTPTYISMRTDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIER
                                                           D +   +I  R DAAYAYRKSQFTYYWNQILS+GSGSLAKYLQEIG+ER
Subjt:  --------------------------------------------------ADRKTPTYISMRTDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIER

Query:  -----------DGSTKEGLTGPPKTP-LIQTMQKSDLQYSLRSLVATQSNQST-----GACFMLRTVAWDGTVDLNARTCTCMEFQYMGIPCSHAIAAA
                   +  T        +T    Q    SD      +L   +S + T        F +     DGTVDLNA TCTCMEFQYMGIPCSHAIAAA
Subjt:  -----------DGSTKEGLTGPPKTP-LIQTMQKSDLQYSLRSLVATQSNQST-----GACFMLRTVAWDGTVDLNARTCTCMEFQYMGIPCSHAIAAA

A0A6J1E2V3 uncharacterized protein LOC1110255309.8e-12352.66Show/hide
Query:  IPVPQSPSTVTIEQARRYNLLPGGSELHVGKIFVSKQDLRMVLSNAAMRSNREYKVSRSTKSKFTVPCIDNTCNWRVAAHSVGKSSIFCISKYVDAHTCT
        IP+  SPS+    + RR       S+L++G+I   K +L   +       NREYK SRSTKSKF V CIDNTCNWRVAAHSVGKSSIFC SKYVDAHTCT
Subjt:  IPVPQSPSTVTIEQARRYNLLPGGSELHVGKIFVSKQDLRMVLSNAAMRSNREYKVSRSTKSKFTVPCIDNTCNWRVAAHSVGKSSIFCISKYVDAHTCT

Query:  IDSVNHDHKQASRWVVANLINDKVAGTGRIYKIKHIKEDVRKEYGVNISYDKAHRARELAYAIVR-----------------------------------
        ID+VNHDHKQAS WVVANLI D+VA T RIYKIKHIKEDVR+E+ VNISYDKAHRARELAYAIVR                                   
Subjt:  IDSVNHDHKQASRWVVANLINDKVAGTGRIYKIKHIKEDVRKEYGVNISYDKAHRARELAYAIVR-----------------------------------

Query:  ----------------------ADRKTP--------------------TYISMRTDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIER------DG
                               D ++                      +IS R DAAYAYRKSQFTYYWNQILS+GSGSLAKYLQEIG+ER       G
Subjt:  ----------------------ADRKTP--------------------TYISMRTDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIER------DG

Query:  STKEGLTGP---------------PKTPLIQ------------------TMQKSDLQYSLRSLVATQSNQSTGACFMLRTVAW----------DGTVDLN
           E +T                 P T +++                  T   S   Y+   L A Q  +S    + ++ V W          DGTVDLN
Subjt:  STKEGLTGP---------------PKTPLIQ------------------TMQKSDLQYSLRSLVATQSNQSTGACFMLRTVAW----------DGTVDLN

Query:  ARTCTCMEFQYMGIPCSHAIAAARHKNINCHTLIDPCYSVDSLISAYAEPILPVGHMSEWKRPADYQPIPVQPPRLVKRAGRRRTQWIASTGERRVVNKC
        ARTCTCMEFQYMGIPCSHAIAAARHKNINCHTLIDPCYSVDSLISAYAEPILP+GHMSEWKRPA+YQ IPVQPP LVKRAGRRRTQ IASTGERRVVNKC
Subjt:  ARTCTCMEFQYMGIPCSHAIAAARHKNINCHTLIDPCYSVDSLISAYAEPILPVGHMSEWKRPADYQPIPVQPPRLVKRAGRRRTQWIASTGERRVVNKC

Query:  SRCEANMSTSYESDGSSSDGATSPIS
        SRC         + G +    T+PI+
Subjt:  SRCEANMSTSYESDGSSSDGATSPIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G37020.1 Cysteine proteinases superfamily protein3.3e-0626.45Show/hide
Query:  WVDVDVVYSPLCI-KDHWVLVAIDMTQSEIFVYDSLPGHISTLKLLTDMRLLSHTIPSLLYACGL-----MDTADCKMKRTPWRVYRPTTDTRQKGSIDC
        + +VD +Y  L + ++HWV +  ++  + I+VYDS+P  +  L+++     L   IP++L    L        A  ++KR   +   P  D R     DC
Subjt:  WVDVDVVYSPLCI-KDHWVLVAIDMTQSEIFVYDSLPGHISTLKLLTDMRLLSHTIPSLLYACGL-----MDTADCKMKRTPWRVYRPTTDTRQKGSIDC

Query:  GIFACKFLEYLVSGNSLETLVHAEVSHIRRQMKIGKSLFIRFGGTWEDNTNSYVG
         I+A K++E L  G S + L    +          ++L+I  G    D    +VG
Subjt:  GIFACKFLEYLVSGNSLETLVHAEVSHIRRQMKIGKSLFIRFGGTWEDNTNSYVG

AT1G49920.1 MuDR family transposase4.2e-0930.6Show/hide
Query:  TPLIQTMQKSDLQYSLRSLVATQSNQSTGACFMLRTVAWDGTVDLNARTCTCMEFQYMGIPCSHAIAAARHKNINCHTLIDPCYSVDSLISAYAEPILPV
        TPL +   +  +    ++ +  QSN ST            G V LN  TCTC EFQ    PC HA+A      IN    +D CY+V+     Y+    PV
Subjt:  TPLIQTMQKSDLQYSLRSLVATQSNQSTGACFMLRTVAWDGTVDLNARTCTCMEFQYMGIPCSHAIAAARHKNINCHTLIDPCYSVDSLISAYAEPILPV

Query:  GHMSEWKR----PADYQPIPVQPPRLVKRAGRRR
          +S W      P    P+   PP  V   G+ +
Subjt:  GHMSEWKR----PADYQPIPVQPPRLVKRAGRRR

AT5G45570.1 Ulp1 protease family protein3.9e-0724.81Show/hide
Query:  WVDVDVVYSPLCIK-DHWVLVAIDMTQSEIFVYDSLPGHISTLKLLTDMRLLSHTIPSLLYACGLMDTADCKMKRTPWRVYRPTTDTRQKGSIDCGIFAC
        +VDVD +Y+ L +  +HWV + ID+T   + VYDS+P   +  ++      +   IP++L +            +  W+      +    G  DC I++ 
Subjt:  WVDVDVVYSPLCIK-DHWVLVAIDMTQSEIFVYDSLPGHISTLKLLTDMRLLSHTIPSLLYACGLMDTADCKMKRTPWRVYRPTTDTRQKGSIDCGIFAC

Query:  KFLEYLVSGNSLETLVHAEVSHIRRQMKI
        K++E L  G S + L    +  +R ++ +
Subjt:  KFLEYLVSGNSLETLVHAEVSHIRRQMKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCATCAGTTGAGGATTAAGGAGAATGACCGCTTTCTGGCTCAAGCTACCAGCATGTCTCACTTGAGCAATGTCAACAAGCTTATCAAGGATAAACTCACA
GCGGACCAACTTGATATGTTCCGTAGAAGAACAATATTTGGTCGATTTGTCGACTTGGAGATGATGTTCTGCAGTGGTGTAGTTCATCACTTTCTGTTAAGGGAG
GTTGCTGGGAGCAGTGACGACAGCATGAGTTTCTTAATTGGTGGCAACGTGTTGACATTCTCGAAGGATCAATTCATGCTTATAACGGGATTGTGGCGGCTGCCC
GGTAAGGTGGTCCAGAAAAAGATTGGAAAGAATAGGTTGCGGAGGAAGTACTTCAACGATGAAGCCTCCATGCTGCTCGAAGAGTTTGTGGAGGTTTACAAGCAG
ACTGATTTCGAGGACGACGAGGATGCCGTTAAAGTGACATTAATTTTGTACACGGAGCTTGTGATGATGGGAAAGAGCAAGAGCAAGTCGAAGGTTGACATCGAC
TTGTACAACCAAGTCGATGACTTGGACTACTTCAACCATTTGAACTGGGGTTCTGATGTCTGGAGTAGGACAATTAACGGTCTGAAGCGTGCGATGAATGGAAAA
GTTGCGCTATACAAGAACAAAGTGTGGATATACGAGGTTGTCCCATCTCTCATCACTCCCGGTGTCAATCGTTTGAGGCGAGACCGCCATTCCCCGGATAATTCG
GTATTCGTGCATTGTAATGGTATTTACACTTCCTTATGTTCTTGTGATTTACGTTCAGTTGGTCATACTTATCCACTCGTGGAGACAGAGCTGGATAAGGACTAT
CAAAGGTGTCCATTGGACGAAAGAGAGGTGGTTGATTTAATTGCGCCTGGGTGTTCCACCTCCGACAGTGATGATGGACACAATCCTTCCCCCATCACCGACAAT
CTTGGCGCCGAAGACGATCTCCCACTCGACGATGCGCATTCGTTGGAAACGAATGTACAGGCAATACCGGATGAATCTTCGGATATGCCACATACAGAGGCCGCA
TCTGAAGGTGGGCAACGGACACCGGTCAAAGTACTTCGACCAAGTACTTCTATTCTGTCGAATGTGGGGCAAAGCACGCGGCAATCACCGCGAGCGACGTCACGC
GATGCTTGCCCTACACAACAGCAAGACACCCGTCGATCAAATGATAAATTCGGGGCTATGGAGAGAAGGCTGGATCTTTTAGCTTCGGACATAGCGGAGGTGAAG
ACAGATTTGGCGAAGGTCAAGTCCGACTTGAGTGAAATGAAACTCATGCTTCAACGGTTGTGTCAGATCGATAGGCGAGAGGTGAATATTGGTGTCTCTCCGTCG
GATACAGTCCACGTGTCACATCCATTGGTATCCAATGTTATCCCCGAGCATGATGGGGATGCTGATGACCATAAACCTGGAGGTTCCGATGCTGGAAAGGAGGAC
GATGTGGTTCCTGTAGAAGCATCGTTGCATGAAAAGGCAACGGATGGAGTAGAGATGACCATACCCCCATCCGATCTTGGAGATGCAGAACTAGCCAATCCCGCG
TGTATTGTCCATTCGGTGGAGTTGGACATTGCAGTGGTGACACCCATTGTTTCGACAGAGATGGTGGAACTCGAAATGGCACCACCAATAGTACAGGATCCACAA
GCAGAGACGACGTCCGATCCAACCTTCGAGCCTCCTGCCTCAATCAACATTGATGGTCCGTGTGGCATGATCCATGGGCCTCGTCAAGCAGAGCATATTGAGTTG
GCCCTTACACCAGCGGATACAAGCCCTACTACTAAACCTATTCCCACACCTGCACCAGCATATACAACTTCCACCCCTCAACCTATTCCCACCCTTACACCAGCT
GAAAACCCCACCACCCGTCATCTGAGTGATCCTGTGGGTTCTATTAACCTCGCGTTAGACAAAATTCCTGAACCATTGGTCATCGTGCACCAGCCAACTAAGGAG
AAGAACCCCCCTCATAGCAAAAAATCCACCACAATCCGGTTTACGGCACCGCAAGAAGCCCCACTCGTTGTCAGCGGTTTCGCTGTTCACGAACCCACTAAGCTG
AAGAAAATCGAACAACAAACCGCTCCTAAGCAGTCGGCCAGGAAAATCGAGGTTTCGTATCCCGACGAAACAAGAAAAACTGAGCGTAAGCGGACGGAAACGAAA
CCATTCAGTCCGGAGGACACGCATCGGCAGAAGAAGAAGCAGAAAATGATGGATGTGGATCCCGTACTTGCCAGCCAGGTTAGGCCATTTCGTCCCAAATACAAC
CCGTTGCATAACTTTTCGGATGCTAAGTTTAGGGAGATGATGCGTTGGGTACGGGACCCTAGGAATGACAATACAACGCGGCCGTCTACAACTTGGAATGTGCAG
AGCGGATATTCCAGAAGATTCTTCATTAACATCCTCAATCCTAAGGAGAAGGTGGAAGACCCGGAAGCTGCTGTCATTCTATATTTCATTATGAGGAAGCTCAGT
AGTCGGCCGCACCTGTGCGTTCATAAGTTTTCTGTCCTGGACCCACTACAAATGCAAGTTCTTACCGCTGCAGGTGGTCCCTACACACGAATCAAGGGGAAGGTC
GTCCAGGACACGACTAATGCTTGGGACGAGTATAAGGAGTGCATGGATGCCGTGTTGGGTCAGGTGGAAGATTTCATTCCATCCTGGGTGGACGTCGACGTAGTG
TACAGCCCGCTCTGTATTAAGGATCACTGGGTCCTGGTTGCGATAGATATGACCCAGTCCGAGATTTTTGTATACGACTCATTGCCAGGCCACATTTCCACGTTG
AAGTTGCTGACAGACATGCGGCTGTTGAGTCATACAATCCCATCGCTTTTGTACGCATGTGGGCTGATGGATACGGCCGATTGCAAGATGAAGAGGACTCCGTGG
CGTGTATACCGTCCTACGACCGACACGAGGCAGAAAGGTAGTATAGACTGTGGTATTTTTGCATGTAAATTTTTGGAATATCTTGTGTCGGGTAATAGTTTAGAA
ACTCTTGTTCATGCTGAAGTGTCGCACATTAGAAGGCAGATGAAGATTGGTAAAAGTTTATTCATAAGATTTGGTGGTACGTGGGAAGACAACACAAACTCGTAT
GTCGGTGGTGAGCTGAAAGGAATAATTGTCCCACTTACAATAACGTATAAAGAACTGAAAAATCGGTTGTACAGACTGATGAAAGTCGACCAGAATGGGTACGAC
CGGGTAATTAGGGTACTGTACCACTTGGCATGCGATTCACCGCCAATGTTCGTAACGGATGACGATGACCTCCAATTTGCATTAGTCCAAGAACAGGTTTCTAAA
GTCCCACTGTTTGTATCGACCGTCCCTCGCGAAAGTATTGACATACAGTCATCTAGATTAAACCAAGATGGAGCGCAGTCCATTCCTAATGGAGGATCTGTACCG
GAAGAAGGTTCCACGGGGGTAGATGAGTGGGGGTTGAACGACGTATACATGCAGGACATGTACAGTTCAGACTGTATATACACTCAAGACTCGGAATTGGTCGCC
CCCACGACGCGCATGCCTCCCGTAATGCATGTCTCATCAGATGAGATAAATAATACAGCACAAGTACAACATGCAAGTCCTCGTACGGTACATGTATTCCCATTT
GATGAATTAAATAACACATCACAAGTACGACATGCAAGTCCTTTGACAGAACATGTTTTCCCATCAGTAGAACCCTCATCCAGTGACCCTCGTAATGTTGGTGAT
CACCAAATACCAGTACCTCAATCCCCAAGCACCGTGACTATAGAACAGGCCAGAAGGTACAACCTTTTACCAGGCGGATCAGAACTGCACGTGGGGAAGATATTC
GTTTCAAAGCAAGACCTGCGTATGGTGCTGTCAAATGCAGCGATGCGATCTAATCGTGAATACAAGGTCAGTAGGTCAACGAAATCAAAGTTTACTGTTCCCTGC
ATCGACAATACATGCAATTGGAGGGTTGCAGCCCACTCCGTCGGTAAATCTTCGATATTTTGTATATCAAAGTATGTAGATGCACACACCTGCACGATTGACAGT
GTGAACCACGACCACAAACAGGCGAGCAGATGGGTCGTAGCGAATCTTATTAATGATAAAGTTGCTGGAACCGGTCGTATTTACAAGATAAAGCATATTAAGGAG
GATGTCCGTAAAGAGTACGGGGTGAACATAAGTTATGACAAGGCCCATCGAGCACGAGAGTTAGCATACGCTATTGTTAGGGCAGACCGGAAGACTCCTACATAC
ATCTCCATGCGTACGGATGCAGCGTATGCGTATCGGAAGTCACAGTTCACGTACTACTGGAACCAGATACTGTCGATTGGATCGGGTTCACTTGCCAAATACTTA
CAAGAAATTGGGATAGAACGAGATGGTTCCACGAAAGAAGGACTAACTGGTCCACCCAAAACACCTCTCATTCAGACTATGCAGAAGAGCGACTTGCAGTACAGT
TTGAGAAGTCTCGTCGCTACACAGTCAAACCAGTCGACTGGTGCATGTTTCATGTTGAGGACGGTGGCTTGGGACGGGACGGTTGATTTGAATGCCCGTACATGT
ACATGCATGGAGTTCCAGTACATGGGCATTCCATGTTCGCACGCAATTGCAGCAGCGAGGCACAAGAATATAAATTGCCACACGTTGATCGATCCATGCTACAGT
GTGGACTCCCTAATTAGTGCCTACGCTGAACCAATCTTACCGGTAGGCCACATGTCGGAATGGAAAAGGCCAGCTGATTACCAGCCTATTCCCGTCCAACCACCA
CGTCTTGTTAAACGTGCAGGCCGGCGTAGAACGCAATGGATCGCCTCAACCGGTGAGCGTCGTGTTGTGAACAAGTGTTCTCGATGCGAAGCAAACATGTCAACA
TCTTACGAGTCGGACGGTTCCTCTTCCGACGGAGCAACCAGTCCTATCTCAAACCATCCCTCCTCCTTCGCGGAGAACGGAGTGCCAATTAACGGTCCTCTTCAA
CAAGCACGGGAAGAAAACGACCAGCTCAGGAGAGAGCTACGTCGAACGCAACACGAGCTCAACGACACGAGGTATAAGTTAGCCCGGGTTGAAGAAACGCAGGAC
TTGCTGGAGGGACTGCTGAAGGAGGAGAAGGAGGAACGACTTCGTCTGGAGGACAGGATGGATCAGTTATTGGCCCGTCTACGCCGATACCGTAATAATAATAAT
GATTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACCATCAGTTGAGGATTAAGGAGAATGACCGCTTTCTGGCTCAAGCTACCAGCATGTCTCACTTGAGCAATGTCAACAAGCTTATCAAGGATAAACTCACA
GCGGACCAACTTGATATGTTCCGTAGAAGAACAATATTTGGTCGATTTGTCGACTTGGAGATGATGTTCTGCAGTGGTGTAGTTCATCACTTTCTGTTAAGGGAG
GTTGCTGGGAGCAGTGACGACAGCATGAGTTTCTTAATTGGTGGCAACGTGTTGACATTCTCGAAGGATCAATTCATGCTTATAACGGGATTGTGGCGGCTGCCC
GGTAAGGTGGTCCAGAAAAAGATTGGAAAGAATAGGTTGCGGAGGAAGTACTTCAACGATGAAGCCTCCATGCTGCTCGAAGAGTTTGTGGAGGTTTACAAGCAG
ACTGATTTCGAGGACGACGAGGATGCCGTTAAAGTGACATTAATTTTGTACACGGAGCTTGTGATGATGGGAAAGAGCAAGAGCAAGTCGAAGGTTGACATCGAC
TTGTACAACCAAGTCGATGACTTGGACTACTTCAACCATTTGAACTGGGGTTCTGATGTCTGGAGTAGGACAATTAACGGTCTGAAGCGTGCGATGAATGGAAAA
GTTGCGCTATACAAGAACAAAGTGTGGATATACGAGGTTGTCCCATCTCTCATCACTCCCGGTGTCAATCGTTTGAGGCGAGACCGCCATTCCCCGGATAATTCG
GTATTCGTGCATTGTAATGGTATTTACACTTCCTTATGTTCTTGTGATTTACGTTCAGTTGGTCATACTTATCCACTCGTGGAGACAGAGCTGGATAAGGACTAT
CAAAGGTGTCCATTGGACGAAAGAGAGGTGGTTGATTTAATTGCGCCTGGGTGTTCCACCTCCGACAGTGATGATGGACACAATCCTTCCCCCATCACCGACAAT
CTTGGCGCCGAAGACGATCTCCCACTCGACGATGCGCATTCGTTGGAAACGAATGTACAGGCAATACCGGATGAATCTTCGGATATGCCACATACAGAGGCCGCA
TCTGAAGGTGGGCAACGGACACCGGTCAAAGTACTTCGACCAAGTACTTCTATTCTGTCGAATGTGGGGCAAAGCACGCGGCAATCACCGCGAGCGACGTCACGC
GATGCTTGCCCTACACAACAGCAAGACACCCGTCGATCAAATGATAAATTCGGGGCTATGGAGAGAAGGCTGGATCTTTTAGCTTCGGACATAGCGGAGGTGAAG
ACAGATTTGGCGAAGGTCAAGTCCGACTTGAGTGAAATGAAACTCATGCTTCAACGGTTGTGTCAGATCGATAGGCGAGAGGTGAATATTGGTGTCTCTCCGTCG
GATACAGTCCACGTGTCACATCCATTGGTATCCAATGTTATCCCCGAGCATGATGGGGATGCTGATGACCATAAACCTGGAGGTTCCGATGCTGGAAAGGAGGAC
GATGTGGTTCCTGTAGAAGCATCGTTGCATGAAAAGGCAACGGATGGAGTAGAGATGACCATACCCCCATCCGATCTTGGAGATGCAGAACTAGCCAATCCCGCG
TGTATTGTCCATTCGGTGGAGTTGGACATTGCAGTGGTGACACCCATTGTTTCGACAGAGATGGTGGAACTCGAAATGGCACCACCAATAGTACAGGATCCACAA
GCAGAGACGACGTCCGATCCAACCTTCGAGCCTCCTGCCTCAATCAACATTGATGGTCCGTGTGGCATGATCCATGGGCCTCGTCAAGCAGAGCATATTGAGTTG
GCCCTTACACCAGCGGATACAAGCCCTACTACTAAACCTATTCCCACACCTGCACCAGCATATACAACTTCCACCCCTCAACCTATTCCCACCCTTACACCAGCT
GAAAACCCCACCACCCGTCATCTGAGTGATCCTGTGGGTTCTATTAACCTCGCGTTAGACAAAATTCCTGAACCATTGGTCATCGTGCACCAGCCAACTAAGGAG
AAGAACCCCCCTCATAGCAAAAAATCCACCACAATCCGGTTTACGGCACCGCAAGAAGCCCCACTCGTTGTCAGCGGTTTCGCTGTTCACGAACCCACTAAGCTG
AAGAAAATCGAACAACAAACCGCTCCTAAGCAGTCGGCCAGGAAAATCGAGGTTTCGTATCCCGACGAAACAAGAAAAACTGAGCGTAAGCGGACGGAAACGAAA
CCATTCAGTCCGGAGGACACGCATCGGCAGAAGAAGAAGCAGAAAATGATGGATGTGGATCCCGTACTTGCCAGCCAGGTTAGGCCATTTCGTCCCAAATACAAC
CCGTTGCATAACTTTTCGGATGCTAAGTTTAGGGAGATGATGCGTTGGGTACGGGACCCTAGGAATGACAATACAACGCGGCCGTCTACAACTTGGAATGTGCAG
AGCGGATATTCCAGAAGATTCTTCATTAACATCCTCAATCCTAAGGAGAAGGTGGAAGACCCGGAAGCTGCTGTCATTCTATATTTCATTATGAGGAAGCTCAGT
AGTCGGCCGCACCTGTGCGTTCATAAGTTTTCTGTCCTGGACCCACTACAAATGCAAGTTCTTACCGCTGCAGGTGGTCCCTACACACGAATCAAGGGGAAGGTC
GTCCAGGACACGACTAATGCTTGGGACGAGTATAAGGAGTGCATGGATGCCGTGTTGGGTCAGGTGGAAGATTTCATTCCATCCTGGGTGGACGTCGACGTAGTG
TACAGCCCGCTCTGTATTAAGGATCACTGGGTCCTGGTTGCGATAGATATGACCCAGTCCGAGATTTTTGTATACGACTCATTGCCAGGCCACATTTCCACGTTG
AAGTTGCTGACAGACATGCGGCTGTTGAGTCATACAATCCCATCGCTTTTGTACGCATGTGGGCTGATGGATACGGCCGATTGCAAGATGAAGAGGACTCCGTGG
CGTGTATACCGTCCTACGACCGACACGAGGCAGAAAGGTAGTATAGACTGTGGTATTTTTGCATGTAAATTTTTGGAATATCTTGTGTCGGGTAATAGTTTAGAA
ACTCTTGTTCATGCTGAAGTGTCGCACATTAGAAGGCAGATGAAGATTGGTAAAAGTTTATTCATAAGATTTGGTGGTACGTGGGAAGACAACACAAACTCGTAT
GTCGGTGGTGAGCTGAAAGGAATAATTGTCCCACTTACAATAACGTATAAAGAACTGAAAAATCGGTTGTACAGACTGATGAAAGTCGACCAGAATGGGTACGAC
CGGGTAATTAGGGTACTGTACCACTTGGCATGCGATTCACCGCCAATGTTCGTAACGGATGACGATGACCTCCAATTTGCATTAGTCCAAGAACAGGTTTCTAAA
GTCCCACTGTTTGTATCGACCGTCCCTCGCGAAAGTATTGACATACAGTCATCTAGATTAAACCAAGATGGAGCGCAGTCCATTCCTAATGGAGGATCTGTACCG
GAAGAAGGTTCCACGGGGGTAGATGAGTGGGGGTTGAACGACGTATACATGCAGGACATGTACAGTTCAGACTGTATATACACTCAAGACTCGGAATTGGTCGCC
CCCACGACGCGCATGCCTCCCGTAATGCATGTCTCATCAGATGAGATAAATAATACAGCACAAGTACAACATGCAAGTCCTCGTACGGTACATGTATTCCCATTT
GATGAATTAAATAACACATCACAAGTACGACATGCAAGTCCTTTGACAGAACATGTTTTCCCATCAGTAGAACCCTCATCCAGTGACCCTCGTAATGTTGGTGAT
CACCAAATACCAGTACCTCAATCCCCAAGCACCGTGACTATAGAACAGGCCAGAAGGTACAACCTTTTACCAGGCGGATCAGAACTGCACGTGGGGAAGATATTC
GTTTCAAAGCAAGACCTGCGTATGGTGCTGTCAAATGCAGCGATGCGATCTAATCGTGAATACAAGGTCAGTAGGTCAACGAAATCAAAGTTTACTGTTCCCTGC
ATCGACAATACATGCAATTGGAGGGTTGCAGCCCACTCCGTCGGTAAATCTTCGATATTTTGTATATCAAAGTATGTAGATGCACACACCTGCACGATTGACAGT
GTGAACCACGACCACAAACAGGCGAGCAGATGGGTCGTAGCGAATCTTATTAATGATAAAGTTGCTGGAACCGGTCGTATTTACAAGATAAAGCATATTAAGGAG
GATGTCCGTAAAGAGTACGGGGTGAACATAAGTTATGACAAGGCCCATCGAGCACGAGAGTTAGCATACGCTATTGTTAGGGCAGACCGGAAGACTCCTACATAC
ATCTCCATGCGTACGGATGCAGCGTATGCGTATCGGAAGTCACAGTTCACGTACTACTGGAACCAGATACTGTCGATTGGATCGGGTTCACTTGCCAAATACTTA
CAAGAAATTGGGATAGAACGAGATGGTTCCACGAAAGAAGGACTAACTGGTCCACCCAAAACACCTCTCATTCAGACTATGCAGAAGAGCGACTTGCAGTACAGT
TTGAGAAGTCTCGTCGCTACACAGTCAAACCAGTCGACTGGTGCATGTTTCATGTTGAGGACGGTGGCTTGGGACGGGACGGTTGATTTGAATGCCCGTACATGT
ACATGCATGGAGTTCCAGTACATGGGCATTCCATGTTCGCACGCAATTGCAGCAGCGAGGCACAAGAATATAAATTGCCACACGTTGATCGATCCATGCTACAGT
GTGGACTCCCTAATTAGTGCCTACGCTGAACCAATCTTACCGGTAGGCCACATGTCGGAATGGAAAAGGCCAGCTGATTACCAGCCTATTCCCGTCCAACCACCA
CGTCTTGTTAAACGTGCAGGCCGGCGTAGAACGCAATGGATCGCCTCAACCGGTGAGCGTCGTGTTGTGAACAAGTGTTCTCGATGCGAAGCAAACATGTCAACA
TCTTACGAGTCGGACGGTTCCTCTTCCGACGGAGCAACCAGTCCTATCTCAAACCATCCCTCCTCCTTCGCGGAGAACGGAGTGCCAATTAACGGTCCTCTTCAA
CAAGCACGGGAAGAAAACGACCAGCTCAGGAGAGAGCTACGTCGAACGCAACACGAGCTCAACGACACGAGGTATAAGTTAGCCCGGGTTGAAGAAACGCAGGAC
TTGCTGGAGGGACTGCTGAAGGAGGAGAAGGAGGAACGACTTCGTCTGGAGGACAGGATGGATCAGTTATTGGCCCGTCTACGCCGATACCGTAATAATAATAAT
GATTTGTAA
Protein sequenceShow/hide protein sequence
MDHQLRIKENDRFLAQATSMSHLSNVNKLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLLREVAGSSDDSMSFLIGGNVLTFSKDQFMLITGLWRLP
GKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLNWGSDVWSRTINGLKRAMNGK
VALYKNKVWIYEVVPSLITPGVNRLRRDRHSPDNSVFVHCNGIYTSLCSCDLRSVGHTYPLVETELDKDYQRCPLDEREVVDLIAPGCSTSDSDDGHNPSPITDN
LGAEDDLPLDDAHSLETNVQAIPDESSDMPHTEAASEGGQRTPVKVLRPSTSILSNVGQSTRQSPRATSRDACPTQQQDTRRSNDKFGAMERRLDLLASDIAEVK
TDLAKVKSDLSEMKLMLQRLCQIDRREVNIGVSPSDTVHVSHPLVSNVIPEHDGDADDHKPGGSDAGKEDDVVPVEASLHEKATDGVEMTIPPSDLGDAELANPA
CIVHSVELDIAVVTPIVSTEMVELEMAPPIVQDPQAETTSDPTFEPPASINIDGPCGMIHGPRQAEHIELALTPADTSPTTKPIPTPAPAYTTSTPQPIPTLTPA
ENPTTRHLSDPVGSINLALDKIPEPLVIVHQPTKEKNPPHSKKSTTIRFTAPQEAPLVVSGFAVHEPTKLKKIEQQTAPKQSARKIEVSYPDETRKTERKRTETK
PFSPEDTHRQKKKQKMMDVDPVLASQVRPFRPKYNPLHNFSDAKFREMMRWVRDPRNDNTTRPSTTWNVQSGYSRRFFINILNPKEKVEDPEAAVILYFIMRKLS
SRPHLCVHKFSVLDPLQMQVLTAAGGPYTRIKGKVVQDTTNAWDEYKECMDAVLGQVEDFIPSWVDVDVVYSPLCIKDHWVLVAIDMTQSEIFVYDSLPGHISTL
KLLTDMRLLSHTIPSLLYACGLMDTADCKMKRTPWRVYRPTTDTRQKGSIDCGIFACKFLEYLVSGNSLETLVHAEVSHIRRQMKIGKSLFIRFGGTWEDNTNSY
VGGELKGIIVPLTITYKELKNRLYRLMKVDQNGYDRVIRVLYHLACDSPPMFVTDDDDLQFALVQEQVSKVPLFVSTVPRESIDIQSSRLNQDGAQSIPNGGSVP
EEGSTGVDEWGLNDVYMQDMYSSDCIYTQDSELVAPTTRMPPVMHVSSDEINNTAQVQHASPRTVHVFPFDELNNTSQVRHASPLTEHVFPSVEPSSSDPRNVGD
HQIPVPQSPSTVTIEQARRYNLLPGGSELHVGKIFVSKQDLRMVLSNAAMRSNREYKVSRSTKSKFTVPCIDNTCNWRVAAHSVGKSSIFCISKYVDAHTCTIDS
VNHDHKQASRWVVANLINDKVAGTGRIYKIKHIKEDVRKEYGVNISYDKAHRARELAYAIVRADRKTPTYISMRTDAAYAYRKSQFTYYWNQILSIGSGSLAKYL
QEIGIERDGSTKEGLTGPPKTPLIQTMQKSDLQYSLRSLVATQSNQSTGACFMLRTVAWDGTVDLNARTCTCMEFQYMGIPCSHAIAAARHKNINCHTLIDPCYS
VDSLISAYAEPILPVGHMSEWKRPADYQPIPVQPPRLVKRAGRRRTQWIASTGERRVVNKCSRCEANMSTSYESDGSSSDGATSPISNHPSSFAENGVPINGPLQ
QAREENDQLRRELRRTQHELNDTRYKLARVEETQDLLEGLLKEEKEERLRLEDRMDQLLARLRRYRNNNNDL