; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g18530 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g18530
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein FAR-RED ELONGATED HYPOCOTYL 3-like
Genome locationchr7:13236449..13241709
RNA-Seq ExpressionMoc07g18530
SyntenyMoc07g18530
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154997.1 uncharacterized protein LOC111022140 [Momordica charantia]3.3e-10546.92Show/hide
Query:  MDVVLRQVEDFIPFWVDVDIVYSPLCIKDHWVLVAIDMTQSEIFLYDSLPGHISTSKLMTDVRPLSHTIPSLLYACGLMDTADCKLRKTPWRVYHPTTDT
        MDVVL QVEDFIP WVDVD+VYS L I+DHWVLVAIDMTQSEIF+YDSLPGHISTSKLM D+RPLSHTIPSLLYACGLMDTADCKLRKTPW VY PTTDT
Subjt:  MDVVLRQVEDFIPFWVDVDIVYSPLCIKDHWVLVAIDMTQSEIFLYDSLPGHISTSKLMTDVRPLSHTIPSLLYACGLMDTADCKLRKTPWRVYHPTTDT

Query:  SLK-----------------CRTLEG-----------------------------------------------------------------ASSWVVANL
          K                 C  +                                                                   ASS +VANL
Subjt:  SLK-----------------CRTLEG-----------------------------------------------------------------ASSWVVANL

Query:  IKDRVAGTGRIYKIKYIKEDVRKEFGVNISYDKAHRARELAYAIVRGRPEDSYMHLHAV-----------------------------------------
        IKDRVAG GRIY IK+IKEDVRKEFGVN SYDKAHRARELAYAIVRGRPEDSYMHLH                                           
Subjt:  IKDRVAGTGRIYKIKYIKEDVRKEFGVNISYDKAHRARELAYAIVRGRPEDSYMHLHAV-----------------------------------------

Query:  -CVSKVT-VHV------------------QLEP---------DTISWIGFTCQI---------------------------LTRNWGR--TVGPS-----
         CV  V   H+                  Q+ P            SW  F   +                            T  W +  +VG       
Subjt:  -CVSKVT-VHV------------------QLEP---------DTISWIGFTCQI---------------------------LTRNWGR--TVGPS-----

Query:  ------------YQVGRRYENMTTNSAESVNALIREARKLPITKTVEFICDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVYWCMFH
                    YQVGRRYENMTTNS +S                             ERRTHWSTQNTSHSDYA+ERLALQFEKSRRYTVKPV WCMFH
Subjt:  ------------YQVGRRYENMTTNSAESVNALIREARKLPITKTVEFICDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVYWCMFH

Query:  VEDGGLDGTVDLNARTCTCMEFQYMGIPCSHAIVA
        VED GLDGTVDLNA TCTCMEFQYMGIPCSHAI A
Subjt:  VEDGGLDGTVDLNARTCTCMEFQYMGIPCSHAIVA

XP_022155155.1 uncharacterized protein LOC111022298 [Momordica charantia]5.1e-9880.75Show/hide
Query:  MDHQLRIKENDRFPAQATSMSHLSNVNRLIKDKLTANQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREV-----------------------------
        MDHQLRIKEND FPAQAT MSHLSNVNRLIKDKLTA+QLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREV                             
Subjt:  MDHQLRIKENDRFPAQATSMSHLSNVNRLIKDKLTANQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREV-----------------------------

Query:  --------VQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFKDDEDGVKMTLILYTELVMMGKRKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTV
                VQKKIGKN LRRKYFNDEASMLLEEFVEVYKQTDF+DDED VK+TLILYTELVMMGK KSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTV
Subjt:  --------VQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFKDDEDGVKMTLILYTELVMMGKRKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTV

Query:  KGLKRAMNGKVALYKNKVRTNKKYLVKYSLPGFPLAFQV
         GLKRAMNGKVALYKNKVRTNKKYLVKYSLPGFPLAFQV
Subjt:  KGLKRAMNGKVALYKNKVRTNKKYLVKYSLPGFPLAFQV

XP_022156658.1 uncharacterized protein LOC111023507 [Momordica charantia]1.5e-89100Show/hide
Query:  MTTNSAESVNALIREARKLPITKTVEFICDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVYWCMFHVEDGGLDGTVDLNARTCTCME
        MTTNSAESVNALIREARKLPITKTVEFICDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVYWCMFHVEDGGLDGTVDLNARTCTCME
Subjt:  MTTNSAESVNALIREARKLPITKTVEFICDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVYWCMFHVEDGGLDGTVDLNARTCTCME

Query:  FQYMGIPCSHAIVAVRHKNINCHTLIDPYYSVDSLIGTYAEPILPVGHMSEWKRPADY
        FQYMGIPCSHAIVAVRHKNINCHTLIDPYYSVDSLIGTYAEPILPVGHMSEWKRPADY
Subjt:  FQYMGIPCSHAIVAVRHKNINCHTLIDPYYSVDSLIGTYAEPILPVGHMSEWKRPADY

XP_022157199.1 uncharacterized protein LOC111023969 [Momordica charantia]7.9e-9151.45Show/hide
Query:  MDHQLRIKENDRFPAQATSMSHLSNVNRLIKDKLTANQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVV----------------------------
        M+H L++ E DRFPAQ TS+SHLS  N++I  KLT  QLDMFR+RTIFGRFVDL+MMFCS +VH+FL REVV                            
Subjt:  MDHQLRIKENDRFPAQATSMSHLSNVNRLIKDKLTANQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVV----------------------------

Query:  ---------QKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFKDDEDGVKMTLILYTELVMMGKRKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTV
                 QKK+ KNRLRR+YF D   + LEEF + YK+  F +D+D VK++LI YTE+VMMGK K KS VD DLY QV+DLDYFN++DWG+ +W RT+
Subjt:  ---------QKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFKDDEDGVKMTLILYTELVMMGKRKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTV

Query:  KGLKRAMNGKVALYKNKVRTNKKYLVKYSLPGFPLAFQVWIYEVVPSLITPGVNRLSKTAIPRIIRYSCSRVVGTKDMGKEVLGSAVLVISYPLVETELD
        KGL+ AM  KV  YKNKV TNKK+ V+YSL GFP+AFQVW YE++PSL+  GVNRLS TA+PRI RYSCS+ + +K + ++V  S+ L I++PLVE+E +
Subjt:  KGLKRAMNGKVALYKNKVRTNKKYLVKYSLPGFPLAFQVWIYEVVPSLITPGVNRLSKTAIPRIIRYSCSRVVGTKDMGKEVLGSAVLVISYPLVETELD

Query:  KDYQRCPLDEREVVDLNEPWCSTSDNDDGQNPSPITDNIGVEDD
        + Y+  P D R  ++ N      SD+DD Q      D+ G ++D
Subjt:  KDYQRCPLDEREVVDLNEPWCSTSDNDDGQNPSPITDNIGVEDD

XP_022159086.1 uncharacterized protein LOC111025530 [Momordica charantia]4.5e-11062.61Show/hide
Query:  ASSWVVANLIKDRVAGTGRIYKIKYIKEDVRKEFGVNISYDKAHRARELAYAIVRGRPEDSYMHLHA------VCVSKVTVHVQLEPDT-----------
        ASSWVVANLIKDRVA T RIYKIK+IKEDVR+EF VNISYDKAHRARELAYAIVRGR EDSYMHLHA      +       H++ E +            
Subjt:  ASSWVVANLIKDRVAGTGRIYKIKYIKEDVRKEFGVNISYDKAHRARELAYAIVRGRPEDSYMHLHA------VCVSKVTVHVQLEPDT-----------

Query:  ------------------ISWIGFTCQI---------------------------LTRNWGR--TVGPS-----------------YQVGRRYENMTTNS
                           SW  F   +                            T  W +  +VG                   YQVGRRYENMTTNS
Subjt:  ------------------ISWIGFTCQI---------------------------LTRNWGR--TVGPS-----------------YQVGRRYENMTTNS

Query:  AESVNALIREARKLPITKTVEFICDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVYWCMFHVEDGGLDGTVDLNARTCTCMEFQYMG
        AESVN L+REA +LPITK VEFI DLLQRWFH RRTHWSTQNTSHSDYAEE+LALQFEKSRRYTVKPV WCMFHVEDGGLDGTVDLNARTCTCMEFQYMG
Subjt:  AESVNALIREARKLPITKTVEFICDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVYWCMFHVEDGGLDGTVDLNARTCTCMEFQYMG

Query:  IPCSHAIVAVRHKNINCHTLIDPYYSVDSLIGTYAEPILPVGHMSEWKRPADY
        IPCSHAI A RHKNINCHTLIDP YSVDSLI  YAEPILP+GHMSEWKRPA+Y
Subjt:  IPCSHAIVAVRHKNINCHTLIDPYYSVDSLIGTYAEPILPVGHMSEWKRPADY

TrEMBL top hitse value%identityAlignment
A0A6J1DLM5 uncharacterized protein LOC1110222982.5e-9880.75Show/hide
Query:  MDHQLRIKENDRFPAQATSMSHLSNVNRLIKDKLTANQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREV-----------------------------
        MDHQLRIKEND FPAQAT MSHLSNVNRLIKDKLTA+QLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREV                             
Subjt:  MDHQLRIKENDRFPAQATSMSHLSNVNRLIKDKLTANQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREV-----------------------------

Query:  --------VQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFKDDEDGVKMTLILYTELVMMGKRKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTV
                VQKKIGKN LRRKYFNDEASMLLEEFVEVYKQTDF+DDED VK+TLILYTELVMMGK KSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTV
Subjt:  --------VQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFKDDEDGVKMTLILYTELVMMGKRKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTV

Query:  KGLKRAMNGKVALYKNKVRTNKKYLVKYSLPGFPLAFQV
         GLKRAMNGKVALYKNKVRTNKKYLVKYSLPGFPLAFQV
Subjt:  KGLKRAMNGKVALYKNKVRTNKKYLVKYSLPGFPLAFQV

A0A6J1DQ99 uncharacterized protein LOC1110232352.6e-8787.29Show/hide
Query:  QILTRNWGRTVGPSYQVGRRYENMTTNSAESVNALIREARKLPITKTVEFICDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVYWCM
        +I    W R     YQVGRRYENMTTNSAESVNAL+REAR+LPITK VEFI DLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPV WCM
Subjt:  QILTRNWGRTVGPSYQVGRRYENMTTNSAESVNALIREARKLPITKTVEFICDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVYWCM

Query:  FHVEDGGLDGTVDLNARTCTCMEFQYMGIPCSHAIVAVRHKNINCHTLIDPYYSVDSLIGTYAEPILPVGHMSEWKRPADY
        FHVED GLD TVDLNARTCTCMEFQYMGIPCSHAI A RHKNINCHTLIDP Y+VDSLIG YAEPILPVGHMSEWKRPADY
Subjt:  FHVEDGGLDGTVDLNARTCTCMEFQYMGIPCSHAIVAVRHKNINCHTLIDPYYSVDSLIGTYAEPILPVGHMSEWKRPADY

A0A6J1DR77 uncharacterized protein LOC1110235077.2e-90100Show/hide
Query:  MTTNSAESVNALIREARKLPITKTVEFICDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVYWCMFHVEDGGLDGTVDLNARTCTCME
        MTTNSAESVNALIREARKLPITKTVEFICDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVYWCMFHVEDGGLDGTVDLNARTCTCME
Subjt:  MTTNSAESVNALIREARKLPITKTVEFICDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVYWCMFHVEDGGLDGTVDLNARTCTCME

Query:  FQYMGIPCSHAIVAVRHKNINCHTLIDPYYSVDSLIGTYAEPILPVGHMSEWKRPADY
        FQYMGIPCSHAIVAVRHKNINCHTLIDPYYSVDSLIGTYAEPILPVGHMSEWKRPADY
Subjt:  FQYMGIPCSHAIVAVRHKNINCHTLIDPYYSVDSLIGTYAEPILPVGHMSEWKRPADY

A0A6J1DSS5 uncharacterized protein LOC1110239693.8e-9151.45Show/hide
Query:  MDHQLRIKENDRFPAQATSMSHLSNVNRLIKDKLTANQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVV----------------------------
        M+H L++ E DRFPAQ TS+SHLS  N++I  KLT  QLDMFR+RTIFGRFVDL+MMFCS +VH+FL REVV                            
Subjt:  MDHQLRIKENDRFPAQATSMSHLSNVNRLIKDKLTANQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVV----------------------------

Query:  ---------QKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFKDDEDGVKMTLILYTELVMMGKRKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTV
                 QKK+ KNRLRR+YF D   + LEEF + YK+  F +D+D VK++LI YTE+VMMGK K KS VD DLY QV+DLDYFN++DWG+ +W RT+
Subjt:  ---------QKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFKDDEDGVKMTLILYTELVMMGKRKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTV

Query:  KGLKRAMNGKVALYKNKVRTNKKYLVKYSLPGFPLAFQVWIYEVVPSLITPGVNRLSKTAIPRIIRYSCSRVVGTKDMGKEVLGSAVLVISYPLVETELD
        KGL+ AM  KV  YKNKV TNKK+ V+YSL GFP+AFQVW YE++PSL+  GVNRLS TA+PRI RYSCS+ + +K + ++V  S+ L I++PLVE+E +
Subjt:  KGLKRAMNGKVALYKNKVRTNKKYLVKYSLPGFPLAFQVWIYEVVPSLITPGVNRLSKTAIPRIIRYSCSRVVGTKDMGKEVLGSAVLVISYPLVETELD

Query:  KDYQRCPLDEREVVDLNEPWCSTSDNDDGQNPSPITDNIGVEDD
        + Y+  P D R  ++ N      SD+DD Q      D+ G ++D
Subjt:  KDYQRCPLDEREVVDLNEPWCSTSDNDDGQNPSPITDNIGVEDD

A0A6J1E2V3 uncharacterized protein LOC1110255302.2e-11062.61Show/hide
Query:  ASSWVVANLIKDRVAGTGRIYKIKYIKEDVRKEFGVNISYDKAHRARELAYAIVRGRPEDSYMHLHA------VCVSKVTVHVQLEPDT-----------
        ASSWVVANLIKDRVA T RIYKIK+IKEDVR+EF VNISYDKAHRARELAYAIVRGR EDSYMHLHA      +       H++ E +            
Subjt:  ASSWVVANLIKDRVAGTGRIYKIKYIKEDVRKEFGVNISYDKAHRARELAYAIVRGRPEDSYMHLHA------VCVSKVTVHVQLEPDT-----------

Query:  ------------------ISWIGFTCQI---------------------------LTRNWGR--TVGPS-----------------YQVGRRYENMTTNS
                           SW  F   +                            T  W +  +VG                   YQVGRRYENMTTNS
Subjt:  ------------------ISWIGFTCQI---------------------------LTRNWGR--TVGPS-----------------YQVGRRYENMTTNS

Query:  AESVNALIREARKLPITKTVEFICDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVYWCMFHVEDGGLDGTVDLNARTCTCMEFQYMG
        AESVN L+REA +LPITK VEFI DLLQRWFH RRTHWSTQNTSHSDYAEE+LALQFEKSRRYTVKPV WCMFHVEDGGLDGTVDLNARTCTCMEFQYMG
Subjt:  AESVNALIREARKLPITKTVEFICDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVYWCMFHVEDGGLDGTVDLNARTCTCMEFQYMG

Query:  IPCSHAIVAVRHKNINCHTLIDPYYSVDSLIGTYAEPILPVGHMSEWKRPADY
        IPCSHAI A RHKNINCHTLIDP YSVDSLI  YAEPILP+GHMSEWKRPA+Y
Subjt:  IPCSHAIVAVRHKNINCHTLIDPYYSVDSLIGTYAEPILPVGHMSEWKRPADY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase4.4e-0739.39Show/hide
Query:  GTVDLNARTCTCMEFQYMGIPCSHAIVAVRHKNINCHTLIDPYYSVDSLIGTYAEPILPVGHMSEW
        G V LN  TCTC EFQ    PC HA+       IN    +D  Y+V+    TY+    PV  +S W
Subjt:  GTVDLNARTCTCMEFQYMGIPCSHAIVAVRHKNINCHTLIDPYYSVDSLIGTYAEPILPVGHMSEW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCATCAGTTGAGGATTAAGGAGAATGACCGCTTTCCGGCTCAAGCTACCAGCATGTCTCACTTGAGCAATGTCAACAGGCTTATCAAGGATAAGCTCACAGCAAA
CCAACTTGATATGTTTCGTAGAAGAACAATATTTGGTCGATTTGTCGACTTGGAGATGATGTTCTGCAGTGGTGTAGTTCATCACTTTCTGTCAAGGGAGGTGGTCCAGA
AAAAGATTGGAAAGAATAGGTTGCGGAGGAAGTACTTCAACGATGAAGCCTCCATGCTGCTCGAAGAGTTTGTGGAGGTTTACAAGCAGACTGATTTCAAGGACGATGAG
GACGGCGTTAAAATGACATTAATTTTGTATACGGAGCTTGTGATGATGGGAAAGAGAAAGAGCAAGTCGAAGGTTGACATCGACTTGTACAACCAAGTCGATGACTTGGA
CTACTTCAACCATTTGGACTGGGGTTCTGATGTTTGGAGTAGAACAGTTAAAGGCCTGAAGCGTGCGATGAATGGAAAAGTTGCGCTATACAAGAATAAAGTAAGAACGA
ACAAAAAGTATCTAGTAAAGTATAGCCTACCGGGATTTCCGCTTGCGTTTCAGGTGTGGATATATGAGGTTGTCCCATCTCTCATCACTCCCGGTGTCAATCGTTTGAGC
AAGACCGCCATTCCCCGGATAATTCGGTATTCGTGCAGTAGAGTCGTCGGTACAAAAGATATGGGGAAGGAGGTCCTTGGTTCAGCGGTGCTGGTCATATCTTATCCACT
CGTGGAGACGGAGTTGGATAAGGACTACCAGAGATGTCCATTGGACGAAAGAGAGGTGGTTGATTTAAATGAGCCTTGGTGTTCCACCTCCGACAACGATGATGGACAAA
ATCCTTCCCCCATCACCGACAATATTGGCGTCGAAGACGATCTCCCGCTCAACGATGCGCATTCGTTGGAAACGAATGTACAGGCAGTGCCGGATGAGTCTTCGGATATG
CCACGTACAGAGGCTGAATCCGAAGGTGGGCAACGGACACCAGTCGAAGTACTTCGACCAAGTACTTCTATTCGGTCGGATGTGGGGCAAAGCACGCGGCAATCACCAAG
GCTGGATCTTTTAGCTTCGGACATGGCGGAGGTGAAGACGAATTCGGCGGAGGTCAAGTCGGACTTAAGTGAAATGAAACTCATGCTTCAACGATTGTGCCAGATCGATA
GGCGCGAGGTGAATATTGGTGTCTCTCCGTCAGATACAGTCCACGTGTCACATCCATTGGTATTCAATGTTATCCCCGAGCATGATGGGGATGTTGATGACCATCAACCT
GGAGGTTCCGATGCTGGAAAGGATGATGATGTGGTTCATGTAGAAGCGTCGTTGCATGAAAAGGCAACAGATGTTGTAGAGATGACCATACACCCATCCGATTTTGAAGA
TGCTGAACTAGCCAACCCCCTGCGTATCATTGATTCGGTGGAGTTGGACGTTGCAGTGGGGACACCCATTCTTTCGACGAAGCTGGTGGAACTCGAAATGACACCTCCAA
TAGTACATGATCCACAAGCAGAGACAACGTCTGATCCAACCTTCGAGCCTCCTGCCTCAAGCAACATTGATGGTTCGTGTGGCACGATCCATGCGCCTCGTCAAGCCGAT
CATATTGAGTTGGCCCTTACATCAGCGGATACGACCCCCACTACTCAACCTATTCCCACCTCTACACCAGCATGTACGACTGCCACCCCTCATCCCGTGGGTTCTACTAA
CCTCGACGGTTCTGCTGTTAAAGAATCCACTAAGTCGAAGAAAACCAAACAAAAAACTTCTCCTAAGCAGTCGGCCACGAAAACCGAGGTTTCATATCCCGACGAAACAA
GAAGGACGGAGCGTAAGCGGACGGAAACGAAACCATTCAGTCCGAAGGACACGCATCGGCAGAAGAAGAAGCAAAAGATGGTGGATGTGGAGCCCGTACCTGCTAGCCAG
GTTAGGCCATTTCGTCCCAAATACAACCCATTGCATAACTTTCCGGATGCGAAGTTTAGGGAGATGATGCGTTGGGTACGAGACCCTGGGAATGACAAAATAACGCGACT
GTCTACAACATGGAATGTGCAGAGCGGATATTCTAGGAAATTCTTCTTTAACATCCTCAATCCTAAGGAGAAGGTCGAAGACCCGGAAGCTGCTGCCATTACATATTTTA
TTATGAGGAAGCTCGGTAGTCGGCCGCACCTATGCCAAGTTATTGCCGCTGCAACTGGTCCTTACTCACAAATCAAGGGGAAGGTCGTCCAGGACATGATCAATGCTTGG
GACGAGTATAAGGAGTGCATGGATGTCGTGCTACGTCAGGTGGAAGATTTCATTCCATTCTGGGTTGACGTCGACATAGTGTACAGCCCGCTCTGTATCAAGGATCATTG
GGTCCTGGTTGCGATAGATATGACCCAGTCCGAAATTTTTTTGTACGACTCATTGCCAGGCCACATTTCCACGTCGAAGTTGATGACAGACGTGCGGCCGTTGAGTCATA
CAATCCCATCGCTTTTGTACGCATGTGGGCTGATGGATACAGCCGATTGCAAGCTGAGGAAGACTCCGTGGCGTGTATACCATCCTACGACCGACACGAGTCTGAAGTGT
CGCACATTAGAAGGAGCGAGCAGCTGGGTCGTAGCGAATCTTATTAAGGATAGAGTTGCTGGAACCGGTCGTATTTACAAGATAAAGTATATTAAGGAGGATGTCCGTAA
AGAGTTCGGGGTGAACATAAGTTATGACAAGGCCCATCGAGCAAGGGAGTTAGCATACGCTATTGTTAGGGGCCGACCGGAGGACTCCTACATGCATCTCCATGCCGTAT
GCGTATCGAAAGTCACAGTTCACGTACAACTGGAACCAGATACTATCAGTTGGATCGGGTTCACTTGCCAAATACTTACAAGAAATTGGGGTAGAACGGTGGGCCCGAGC
TACCAAGTTGGTAGAAGATATGAAAACATGACGACAAACAGCGCTGAGTCGGTAAATGCCCTCATTCGAGAGGCTAGAAAGTTACCTATCACTAAGACTGTCGAGTTCAT
CTGCGACTTGCTACAAAGATGGTTCCACGAAAGAAGGACCCACTGGTCCACCCAAAACACCTCTCATTCAGACTATGCAGAAGAGCGACTTGCACTTCAGTTTGAGAAGT
CTCGTCGCTACACAGTCAAACCAGTTTACTGGTGCATGTTTCATGTTGAGGACGGTGGCTTGGATGGGACGGTTGATTTGAATGCCCGTACATGTACATGCATGGAGTTC
CAGTACATGGGCATTCCATGTTCGCACGCAATTGTAGCAGTGAGGCACAAGAATATAAATTGCCACACGTTGATCGATCCATACTACAGTGTGGACTCCCTAATTGGTAC
CTACGCCGAACCAATCTTACCTGTAGGCCACATGTCGGAATGGAAAAGGCCAGCTGATTACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCATCAGTTGAGGATTAAGGAGAATGACCGCTTTCCGGCTCAAGCTACCAGCATGTCTCACTTGAGCAATGTCAACAGGCTTATCAAGGATAAGCTCACAGCAAA
CCAACTTGATATGTTTCGTAGAAGAACAATATTTGGTCGATTTGTCGACTTGGAGATGATGTTCTGCAGTGGTGTAGTTCATCACTTTCTGTCAAGGGAGGTGGTCCAGA
AAAAGATTGGAAAGAATAGGTTGCGGAGGAAGTACTTCAACGATGAAGCCTCCATGCTGCTCGAAGAGTTTGTGGAGGTTTACAAGCAGACTGATTTCAAGGACGATGAG
GACGGCGTTAAAATGACATTAATTTTGTATACGGAGCTTGTGATGATGGGAAAGAGAAAGAGCAAGTCGAAGGTTGACATCGACTTGTACAACCAAGTCGATGACTTGGA
CTACTTCAACCATTTGGACTGGGGTTCTGATGTTTGGAGTAGAACAGTTAAAGGCCTGAAGCGTGCGATGAATGGAAAAGTTGCGCTATACAAGAATAAAGTAAGAACGA
ACAAAAAGTATCTAGTAAAGTATAGCCTACCGGGATTTCCGCTTGCGTTTCAGGTGTGGATATATGAGGTTGTCCCATCTCTCATCACTCCCGGTGTCAATCGTTTGAGC
AAGACCGCCATTCCCCGGATAATTCGGTATTCGTGCAGTAGAGTCGTCGGTACAAAAGATATGGGGAAGGAGGTCCTTGGTTCAGCGGTGCTGGTCATATCTTATCCACT
CGTGGAGACGGAGTTGGATAAGGACTACCAGAGATGTCCATTGGACGAAAGAGAGGTGGTTGATTTAAATGAGCCTTGGTGTTCCACCTCCGACAACGATGATGGACAAA
ATCCTTCCCCCATCACCGACAATATTGGCGTCGAAGACGATCTCCCGCTCAACGATGCGCATTCGTTGGAAACGAATGTACAGGCAGTGCCGGATGAGTCTTCGGATATG
CCACGTACAGAGGCTGAATCCGAAGGTGGGCAACGGACACCAGTCGAAGTACTTCGACCAAGTACTTCTATTCGGTCGGATGTGGGGCAAAGCACGCGGCAATCACCAAG
GCTGGATCTTTTAGCTTCGGACATGGCGGAGGTGAAGACGAATTCGGCGGAGGTCAAGTCGGACTTAAGTGAAATGAAACTCATGCTTCAACGATTGTGCCAGATCGATA
GGCGCGAGGTGAATATTGGTGTCTCTCCGTCAGATACAGTCCACGTGTCACATCCATTGGTATTCAATGTTATCCCCGAGCATGATGGGGATGTTGATGACCATCAACCT
GGAGGTTCCGATGCTGGAAAGGATGATGATGTGGTTCATGTAGAAGCGTCGTTGCATGAAAAGGCAACAGATGTTGTAGAGATGACCATACACCCATCCGATTTTGAAGA
TGCTGAACTAGCCAACCCCCTGCGTATCATTGATTCGGTGGAGTTGGACGTTGCAGTGGGGACACCCATTCTTTCGACGAAGCTGGTGGAACTCGAAATGACACCTCCAA
TAGTACATGATCCACAAGCAGAGACAACGTCTGATCCAACCTTCGAGCCTCCTGCCTCAAGCAACATTGATGGTTCGTGTGGCACGATCCATGCGCCTCGTCAAGCCGAT
CATATTGAGTTGGCCCTTACATCAGCGGATACGACCCCCACTACTCAACCTATTCCCACCTCTACACCAGCATGTACGACTGCCACCCCTCATCCCGTGGGTTCTACTAA
CCTCGACGGTTCTGCTGTTAAAGAATCCACTAAGTCGAAGAAAACCAAACAAAAAACTTCTCCTAAGCAGTCGGCCACGAAAACCGAGGTTTCATATCCCGACGAAACAA
GAAGGACGGAGCGTAAGCGGACGGAAACGAAACCATTCAGTCCGAAGGACACGCATCGGCAGAAGAAGAAGCAAAAGATGGTGGATGTGGAGCCCGTACCTGCTAGCCAG
GTTAGGCCATTTCGTCCCAAATACAACCCATTGCATAACTTTCCGGATGCGAAGTTTAGGGAGATGATGCGTTGGGTACGAGACCCTGGGAATGACAAAATAACGCGACT
GTCTACAACATGGAATGTGCAGAGCGGATATTCTAGGAAATTCTTCTTTAACATCCTCAATCCTAAGGAGAAGGTCGAAGACCCGGAAGCTGCTGCCATTACATATTTTA
TTATGAGGAAGCTCGGTAGTCGGCCGCACCTATGCCAAGTTATTGCCGCTGCAACTGGTCCTTACTCACAAATCAAGGGGAAGGTCGTCCAGGACATGATCAATGCTTGG
GACGAGTATAAGGAGTGCATGGATGTCGTGCTACGTCAGGTGGAAGATTTCATTCCATTCTGGGTTGACGTCGACATAGTGTACAGCCCGCTCTGTATCAAGGATCATTG
GGTCCTGGTTGCGATAGATATGACCCAGTCCGAAATTTTTTTGTACGACTCATTGCCAGGCCACATTTCCACGTCGAAGTTGATGACAGACGTGCGGCCGTTGAGTCATA
CAATCCCATCGCTTTTGTACGCATGTGGGCTGATGGATACAGCCGATTGCAAGCTGAGGAAGACTCCGTGGCGTGTATACCATCCTACGACCGACACGAGTCTGAAGTGT
CGCACATTAGAAGGAGCGAGCAGCTGGGTCGTAGCGAATCTTATTAAGGATAGAGTTGCTGGAACCGGTCGTATTTACAAGATAAAGTATATTAAGGAGGATGTCCGTAA
AGAGTTCGGGGTGAACATAAGTTATGACAAGGCCCATCGAGCAAGGGAGTTAGCATACGCTATTGTTAGGGGCCGACCGGAGGACTCCTACATGCATCTCCATGCCGTAT
GCGTATCGAAAGTCACAGTTCACGTACAACTGGAACCAGATACTATCAGTTGGATCGGGTTCACTTGCCAAATACTTACAAGAAATTGGGGTAGAACGGTGGGCCCGAGC
TACCAAGTTGGTAGAAGATATGAAAACATGACGACAAACAGCGCTGAGTCGGTAAATGCCCTCATTCGAGAGGCTAGAAAGTTACCTATCACTAAGACTGTCGAGTTCAT
CTGCGACTTGCTACAAAGATGGTTCCACGAAAGAAGGACCCACTGGTCCACCCAAAACACCTCTCATTCAGACTATGCAGAAGAGCGACTTGCACTTCAGTTTGAGAAGT
CTCGTCGCTACACAGTCAAACCAGTTTACTGGTGCATGTTTCATGTTGAGGACGGTGGCTTGGATGGGACGGTTGATTTGAATGCCCGTACATGTACATGCATGGAGTTC
CAGTACATGGGCATTCCATGTTCGCACGCAATTGTAGCAGTGAGGCACAAGAATATAAATTGCCACACGTTGATCGATCCATACTACAGTGTGGACTCCCTAATTGGTAC
CTACGCCGAACCAATCTTACCTGTAGGCCACATGTCGGAATGGAAAAGGCCAGCTGATTACTAG
Protein sequenceShow/hide protein sequence
MDHQLRIKENDRFPAQATSMSHLSNVNRLIKDKLTANQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFKDDE
DGVKMTLILYTELVMMGKRKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTVKGLKRAMNGKVALYKNKVRTNKKYLVKYSLPGFPLAFQVWIYEVVPSLITPGVNRLS
KTAIPRIIRYSCSRVVGTKDMGKEVLGSAVLVISYPLVETELDKDYQRCPLDEREVVDLNEPWCSTSDNDDGQNPSPITDNIGVEDDLPLNDAHSLETNVQAVPDESSDM
PRTEAESEGGQRTPVEVLRPSTSIRSDVGQSTRQSPRLDLLASDMAEVKTNSAEVKSDLSEMKLMLQRLCQIDRREVNIGVSPSDTVHVSHPLVFNVIPEHDGDVDDHQP
GGSDAGKDDDVVHVEASLHEKATDVVEMTIHPSDFEDAELANPLRIIDSVELDVAVGTPILSTKLVELEMTPPIVHDPQAETTSDPTFEPPASSNIDGSCGTIHAPRQAD
HIELALTSADTTPTTQPIPTSTPACTTATPHPVGSTNLDGSAVKESTKSKKTKQKTSPKQSATKTEVSYPDETRRTERKRTETKPFSPKDTHRQKKKQKMVDVEPVPASQ
VRPFRPKYNPLHNFPDAKFREMMRWVRDPGNDKITRLSTTWNVQSGYSRKFFFNILNPKEKVEDPEAAAITYFIMRKLGSRPHLCQVIAAATGPYSQIKGKVVQDMINAW
DEYKECMDVVLRQVEDFIPFWVDVDIVYSPLCIKDHWVLVAIDMTQSEIFLYDSLPGHISTSKLMTDVRPLSHTIPSLLYACGLMDTADCKLRKTPWRVYHPTTDTSLKC
RTLEGASSWVVANLIKDRVAGTGRIYKIKYIKEDVRKEFGVNISYDKAHRARELAYAIVRGRPEDSYMHLHAVCVSKVTVHVQLEPDTISWIGFTCQILTRNWGRTVGPS
YQVGRRYENMTTNSAESVNALIREARKLPITKTVEFICDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVYWCMFHVEDGGLDGTVDLNARTCTCMEF
QYMGIPCSHAIVAVRHKNINCHTLIDPYYSVDSLIGTYAEPILPVGHMSEWKRPADY