; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g33030 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g33030
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr8:23982897..23985985
RNA-Seq ExpressionMoc08g33030
SyntenyMoc08g33030
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]2.2e-4644.44Show/hide
Query:  DLCSRRFTTGDIVLANFLRPTYGLYQRMTAPNAVPARVAAEYDWAGKCKTILSYMDGTHTDYQTRWLDLDVVYLPYDIA---------------------
        +LC R+FTTGD++++NFLR T G+Y  M +PN + +RVA++YDW G+  ++LSY+DGTH+D  TRW+D+D VYLPY+I                      
Subjt:  DLCSRRFTTGDIVLANFLRPTYGLYQRMTAPNAVPARVAAEYDWAGKCKTILSYMDGTHTDYQTRWLDLDVVYLPYDIA---------------------

Query:  ---------LESELRPMAVVLLALMHKSGVQVLRPTLPNTPWHIRQVTSGPQQSGSGDCGMFCVKYFEYDVTGSNMTSLTQDNISFFREKLAIEMWAN
                 LE EL+PM  ++  L+ + GV + +P +P TPW IR+V+S PQQ   GDCG+FC+ +FEYDVT  +  +LTQ  +SFFR + A+++WAN
Subjt:  ---------LESELRPMAVVLLALMHKSGVQVLRPTLPNTPWHIRQVTSGPQQSGSGDCGMFCVKYFEYDVTGSNMTSLTQDNISFFREKLAIEMWAN

XP_022154561.1 uncharacterized protein LOC111021802 [Momordica charantia]7.4e-7449.01Show/hide
Query:  MFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISELKYDRSPVRKDTSPHKLRALYFNDSNDVLLSEFEKIYLAAW
        MFRKT F HLLD+DLVFNG LIHNILLRE+E+STP+TISFNLF  ++SF R +F +IS LKY R+PVR++T PH+L  LYFND  D++LS+FEK+Y AA 
Subjt:  MFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISELKYDRSPVRKDTSPHKLRALYFNDSNDVLLSEFEKIYLAAW

Query:  FEDDFDAVK---------------SEARPDEE----VEGWPV-----------QEIVQSLR--------------------FPLGVPVWAYETISSLSGR
        FEDD+D VK                  + D      V+ W V           ++ + SL+                    FP    VWAY+TISSLS R
Subjt:  FEDDFDAVK---------------SEARPDEE----VEGWPV-----------QEIVQSLR--------------------FPLGVPVWAYETISSLSGR

Query:  VANKVNPDAVPRILRWRCSHSTAWHVLDREIFRSSTGRTRSIEATDAETTFLDRTFEPPELEDEDEVPRENDAAETSSARAGPEKDDGRHGGNVDEGVRE
        VANKV  D VP I +WR  HSTAWHVLDR+IF S+ GRTR+++ TD ET+FL+R+F+PP  +D+D +    D A  S+ R G + DD   G +V E V +
Subjt:  VANKVNPDAVPRILRWRCSHSTAWHVLDREIFRSSTGRTRSIEATDAETTFLDRTFEPPELEDEDEVPRENDAAETSSARAGPEKDDGRHGGNVDEGVRE

Query:  DVHKEAEEENGRGKKVRMSSVRLRKVEKRMKRMDKRMDDGFEGIKAELKSIRKFL
        D   E EE  G+  KV +S+ RL++VEK +K MDKRMD+    I+AELKSI+KFL
Subjt:  DVHKEAEEENGRGKKVRMSSVRLRKVEKRMKRMDKRMDDGFEGIKAELKSIRKFL

XP_022156465.1 uncharacterized protein LOC111023353 [Momordica charantia]2.5e-4548.25Show/hide
Query:  MFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISELKYDRSPVRKDTSPHKLRALYFNDSNDVLLSEFEKIYLAAW
        MFRKT+FGHLLD+DLVFNGPLIHNILLRE+EDSTP+TISFNLFG +VSFGRREFD+IS L YDRSPVRK T  HKLR LYFND  + +LS+F K+Y+AA 
Subjt:  MFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISELKYDRSPVRKDTSPHKLRALYFNDSNDVLLSEFEKIYLAAW

Query:  FEDDFDAVKSEARPDEEVEGWPVQEIVQSLRFPLG-VPVWAYETISSLSGRVANKV------NPDAVPRILRWRCSHSTAWHVLDREIFRSSTGRTRSIE
        F+DDFD +K       E+     +  ++  +  LG V  W       L+    +K        P  + +    R S+S        +++ +   RTR +E
Subjt:  FEDDFDAVKSEARPDEEVEGWPVQEIVQSLRFPLG-VPVWAYETISSLSGRVANKV------NPDAVPRILRWRCSHSTAWHVLDREIFRSSTGRTRSIE

Query:  ATDAETTFLDRTFEPPELEDEDEVPRENDAAETSSARAGPEKDDGRHGGNVDEGVRE
        ATDAET F+ RTFEPPE ED+D   R+ DA  ++        D GR        VRE
Subjt:  ATDAETTFLDRTFEPPELEDEDEVPRENDAAETSSARAGPEKDDGRHGGNVDEGVRE

XP_022159061.1 uncharacterized protein LOC111025501 [Momordica charantia]3.7e-4994.5Show/hide
Query:  MFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISELKYDRSPVRKDTSPHKLRALYFNDSNDVLLSEFEKIYLAAW
        MFRKTVFGHLLDLDLVFNGPLIHNILLREIE STPDTISFNLFGNK SFGR EFDIIS LKYDRSPVRKDTSPH+LRALYFNDSNDVLLSEFEKIYLAA 
Subjt:  MFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISELKYDRSPVRKDTSPHKLRALYFNDSNDVLLSEFEKIYLAAW

Query:  FEDDFDAVK
        FEDDFDAVK
Subjt:  FEDDFDAVK

XP_022159362.1 uncharacterized protein LOC111025779 [Momordica charantia]1.2e-4469.44Show/hide
Query:  MDGTHTDYQTRWLDLDVVYLPYDI------------------------------ALESELRPMAVVLLALMHKSGVQVLRPTLPNTPWHIRQVTSGPQQS
        MDGTHTDYQTRWLDLDVVYLPY+I                               LESELR M VVL ALMH++GVQVLR TLP  PW I QVTS PQQS
Subjt:  MDGTHTDYQTRWLDLDVVYLPYDI------------------------------ALESELRPMAVVLLALMHKSGVQVLRPTLPNTPWHIRQVTSGPQQS

Query:  GSGDCGMFCVKYFEYDVTGSNMTSLTQDNISFFREKLAIEMWAN
        GSGDCGMFCVKYFEYDVT SNMTSLTQDNISFFREKLAIEMWAN
Subjt:  GSGDCGMFCVKYFEYDVTGSNMTSLTQDNISFFREKLAIEMWAN

TrEMBL top hitse value%identityAlignment
A0A6J1DLV0 uncharacterized protein LOC1110216461.1e-4644.44Show/hide
Query:  DLCSRRFTTGDIVLANFLRPTYGLYQRMTAPNAVPARVAAEYDWAGKCKTILSYMDGTHTDYQTRWLDLDVVYLPYDIA---------------------
        +LC R+FTTGD++++NFLR T G+Y  M +PN + +RVA++YDW G+  ++LSY+DGTH+D  TRW+D+D VYLPY+I                      
Subjt:  DLCSRRFTTGDIVLANFLRPTYGLYQRMTAPNAVPARVAAEYDWAGKCKTILSYMDGTHTDYQTRWLDLDVVYLPYDIA---------------------

Query:  ---------LESELRPMAVVLLALMHKSGVQVLRPTLPNTPWHIRQVTSGPQQSGSGDCGMFCVKYFEYDVTGSNMTSLTQDNISFFREKLAIEMWAN
                 LE EL+PM  ++  L+ + GV + +P +P TPW IR+V+S PQQ   GDCG+FC+ +FEYDVT  +  +LTQ  +SFFR + A+++WAN
Subjt:  ---------LESELRPMAVVLLALMHKSGVQVLRPTLPNTPWHIRQVTSGPQQSGSGDCGMFCVKYFEYDVTGSNMTSLTQDNISFFREKLAIEMWAN

A0A6J1DP34 uncharacterized protein LOC1110218023.6e-7449.01Show/hide
Query:  MFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISELKYDRSPVRKDTSPHKLRALYFNDSNDVLLSEFEKIYLAAW
        MFRKT F HLLD+DLVFNG LIHNILLRE+E+STP+TISFNLF  ++SF R +F +IS LKY R+PVR++T PH+L  LYFND  D++LS+FEK+Y AA 
Subjt:  MFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISELKYDRSPVRKDTSPHKLRALYFNDSNDVLLSEFEKIYLAAW

Query:  FEDDFDAVK---------------SEARPDEE----VEGWPV-----------QEIVQSLR--------------------FPLGVPVWAYETISSLSGR
        FEDD+D VK                  + D      V+ W V           ++ + SL+                    FP    VWAY+TISSLS R
Subjt:  FEDDFDAVK---------------SEARPDEE----VEGWPV-----------QEIVQSLR--------------------FPLGVPVWAYETISSLSGR

Query:  VANKVNPDAVPRILRWRCSHSTAWHVLDREIFRSSTGRTRSIEATDAETTFLDRTFEPPELEDEDEVPRENDAAETSSARAGPEKDDGRHGGNVDEGVRE
        VANKV  D VP I +WR  HSTAWHVLDR+IF S+ GRTR+++ TD ET+FL+R+F+PP  +D+D +    D A  S+ R G + DD   G +V E V +
Subjt:  VANKVNPDAVPRILRWRCSHSTAWHVLDREIFRSSTGRTRSIEATDAETTFLDRTFEPPELEDEDEVPRENDAAETSSARAGPEKDDGRHGGNVDEGVRE

Query:  DVHKEAEEENGRGKKVRMSSVRLRKVEKRMKRMDKRMDDGFEGIKAELKSIRKFL
        D   E EE  G+  KV +S+ RL++VEK +K MDKRMD+    I+AELKSI+KFL
Subjt:  DVHKEAEEENGRGKKVRMSSVRLRKVEKRMKRMDKRMDDGFEGIKAELKSIRKFL

A0A6J1DQC8 uncharacterized protein LOC1110233531.2e-4548.25Show/hide
Query:  MFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISELKYDRSPVRKDTSPHKLRALYFNDSNDVLLSEFEKIYLAAW
        MFRKT+FGHLLD+DLVFNGPLIHNILLRE+EDSTP+TISFNLFG +VSFGRREFD+IS L YDRSPVRK T  HKLR LYFND  + +LS+F K+Y+AA 
Subjt:  MFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISELKYDRSPVRKDTSPHKLRALYFNDSNDVLLSEFEKIYLAAW

Query:  FEDDFDAVKSEARPDEEVEGWPVQEIVQSLRFPLG-VPVWAYETISSLSGRVANKV------NPDAVPRILRWRCSHSTAWHVLDREIFRSSTGRTRSIE
        F+DDFD +K       E+     +  ++  +  LG V  W       L+    +K        P  + +    R S+S        +++ +   RTR +E
Subjt:  FEDDFDAVKSEARPDEEVEGWPVQEIVQSLRFPLG-VPVWAYETISSLSGRVANKV------NPDAVPRILRWRCSHSTAWHVLDREIFRSSTGRTRSIE

Query:  ATDAETTFLDRTFEPPELEDEDEVPRENDAAETSSARAGPEKDDGRHGGNVDEGVRE
        ATDAET F+ RTFEPPE ED+D   R+ DA  ++        D GR        VRE
Subjt:  ATDAETTFLDRTFEPPELEDEDEVPRENDAAETSSARAGPEKDDGRHGGNVDEGVRE

A0A6J1DZM8 uncharacterized protein LOC1110257796.0e-4569.44Show/hide
Query:  MDGTHTDYQTRWLDLDVVYLPYDI------------------------------ALESELRPMAVVLLALMHKSGVQVLRPTLPNTPWHIRQVTSGPQQS
        MDGTHTDYQTRWLDLDVVYLPY+I                               LESELR M VVL ALMH++GVQVLR TLP  PW I QVTS PQQS
Subjt:  MDGTHTDYQTRWLDLDVVYLPYDI------------------------------ALESELRPMAVVLLALMHKSGVQVLRPTLPNTPWHIRQVTSGPQQS

Query:  GSGDCGMFCVKYFEYDVTGSNMTSLTQDNISFFREKLAIEMWAN
        GSGDCGMFCVKYFEYDVT SNMTSLTQDNISFFREKLAIEMWAN
Subjt:  GSGDCGMFCVKYFEYDVTGSNMTSLTQDNISFFREKLAIEMWAN

A0A6J1E2S4 uncharacterized protein LOC1110255011.8e-4994.5Show/hide
Query:  MFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISELKYDRSPVRKDTSPHKLRALYFNDSNDVLLSEFEKIYLAAW
        MFRKTVFGHLLDLDLVFNGPLIHNILLREIE STPDTISFNLFGNK SFGR EFDIIS LKYDRSPVRKDTSPH+LRALYFNDSNDVLLSEFEKIYLAA 
Subjt:  MFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISELKYDRSPVRKDTSPHKLRALYFNDSNDVLLSEFEKIYLAAW

Query:  FEDDFDAVK
        FEDDFDAVK
Subjt:  FEDDFDAVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTAGGAAAACTGTATTTGGTCATCTCCTTGACTTGGATCTCGTATTTAACGGGCCATTGATACACAACATCCTACTTAGGGAGATCGAGGATAGTACACCTGACAC
CATTAGCTTCAACCTATTTGGTAATAAGGTGTCATTTGGGCGGAGGGAGTTTGACATTATTTCTGAGCTTAAGTATGATAGGAGTCCAGTTAGAAAAGACACATCTCCCC
ACAAACTCAGGGCTCTTTACTTTAATGATAGCAACGACGTTCTCTTGAGTGAATTTGAGAAGATTTATTTAGCCGCATGGTTCGAGGACGACTTCGACGCGGTCAAGTCT
GAAGCGCGGCCCGACGAAGAGGTCGAAGGATGGCCGGTTCAGGAAATCGTACAGTCTTTACGGTTTCCCTTGGGTGTTCCAGTGTGGGCCTACGAGACTATATCTTCCTT
ATCTGGGCGTGTGGCCAATAAGGTAAATCCGGATGCCGTGCCACGGATTCTTCGGTGGAGGTGCAGCCATTCAACTGCATGGCATGTACTGGACAGAGAGATTTTCCGAT
CTAGCACAGGAAGAACTCGATCAATAGAAGCAACGGATGCTGAGACGACCTTCCTAGATAGGACGTTCGAACCACCGGAGCTCGAAGATGAGGACGAAGTCCCACGCGAG
AATGATGCTGCTGAAACATCAAGTGCACGTGCAGGACCAGAGAAGGATGACGGGAGACATGGAGGGAACGTCGACGAAGGTGTCAGAGAAGACGTCCATAAAGAGGCTGA
GGAGGAGAATGGACGTGGTAAGAAGGTACGCATGTCTAGTGTGCGTCTGAGGAAGGTTGAGAAACGGATGAAGCGCATGGACAAGCGCATGGATGACGGGTTCGAAGGTA
TTAAGGCTGAATTAAAATCCATCCGAAAGTTCTTGCGAAGAATCACTAAGGGTTTACCTGTCGACCCGAATGACATGAGAAGAGGCAGCAGCGGTGATGGTACTAAGCAA
GGAAATGGTCCGAGTGATGGTTCTGGGCCAAGAGATGGTCCGAGTGATGCACCCAGTGGGGGACGTAGTGATCGAGGTCATGGTCCTAAGGATGGCGATGGTTCTACACC
ATCTCTTAGGGACGTGGACGACACAGATGACATGATCATCGATCCCCCCCATGTGATCGCGCACGAGACAAAGGAACATCAATCCGGCCAAACGACCGGGGACCAGGTTG
AACTAGAACGCGAGTATTCACCTGCCTTGACGCGCAAGGAAGACATGGGTACAGAGGACGTGTTCAAGGAAGATATCGGTCTACATGCGCGTTGTGAGGCTGCCCCTCTC
GAGCAGACTCCTGTTCAGAGCAGACAGGTGGACCATATTACGATTGATTCGCATCTAATACGCCGAGGACTTACGGACTCTGATGCGGAAGGGCCGGGAGCAACGTCGCA
ACCAGACCTGGATGAGGTATGTGTGCTATTGCAGCCCGTTGAACGTCAGAACCCTCGGTGGGGGTCTCGGAAGAGGAAGCTCTCATGGAAGCTTCAGGGGTCGTTTAATG
TAATGGTGGACGGGAAGCGGAAGAAGGTAATGCGATATGACCCACTAGTCCACGTCCCCTCTGAACAAGTCCAGAAGTTTCATGCTTGGCTGGCGAACCCTAACACTGAC
CACGCCACTCGCAAATCATGCTACGGTGATCGAGGAAAGACATGGTTTCGTGACCTTATCAACTCGGGCAAGTGGATGACGAGTGAGGACTTGTGTTCTCGAAGATTCAC
CACCGGTGACATTGTCCTTGCGAACTTTCTTCGACCAACATACGGACTATATCAACGCATGACAGCCCCGAACGCTGTACCTGCGAGAGTTGCAGCGGAATACGATTGGG
CTGGGAAGTGTAAGACCATCCTGAGCTATATGGACGGGACGCACACCGACTATCAGACACGATGGCTTGATCTGGATGTTGTTTACCTGCCATACGACATCGCTTTGGAG
TCCGAGTTGAGGCCGATGGCTGTTGTCCTACTGGCGTTGATGCACAAGTCCGGTGTTCAGGTACTGAGGCCGACACTACCGAATACGCCATGGCACATTCGTCAAGTAAC
GTCTGGGCCCCAGCAAAGCGGGTCTGGTGACTGTGGGATGTTTTGCGTTAAATATTTCGAGTATGATGTAACAGGGTCAAATATGACCAGCTTAACTCAGGACAACATTA
GTTTCTTTAGGGAGAAATTAGCCATAGAAATGTGGGCAAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTAGGAAAACTGTATTTGGTCATCTCCTTGACTTGGATCTCGTATTTAACGGGCCATTGATACACAACATCCTACTTAGGGAGATCGAGGATAGTACACCTGACAC
CATTAGCTTCAACCTATTTGGTAATAAGGTGTCATTTGGGCGGAGGGAGTTTGACATTATTTCTGAGCTTAAGTATGATAGGAGTCCAGTTAGAAAAGACACATCTCCCC
ACAAACTCAGGGCTCTTTACTTTAATGATAGCAACGACGTTCTCTTGAGTGAATTTGAGAAGATTTATTTAGCCGCATGGTTCGAGGACGACTTCGACGCGGTCAAGTCT
GAAGCGCGGCCCGACGAAGAGGTCGAAGGATGGCCGGTTCAGGAAATCGTACAGTCTTTACGGTTTCCCTTGGGTGTTCCAGTGTGGGCCTACGAGACTATATCTTCCTT
ATCTGGGCGTGTGGCCAATAAGGTAAATCCGGATGCCGTGCCACGGATTCTTCGGTGGAGGTGCAGCCATTCAACTGCATGGCATGTACTGGACAGAGAGATTTTCCGAT
CTAGCACAGGAAGAACTCGATCAATAGAAGCAACGGATGCTGAGACGACCTTCCTAGATAGGACGTTCGAACCACCGGAGCTCGAAGATGAGGACGAAGTCCCACGCGAG
AATGATGCTGCTGAAACATCAAGTGCACGTGCAGGACCAGAGAAGGATGACGGGAGACATGGAGGGAACGTCGACGAAGGTGTCAGAGAAGACGTCCATAAAGAGGCTGA
GGAGGAGAATGGACGTGGTAAGAAGGTACGCATGTCTAGTGTGCGTCTGAGGAAGGTTGAGAAACGGATGAAGCGCATGGACAAGCGCATGGATGACGGGTTCGAAGGTA
TTAAGGCTGAATTAAAATCCATCCGAAAGTTCTTGCGAAGAATCACTAAGGGTTTACCTGTCGACCCGAATGACATGAGAAGAGGCAGCAGCGGTGATGGTACTAAGCAA
GGAAATGGTCCGAGTGATGGTTCTGGGCCAAGAGATGGTCCGAGTGATGCACCCAGTGGGGGACGTAGTGATCGAGGTCATGGTCCTAAGGATGGCGATGGTTCTACACC
ATCTCTTAGGGACGTGGACGACACAGATGACATGATCATCGATCCCCCCCATGTGATCGCGCACGAGACAAAGGAACATCAATCCGGCCAAACGACCGGGGACCAGGTTG
AACTAGAACGCGAGTATTCACCTGCCTTGACGCGCAAGGAAGACATGGGTACAGAGGACGTGTTCAAGGAAGATATCGGTCTACATGCGCGTTGTGAGGCTGCCCCTCTC
GAGCAGACTCCTGTTCAGAGCAGACAGGTGGACCATATTACGATTGATTCGCATCTAATACGCCGAGGACTTACGGACTCTGATGCGGAAGGGCCGGGAGCAACGTCGCA
ACCAGACCTGGATGAGGTATGTGTGCTATTGCAGCCCGTTGAACGTCAGAACCCTCGGTGGGGGTCTCGGAAGAGGAAGCTCTCATGGAAGCTTCAGGGGTCGTTTAATG
TAATGGTGGACGGGAAGCGGAAGAAGGTAATGCGATATGACCCACTAGTCCACGTCCCCTCTGAACAAGTCCAGAAGTTTCATGCTTGGCTGGCGAACCCTAACACTGAC
CACGCCACTCGCAAATCATGCTACGGTGATCGAGGAAAGACATGGTTTCGTGACCTTATCAACTCGGGCAAGTGGATGACGAGTGAGGACTTGTGTTCTCGAAGATTCAC
CACCGGTGACATTGTCCTTGCGAACTTTCTTCGACCAACATACGGACTATATCAACGCATGACAGCCCCGAACGCTGTACCTGCGAGAGTTGCAGCGGAATACGATTGGG
CTGGGAAGTGTAAGACCATCCTGAGCTATATGGACGGGACGCACACCGACTATCAGACACGATGGCTTGATCTGGATGTTGTTTACCTGCCATACGACATCGCTTTGGAG
TCCGAGTTGAGGCCGATGGCTGTTGTCCTACTGGCGTTGATGCACAAGTCCGGTGTTCAGGTACTGAGGCCGACACTACCGAATACGCCATGGCACATTCGTCAAGTAAC
GTCTGGGCCCCAGCAAAGCGGGTCTGGTGACTGTGGGATGTTTTGCGTTAAATATTTCGAGTATGATGTAACAGGGTCAAATATGACCAGCTTAACTCAGGACAACATTA
GTTTCTTTAGGGAGAAATTAGCCATAGAAATGTGGGCAAACTGA
Protein sequenceShow/hide protein sequence
MFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISELKYDRSPVRKDTSPHKLRALYFNDSNDVLLSEFEKIYLAAWFEDDFDAVKS
EARPDEEVEGWPVQEIVQSLRFPLGVPVWAYETISSLSGRVANKVNPDAVPRILRWRCSHSTAWHVLDREIFRSSTGRTRSIEATDAETTFLDRTFEPPELEDEDEVPRE
NDAAETSSARAGPEKDDGRHGGNVDEGVREDVHKEAEEENGRGKKVRMSSVRLRKVEKRMKRMDKRMDDGFEGIKAELKSIRKFLRRITKGLPVDPNDMRRGSSGDGTKQ
GNGPSDGSGPRDGPSDAPSGGRSDRGHGPKDGDGSTPSLRDVDDTDDMIIDPPHVIAHETKEHQSGQTTGDQVELEREYSPALTRKEDMGTEDVFKEDIGLHARCEAAPL
EQTPVQSRQVDHITIDSHLIRRGLTDSDAEGPGATSQPDLDEVCVLLQPVERQNPRWGSRKRKLSWKLQGSFNVMVDGKRKKVMRYDPLVHVPSEQVQKFHAWLANPNTD
HATRKSCYGDRGKTWFRDLINSGKWMTSEDLCSRRFTTGDIVLANFLRPTYGLYQRMTAPNAVPARVAAEYDWAGKCKTILSYMDGTHTDYQTRWLDLDVVYLPYDIALE
SELRPMAVVLLALMHKSGVQVLRPTLPNTPWHIRQVTSGPQQSGSGDCGMFCVKYFEYDVTGSNMTSLTQDNISFFREKLAIEMWAN