; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg001601 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg001601
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUlp1-like peptidase
Genome locationscaffold10:3980799..3985582
RNA-Seq ExpressionSpg001601
SyntenySpg001601
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR015410 - Domain of unknown function DUF1985
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146372.1 uncharacterized protein LOC111015600 [Momordica charantia]2.3e-3843.5Show/hide
Query:  MIFTGQLVHYILLREVNETRADVISFKLLGKKVSFGKSEFDLITDLRYAITLTRRHSAGNRLRETYLNNSITMRCEDLETLYPNLEFQTEEDGVKMSIFY
        ++F G L+H++LL EV E R DVISF L  K+VSFGK EFDLIT L + +     H  G RLR  Y  +S+ ++C +LE ++    F  +ED VK+ I Y
Subjt:  MIFTGQLVHYILLREVNETRADVISFKLLGKKVSFGKSEFDLITDLRYAITLTRRHSAGNRLRETYLNNSITMRCEDLETLYPNLEFQTEEDGVKMSIFY

Query:  FIELVMMGREKRQLIDTSLLNVIDDWVAFCNEDWSNMIFPKTIKSLKKALKGKTKSYKAKGSDSKKVANRVSDTTMPC--IRRWSCLHSPSYTRLKIEVF
        FIEL MMG+E++Q IDT  + V+D W AFCN DWS+MIF +TI SLK  LK K  +Y+ K +         S    P   +RR   L S        EVF
Subjt:  FIELVMMGREKRQLIDTSLLNVIDDWVAFCNEDWSNMIFPKTIKSLKKALKGKTKSYKAKGSDSKKVANRVSDTTMPC--IRRWSCLHSPSYTRLKIEVF

Query:  ALMAVVVTIHLIPTDEEREFMSRTLEAPHVE--PDLPPLP--AAVP
              V  HL+ TD E + M R +  P V   PD P +P  A VP
Subjt:  ALMAVVVTIHLIPTDEEREFMSRTLEAPHVE--PDLPPLP--AAVP

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]4.4e-4537.76Show/hide
Query:  MIFTGQLVHYILLREVNETRADVISFKLLGKKVSFGKSEFDLITDLRYAITLTRRHSAGNRLRETYLNNSITMRCEDLETLYPNLEFQTEEDGVKMSIFY
        ++F G L+H++LLREV E R DVISF L GK+VSFGK EFDLIT L + +     H  G RLR  Y  + + ++C +LE ++    F  +ED VK+ I Y
Subjt:  MIFTGQLVHYILLREVNETRADVISFKLLGKKVSFGKSEFDLITDLRYAITLTRRHSAGNRLRETYLNNSITMRCEDLETLYPNLEFQTEEDGVKMSIFY

Query:  FIELVMMGREKRQLIDTSLLNVIDDWVAFCNEDWSNMIFPKTIKSLKKALKGKTKSYKAKGSD--------------------SKKVANRVSDTTMPCIR
        FIEL MMG+E++Q IDT+LL V+D W  FCN DWS+MIF +TI SLK ALK K   Y+ K +                     + +  + +SD  +P + 
Subjt:  FIELVMMGREKRQLIDTSLLNVIDDWVAFCNEDWSNMIFPKTIKSLKKALKGKTKSYKAKGSD--------------------SKKVANRVSDTTMPCIR

Query:  RWSCLHSPSYTRLKIEVFALMAVVVTIHLIPTDEEREFMSRTLEAPHVE--PDLPPLP--AAVPQVEGGAGLDDMKLDPLEVGDYLDVEEGSFGSTHFIP
        RWSC++S  +  L  EVF      V  HL+ TD + + M R +  P V   PD P +P  A VP          +   P       DVE G         
Subjt:  RWSCLHSPSYTRLKIEVFALMAVVVTIHLIPTDEEREFMSRTLEAPHVE--PDLPPLP--AAVPQVEGGAGLDDMKLDPLEVGDYLDVEEGSFGSTHFIP

Query:  QEAETTKGKANDEMEIVKEKDIKGEKGKEKV
           +  +  AND   +  EK +K  K K+++
Subjt:  QEAETTKGKANDEMEIVKEKDIKGEKGKEKV

XP_022154995.1 uncharacterized protein LOC111022139 [Momordica charantia]1.8e-3047.89Show/hide
Query:  IFTGQLVHYILLREVNETRADVISFKLLGKKVSFGKSEFDLITDLRYAITLTRRHSAGNRLRETYLNNSITMRCEDLETLYPNLEFQTEEDGVKMSIFYF
        +F   L+H++LLREV E R D+ISF L G +VSFGK EFDLIT LR+ +          RLR  Y  +  +++C +LE ++    F+ +ED VK++I YF
Subjt:  IFTGQLVHYILLREVNETRADVISFKLLGKKVSFGKSEFDLITDLRYAITLTRRHSAGNRLRETYLNNSITMRCEDLETLYPNLEFQTEEDGVKMSIFYF

Query:  IELVMMGREKRQLIDTSLLNVIDDWVAFCNEDWSNMIFPKTI
        IEL MMG+E++Q +DTSLL ++D W  FCN DWS+MI   T+
Subjt:  IELVMMGREKRQLIDTSLLNVIDDWVAFCNEDWSNMIFPKTI

XP_022155158.1 uncharacterized protein LOC111022300 [Momordica charantia]5.7e-3748.73Show/hide
Query:  MIFTGQLVHYILLREVNETRADVISFKLLGKKVSFGKSEFDLITDLRYAITLTRRHSAGNRLRETYLNNSITMRCEDLETLYPNLEFQTEEDGVKMSIFY
        ++F G L+H++LLREV E R D+ISF L GK+VSFGK EFDLIT L Y +        G RLR  Y  +S+ ++C +LE ++    F  +ED VK+ I Y
Subjt:  MIFTGQLVHYILLREVNETRADVISFKLLGKKVSFGKSEFDLITDLRYAITLTRRHSAGNRLRETYLNNSITMRCEDLETLYPNLEFQTEEDGVKMSIFY

Query:  FIELVMMGREKRQLIDTSLLNVIDDWVAFCNEDWSNMIFPKTIKSLKKALKGKTKSYK
        F+EL MMG+E++Q ID +LL V+D W  FCN DWS++IF +T+ SLK A+  K  +Y+
Subjt:  FIELVMMGREKRQLIDTSLLNVIDDWVAFCNEDWSNMIFPKTIKSLKKALKGKTKSYK

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]5.9e-4240.57Show/hide
Query:  MIFTGQLVHYILLREVNETRADVISFKLLGKKVSFGKSEFDLITDLRYAITLTRRHSAGNRLRETYLNNSITMRCEDLETLYPNLEFQTEEDGVKMSIFY
        ++F G L+H++LLREV E + D+ISF L G +VSFGK EFDLIT LR+ +          RLR  Y  +  +++C +LE ++    F+ +ED VK++I Y
Subjt:  MIFTGQLVHYILLREVNETRADVISFKLLGKKVSFGKSEFDLITDLRYAITLTRRHSAGNRLRETYLNNSITMRCEDLETLYPNLEFQTEEDGVKMSIFY

Query:  FIELVMMGREKRQLIDTSLLNVIDDWVAFCNEDWSNMIFPKTIKSLKKALKGKTKSYKAK----------------------------GSDSKKVANRVS
        FIEL MMG+E++  +DTSLL ++D W  FCN DWS+MIF +T+ SLK ALK K + YK K                             + S +VA R++
Subjt:  FIELVMMGREKRQLIDTSLLNVIDDWVAFCNEDWSNMIFPKTIKSLKKALKGKTKSYKAK----------------------------GSDSKKVANRVS

Query:  DTTMPCIRRWSCLHSPSYTRLKIEVFALMAVVVTIHLIPTDEER
        D  +P + RWSC +S ++  L+ EVF  +   V + L  TD ER
Subjt:  DTTMPCIRRWSCLHSPSYTRLKIEVFALMAVVVTIHLIPTDEER

TrEMBL top hitse value%identityAlignment
A0A5A7UGY3 Ulp1-like peptidase1.3e-2323.6Show/hide
Query:  MIFTGQLVHYILLREVNETRADVISFKLLGKKVSFGKSEFDLITDLRYAITLTRRHSAGNRLRETYLNNSITMRCEDLETLYPNLEFQTEEDG-VKMSIF
        ++F G L+HY+LLREV +   D ISF L     +FG+ EF++IT L        +    +RL E +  +   +   DLE ++  LE++ ++D  VK+++ 
Subjt:  MIFTGQLVHYILLREVNETRADVISFKLLGKKVSFGKSEFDLITDLRYAITLTRRHSAGNRLRETYLNNSITMRCEDLETLYPNLEFQTEEDG-VKMSIF

Query:  YFIELVMMGREKRQLIDTSLLNVIDDWVAFCNEDWSNMIFPKTIKSLKKALKGKTKSYKAKGSDSKKV------------------------ANRVSDTT
        YFIE+ ++G+++R  +D     + DDW +F N DW  ++F +T+ +LK+AL  +    K K + +KK                          ++V+D  
Subjt:  YFIELVMMGREKRQLIDTSLLNVIDDWVAFCNEDWSNMIFPKTIKSLKKALKGKTKSYKAKGSDSKKV------------------------ANRVSDTT

Query:  MPCIRRWSCLHSP-SYTRLKIEVFALMAVVVTIHLIPTDEEREFMS---------------------RTLEAPHVEPDLPPLPAAVPQVEGGAGLDDMKL
        +P + RW C  SP S T  ++    +  +   I + P +E+ +  S                     R  E  + E D         +++       MK 
Subjt:  MPCIRRWSCLHSP-SYTRLKIEVFALMAVVVTIHLIPTDEEREFMS---------------------RTLEAPHVEPDLPPLPAAVPQVEGGAGLDDMKL

Query:  DPLEVGDYLDVEEGSFGSTHFIPQEAETTKGKANDEMEIVKEKDIKGEKGKEKVVEEEVIEQSKNKRKKKKEKEKEVETEKVKEKDIKGEKDKEKVVDEE
            + D + V EG   S   I  + +  KG  +  ++ +  +  KG++G  K V E +++ +         + K+V+  K ++ D  G  +  ++  E 
Subjt:  DPLEVGDYLDVEEGSFGSTHFIPQEAETTKGKANDEMEIVKEKDIKGEKGKEKVVEEEVIEQSKNKRKKKKEKEKEVETEKVKEKDIKGEKDKEKVVDEE

Query:  VIEQEKNKKKKGKEKELETEKVKEKDIEGDKGKEKAVDEQVIEGEEKKKKKKNKKKKKKKK------------KKQSCECTEILLRMEAELHG-MRRLLR
          ++    KKKG E  LE    K+ DI  ++  +   DE V E E    ++ + +  ++K+             K+S   T      EA ++  M ++L 
Subjt:  VIEQEKNKKKKGKEKELETEKVKEKDIEGDKGKEKAVDEQVIEGEEKKKKKKNKKKKKKKK------------KKQSCECTEILLRMEAELHG-MRRLLR

Query:  KLAKDKGVDSIKYIGPDNEAGNGGLSTKKHDDEGSGGPSTKNHDDKGNERDVEDSVPGSGKGVGDDDEGPSTEKHDDATGERDTDDDIGVSGKVDDVVVP
            D  +D ++            ++ K+ +DE                  + ++  G        D         D                       
Subjt:  KLAKDKGVDSIKYIGPDNEAGNGGLSTKKHDDEGSGGPSTKNHDDKGNERDVEDSVPGSGKGVGDDDEGPSTEKHDDATGERDTDDDIGVSGKVDDVVVP

Query:  KPEVIDSLFMFICKKMDTRLDLCHWRFITGDLVVTEFL-RRGDVYEELVEGNPECFKWSRFKSVLKYVRGEHTDYNVPWSTVDVVYMPFNLGREHWALLC
          E +D+ F+FIC K+          F T D +    L  +  +Y+E ++ N   F W     ++ YV G   D+  PW++VD VY PFN+   HW LLC
Subjt:  KPEVIDSLFMFICKKMDTRLDLCHWRFITGDLVVTEFL-RRGDVYEELVEGNPECFKWSRFKSVLKYVRGEHTDYNVPWSTVDVVYMPFNLGREHWALLC

Query:  ADLKVGEVAVTDSLVAFASDAELEKEMKIVCTILPRLLEAGGVIE--VKSSLPRTTWTFKRRIEVPQQVDSGDCGIFDAKFLEY
         DL   +V V DSL +  +  ++   +  +  ++P+LL++ G  +   +SS  +  W       +P Q ++ DCG+F  K+ EY
Subjt:  ADLKVGEVAVTDSLVAFASDAELEKEMKIVCTILPRLLEAGGVIE--VKSSLPRTTWTFKRRIEVPQQVDSGDCGIFDAKFLEY

A0A5D3DYH3 Ulp1-like peptidase1.9e-2223.7Show/hide
Query:  MIFTGQLVHYILLREVNETRADVISFKLLGKKVSFGKSEFDLITDLRYAITLTRRHSAGNRLRETYLNNSITMRCEDLETLYPNLEFQTEEDG-VKMSIF
        ++F G L+HY+LLREV +   D ISF L G   +FG+ EF+++T L        +    +RL E +  +   +   DLE ++  LE++ ++D  VK+++ 
Subjt:  MIFTGQLVHYILLREVNETRADVISFKLLGKKVSFGKSEFDLITDLRYAITLTRRHSAGNRLRETYLNNSITMRCEDLETLYPNLEFQTEEDG-VKMSIF

Query:  YFIELVMMGREKRQLIDTSLLNVIDDWVAFCNEDWSNMIFPKTIKSLKKALKGKTKSYKAKGSDSKKV------------------------ANRVSDTT
        YFIE+ ++G+++R  +D     + DDW +F N DW  ++F +T+ +LK+AL  +    K K + +KK                          ++V+D  
Subjt:  YFIELVMMGREKRQLIDTSLLNVIDDWVAFCNEDWSNMIFPKTIKSLKKALKGKTKSYKAKGSDSKKV------------------------ANRVSDTT

Query:  MPCIRRWSCLHSP-SYTRLKIEVFALMAVVVTIHLIPTDEEREFMSRTLEAPHVEPDLPPLPAAVPQVEGGAGLDDMKLDPLEVGDYLDVEEGSFGSTHF
        +P + RW C  SP S T  ++    +  +   I + P +E+ +  S                                      G+  +    +F S+  
Subjt:  MPCIRRWSCLHSP-SYTRLKIEVFALMAVVVTIHLIPTDEEREFMSRTLEAPHVEPDLPPLPAAVPQVEGGAGLDDMKLDPLEVGDYLDVEEGSFGSTHF

Query:  IPQE---AETTKGKANDEMEIVKEKDIKGEKGKEKVVEE-----EVIEQSKNKRKKKKEKEKEVETEKVKEKDIKGEKDK-EKVVDEEVIEQEKNKKKKG
        I  +   ++  +   NDE E+ K K  K +   +K +        V+E   N  K   ++ K + +  +K   ++ + D+ +  V E + ++    KKKG
Subjt:  IPQE---AETTKGKANDEMEIVKEKDIKGEKGKEKVVEE-----EVIEQSKNKRKKKKEKEKEVETEKVKEKDIKGEKDK-EKVVDEEVIEQEKNKKKKG

Query:  KEKELETEKVKEKDIEGDKGKEKAVDEQVIEGEEKKKKKKNKKKKKKKK------------KKQSCECTEILLRMEAELHG-MRRLLRKLAKDKGVDSIK
         E  LE    K+ DI  ++  +   DE VIE E    ++ + +  ++K+             K+S   T      EA ++  M ++L     D  +D ++
Subjt:  KEKELETEKVKEKDIEGDKGKEKAVDEQVIEGEEKKKKKKNKKKKKKKK------------KKQSCECTEILLRMEAELHG-MRRLLRKLAKDKGVDSIK

Query:  YIGPDNEAGNGGLSTKKHDDEGSGGPSTKNHDDKGNERDVEDSVPGSGKGVGDDDEGPSTEKHDDATGERDTDDDIGVSGKVDDVVVPKPEVIDSLFMFI
                    ++ K+ DDE                  + ++  G        D         D                         E +D+LF+FI
Subjt:  YIGPDNEAGNGGLSTKKHDDEGSGGPSTKNHDDKGNERDVEDSVPGSGKGVGDDDEGPSTEKHDDATGERDTDDDIGVSGKVDDVVVPKPEVIDSLFMFI

Query:  CKKMDTRLDLCHWRFITGDLVVTEFL-RRGDVYEELVEGNPECFKWSRFKSVLKYVRGEHTDYNVPWSTVDVVYMPFNLGREHWALLCADLKVGEVAVTD
          K+          F T D +    L  +  +Y+E ++ N   F W     ++ YV G   D+  PW++VD VY PFN+   HW LLC DL   +V V D
Subjt:  CKKMDTRLDLCHWRFITGDLVVTEFL-RRGDVYEELVEGNPECFKWSRFKSVLKYVRGEHTDYNVPWSTVDVVYMPFNLGREHWALLCADLKVGEVAVTD

Query:  SLVAFASDAELEKEMKIVCTILPRLLEAGGVIE--VKSSLPRTTWTFKRRIEVPQQVDSGDCGIFDAKFLEY
        SL +  +  E+   +  +  ++P+LL++ G  +   +SS  +  W       +P Q ++ DCG+F  K+ EY
Subjt:  SLVAFASDAELEKEMKIVCTILPRLLEAGGVIE--VKSSLPRTTWTFKRRIEVPQQVDSGDCGIFDAKFLEY

A0A6J1DJX9 uncharacterized protein LOC1110207572.1e-4537.76Show/hide
Query:  MIFTGQLVHYILLREVNETRADVISFKLLGKKVSFGKSEFDLITDLRYAITLTRRHSAGNRLRETYLNNSITMRCEDLETLYPNLEFQTEEDGVKMSIFY
        ++F G L+H++LLREV E R DVISF L GK+VSFGK EFDLIT L + +     H  G RLR  Y  + + ++C +LE ++    F  +ED VK+ I Y
Subjt:  MIFTGQLVHYILLREVNETRADVISFKLLGKKVSFGKSEFDLITDLRYAITLTRRHSAGNRLRETYLNNSITMRCEDLETLYPNLEFQTEEDGVKMSIFY

Query:  FIELVMMGREKRQLIDTSLLNVIDDWVAFCNEDWSNMIFPKTIKSLKKALKGKTKSYKAKGSD--------------------SKKVANRVSDTTMPCIR
        FIEL MMG+E++Q IDT+LL V+D W  FCN DWS+MIF +TI SLK ALK K   Y+ K +                     + +  + +SD  +P + 
Subjt:  FIELVMMGREKRQLIDTSLLNVIDDWVAFCNEDWSNMIFPKTIKSLKKALKGKTKSYKAKGSD--------------------SKKVANRVSDTTMPCIR

Query:  RWSCLHSPSYTRLKIEVFALMAVVVTIHLIPTDEEREFMSRTLEAPHVE--PDLPPLP--AAVPQVEGGAGLDDMKLDPLEVGDYLDVEEGSFGSTHFIP
        RWSC++S  +  L  EVF      V  HL+ TD + + M R +  P V   PD P +P  A VP          +   P       DVE G         
Subjt:  RWSCLHSPSYTRLKIEVFALMAVVVTIHLIPTDEEREFMSRTLEAPHVE--PDLPPLP--AAVPQVEGGAGLDDMKLDPLEVGDYLDVEEGSFGSTHFIP

Query:  QEAETTKGKANDEMEIVKEKDIKGEKGKEKV
           +  +  AND   +  EK +K  K K+++
Subjt:  QEAETTKGKANDEMEIVKEKDIKGEKGKEKV

A0A6J1DRZ7 uncharacterized protein LOC1110238472.9e-4240.57Show/hide
Query:  MIFTGQLVHYILLREVNETRADVISFKLLGKKVSFGKSEFDLITDLRYAITLTRRHSAGNRLRETYLNNSITMRCEDLETLYPNLEFQTEEDGVKMSIFY
        ++F G L+H++LLREV E + D+ISF L G +VSFGK EFDLIT LR+ +          RLR  Y  +  +++C +LE ++    F+ +ED VK++I Y
Subjt:  MIFTGQLVHYILLREVNETRADVISFKLLGKKVSFGKSEFDLITDLRYAITLTRRHSAGNRLRETYLNNSITMRCEDLETLYPNLEFQTEEDGVKMSIFY

Query:  FIELVMMGREKRQLIDTSLLNVIDDWVAFCNEDWSNMIFPKTIKSLKKALKGKTKSYKAK----------------------------GSDSKKVANRVS
        FIEL MMG+E++  +DTSLL ++D W  FCN DWS+MIF +T+ SLK ALK K + YK K                             + S +VA R++
Subjt:  FIELVMMGREKRQLIDTSLLNVIDDWVAFCNEDWSNMIFPKTIKSLKKALKGKTKSYKAK----------------------------GSDSKKVANRVS

Query:  DTTMPCIRRWSCLHSPSYTRLKIEVFALMAVVVTIHLIPTDEER
        D  +P + RWSC +S ++  L+ EVF  +   V + L  TD ER
Subjt:  DTTMPCIRRWSCLHSPSYTRLKIEVFALMAVVVTIHLIPTDEER

A0A6J1DSS5 uncharacterized protein LOC1110239691.8e-2834.96Show/hide
Query:  MIFTGQLVHYILLREVNETRADVISFKLLGKKVSFGKSEFDLITDL-RYAITLTRRHSAGNRLRETYLNNSITMRCEDLETLYPNLEFQTEEDGVKMSIF
        M+F   LVHY LLREV +TR DV+ F +LG  V+F K+EF L+T L R +  + ++  + NRLR  Y  + + +R E+ E  Y  + F  ++D VK+S+ 
Subjt:  MIFTGQLVHYILLREVNETRADVISFKLLGKKVSFGKSEFDLITDL-RYAITLTRRHSAGNRLRETYLNNSITMRCEDLETLYPNLEFQTEEDGVKMSIF

Query:  YFIELVMMGREK-RQLIDTSLLNVIDDWVAFCNEDWSNMIFPKTIKSLKKALKGKTKSYKAKGSDSKKV----------------------------ANR
        Y+ E+VMMG+ K +  +D  L   ++D   F N DW   I+ +T+K L+ A+K K  +YK K + +KK                              NR
Subjt:  YFIELVMMGREK-RQLIDTSLLNVIDDWVAFCNEDWSNMIFPKTIKSLKKALKGKTKSYKAKGSDSKKV----------------------------ANR

Query:  VSDTTMPCIRRWSCLHSPSYTRLKIEVFALMAVVVTIHLIPTDEER
        +SDT MP I R+SC  S +   L+ +VF    + +T  L+ ++ ER
Subjt:  VSDTTMPCIRRWSCLHSPSYTRLKIEVFALMAVVVTIHLIPTDEER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTTTACTGGGCAACTTGTTCATTACATTCTACTTAGAGAAGTTAATGAGACTAGGGCAGATGTAATTAGTTTTAAGTTGTTGGGGAAGAAAGTCTCATTTGGTAA
GAGTGAGTTTGACCTAATCACCGATCTTAGATATGCAATTACACTGACTAGGAGACACTCAGCGGGTAATAGGCTTAGAGAAACTTACTTAAATAATAGCATAACCATGA
GATGTGAGGACTTAGAAACTTTATACCCTAATTTAGAGTTCCAAACTGAGGAGGATGGAGTGAAGATGTCCATATTTTACTTTATTGAGCTCGTGATGATGGGGAGGGAG
AAAAGACAGTTGATTGACACATCCCTGTTGAATGTCATCGACGATTGGGTTGCTTTCTGTAATGAGGATTGGAGCAACATGATATTCCCAAAGACTATAAAGAGCCTCAA
GAAAGCATTGAAAGGAAAGACAAAGTCGTACAAGGCAAAAGGATCGGATTCAAAGAAGGTTGCAAATCGCGTGAGTGACACGACCATGCCGTGCATTCGAAGATGGTCAT
GCTTACACTCTCCTTCGTACACCCGTCTTAAAATTGAGGTGTTTGCATTGATGGCAGTTGTTGTCACGATACATCTTATTCCCACTGACGAAGAGAGAGAGTTTATGTCT
CGAACGCTAGAGGCTCCACATGTAGAACCTGACCTTCCCCCACTCCCTGCCGCTGTCCCTCAGGTGGAGGGGGGTGCAGGGTTGGATGATATGAAGTTGGATCCACTTGA
AGTGGGGGATTACTTAGATGTGGAAGAAGGAAGCTTTGGATCCACTCACTTTATCCCTCAGGAGGCCGAGACGACGAAAGGGAAAGCAAATGATGAGATGGAGATAGTGA
AAGAGAAAGATATTAAAGGAGAAAAGGGTAAAGAGAAAGTAGTAGAGGAAGAAGTCATCGAACAATCAAAGAATAAGAGGAAGAAGAAGAAAGAGAAAGAGAAGGAGGTG
GAGACGGAGAAAGTGAAAGAAAAAGATATTAAAGGAGAAAAGGATAAAGAGAAAGTAGTGGATGAAGAAGTGATTGAACAAGAAAAGAATAAGAAGAAAAAAGGGAAAGA
GAAGGAGTTGGAGACGGAGAAAGTAAAAGAGAAAGATATTGAAGGAGATAAAGGCAAAGAGAAAGCAGTGGATGAACAAGTGATCGAAGGAGAAGAGAAGAAGAAGAAGA
AGAAGAATAAGAAGAAGAAGAAGAAGAAGAAAAAGAAGCAGAGTTGCGAATGTACGGAGATTCTATTAAGGATGGAGGCGGAGTTACACGGCATGCGTAGATTGTTACGG
AAGCTTGCTAAGGATAAAGGTGTAGACTCGATTAAGTACATTGGACCTGACAATGAAGCGGGTAATGGGGGTCTATCCACCAAAAAACATGATGACGAGGGTAGTGGGGG
TCCATCCACCAAAAATCATGATGACAAGGGTAATGAGCGTGATGTCGAGGACAGCGTACCTGGTAGTGGGAAGGGGGTTGGTGATGATGACGAGGGTCCATCCACCGAAA
AACATGATGACGCGACCGGGGAACGTGACACCGATGACGACATAGGAGTCAGTGGGAAGGTGGATGACGTCGTGGTACCTAAACCGGAGGTAATTGACTCTCTTTTCATG
TTCATCTGTAAGAAGATGGACACTCGTCTCGACTTATGTCATTGGAGGTTCATTACCGGTGACCTAGTTGTTACGGAGTTCTTAAGGAGAGGGGACGTGTATGAAGAACT
TGTAGAAGGCAACCCCGAGTGCTTCAAATGGAGTAGGTTCAAGTCCGTCCTCAAATACGTTCGAGGCGAGCACACGGATTATAATGTTCCATGGAGTACGGTAGATGTCG
TGTACATGCCTTTCAACCTAGGAAGAGAACACTGGGCTTTGTTATGTGCGGACCTGAAGGTAGGTGAGGTGGCCGTCACAGATTCACTAGTGGCTTTTGCGTCCGACGCC
GAGCTGGAAAAGGAGATGAAAATAGTATGCACCATCCTTCCACGGCTTCTAGAAGCAGGTGGTGTCATAGAGGTGAAATCGTCACTACCACGCACTACATGGACATTCAA
GAGGAGGATTGAAGTTCCCCAGCAGGTAGATAGTGGAGATTGTGGGATATTCGACGCAAAATTCCTCGAGTATGATAATGTATTGCTGAGCGACTTGAGGGAGCAAATTC
TGTGCTGCAGCAAAGCAAGGAGCAAAACTGCCACGTCACAGCTCGTTAGCCAATTTAATGAACTGAATTCAGTTAAGCTATTCTCAGATTTAAGGAGCAAAGAGAGCCAT
CCACGTGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGATTTTTACTGGGCAACTTGTTCATTACATTCTACTTAGAGAAGTTAATGAGACTAGGGCAGATGTAATTAGTTTTAAGTTGTTGGGGAAGAAAGTCTCATTTGGTAA
GAGTGAGTTTGACCTAATCACCGATCTTAGATATGCAATTACACTGACTAGGAGACACTCAGCGGGTAATAGGCTTAGAGAAACTTACTTAAATAATAGCATAACCATGA
GATGTGAGGACTTAGAAACTTTATACCCTAATTTAGAGTTCCAAACTGAGGAGGATGGAGTGAAGATGTCCATATTTTACTTTATTGAGCTCGTGATGATGGGGAGGGAG
AAAAGACAGTTGATTGACACATCCCTGTTGAATGTCATCGACGATTGGGTTGCTTTCTGTAATGAGGATTGGAGCAACATGATATTCCCAAAGACTATAAAGAGCCTCAA
GAAAGCATTGAAAGGAAAGACAAAGTCGTACAAGGCAAAAGGATCGGATTCAAAGAAGGTTGCAAATCGCGTGAGTGACACGACCATGCCGTGCATTCGAAGATGGTCAT
GCTTACACTCTCCTTCGTACACCCGTCTTAAAATTGAGGTGTTTGCATTGATGGCAGTTGTTGTCACGATACATCTTATTCCCACTGACGAAGAGAGAGAGTTTATGTCT
CGAACGCTAGAGGCTCCACATGTAGAACCTGACCTTCCCCCACTCCCTGCCGCTGTCCCTCAGGTGGAGGGGGGTGCAGGGTTGGATGATATGAAGTTGGATCCACTTGA
AGTGGGGGATTACTTAGATGTGGAAGAAGGAAGCTTTGGATCCACTCACTTTATCCCTCAGGAGGCCGAGACGACGAAAGGGAAAGCAAATGATGAGATGGAGATAGTGA
AAGAGAAAGATATTAAAGGAGAAAAGGGTAAAGAGAAAGTAGTAGAGGAAGAAGTCATCGAACAATCAAAGAATAAGAGGAAGAAGAAGAAAGAGAAAGAGAAGGAGGTG
GAGACGGAGAAAGTGAAAGAAAAAGATATTAAAGGAGAAAAGGATAAAGAGAAAGTAGTGGATGAAGAAGTGATTGAACAAGAAAAGAATAAGAAGAAAAAAGGGAAAGA
GAAGGAGTTGGAGACGGAGAAAGTAAAAGAGAAAGATATTGAAGGAGATAAAGGCAAAGAGAAAGCAGTGGATGAACAAGTGATCGAAGGAGAAGAGAAGAAGAAGAAGA
AGAAGAATAAGAAGAAGAAGAAGAAGAAGAAAAAGAAGCAGAGTTGCGAATGTACGGAGATTCTATTAAGGATGGAGGCGGAGTTACACGGCATGCGTAGATTGTTACGG
AAGCTTGCTAAGGATAAAGGTGTAGACTCGATTAAGTACATTGGACCTGACAATGAAGCGGGTAATGGGGGTCTATCCACCAAAAAACATGATGACGAGGGTAGTGGGGG
TCCATCCACCAAAAATCATGATGACAAGGGTAATGAGCGTGATGTCGAGGACAGCGTACCTGGTAGTGGGAAGGGGGTTGGTGATGATGACGAGGGTCCATCCACCGAAA
AACATGATGACGCGACCGGGGAACGTGACACCGATGACGACATAGGAGTCAGTGGGAAGGTGGATGACGTCGTGGTACCTAAACCGGAGGTAATTGACTCTCTTTTCATG
TTCATCTGTAAGAAGATGGACACTCGTCTCGACTTATGTCATTGGAGGTTCATTACCGGTGACCTAGTTGTTACGGAGTTCTTAAGGAGAGGGGACGTGTATGAAGAACT
TGTAGAAGGCAACCCCGAGTGCTTCAAATGGAGTAGGTTCAAGTCCGTCCTCAAATACGTTCGAGGCGAGCACACGGATTATAATGTTCCATGGAGTACGGTAGATGTCG
TGTACATGCCTTTCAACCTAGGAAGAGAACACTGGGCTTTGTTATGTGCGGACCTGAAGGTAGGTGAGGTGGCCGTCACAGATTCACTAGTGGCTTTTGCGTCCGACGCC
GAGCTGGAAAAGGAGATGAAAATAGTATGCACCATCCTTCCACGGCTTCTAGAAGCAGGTGGTGTCATAGAGGTGAAATCGTCACTACCACGCACTACATGGACATTCAA
GAGGAGGATTGAAGTTCCCCAGCAGGTAGATAGTGGAGATTGTGGGATATTCGACGCAAAATTCCTCGAGTATGATAATGTATTGCTGAGCGACTTGAGGGAGCAAATTC
TGTGCTGCAGCAAAGCAAGGAGCAAAACTGCCACGTCACAGCTCGTTAGCCAATTTAATGAACTGAATTCAGTTAAGCTATTCTCAGATTTAAGGAGCAAAGAGAGCCAT
CCACGTGTCTAG
Protein sequenceShow/hide protein sequence
MIFTGQLVHYILLREVNETRADVISFKLLGKKVSFGKSEFDLITDLRYAITLTRRHSAGNRLRETYLNNSITMRCEDLETLYPNLEFQTEEDGVKMSIFYFIELVMMGRE
KRQLIDTSLLNVIDDWVAFCNEDWSNMIFPKTIKSLKKALKGKTKSYKAKGSDSKKVANRVSDTTMPCIRRWSCLHSPSYTRLKIEVFALMAVVVTIHLIPTDEEREFMS
RTLEAPHVEPDLPPLPAAVPQVEGGAGLDDMKLDPLEVGDYLDVEEGSFGSTHFIPQEAETTKGKANDEMEIVKEKDIKGEKGKEKVVEEEVIEQSKNKRKKKKEKEKEV
ETEKVKEKDIKGEKDKEKVVDEEVIEQEKNKKKKGKEKELETEKVKEKDIEGDKGKEKAVDEQVIEGEEKKKKKKNKKKKKKKKKKQSCECTEILLRMEAELHGMRRLLR
KLAKDKGVDSIKYIGPDNEAGNGGLSTKKHDDEGSGGPSTKNHDDKGNERDVEDSVPGSGKGVGDDDEGPSTEKHDDATGERDTDDDIGVSGKVDDVVVPKPEVIDSLFM
FICKKMDTRLDLCHWRFITGDLVVTEFLRRGDVYEELVEGNPECFKWSRFKSVLKYVRGEHTDYNVPWSTVDVVYMPFNLGREHWALLCADLKVGEVAVTDSLVAFASDA
ELEKEMKIVCTILPRLLEAGGVIEVKSSLPRTTWTFKRRIEVPQQVDSGDCGIFDAKFLEYDNVLLSDLREQILCCSKARSKTATSQLVSQFNELNSVKLFSDLRSKESH
PRV