; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0023167 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0023167
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUlp1-like peptidase
Genome locationchr7:45282619..45286392
RNA-Seq ExpressionLag0023167
SyntenyLag0023167
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146372.1 uncharacterized protein LOC111015600 [Momordica charantia]3.4e-5143.93Show/hide
Query:  MFRQTIFGPIVDSNIIFNGQLIHHLLRREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRIFDRNKSGVRLRRLYFKDSVKDTVADVEKRFLEIQ
        MFRQT FGPI+D  ++FNG LIHHLL  EVE+PRQDVISF++F  RVSFGK+EFDLITG  H     + +  G RLR  YFKDSV+   +++EK FLE  
Subjt:  MFRQTIFGPIVDSNIIFNGQLIHHLLRREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRIFDRNKSGVRLRRLYFKDSVKDTVADVEKRFLEIQ

Query:  FETDEDAVKVALAYFIELAMFGRERKQKFNWFLLGIMDDWEIFCNYDWSKVIFEMTIRSLKKAL--RAATQRDDVVGETSRVERYNLYGFTHAFQVWAYE
        F  DED VKV + YFIELAM G+ERKQ  +   +G++D WE FCN DWS +IF+ TI SLK  L  + +  +     + + VE Y+LYGF +        
Subjt:  FETDEDAVKVALAYFIELAMFGRERKQKFNWFLLGIMDDWEIFCNYDWSKVIFEMTIRSLKKAL--RAATQRDDVVGETSRVERYNLYGFTHAFQVWAYE

Query:  TISSLKNRVANKLNQDAIPRFSRWSCSHSPSYTQLSSEIFGLTEAKVIVQLVPSDAELEHMCRIVGAPKAPVFSPQSEAP
             ++RV                         L+SE+F  T +KV   L+ +DAE +HM R++  P+  V       P
Subjt:  TISSLKNRVANKLNQDAIPRFSRWSCSHSPSYTQLSSEIFGLTEAKVIVQLVPSDAELEHMCRIVGAPKAPVFSPQSEAP

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]9.6e-7844.1Show/hide
Query:  MFRQTIFGPIVDSNIIFNGQLIHHLLRREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRIFDRNKSGVRLRRLYFKDSVKDTVADVEKRFLEIQ
        MFRQT FGPI+D +++FNG LIHHLL REVE+PRQDVISF++FG RVSFGK+EFDLITG  H     D +  G RLR  YFKD V+   +++EK FLE  
Subjt:  MFRQTIFGPIVDSNIIFNGQLIHHLLRREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRIFDRNKSGVRLRRLYFKDSVKDTVADVEKRFLEIQ

Query:  FETDEDAVKVALAYFIELAMFGRERKQKFNWFLLGIMDDWEIFCNYDWSKVIFEMTIRSLKKAL--RAATQRDDVVGETSRVERYNLYGFTHAFQVWAYE
        F  DED VKV + YFIELAM G+ERKQ  +  LLG++D WE+FCNYDWS +IF+ TI SLK AL  + +  +     + S VE Y+LYGF +AFQVWAYE
Subjt:  FETDEDAVKVALAYFIELAMFGRERKQKFNWFLLGIMDDWEIFCNYDWSKVIFEMTIRSLKKAL--RAATQRDDVVGETSRVERYNLYGFTHAFQVWAYE

Query:  TISSLKNRVANKLNQDAIPRFSRWSCSHSPSYTQLSSEIFGLTEAKVIVQLVPSDAELEHMCRIVGAPKA------PVFSPQSEAPFFPPQPELVNNANV
        TIS+        L+ DAIPR  RWSC +S  +  L+SE+F  T +KV   L+ +DA+ +HM R++  P+       P    ++  P  P  PE       
Subjt:  TISSLKNRVANKLNQDAIPRFSRWSCSHSPSYTQLSSEIFGLTEAKVIVQLVPSDAELEHMCRIVGAPKA------PVFSPQSEAPFFPPQPELVNNANV

Query:  DRVVSDRGSDEASWDRGYPIKEVDMVGLDEELTHESLFEGVGKTCRCDCKHSYESLDRQMKEMEFEVKEIKNDLKGIKTDLKSIKKYLRRLSKVMNSDKG
           V D  +D        P+  VD   +DE     S  +G G   R       + + R++K ++  V  I++ L      LK I+ YL++L+K     K 
Subjt:  DRVVSDRGSDEASWDRGYPIKEVDMVGLDEELTHESLFEGVGKTCRCDCKHSYESLDRQMKEMEFEVKEIKNDLKGIKTDLKSIKKYLRRLSKVMNSDKG

Query:  KDRVKEEGYGGVSED
         D  K  G GG  +D
Subjt:  KDRVKEEGYGGVSED

XP_022154561.1 uncharacterized protein LOC111021802 [Momordica charantia]9.1e-5236.58Show/hide
Query:  MFRQTIFGPIVDSNIIFNGQLIHHLLRREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRIFDRNKSGVRLRRLYFKDSVKDTVADVEKRFLEIQ
        MFR+T F  ++D +++FNG LIH++L REVE+   + ISFN+F  R+SF + +F LI+G ++ R     N    RL  LYF D     ++D EK +   +
Subjt:  MFRQTIFGPIVDSNIIFNGQLIHHLLRREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRIFDRNKSGVRLRRLYFKDSVKDTVADVEKRFLEIQ

Query:  FETDEDAVKVALAYFIELAMFGRERKQKFNWFLLGIMDDWEIFCNYDWSKVIFEMTIRSLKKALRAATQRDDVVGETSRVERYNLYGFTHAFQVWAYETI
        FE D D VKV + Y + + + GRER  KF+  LLGI+DDWE+ CNY+W+ + FE TI SL++     ++      +    + Y+LYGF   FQVWAY+TI
Subjt:  FETDEDAVKVALAYFIELAMFGRERKQKFNWFLLGIMDDWEIFCNYDWSKVIFEMTIRSLKKALRAATQRDDVVGETSRVERYNLYGFTHAFQVWAYETI

Query:  SSLKNRVANKLNQDAIPRFSRWSCSHSPSYTQLSSEIFGLTEAKVIVQLVPSDAELEHMCRIVGAPKAPVFSPQSEAPFFPPQPELVNNANVDRVVSDRG
        SSL  RVANK+  D +P   +W   HS ++  L  +IF  T+ +    L  +D E   + R    P     S   +        E  +NA    V     
Subjt:  SSLKNRVANKLNQDAIPRFSRWSCSHSPSYTQLSSEIFGLTEAKVIVQLVPSDAELEHMCRIVGAPKAPVFSPQSEAPFFPPQPELVNNANVDRVVSDRG

Query:  SDEASWDRGYPIKEVDMVGLDEELTHESLFEGVGKTCRCDCKHSYESLDRQMKEMEFEVKEIKNDLKGIKTDLKSIKKYL
         D+ S  RG    +V+MV  D EL +E   E  GK   C      + +++ +K M+  + E   D   I+ +LKSIKK+L
Subjt:  SDEASWDRGYPIKEVDMVGLDEELTHESLFEGVGKTCRCDCKHSYESLDRQMKEMEFEVKEIKNDLKGIKTDLKSIKKYL

XP_022155158.1 uncharacterized protein LOC111022300 [Momordica charantia]4.2e-4960.37Show/hide
Query:  MFRQTIFGPIVDSNIIFNGQLIHHLLRREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRIFDRNKSGVRLRRLYFKDSVKDTVADVEKRFLEIQ
        MFRQT FGPI+D +++FNG LIHHLL REVE+PRQD+ISF++FG RVSFGK+EFDLITG  +     D +  G RLR  YFKDSV+   +++EK F+E  
Subjt:  MFRQTIFGPIVDSNIIFNGQLIHHLLRREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRIFDRNKSGVRLRRLYFKDSVKDTVADVEKRFLEIQ

Query:  FETDEDAVKVALAYFIELAMFGRERKQKFNWFLLGIMDDWEIFCNYDWSKVIFEMTIRSLKKAL
        F  DEDAVKV + YF+ELAM G+ERKQ  +  LLG++D WE+FCN+DWS +IFE T+ SLK A+
Subjt:  FETDEDAVKVALAYFIELAMFGRERKQKFNWFLLGIMDDWEIFCNYDWSKVIFEMTIRSLKKAL

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]1.4e-7656.81Show/hide
Query:  MFRQTIFGPIVDSNIIFNGQLIHHLLRREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRIFDRNKSGVRLRRLYFKDSVKDTVADVEKRFLEIQ
        MF QT FGPI+  N++FNG L+HHLL REVE+P+ D+ISFN+FGNRVSFGK+EFDLITG RH     D +    RLR LYF+D      +++EK FLE  
Subjt:  MFRQTIFGPIVDSNIIFNGQLIHHLLRREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRIFDRNKSGVRLRRLYFKDSVKDTVADVEKRFLEIQ

Query:  FETDEDAVKVALAYFIELAMFGRERKQKFNWFLLGIMDDWEIFCNYDWSKVIFEMTIRSLKKALRAATQ--RDDVVGETSRVERYNLYGFTHAFQVWAYE
        FE DEDAVK+A+ YFIELAM G+ERK K +  LLGI+D WE+FCNYDWS +IFE T+ SLK AL+   +  +  V  ++S VE Y+LY F +AFQVWAYE
Subjt:  FETDEDAVKVALAYFIELAMFGRERKQKFNWFLLGIMDDWEIFCNYDWSKVIFEMTIRSLKKALRAATQ--RDDVVGETSRVERYNLYGFTHAFQVWAYE

Query:  TISSLKNRVANKLNQDAIPRFSRWSCSHSPSYTQLSSEIFGLTEAKVIVQLVPSDAE
        TIS+L  RVA +LN DAIPR  RWSC++S ++  L  E+F   ++KV+V+L  +D E
Subjt:  TISSLKNRVANKLNQDAIPRFSRWSCSHSPSYTQLSSEIFGLTEAKVIVQLVPSDAE

TrEMBL top hitse value%identityAlignment
A0A6J1CZE8 uncharacterized protein LOC1110156001.7e-5143.93Show/hide
Query:  MFRQTIFGPIVDSNIIFNGQLIHHLLRREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRIFDRNKSGVRLRRLYFKDSVKDTVADVEKRFLEIQ
        MFRQT FGPI+D  ++FNG LIHHLL  EVE+PRQDVISF++F  RVSFGK+EFDLITG  H     + +  G RLR  YFKDSV+   +++EK FLE  
Subjt:  MFRQTIFGPIVDSNIIFNGQLIHHLLRREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRIFDRNKSGVRLRRLYFKDSVKDTVADVEKRFLEIQ

Query:  FETDEDAVKVALAYFIELAMFGRERKQKFNWFLLGIMDDWEIFCNYDWSKVIFEMTIRSLKKAL--RAATQRDDVVGETSRVERYNLYGFTHAFQVWAYE
        F  DED VKV + YFIELAM G+ERKQ  +   +G++D WE FCN DWS +IF+ TI SLK  L  + +  +     + + VE Y+LYGF +        
Subjt:  FETDEDAVKVALAYFIELAMFGRERKQKFNWFLLGIMDDWEIFCNYDWSKVIFEMTIRSLKKAL--RAATQRDDVVGETSRVERYNLYGFTHAFQVWAYE

Query:  TISSLKNRVANKLNQDAIPRFSRWSCSHSPSYTQLSSEIFGLTEAKVIVQLVPSDAELEHMCRIVGAPKAPVFSPQSEAP
             ++RV                         L+SE+F  T +KV   L+ +DAE +HM R++  P+  V       P
Subjt:  TISSLKNRVANKLNQDAIPRFSRWSCSHSPSYTQLSSEIFGLTEAKVIVQLVPSDAELEHMCRIVGAPKAPVFSPQSEAP

A0A6J1DJX9 uncharacterized protein LOC1110207574.7e-7844.1Show/hide
Query:  MFRQTIFGPIVDSNIIFNGQLIHHLLRREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRIFDRNKSGVRLRRLYFKDSVKDTVADVEKRFLEIQ
        MFRQT FGPI+D +++FNG LIHHLL REVE+PRQDVISF++FG RVSFGK+EFDLITG  H     D +  G RLR  YFKD V+   +++EK FLE  
Subjt:  MFRQTIFGPIVDSNIIFNGQLIHHLLRREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRIFDRNKSGVRLRRLYFKDSVKDTVADVEKRFLEIQ

Query:  FETDEDAVKVALAYFIELAMFGRERKQKFNWFLLGIMDDWEIFCNYDWSKVIFEMTIRSLKKAL--RAATQRDDVVGETSRVERYNLYGFTHAFQVWAYE
        F  DED VKV + YFIELAM G+ERKQ  +  LLG++D WE+FCNYDWS +IF+ TI SLK AL  + +  +     + S VE Y+LYGF +AFQVWAYE
Subjt:  FETDEDAVKVALAYFIELAMFGRERKQKFNWFLLGIMDDWEIFCNYDWSKVIFEMTIRSLKKAL--RAATQRDDVVGETSRVERYNLYGFTHAFQVWAYE

Query:  TISSLKNRVANKLNQDAIPRFSRWSCSHSPSYTQLSSEIFGLTEAKVIVQLVPSDAELEHMCRIVGAPKA------PVFSPQSEAPFFPPQPELVNNANV
        TIS+        L+ DAIPR  RWSC +S  +  L+SE+F  T +KV   L+ +DA+ +HM R++  P+       P    ++  P  P  PE       
Subjt:  TISSLKNRVANKLNQDAIPRFSRWSCSHSPSYTQLSSEIFGLTEAKVIVQLVPSDAELEHMCRIVGAPKA------PVFSPQSEAPFFPPQPELVNNANV

Query:  DRVVSDRGSDEASWDRGYPIKEVDMVGLDEELTHESLFEGVGKTCRCDCKHSYESLDRQMKEMEFEVKEIKNDLKGIKTDLKSIKKYLRRLSKVMNSDKG
           V D  +D        P+  VD   +DE     S  +G G   R       + + R++K ++  V  I++ L      LK I+ YL++L+K     K 
Subjt:  DRVVSDRGSDEASWDRGYPIKEVDMVGLDEELTHESLFEGVGKTCRCDCKHSYESLDRQMKEMEFEVKEIKNDLKGIKTDLKSIKKYLRRLSKVMNSDKG

Query:  KDRVKEEGYGGVSED
         D  K  G GG  +D
Subjt:  KDRVKEEGYGGVSED

A0A6J1DM82 uncharacterized protein LOC1110223002.0e-4960.37Show/hide
Query:  MFRQTIFGPIVDSNIIFNGQLIHHLLRREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRIFDRNKSGVRLRRLYFKDSVKDTVADVEKRFLEIQ
        MFRQT FGPI+D +++FNG LIHHLL REVE+PRQD+ISF++FG RVSFGK+EFDLITG  +     D +  G RLR  YFKDSV+   +++EK F+E  
Subjt:  MFRQTIFGPIVDSNIIFNGQLIHHLLRREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRIFDRNKSGVRLRRLYFKDSVKDTVADVEKRFLEIQ

Query:  FETDEDAVKVALAYFIELAMFGRERKQKFNWFLLGIMDDWEIFCNYDWSKVIFEMTIRSLKKAL
        F  DEDAVKV + YF+ELAM G+ERKQ  +  LLG++D WE+FCN+DWS +IFE T+ SLK A+
Subjt:  FETDEDAVKVALAYFIELAMFGRERKQKFNWFLLGIMDDWEIFCNYDWSKVIFEMTIRSLKKAL

A0A6J1DP34 uncharacterized protein LOC1110218024.4e-5236.58Show/hide
Query:  MFRQTIFGPIVDSNIIFNGQLIHHLLRREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRIFDRNKSGVRLRRLYFKDSVKDTVADVEKRFLEIQ
        MFR+T F  ++D +++FNG LIH++L REVE+   + ISFN+F  R+SF + +F LI+G ++ R     N    RL  LYF D     ++D EK +   +
Subjt:  MFRQTIFGPIVDSNIIFNGQLIHHLLRREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRIFDRNKSGVRLRRLYFKDSVKDTVADVEKRFLEIQ

Query:  FETDEDAVKVALAYFIELAMFGRERKQKFNWFLLGIMDDWEIFCNYDWSKVIFEMTIRSLKKALRAATQRDDVVGETSRVERYNLYGFTHAFQVWAYETI
        FE D D VKV + Y + + + GRER  KF+  LLGI+DDWE+ CNY+W+ + FE TI SL++     ++      +    + Y+LYGF   FQVWAY+TI
Subjt:  FETDEDAVKVALAYFIELAMFGRERKQKFNWFLLGIMDDWEIFCNYDWSKVIFEMTIRSLKKALRAATQRDDVVGETSRVERYNLYGFTHAFQVWAYETI

Query:  SSLKNRVANKLNQDAIPRFSRWSCSHSPSYTQLSSEIFGLTEAKVIVQLVPSDAELEHMCRIVGAPKAPVFSPQSEAPFFPPQPELVNNANVDRVVSDRG
        SSL  RVANK+  D +P   +W   HS ++  L  +IF  T+ +    L  +D E   + R    P     S   +        E  +NA    V     
Subjt:  SSLKNRVANKLNQDAIPRFSRWSCSHSPSYTQLSSEIFGLTEAKVIVQLVPSDAELEHMCRIVGAPKAPVFSPQSEAPFFPPQPELVNNANVDRVVSDRG

Query:  SDEASWDRGYPIKEVDMVGLDEELTHESLFEGVGKTCRCDCKHSYESLDRQMKEMEFEVKEIKNDLKGIKTDLKSIKKYL
         D+ S  RG    +V+MV  D EL +E   E  GK   C      + +++ +K M+  + E   D   I+ +LKSIKK+L
Subjt:  SDEASWDRGYPIKEVDMVGLDEELTHESLFEGVGKTCRCDCKHSYESLDRQMKEMEFEVKEIKNDLKGIKTDLKSIKKYL

A0A6J1DRZ7 uncharacterized protein LOC1110238476.7e-7756.81Show/hide
Query:  MFRQTIFGPIVDSNIIFNGQLIHHLLRREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRIFDRNKSGVRLRRLYFKDSVKDTVADVEKRFLEIQ
        MF QT FGPI+  N++FNG L+HHLL REVE+P+ D+ISFN+FGNRVSFGK+EFDLITG RH     D +    RLR LYF+D      +++EK FLE  
Subjt:  MFRQTIFGPIVDSNIIFNGQLIHHLLRREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRIFDRNKSGVRLRRLYFKDSVKDTVADVEKRFLEIQ

Query:  FETDEDAVKVALAYFIELAMFGRERKQKFNWFLLGIMDDWEIFCNYDWSKVIFEMTIRSLKKALRAATQ--RDDVVGETSRVERYNLYGFTHAFQVWAYE
        FE DEDAVK+A+ YFIELAM G+ERK K +  LLGI+D WE+FCNYDWS +IFE T+ SLK AL+   +  +  V  ++S VE Y+LY F +AFQVWAYE
Subjt:  FETDEDAVKVALAYFIELAMFGRERKQKFNWFLLGIMDDWEIFCNYDWSKVIFEMTIRSLKKALRAATQ--RDDVVGETSRVERYNLYGFTHAFQVWAYE

Query:  TISSLKNRVANKLNQDAIPRFSRWSCSHSPSYTQLSSEIFGLTEAKVIVQLVPSDAE
        TIS+L  RVA +LN DAIPR  RWSC++S ++  L  E+F   ++KV+V+L  +D E
Subjt:  TISSLKNRVANKLNQDAIPRFSRWSCSHSPSYTQLSSEIFGLTEAKVIVQLVPSDAE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases3.3e-0730.98Show/hide
Query:  KYLRRLSKVMNSDKGKDRVKEEGYGGVSEDAMVE--DRDMH---KVIDSLFMFVRKKLQQRSDLHRWKFVIADIVVTEFMRRHDHISEEFKKVQDPSLIT
        +Y R LSK+    KGK  +   G   +S   + +  +R  H   KV+D L  F R  L  R+D    + +  D++ ++F+ +   +  +F K   P    
Subjt:  KYLRRLSKVMNSDKGKDRVKEEGYGGVSEDAMVE--DRDMH---KVIDSLFMFVRKKLQQRSDLHRWKFVIADIVVTEFMRRHDHISEEFKKVQDPSLIT

Query:  FDWSTTKTVMDYVMGR-HSDHDAHWSTVDAIYNHSTSEGNHWVMVCVDLQVGKLTVLDSFIALASDATL-KELSTL-AMLMLLF
         D+     ++D ++G   S+    ++  D +Y     +  HWV +CVDL+  K+T+LDS I L  DA L  EL  L AML  LF
Subjt:  FDWSTTKTVMDYVMGR-HSDHDAHWSTVDAIYNHSTSEGNHWVMVCVDLQVGKLTVLDSFIALASDATL-KELSTL-AMLMLLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTAGGCAAACCATATTTGGGCCTATAGTGGATAGTAACATCATATTTAATGGTCAGTTAATCCACCATCTATTGCGTAGGGAGGTTGAGGATCCTAGACAGGATGT
CATTAGTTTCAATATATTTGGAAATAGGGTGTCCTTTGGCAAGCAAGAATTTGACCTAATCACAGGATTTAGACACCATAGAAGGATATTTGATAGAAATAAGTCAGGGG
TTAGATTGAGGCGTCTGTACTTTAAAGATAGTGTCAAAGATACAGTAGCAGATGTTGAAAAAAGGTTCTTAGAAATACAATTTGAGACTGATGAAGATGCGGTGAAGGTA
GCTCTCGCATATTTCATTGAGCTAGCAATGTTTGGGCGGGAGAGGAAGCAGAAGTTCAATTGGTTTCTATTGGGTATTATGGATGATTGGGAGATATTCTGCAACTATGA
TTGGAGCAAGGTAATTTTTGAGATGACTATCAGGAGCTTGAAAAAAGCACTTAGGGCTGCCACCCAAAGAGACGACGTGGTTGGAGAGACTAGTCGAGTGGAAAGATATA
ATCTTTACGGCTTTACACATGCTTTTCAGGTATGGGCGTATGAGACTATATCATCTCTGAAAAACCGTGTTGCGAACAAACTGAACCAGGATGCGATCCCACGCTTTTCT
CGGTGGTCATGCTCCCATTCTCCTTCGTACACCCAACTTAGCAGTGAGATATTTGGCTTGACGGAGGCAAAGGTGATAGTACAATTGGTGCCGAGCGATGCAGAACTCGA
ACACATGTGTCGCATCGTTGGTGCACCAAAGGCCCCTGTTTTTTCGCCACAATCAGAGGCCCCTTTTTTTCCGCCACAACCAGAACTAGTGAATAATGCAAACGTAGATC
GTGTCGTGAGTGATAGAGGGTCAGATGAGGCTAGTTGGGATAGGGGTTATCCAATAAAAGAGGTAGATATGGTTGGGCTCGATGAAGAATTGACACATGAGAGTCTATTT
GAAGGCGTGGGCAAGACTTGTCGGTGTGACTGCAAGCATTCATACGAGTCACTAGACCGACAGATGAAGGAGATGGAATTTGAAGTGAAGGAAATAAAAAACGATTTAAA
AGGGATAAAAACTGATCTAAAGTCAATTAAGAAGTACTTGCGTCGATTATCGAAGGTGATGAACTCTGATAAAGGAAAGGATCGTGTCAAGGAGGAGGGGTATGGTGGGG
TTTCAGAAGATGCGATGGTAGAGGACCGTGATATGCATAAGGTCATTGACTCGCTTTTTATGTTCGTTCGGAAGAAACTGCAACAACGGTCGGACTTGCATCGTTGGAAA
TTTGTCATTGCAGATATTGTTGTTACCGAGTTTATGAGACGTCACGACCATATATCTGAAGAGTTCAAGAAGGTGCAAGATCCTTCATTGATTACGTTCGACTGGAGTAC
GACTAAGACTGTGATGGATTACGTTATGGGTCGACACTCGGACCATGATGCACATTGGAGTACAGTTGACGCGATCTACAACCATTCAACCTCGGAGGGAAACCATTGGG
TTATGGTATGTGTTGATCTCCAGGTGGGCAAGTTGACCGTCCTTGATTCATTCATAGCACTGGCATCTGATGCAACCTTGAAGGAGTTGAGCACTTTAGCCATGCTAATG
CTACTGTTCAATGTTGTAGAGCTTATGATGATAAATTTGAGAGTGTATTTTTTCTGTTTTTCAGAAACGAAGGGGGAAAACCCCATTCGATTCCGGCGCCGGAGAGGGAG
GAGGGAGGAACGAGGATGGAGGAGAACGAAGAGGGAGGAGGGAGGAACGAGGAGGGAGGAGGGTCCTCCCTCCTCTGTCTCGTCGCCGGAGACGGAGGAGGGTCCTCCCT
CCTCTGTCTCGTCGCCGGAGACGGAGGAGGGTTCCCTCTCCGGCGCCGGAATCGAAGGGGGTTTTCCCCTTTCGTTTCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTAGGCAAACCATATTTGGGCCTATAGTGGATAGTAACATCATATTTAATGGTCAGTTAATCCACCATCTATTGCGTAGGGAGGTTGAGGATCCTAGACAGGATGT
CATTAGTTTCAATATATTTGGAAATAGGGTGTCCTTTGGCAAGCAAGAATTTGACCTAATCACAGGATTTAGACACCATAGAAGGATATTTGATAGAAATAAGTCAGGGG
TTAGATTGAGGCGTCTGTACTTTAAAGATAGTGTCAAAGATACAGTAGCAGATGTTGAAAAAAGGTTCTTAGAAATACAATTTGAGACTGATGAAGATGCGGTGAAGGTA
GCTCTCGCATATTTCATTGAGCTAGCAATGTTTGGGCGGGAGAGGAAGCAGAAGTTCAATTGGTTTCTATTGGGTATTATGGATGATTGGGAGATATTCTGCAACTATGA
TTGGAGCAAGGTAATTTTTGAGATGACTATCAGGAGCTTGAAAAAAGCACTTAGGGCTGCCACCCAAAGAGACGACGTGGTTGGAGAGACTAGTCGAGTGGAAAGATATA
ATCTTTACGGCTTTACACATGCTTTTCAGGTATGGGCGTATGAGACTATATCATCTCTGAAAAACCGTGTTGCGAACAAACTGAACCAGGATGCGATCCCACGCTTTTCT
CGGTGGTCATGCTCCCATTCTCCTTCGTACACCCAACTTAGCAGTGAGATATTTGGCTTGACGGAGGCAAAGGTGATAGTACAATTGGTGCCGAGCGATGCAGAACTCGA
ACACATGTGTCGCATCGTTGGTGCACCAAAGGCCCCTGTTTTTTCGCCACAATCAGAGGCCCCTTTTTTTCCGCCACAACCAGAACTAGTGAATAATGCAAACGTAGATC
GTGTCGTGAGTGATAGAGGGTCAGATGAGGCTAGTTGGGATAGGGGTTATCCAATAAAAGAGGTAGATATGGTTGGGCTCGATGAAGAATTGACACATGAGAGTCTATTT
GAAGGCGTGGGCAAGACTTGTCGGTGTGACTGCAAGCATTCATACGAGTCACTAGACCGACAGATGAAGGAGATGGAATTTGAAGTGAAGGAAATAAAAAACGATTTAAA
AGGGATAAAAACTGATCTAAAGTCAATTAAGAAGTACTTGCGTCGATTATCGAAGGTGATGAACTCTGATAAAGGAAAGGATCGTGTCAAGGAGGAGGGGTATGGTGGGG
TTTCAGAAGATGCGATGGTAGAGGACCGTGATATGCATAAGGTCATTGACTCGCTTTTTATGTTCGTTCGGAAGAAACTGCAACAACGGTCGGACTTGCATCGTTGGAAA
TTTGTCATTGCAGATATTGTTGTTACCGAGTTTATGAGACGTCACGACCATATATCTGAAGAGTTCAAGAAGGTGCAAGATCCTTCATTGATTACGTTCGACTGGAGTAC
GACTAAGACTGTGATGGATTACGTTATGGGTCGACACTCGGACCATGATGCACATTGGAGTACAGTTGACGCGATCTACAACCATTCAACCTCGGAGGGAAACCATTGGG
TTATGGTATGTGTTGATCTCCAGGTGGGCAAGTTGACCGTCCTTGATTCATTCATAGCACTGGCATCTGATGCAACCTTGAAGGAGTTGAGCACTTTAGCCATGCTAATG
CTACTGTTCAATGTTGTAGAGCTTATGATGATAAATTTGAGAGTGTATTTTTTCTGTTTTTCAGAAACGAAGGGGGAAAACCCCATTCGATTCCGGCGCCGGAGAGGGAG
GAGGGAGGAACGAGGATGGAGGAGAACGAAGAGGGAGGAGGGAGGAACGAGGAGGGAGGAGGGTCCTCCCTCCTCTGTCTCGTCGCCGGAGACGGAGGAGGGTCCTCCCT
CCTCTGTCTCGTCGCCGGAGACGGAGGAGGGTTCCCTCTCCGGCGCCGGAATCGAAGGGGGTTTTCCCCTTTCGTTTCTGTAG
Protein sequenceShow/hide protein sequence
MFRQTIFGPIVDSNIIFNGQLIHHLLRREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRIFDRNKSGVRLRRLYFKDSVKDTVADVEKRFLEIQFETDEDAVKV
ALAYFIELAMFGRERKQKFNWFLLGIMDDWEIFCNYDWSKVIFEMTIRSLKKALRAATQRDDVVGETSRVERYNLYGFTHAFQVWAYETISSLKNRVANKLNQDAIPRFS
RWSCSHSPSYTQLSSEIFGLTEAKVIVQLVPSDAELEHMCRIVGAPKAPVFSPQSEAPFFPPQPELVNNANVDRVVSDRGSDEASWDRGYPIKEVDMVGLDEELTHESLF
EGVGKTCRCDCKHSYESLDRQMKEMEFEVKEIKNDLKGIKTDLKSIKKYLRRLSKVMNSDKGKDRVKEEGYGGVSEDAMVEDRDMHKVIDSLFMFVRKKLQQRSDLHRWK
FVIADIVVTEFMRRHDHISEEFKKVQDPSLITFDWSTTKTVMDYVMGRHSDHDAHWSTVDAIYNHSTSEGNHWVMVCVDLQVGKLTVLDSFIALASDATLKELSTLAMLM
LLFNVVELMMINLRVYFFCFSETKGENPIRFRRRRGRREERGWRRTKREEGGTRREEGPPSSVSSPETEEGPPSSVSSPETEEGSLSGAGIEGGFPLSFL