; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038636 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038636
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUlp1-like peptidase
Genome locationchr2:22077981..22079709
RNA-Seq ExpressionLag0038636
SyntenyLag0038636
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146372.1 uncharacterized protein LOC111015600 [Momordica charantia]2.1e-5842.81Show/hide
Query:  DHFPAALTCCSHLNKTISNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITGFRHSFRPMRRD
        D FPA LT  +H++KT + IK+ L+PTQL++FRQTCFGP++D+ V+FNG L+H++LL EVEE R +V+SF L  ++VSFGKREFDLITG  H    +   
Subjt:  DHFPAALTCCSHLNKTISNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITGFRHSFRPMRRD

Query:  IEGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIASFYFFELALMGRERKQQVDASTLNLMDNWVAFCNEDWSTIVFDKTIKSLKKALKGKA
        I  P  RL   YF+++V +K  EL+K F    F +DED VK+   YF ELA+MG+ERKQ +D  T+ ++D W AFCN DWS+++FD+TI SLK  LK K 
Subjt:  IEGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIASFYFFELALMGRERKQQVDASTLNLMDNWVAFCNEDWSTIVFDKTIKSLKKALKGKA

Query:  ESYKRKG-GGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRISETAIPRIRRWSCSHSPSYTIIADEVFGSRATRVTLSLVSSAEESQFMNRVMEPP
         +Y++K    P   ETYSLYGFP+                           R+RR          ++A EVF +  ++V   L+++  E Q M RV+ PP
Subjt:  ESYKRKG-GGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRISETAIPRIRRWSCSHSPSYTIIADEVFGSRATRVTLSLVSSAEESQFMNRVMEPP

Query:  RAPEPVPEPILEPEQE--PDQETERQP
             +P+P   P++   PD+     P
Subjt:  RAPEPVPEPILEPEQE--PDQETERQP

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]2.0e-7744.87Show/hide
Query:  MEITEKIPITDHFPAALTCCSHLNKTISNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITGF
        M++   I   D FPA LT  +H++KT + IK+ L+PTQL++FRQTCFGP++D++V+FNG L+H++LLREVEE R +V+SF L G++VSFGKREFDLITG 
Subjt:  MEITEKIPITDHFPAALTCCSHLNKTISNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITGF

Query:  RHSFRPMRRDIEGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIASFYFFELALMGRERKQQVDASTLNLMDNWVAFCNEDWSTIVFDKTIK
         H  R  R D   P  RL   YF++ V +K  EL+K F    F +DED VK+   YF ELA+MG+ERKQ +D + L ++D W  FCN DWS+++FD+TI 
Subjt:  RHSFRPMRRDIEGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIASFYFFELALMGRERKQQVDASTLNLMDNWVAFCNEDWSTIVFDKTIK

Query:  SLKKALKGKAESYKRKG-GGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRISETAIPRIRRWSCSHSPSYTIIADEVFGSRATRVTLSLVSSAEES
        SLK ALK K   Y++K    P   ETYSLYGFP+AFQ+WAYET+S+L        S+ AIPR+ RWSC +S  + ++  EVF +  ++V   L+++  + 
Subjt:  SLKKALKGKAESYKRKG-GGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRISETAIPRIRRWSCSHSPSYTIIADEVFGSRATRVTLSLVSSAEES

Query:  QFMNRVMEPPRA---PEPVPEP----ILEPEQEPDQETERQPTVSEAILPVVEEATVADTEMLDVTEASPEVSNKRGREQD-DKNKGKRK
        Q M RV+ PP     P+P   P    + +P   P++     P     + P+  E  V D   +D  EA P  ++  G E+   KNK K++
Subjt:  QFMNRVMEPPRA---PEPVPEP----ILEPEQEPDQETERQPTVSEAILPVVEEATVADTEMLDVTEASPEVSNKRGREQD-DKNKGKRK

XP_022155158.1 uncharacterized protein LOC111022300 [Momordica charantia]8.4e-5551.96Show/hide
Query:  DHFPAALTCCSHLNKTISNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITGFRHSFRPMRRD
        D FP  LT  +H +KT S +K  L+PTQ+++FRQTCFGP++D++V+FNG L+H++LLREVEE R +++SF L G++VSFGKREFDLITG   S+R +R D
Subjt:  DHFPAALTCCSHLNKTISNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITGFRHSFRPMRRD

Query:  IEGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIASFYFFELALMGRERKQQVDASTLNLMDNWVAFCNEDWSTIVFDKTIKSLKKALKGKA
         + P  RL   YF+++V +K  EL+K F    F +DEDAVK+   YF ELA+MG+ERKQ +DA+ L ++D W  FCN DWS+++F++T+ SLK A+  K 
Subjt:  IEGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIASFYFFELALMGRERKQQVDASTLNLMDNWVAFCNEDWSTIVFDKTIKSLKKALKGKA

Query:  ESYK
         +Y+
Subjt:  ESYK

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]3.4e-8051.33Show/hide
Query:  MEITEKIPITDHFPAALTCCSHLNKTISNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITGF
        M +T KI   D FPAAL+  +H+ KT S +K+ L+P+QL++F QTCFGP++ +NV+FNG L+H++LLREVEE + +++SF L G +VSFGKREFDLITG 
Subjt:  MEITEKIPITDHFPAALTCCSHLNKTISNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITGF

Query:  RHSFRPMRRDIEGPPNRLLR-LYFRENVGMKVEELDKSFPTLQFENDEDAVKIASFYFFELALMGRERKQQVDASTLNLMDNWVAFCNEDWSTIVFDKTI
        RH+   +  D+    NR LR LYF++   +K  EL+K F    FENDEDAVKIA  YF ELA+MG+ERK ++D S L ++D W  FCN DWS+++F++T+
Subjt:  RHSFRPMRRDIEGPPNRLLR-LYFRENVGMKVEELDKSFPTLQFENDEDAVKIASFYFFELALMGRERKQQVDASTLNLMDNWVAFCNEDWSTIVFDKTI

Query:  KSLKKALKGKAESYKRKGGGPKKQ-ETYSLYGFPFAFQIWAYETVSSLTGRVANRISETAIPRIRRWSCSHSPSYTIIADEVFGSRATRVTLSLVSSAEE
         SLK ALK K E YK+K        ETYSLY FP+AFQ+WAYET+S+L+ RVA R+++ AIPR+ RWSC++S ++ ++  EVF +  ++V + L ++  E
Subjt:  KSLKKALKGKAESYKRKGGGPKKQ-ETYSLYGFPFAFQIWAYETVSSLTGRVANRISETAIPRIRRWSCSHSPSYTIIADEVFGSRATRVTLSLVSSAEE

XP_022157199.1 uncharacterized protein LOC111023969 [Momordica charantia]1.7e-5542.38Show/hide
Query:  MEITEKIPITDHFPAALTCCSHLNKTISNIKSTLSPTQLNLFR-QTCFGPLIDVNVIFNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITG
        ME T K+   D FPA +T  SHL+ T   I   L+PTQL++FR +T FG  +D++++F   LVHY LLREV + R +VM F +LG  V+F K EF L+TG
Subjt:  MEITEKIPITDHFPAALTCCSHLNKTISNIKSTLSPTQLNLFR-QTCFGPLIDVNVIFNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITG

Query:  -FRHSFRPMRRDIEGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIASFYFFELALMGRER-KQQVDASTLNLMDNWVAFCNEDWSTIVFDK
         +R S R +++ +    NRL R YF++ V +++EE ++ +  + F ND+DAVK++  Y+ E+ +MG+ + K  VD      +++   F N DW T ++ +
Subjt:  -FRHSFRPMRRDIEGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIASFYFFELALMGRER-KQQVDASTLNLMDNWVAFCNEDWSTIVFDK

Query:  TIKSLKKALKGKAESYKRKGGGPKK-QETYSLYGFPFAFQIWAYETVSSLTGRVANRISETAIPRIRRWSCSHSPSYTIIADEVFGSRATRVTLSLVSSA
        T+K L+ A+K K  +YK K    KK Q  YSL GFP AFQ+WAYE + SL     NR+S+TA+PRI R+SCS S +  ++  +VF S    +T  LV S 
Subjt:  TIKSLKKALKGKAESYKRKGGGPKK-QETYSLYGFPFAFQIWAYETVSSLTGRVANRISETAIPRIRRWSCSHSPSYTIIADEVFGSRATRVTLSLVSSA

Query:  EE
         E
Subjt:  EE

TrEMBL top hitse value%identityAlignment
A0A6J1CZE8 uncharacterized protein LOC1110156001.0e-5842.81Show/hide
Query:  DHFPAALTCCSHLNKTISNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITGFRHSFRPMRRD
        D FPA LT  +H++KT + IK+ L+PTQL++FRQTCFGP++D+ V+FNG L+H++LL EVEE R +V+SF L  ++VSFGKREFDLITG  H    +   
Subjt:  DHFPAALTCCSHLNKTISNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITGFRHSFRPMRRD

Query:  IEGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIASFYFFELALMGRERKQQVDASTLNLMDNWVAFCNEDWSTIVFDKTIKSLKKALKGKA
        I  P  RL   YF+++V +K  EL+K F    F +DED VK+   YF ELA+MG+ERKQ +D  T+ ++D W AFCN DWS+++FD+TI SLK  LK K 
Subjt:  IEGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIASFYFFELALMGRERKQQVDASTLNLMDNWVAFCNEDWSTIVFDKTIKSLKKALKGKA

Query:  ESYKRKG-GGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRISETAIPRIRRWSCSHSPSYTIIADEVFGSRATRVTLSLVSSAEESQFMNRVMEPP
         +Y++K    P   ETYSLYGFP+                           R+RR          ++A EVF +  ++V   L+++  E Q M RV+ PP
Subjt:  ESYKRKG-GGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRISETAIPRIRRWSCSHSPSYTIIADEVFGSRATRVTLSLVSSAEESQFMNRVMEPP

Query:  RAPEPVPEPILEPEQE--PDQETERQP
             +P+P   P++   PD+     P
Subjt:  RAPEPVPEPILEPEQE--PDQETERQP

A0A6J1DJX9 uncharacterized protein LOC1110207579.9e-7844.87Show/hide
Query:  MEITEKIPITDHFPAALTCCSHLNKTISNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITGF
        M++   I   D FPA LT  +H++KT + IK+ L+PTQL++FRQTCFGP++D++V+FNG L+H++LLREVEE R +V+SF L G++VSFGKREFDLITG 
Subjt:  MEITEKIPITDHFPAALTCCSHLNKTISNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITGF

Query:  RHSFRPMRRDIEGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIASFYFFELALMGRERKQQVDASTLNLMDNWVAFCNEDWSTIVFDKTIK
         H  R  R D   P  RL   YF++ V +K  EL+K F    F +DED VK+   YF ELA+MG+ERKQ +D + L ++D W  FCN DWS+++FD+TI 
Subjt:  RHSFRPMRRDIEGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIASFYFFELALMGRERKQQVDASTLNLMDNWVAFCNEDWSTIVFDKTIK

Query:  SLKKALKGKAESYKRKG-GGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRISETAIPRIRRWSCSHSPSYTIIADEVFGSRATRVTLSLVSSAEES
        SLK ALK K   Y++K    P   ETYSLYGFP+AFQ+WAYET+S+L        S+ AIPR+ RWSC +S  + ++  EVF +  ++V   L+++  + 
Subjt:  SLKKALKGKAESYKRKG-GGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRISETAIPRIRRWSCSHSPSYTIIADEVFGSRATRVTLSLVSSAEES

Query:  QFMNRVMEPPRA---PEPVPEP----ILEPEQEPDQETERQPTVSEAILPVVEEATVADTEMLDVTEASPEVSNKRGREQD-DKNKGKRK
        Q M RV+ PP     P+P   P    + +P   P++     P     + P+  E  V D   +D  EA P  ++  G E+   KNK K++
Subjt:  QFMNRVMEPPRA---PEPVPEP----ILEPEQEPDQETERQPTVSEAILPVVEEATVADTEMLDVTEASPEVSNKRGREQD-DKNKGKRK

A0A6J1DM82 uncharacterized protein LOC1110223004.0e-5551.96Show/hide
Query:  DHFPAALTCCSHLNKTISNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITGFRHSFRPMRRD
        D FP  LT  +H +KT S +K  L+PTQ+++FRQTCFGP++D++V+FNG L+H++LLREVEE R +++SF L G++VSFGKREFDLITG   S+R +R D
Subjt:  DHFPAALTCCSHLNKTISNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITGFRHSFRPMRRD

Query:  IEGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIASFYFFELALMGRERKQQVDASTLNLMDNWVAFCNEDWSTIVFDKTIKSLKKALKGKA
         + P  RL   YF+++V +K  EL+K F    F +DEDAVK+   YF ELA+MG+ERKQ +DA+ L ++D W  FCN DWS+++F++T+ SLK A+  K 
Subjt:  IEGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIASFYFFELALMGRERKQQVDASTLNLMDNWVAFCNEDWSTIVFDKTIKSLKKALKGKA

Query:  ESYK
         +Y+
Subjt:  ESYK

A0A6J1DRZ7 uncharacterized protein LOC1110238471.6e-8051.33Show/hide
Query:  MEITEKIPITDHFPAALTCCSHLNKTISNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITGF
        M +T KI   D FPAAL+  +H+ KT S +K+ L+P+QL++F QTCFGP++ +NV+FNG L+H++LLREVEE + +++SF L G +VSFGKREFDLITG 
Subjt:  MEITEKIPITDHFPAALTCCSHLNKTISNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITGF

Query:  RHSFRPMRRDIEGPPNRLLR-LYFRENVGMKVEELDKSFPTLQFENDEDAVKIASFYFFELALMGRERKQQVDASTLNLMDNWVAFCNEDWSTIVFDKTI
        RH+   +  D+    NR LR LYF++   +K  EL+K F    FENDEDAVKIA  YF ELA+MG+ERK ++D S L ++D W  FCN DWS+++F++T+
Subjt:  RHSFRPMRRDIEGPPNRLLR-LYFRENVGMKVEELDKSFPTLQFENDEDAVKIASFYFFELALMGRERKQQVDASTLNLMDNWVAFCNEDWSTIVFDKTI

Query:  KSLKKALKGKAESYKRKGGGPKKQ-ETYSLYGFPFAFQIWAYETVSSLTGRVANRISETAIPRIRRWSCSHSPSYTIIADEVFGSRATRVTLSLVSSAEE
         SLK ALK K E YK+K        ETYSLY FP+AFQ+WAYET+S+L+ RVA R+++ AIPR+ RWSC++S ++ ++  EVF +  ++V + L ++  E
Subjt:  KSLKKALKGKAESYKRKGGGPKKQ-ETYSLYGFPFAFQIWAYETVSSLTGRVANRISETAIPRIRRWSCSHSPSYTIIADEVFGSRATRVTLSLVSSAEE

A0A6J1DSS5 uncharacterized protein LOC1110239698.2e-5642.38Show/hide
Query:  MEITEKIPITDHFPAALTCCSHLNKTISNIKSTLSPTQLNLFR-QTCFGPLIDVNVIFNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITG
        ME T K+   D FPA +T  SHL+ T   I   L+PTQL++FR +T FG  +D++++F   LVHY LLREV + R +VM F +LG  V+F K EF L+TG
Subjt:  MEITEKIPITDHFPAALTCCSHLNKTISNIKSTLSPTQLNLFR-QTCFGPLIDVNVIFNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITG

Query:  -FRHSFRPMRRDIEGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIASFYFFELALMGRER-KQQVDASTLNLMDNWVAFCNEDWSTIVFDK
         +R S R +++ +    NRL R YF++ V +++EE ++ +  + F ND+DAVK++  Y+ E+ +MG+ + K  VD      +++   F N DW T ++ +
Subjt:  -FRHSFRPMRRDIEGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIASFYFFELALMGRER-KQQVDASTLNLMDNWVAFCNEDWSTIVFDK

Query:  TIKSLKKALKGKAESYKRKGGGPKK-QETYSLYGFPFAFQIWAYETVSSLTGRVANRISETAIPRIRRWSCSHSPSYTIIADEVFGSRATRVTLSLVSSA
        T+K L+ A+K K  +YK K    KK Q  YSL GFP AFQ+WAYE + SL     NR+S+TA+PRI R+SCS S +  ++  +VF S    +T  LV S 
Subjt:  TIKSLKKALKGKAESYKRKGGGPKK-QETYSLYGFPFAFQIWAYETVSSLTGRVANRISETAIPRIRRWSCSHSPSYTIIADEVFGSRATRVTLSLVSSA

Query:  EE
         E
Subjt:  EE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31150.1 Domain of unknown function (DUF1985)3.2e-0423.56Show/hide
Query:  FGPLIDVNVI---FNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITGFRHSFRPMRRDIEGPP--------NRLLRLYFRENVGMKVEELD
        FG L +  V     +G+L+H +L R+V   + + + F   G  + F  REF ++TG R    P   +++           NRL        +G  +E L 
Subjt:  FGPLIDVNVI---FNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITGFRHSFRPMRRDIEGPP--------NRLLRLYFRENVGMKVEELD

Query:  KSFPTLQFENDEDAVKIASFYFFELAL-------MGRERKQQVDASTLNLMDNWVAFCNEDWSTIVFDKTIKSL---KKALKGKAESYKRKGGGPKKQET
        K              K++S+    LAL       +    +  V    + ++++   F    W    F  TI+     K A     +  KR      KQ+T
Subjt:  KSFPTLQFENDEDAVKIASFYFFELAL-------MGRERKQQVDASTLNLMDNWVAFCNEDWSTIVFDKTIKSL---KKALKGKAESYKRKGGGPKKQET

Query:  YSLYGFPFAFQIWAYETVSSLTGRV
         + YGFP A Q+  +E++  +  R+
Subjt:  YSLYGFPFAFQIWAYETVSSLTGRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCCGACCGGGATTCATCCTCTGTAATTTCTCGGTCAACCGAGAAATATGGCAGACTCGAAGCCTTTCACGGTTGCCGAGAAATTACAGAGGATGAATCCCGGGC
TCGATTTCGACCGGACTTTCTCGGTCACCGGAAATATGGCAGACATCGAAGCCTTTCTCGGTTGGCCGAGAAAGTACACGTACCAATCCCAACCCTCGATTTAACCGAGA
ATTTCTCGGTCGACCGGGAAATATGGCAGAATCGAAGCCTTTCTCGGTCTAGTATGGAAATCACCGAGAAAATCCCCATCACTGACCATTTTCCTGCTGCGTTGACATGT
TGCTCACACCTAAACAAAACCATTAGCAATATTAAGTCAACCCTAAGTCCTACCCAATTAAACCTCTTTAGGCAAACATGTTTCGGGCCTTTAATAGACGTGAATGTTAT
TTTTAATGGCCAATTAGTACACTACATCCTCCTTAGGGAAGTAGAGGAGAATAGGGCAAATGTGATGAGTTTTAAATTGTTAGGTCAGAAGGTCTCATTTGGTAAGAGAG
AGTTTGACCTCATAACCGGCTTTCGTCATTCATTTAGACCAATGAGGAGAGATATAGAGGGCCCTCCCAATAGACTCCTAAGATTATATTTTAGGGAGAATGTAGGTATG
AAGGTGGAGGAGTTAGATAAGTCGTTTCCGACCCTTCAGTTTGAGAACGACGAAGATGCAGTTAAGATCGCATCGTTTTATTTTTTTGAGTTGGCTTTGATGGGGAGGGA
ACGCAAACAACAAGTAGATGCCAGCACTCTAAACTTGATGGATAACTGGGTTGCATTCTGCAATGAGGACTGGAGTACCATCGTGTTTGACAAGACAATAAAAAGTCTAA
AAAAGGCACTGAAGGGGAAGGCTGAGTCGTACAAGCGCAAGGGCGGTGGTCCAAAGAAACAGGAGACATACAGTCTGTACGGTTTCCCGTTTGCTTTTCAGATATGGGCT
TACGAGACCGTTTCATCTCTCACCGGACGTGTTGCCAATCGTATAAGTGAGACGGCCATCCCACGCATTCGTCGATGGTCTTGCTCCCACTCCCCATCATACACAATCAT
TGCTGATGAGGTTTTTGGATCCCGAGCGACAAGAGTTACGTTGAGTCTTGTTTCTTCTGCAGAGGAGAGTCAATTCATGAATCGAGTGATGGAGCCTCCACGTGCACCGG
AGCCGGTGCCTGAGCCAATATTAGAGCCGGAGCAAGAACCAGATCAAGAAACAGAGAGACAACCTACTGTATCTGAGGCTATACTCCCTGTTGTAGAGGAGGCTACTGTA
GCAGATACTGAGATGTTGGATGTCACTGAAGCTTCTCCAGAAGTTTCAAATAAAAGAGGAAGGGAACAAGATGACAAAAACAAAGGAAAGAGAAAGAGAATGAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATCCCGACCGGGATTCATCCTCTGTAATTTCTCGGTCAACCGAGAAATATGGCAGACTCGAAGCCTTTCACGGTTGCCGAGAAATTACAGAGGATGAATCCCGGGC
TCGATTTCGACCGGACTTTCTCGGTCACCGGAAATATGGCAGACATCGAAGCCTTTCTCGGTTGGCCGAGAAAGTACACGTACCAATCCCAACCCTCGATTTAACCGAGA
ATTTCTCGGTCGACCGGGAAATATGGCAGAATCGAAGCCTTTCTCGGTCTAGTATGGAAATCACCGAGAAAATCCCCATCACTGACCATTTTCCTGCTGCGTTGACATGT
TGCTCACACCTAAACAAAACCATTAGCAATATTAAGTCAACCCTAAGTCCTACCCAATTAAACCTCTTTAGGCAAACATGTTTCGGGCCTTTAATAGACGTGAATGTTAT
TTTTAATGGCCAATTAGTACACTACATCCTCCTTAGGGAAGTAGAGGAGAATAGGGCAAATGTGATGAGTTTTAAATTGTTAGGTCAGAAGGTCTCATTTGGTAAGAGAG
AGTTTGACCTCATAACCGGCTTTCGTCATTCATTTAGACCAATGAGGAGAGATATAGAGGGCCCTCCCAATAGACTCCTAAGATTATATTTTAGGGAGAATGTAGGTATG
AAGGTGGAGGAGTTAGATAAGTCGTTTCCGACCCTTCAGTTTGAGAACGACGAAGATGCAGTTAAGATCGCATCGTTTTATTTTTTTGAGTTGGCTTTGATGGGGAGGGA
ACGCAAACAACAAGTAGATGCCAGCACTCTAAACTTGATGGATAACTGGGTTGCATTCTGCAATGAGGACTGGAGTACCATCGTGTTTGACAAGACAATAAAAAGTCTAA
AAAAGGCACTGAAGGGGAAGGCTGAGTCGTACAAGCGCAAGGGCGGTGGTCCAAAGAAACAGGAGACATACAGTCTGTACGGTTTCCCGTTTGCTTTTCAGATATGGGCT
TACGAGACCGTTTCATCTCTCACCGGACGTGTTGCCAATCGTATAAGTGAGACGGCCATCCCACGCATTCGTCGATGGTCTTGCTCCCACTCCCCATCATACACAATCAT
TGCTGATGAGGTTTTTGGATCCCGAGCGACAAGAGTTACGTTGAGTCTTGTTTCTTCTGCAGAGGAGAGTCAATTCATGAATCGAGTGATGGAGCCTCCACGTGCACCGG
AGCCGGTGCCTGAGCCAATATTAGAGCCGGAGCAAGAACCAGATCAAGAAACAGAGAGACAACCTACTGTATCTGAGGCTATACTCCCTGTTGTAGAGGAGGCTACTGTA
GCAGATACTGAGATGTTGGATGTCACTGAAGCTTCTCCAGAAGTTTCAAATAAAAGAGGAAGGGAACAAGATGACAAAAACAAAGGAAAGAGAAAGAGAATGAAGTAG
Protein sequenceShow/hide protein sequence
MNPDRDSSSVISRSTEKYGRLEAFHGCREITEDESRARFRPDFLGHRKYGRHRSLSRLAEKVHVPIPTLDLTENFSVDREIWQNRSLSRSSMEITEKIPITDHFPAALTC
CSHLNKTISNIKSTLSPTQLNLFRQTCFGPLIDVNVIFNGQLVHYILLREVEENRANVMSFKLLGQKVSFGKREFDLITGFRHSFRPMRRDIEGPPNRLLRLYFRENVGM
KVEELDKSFPTLQFENDEDAVKIASFYFFELALMGRERKQQVDASTLNLMDNWVAFCNEDWSTIVFDKTIKSLKKALKGKAESYKRKGGGPKKQETYSLYGFPFAFQIWA
YETVSSLTGRVANRISETAIPRIRRWSCSHSPSYTIIADEVFGSRATRVTLSLVSSAEESQFMNRVMEPPRAPEPVPEPILEPEQEPDQETERQPTVSEAILPVVEEATV
ADTEMLDVTEASPEVSNKRGREQDDKNKGKRKRMK