; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g26460 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g26460
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr11:19505320..19529232
RNA-Seq ExpressionMoc11g26460
SyntenyMoc11g26460
Gene Ontology termsNA
InterPro domainsIPR025312 - Domain of unknown function DUF4216
IPR029480 - Transposase-associated domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD1842444.1 unnamed protein product [Ananas comosus var. bracteatus]5.7e-5935.36Show/hide
Query:  ISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTQWTMHGEESLSYGLGENDNDSGEEDIFEILEDHFGVFNTNNWTEKGESSKHGYNEEPNEEASKFYRL
        I CPCK+CNN + K R++VE D+++ G+V  YT+W +HGEE  +  +G +D DS  E + +ILE HFG  N   W  +   +    +EEPNE A+KF++L
Subjt:  ISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTQWTMHGEESLSYGLGENDNDSGEEDIFEILEDHFGVFNTNNWTEKGESSKHGYNEEPNEEASKFYRL

Query:  LNDAEKELYPGFGHPMRPGKQSMRLTTKEWRQAHLYILQNCDDVLPYIGEHIQALQHVDARNAQRKHDKEFIEWFES-----------------------
        L D  ++L      P+  GK+  +LT +EW  A L++L+N ++V P++ E    +     +N + K DK+F EWFE+                       
Subjt:  LNDAEKELYPGFGHPMRPGKQSMRLTTKEWRQAHLYILQNCDDVLPYIGEHIQALQHVDARNAQRKHDKEFIEWFES-----------------------

Query:  ----------------------HDVDDLRKTQNSGVVVRGTDNR---EYFGVVHEIIELQYMVGNNVVLFKCKWWDINSRSKGIRVDDHGLTSVNMTYT-
                              H++   +K+QNSGVVV+G       +++G++  I+EL+YM  + VVLFKC WWD+++ ++GI+ D++G  ++N + T 
Subjt:  ----------------------HDVDDLRKTQNSGVVVRGTDNR---EYFGVVHEIIELQYMVGNNVVLFKCKWWDINSRSKGIRVDDHGLTSVNMTYT-

Query:  -FATNEPFVLACQSEQVIYLEDRRNPTWYFVLKVDPRDYYKIPSE
         +  ++PFVLAC SEQV Y+ D R   W+ V++ +PRD+Y +P E
Subjt:  -FATNEPFVLACQSEQVIYLEDRRNPTWYFVLKVDPRDYYKIPSE

XP_016180369.1 uncharacterized protein LOC107622839 isoform X2 [Arachis ipaensis]2.3e-4439.17Show/hide
Query:  GHPMRPGKQSMRLTTKEWRQAHLYILQNCDDVLPYIGEHIQALQHVDARNAQRKHDKEFIEWFESH----------------------------------
        G P R  K + RL+ +E +Q+HLYIL+NCD V P+I +H + L+  + RN Q++HD+EF +WFESH                                  
Subjt:  GHPMRPGKQSMRLTTKEWRQAHLYILQNCDDVLPYIGEHIQALQHVDARNAQRKHDKEFIEWFESH----------------------------------

Query:  ---------DVDDLRKTQNSGVVVRGTDNREYFGVVHEIIELQYMVGNNVVLFKCKWWDINSRSKGIRVDDHGLTSVNMTYTFATNEPFVLACQSEQVIY
                 D ++ RKTQ+SGV+V+    +EY+GV+ +I+EL YM  N VV+FKC WWD+++  +G++VD++G+T VN   T  T E FV+ACQ EQV Y
Subjt:  ---------DVDDLRKTQNSGVVVRGTDNREYFGVVHEIIELQYMVGNNVVLFKCKWWDINSRSKGIRVDDHGLTSVNMTYTFATNEPFVLACQSEQVIY

Query:  LEDRRNPTWYFVLKVDPRDYYKIPSELKSYASQVIDGTLD
        +ED  N  W  V+KV PRDY+ +P E +    +V +   D
Subjt:  LEDRRNPTWYFVLKVDPRDYYKIPSELKSYASQVIDGTLD

XP_020968830.1 uncharacterized protein LOC107622839 isoform X1 [Arachis ipaensis]2.3e-4439.17Show/hide
Query:  GHPMRPGKQSMRLTTKEWRQAHLYILQNCDDVLPYIGEHIQALQHVDARNAQRKHDKEFIEWFESH----------------------------------
        G P R  K + RL+ +E +Q+HLYIL+NCD V P+I +H + L+  + RN Q++HD+EF +WFESH                                  
Subjt:  GHPMRPGKQSMRLTTKEWRQAHLYILQNCDDVLPYIGEHIQALQHVDARNAQRKHDKEFIEWFESH----------------------------------

Query:  ---------DVDDLRKTQNSGVVVRGTDNREYFGVVHEIIELQYMVGNNVVLFKCKWWDINSRSKGIRVDDHGLTSVNMTYTFATNEPFVLACQSEQVIY
                 D ++ RKTQ+SGV+V+    +EY+GV+ +I+EL YM  N VV+FKC WWD+++  +G++VD++G+T VN   T  T E FV+ACQ EQV Y
Subjt:  ---------DVDDLRKTQNSGVVVRGTDNREYFGVVHEIIELQYMVGNNVVLFKCKWWDINSRSKGIRVDDHGLTSVNMTYTFATNEPFVLACQSEQVIY

Query:  LEDRRNPTWYFVLKVDPRDYYKIPSELKSYASQVIDGTLD
        +ED  N  W  V+KV PRDY+ +P E +    +V +   D
Subjt:  LEDRRNPTWYFVLKVDPRDYYKIPSELKSYASQVIDGTLD

XP_025646460.1 uncharacterized protein LOC112741639 [Arachis hypogaea]8.8e-4437.17Show/hide
Query:  HGYNEEPNEEASKFYRLLNDAEKELYPGFGHPMRPGKQSMRLTTKEWRQAHLYILQNCDDVLPYIGEHIQALQHVDARNAQRKHDKEFIEWFESH-----
        HGY  E  ++    ++             G P R  K + RL+ +E +QAHLYIL+ CD V P+I +H + L+  + RN Q++HD+EF +WFESH     
Subjt:  HGYNEEPNEEASKFYRLLNDAEKELYPGFGHPMRPGKQSMRLTTKEWRQAHLYILQNCDDVLPYIGEHIQALQHVDARNAQRKHDKEFIEWFESH-----

Query:  --------------------------------------DVDDLRKTQNSGVVVRGTDNREYFGVVHEIIELQYMVGNNVVLFKCKWWDINSRSKGIRVDD
                                              D ++ RKTQ+SGV+V+    +EY+GV+ +I+EL YM  N VV+FKC WWD+++  +G++VD+
Subjt:  --------------------------------------DVDDLRKTQNSGVVVRGTDNREYFGVVHEIIELQYMVGNNVVLFKCKWWDINSRSKGIRVDD

Query:  HGLTSVNMTYTFATNEPFVLACQSEQVIYLEDRRNPTWYFVLKVDPRDYYKIPSELKSYASQVIDGTLD
        +G+T VN   T  T E FV+ACQSEQV Y+ED  N  W  V+KV PRDY+ +P E +    +V D   D
Subjt:  HGLTSVNMTYTFATNEPFVLACQSEQVIYLEDRRNPTWYFVLKVDPRDYYKIPSELKSYASQVIDGTLD

XP_025646460.1 uncharacterized protein LOC112741639 [Arachis hypogaea]5.0e-1542.52Show/hide
Query:  EISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTQWTMHG-----EESLSYGLGENDNDSGEEDIFEILEDHFGVFNTNNWTEK----------GESSKH
        +I CPC +CNN + K+R+ VE DLL  GIV +YT W  HG     E   S     +D+   ++D+  +L DHFGV++     E+          GE  + 
Subjt:  EISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTQWTMHG-----EESLSYGLGENDNDSGEEDIFEILEDHFGVFNTNNWTEK----------GESSKH

Query:  GYNEEPNEEASKFYRLLNDAEKELYPG
         + EEPNE+A+KFY+LL+D+EKELYPG
Subjt:  GYNEEPNEEASKFYRLLNDAEKELYPG

XP_025646460.1 uncharacterized protein LOC112741639 [Arachis hypogaea]1.4e-4173.21Show/hide
Query:  EISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTQWTMHGEESLSYGLGENDNDSGEEDIFEILEDHFGVFNTNNWTEKGESSKHGYNEEPNEEASKFYR
        +ISCPCKRCNN +LKTRD+VE DLLMFGIVPSY +WTMHGEES +Y +GENDND+ +ED+FEILEDHFG  +T+NW  K ES+KHGY+EEPNE AS+FY 
Subjt:  EISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTQWTMHGEESLSYGLGENDNDSGEEDIFEILEDHFGVFNTNNWTEKGESSKHGYNEEPNEEASKFYR

Query:  LLNDAEKELYPG
        +LN AEKELY G
Subjt:  LLNDAEKELYPG

TrEMBL top hitse value%identityAlignment
A0A2N9GUI6 Uncharacterized protein3.0e-4542.6Show/hide
Query:  RPGKQSMRLTTKEWRQAHLYILQNCDDVLPYIGEHIQALQHVDARNAQRKHDKEFIEWFESH--------------------------------------
        R G  S +L+ +EW QAH Y+L+NCD+V  +I EH   ++   ARN + +H K+FIEWFESH                                      
Subjt:  RPGKQSMRLTTKEWRQAHLYILQNCDDVLPYIGEHIQALQHVDARNAQRKHDKEFIEWFESH--------------------------------------

Query:  -----DVDDLRKTQNSGVVVRG---TDNREYFGVVHEIIELQYMVGNNVVLFKCKWWDINSRSKGIRVDDHGLTSVNMTYTFATNEPFVLACQSEQVIYL
             + D  RKTQ+ GV+V+G   T N +Y+GV+ +IIEL+YM GN +V+FKC+WWD+N+  +GI VD++G T VN+T    +NEPFVLACQ EQV Y+
Subjt:  -----DVDDLRKTQNSGVVVRG---TDNREYFGVVHEIIELQYMVGNNVVLFKCKWWDINSRSKGIRVDDHGLTSVNMTYTFATNEPFVLACQSEQVIYL

Query:  EDRRNPTWYFVLKVDPRDYYKIP
        +D +NP W FV+K +PR+YY +P
Subjt:  EDRRNPTWYFVLKVDPRDYYKIP

A0A2N9GUI6 Uncharacterized protein2.1e-2251.3Show/hide
Query:  ELEISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTQWTMHGEESLSYGLGE--NDNDSGEEDIFEILEDHFGVFNTNNWTEKGESSKHGYNEEPNEEAS
        E +I CPCKRCN+   K+RDDVEADL+  GIVP+YT+W  HGEE+ S       +D++S   D+ E++ED+FG  N  +W   GE S +G  EEPN++A+
Subjt:  ELEISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTQWTMHGEESLSYGLGE--NDNDSGEEDIFEILEDHFGVFNTNNWTEKGESSKHGYNEEPNEEAS

Query:  KFYRLLNDAEKELYP
        KF+RLL D E++LYP
Subjt:  KFYRLLNDAEKELYP

A0A2N9GUI6 Uncharacterized protein2.9e-4029.43Show/hide
Query:  ISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTQWTMHGE--ESLSYGLGENDN---DSGEEDIFEILEDHFG-VFNTNNW-TEKGESSKHGYNEEPNEE
        I CPC +C N   K+R +V+ DL   GI+ SYT+W  HGE  +  S    E+ N   D G  ++  +L+D FG V N++N  T  GE +           
Subjt:  ISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTQWTMHGE--ESLSYGLGENDN---DSGEEDIFEILEDHFG-VFNTNNW-TEKGESSKHGYNEEPNEE

Query:  ASKFYRLLNDAEKELYPGFGHPMRPGKQSMRLTTKEWRQAHLYILQNCDDVLPYIGEHIQALQHVDARNAQRKHDKEFIEWFE-----------------
                                   +  +L   +W +A + +L+NC++V PY+ EHI+ +Q     + + +H+++F+ WF+                 
Subjt:  ASKFYRLLNDAEKELYPGFGHPMRPGKQSMRLTTKEWRQAHLYILQNCDDVLPYIGEHIQALQHVDARNAQRKHDKEFIEWFE-----------------

Query:  --------------------------SHDVDDLRKTQNSGVVVRGTD---NREYFGVVHEIIELQYMVGNNVVLFKCKWWDINSRSKGIRVDDHGLTSVN
                                  + + +   KTQNSGV VRG D   ++EY+G++ +IIELQY     + LFKC WWD+ +  +G + D++G   VN
Subjt:  --------------------------SHDVDDLRKTQNSGVVVRGTD---NREYFGVVHEIIELQYMVGNNVVLFKCKWWDINSRSKGIRVDDHGLTSVN

Query:  MTYTFATN--EPFVLACQSEQVIYLEDRRNPTWYFVLKVDPRDYYKIPS-ELKSYASQVIDGTLDMNNDEI--FVQVFGPEKHG
        +    ATN  +P++LA Q++QV Y +D R+P W  V+K  PRD Y +P+ E  +   +    T    N+ I  F ++ G + HG
Subjt:  MTYTFATN--EPFVLACQSEQVIYLEDRRNPTWYFVLKVDPRDYYKIPS-ELKSYASQVIDGTLDMNNDEI--FVQVFGPEKHG

A0A443Q533 Uncharacterized protein1.4e-3938.15Show/hide
Query:  LYPGFGHPMRPGKQSMR-LTTKEWRQAHLYILQNCDDVLPYIGEHIQALQ-----HVDARNAQRKHDKEFIEWFESH-----------------------
        ++ G G  +  GK ++R L+T EW QAHLY+L NCDDV P++  H Q ++      V  R+  RKH KEF +WFE H                       
Subjt:  LYPGFGHPMRPGKQSMR-LTTKEWRQAHLYILQNCDDVLPYIGEHIQALQ-----HVDARNAQRKHDKEFIEWFESH-----------------------

Query:  ----------------DVDDLRKTQNSGVVVRG--------------TDNREYFGVVHEIIELQYMVGNNVVLFKCKWWDI-NSRSKGIRVDDHGLTSVN
                        D +  RKTQNSGV +                T N  Y+GV+ ++IELQY+ GN +VLFKC WWD+ N   +GI+ D++G T VN
Subjt:  ----------------DVDDLRKTQNSGVVVRG--------------TDNREYFGVVHEIIELQYMVGNNVVLFKCKWWDI-NSRSKGIRVDDHGLTSVN

Query:  MTYTFATNEPFVLACQSEQVIYLEDRRNPTWYFVLKVDPRDYYKIPSEL
         T T  TNEPF+LA Q++QV Y++D   P W+  +K+ PRD Y + SE+
Subjt:  MTYTFATNEPFVLACQSEQVIYLEDRRNPTWYFVLKVDPRDYYKIPSEL

A0A443Q533 Uncharacterized protein1.1e-1548.25Show/hide
Query:  EISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTQWTMHGEESLSYGLGENDNDSGEE--DIFEILEDHFGVFNTNNWTEKGESSKHGYNEEPNEEASKF
        +I CPC +C N   K R+DV  DLL  GI+  Y  WT HGE+      GE  +D  EE  D+ E+L+D FG+ N +   E  ESS+    +EPN EA KF
Subjt:  EISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTQWTMHGEESLSYGLGENDNDSGEE--DIFEILEDHFGVFNTNNWTEKGESSKHGYNEEPNEEASKF

Query:  YRLLNDAEKELYPG
        Y+LL DAE ELYPG
Subjt:  YRLLNDAEKELYPG

A0A6P5MAM6 uncharacterized protein LOC1074582643.7e-4035.32Show/hide
Query:  HGYNEEPNEEASKFYRLLNDAEKELYPGFGHPMRPGKQSMRLTTKEWRQAHLYILQNCDDVLPYIGEHIQALQHVDARNAQRKHDKEFIEWFESH-----
        HGY  E  ++    ++             G P R  K + RL+ +E +Q+HLYIL+NCD V P+I +H + L+  + RN Q+ HD+EF  WFESH     
Subjt:  HGYNEEPNEEASKFYRLLNDAEKELYPGFGHPMRPGKQSMRLTTKEWRQAHLYILQNCDDVLPYIGEHIQALQHVDARNAQRKHDKEFIEWFESH-----

Query:  --------------------------------------DVDDLRKTQNSGVVVRGTDNREYFGVVHEIIELQYMVGNNVVLFKCKWWDINSRSKGIRVDD
                                              D ++ +KTQ+SGV+V     +E++GV+ +I+EL YM  N VV+FKC WWD+++   G++VD+
Subjt:  --------------------------------------DVDDLRKTQNSGVVVRGTDNREYFGVVHEIIELQYMVGNNVVLFKCKWWDINSRSKGIRVDD

Query:  HGLTSVNMTYTFATNEPFVLACQSEQVIYLEDRRNPTWYFVLKVDPRDYYKIPSELKSYASQVIDGTLD
        +G+T VN  +T    E FV+A Q EQV Y+ED  N  W  V+KV PRDY+ +P E +    +V D   D
Subjt:  HGLTSVNMTYTFATNEPFVLACQSEQVIYLEDRRNPTWYFVLKVDPRDYYKIPSELKSYASQVIDGTLD

A0A6V7QH08 Uncharacterized protein2.8e-5935.36Show/hide
Query:  ISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTQWTMHGEESLSYGLGENDNDSGEEDIFEILEDHFGVFNTNNWTEKGESSKHGYNEEPNEEASKFYRL
        I CPCK+CNN + K R++VE D+++ G+V  YT+W +HGEE  +  +G +D DS  E + +ILE HFG  N   W  +   +    +EEPNE A+KF++L
Subjt:  ISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTQWTMHGEESLSYGLGENDNDSGEEDIFEILEDHFGVFNTNNWTEKGESSKHGYNEEPNEEASKFYRL

Query:  LNDAEKELYPGFGHPMRPGKQSMRLTTKEWRQAHLYILQNCDDVLPYIGEHIQALQHVDARNAQRKHDKEFIEWFES-----------------------
        L D  ++L      P+  GK+  +LT +EW  A L++L+N ++V P++ E    +     +N + K DK+F EWFE+                       
Subjt:  LNDAEKELYPGFGHPMRPGKQSMRLTTKEWRQAHLYILQNCDDVLPYIGEHIQALQHVDARNAQRKHDKEFIEWFES-----------------------

Query:  ----------------------HDVDDLRKTQNSGVVVRGTDNR---EYFGVVHEIIELQYMVGNNVVLFKCKWWDINSRSKGIRVDDHGLTSVNMTYT-
                              H++   +K+QNSGVVV+G       +++G++  I+EL+YM  + VVLFKC WWD+++ ++GI+ D++G  ++N + T 
Subjt:  ----------------------HDVDDLRKTQNSGVVVRGTDNR---EYFGVVHEIIELQYMVGNNVVLFKCKWWDINSRSKGIRVDDHGLTSVNMTYT-

Query:  -FATNEPFVLACQSEQVIYLEDRRNPTWYFVLKVDPRDYYKIPSE
         +  ++PFVLAC SEQV Y+ D R   W+ V++ +PRD+Y +P E
Subjt:  -FATNEPFVLACQSEQVIYLEDRRNPTWYFVLKVDPRDYYKIPSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGTTGATAGCAGAGAATGACTTGTTCGTCGATACTTGGATTGATAGTGTGAATATTTTGCCTCCGGAAACTGTTTTAGATCAAGGCGACCTCCACACTTCTGACTC
TCATACCTGTACAGTTGGTGCATTTAGTCAATTTGCTAGTGGAATTGGAGAAAATGATAATGATTTTGGTGAAGAAGATATCTTTGAAATATTAGAGGATCACTTTGGTG
TTCTTAACACCAATAATTGGACCAAGAAAGGAGGATCAAGTAAACATGGTTATAATGAAGAACCAAATGAGGAAGCTTCTAAGTTTTATAGATTGTTAAATGATGTAGAA
AATGAACTTTATCCTAGGTGGCCAAGAAAAGTGCGTGATCCCACACGCAATCTTAAACTAGTTAGATTAACTCATGGGATTAGATTTGAAGTATCATGGAGGAACAAAAG
ACCTGTTGGAGATAATGCTGATATTTTCAAAAGCCAATGCACTATTTTGACTCGACAAAAGCGATTTAAAGTTGAAGGTCATGAGGCTTCTATTTTACGTCAACTTAATC
GATCTTATAATAATTGGAGAGATAGCTTGAAGAAAAAATGGTTGTACAAATATGATACAGTTGCAGAAGCTTTGGCTAATATTCCACCAAAAATTACAAGAGAATATGCA
GAATATCTTGCAAACTTGTGGAAGTCAACTGAATATCAGGAGATGCGTGAAAGGAATAAAGTGGACCGAGAACTAGAGATTTCTTGTCCATGTAAGAGATGCAATAATGC
AATACTTAAAACTCGAGATGATGTTGAAGCAGATTTGTTAATGTTTGGTATAGTTCCAAGTTACACTCAATGGACAATGCATGGTGAAGAAAGTTTATCATATGGATTAG
GAGAAAATGATAATGACTCTGGTGAAGAAGATATTTTTGAAATATTAGAGGATCACTTTGGTGTTTTTAACACCAATAATTGGACCGAGAAAGGAGAATCAAGTAAACAT
GGTTATAATGAAGAACCAAATGAGGAAGCTTCTAAGTTTTATAGATTATTAAATGATGCAGAAAAAGAACTTTATCCTGGGTTTGGTCATCCTATGCGACCAGGTAAACA
ATCAATGAGGCTCACGACAAAAGAGTGGAGACAAGCACATCTTTATATTTTACAGAATTGTGATGATGTCCTACCGTATATTGGTGAACACATACAAGCATTACAACATG
TTGATGCCAGAAATGCACAGAGAAAGCACGATAAAGAGTTTATCGAATGGTTTGAAAGTCATGATGTAGATGATTTAAGAAAAACACAAAATAGTGGAGTAGTAGTGAGA
GGAACTGACAATCGAGAGTACTTTGGTGTGGTGCATGAAATTATTGAATTGCAGTATATGGTAGGAAATAATGTTGTTTTGTTTAAATGTAAATGGTGGGATATCAATAG
TCGTAGTAAAGGAATTAGAGTTGATGATCATGGACTGACTAGTGTGAATATGACCTACACATTTGCTACAAATGAGCCATTTGTATTGGCATGTCAATCTGAACAAGTTA
TTTACCTTGAAGATAGAAGAAATCCAACTTGGTATTTTGTGTTGAAGGTTGATCCAAGAGATTATTACAAGATCCCTTCGGAACTCAAATCATATGCCTCTCAAGTGATT
GATGGTACACTTGACATGAACAACGACGAGATTTTTGTGCAAGTCTTTGGACCAGAGAAACATGGGCGTGTTCGAGGTTATGGAGCCGGTGTTACTCCTTCTGAGTTGTT
TGGATCATCTTCCAAAGTCCGTGATCTTGAGCGACGCCTTAACGAGTCAGAACAACGTCTTCAAGAATCTGAACGGCAAAGAAAAATTGAGTGGACGAGATTTTTGTGCA
AGTCTTTGGACCATAGAAACATGGGCGTATTCGAGGTTATGGAACCGGTGTTACTCCTTCTGAGTTGTTTGGATCATCTTCCAAAGTTCGTGATCTTGAGCGACGCCTTA
AGTAGTCAGAACAACGTCTTCAAGAATCTGAACGACAAAGAAAAGTTGAGGGGTGGGGTAGTAGGATACGACCTACCCAAAAGGGTAACTGTGCGGATTGTCAAGGGCTT
TGTTGCCTTGGGAGGTGTTTTTGGCGTATTAGGCAATGCCTTGGGTCTGCGGTTGGGGCTAGTGTCCCTTGGGGAGTGGACTGAGACATTTCTATCTTCAATAAGTTGGG
TCTTCTTCATAATTCATTCGAGGCACACAGGTTCCCATAGTAGGAGTTCCACTCTAATATTTGAGTCCAGCCCATTGATGAATGTGCTCTCCACAATGTCGTCGGCTATA
TTGGGTAATGACGTTGACCATGCTTCAAACGTCTGGCAGAACTCAGTAACCGTACCTTCCTATCGAATGGACAGAAAGCGGGAAAGGAGGTTTCTATCCCTGGACTCGCT
AAACCGCTCAAAAATCCTAGTTTTTAGGTCGTTCCAATCTACAAACTGCTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGGTTGATAGCAGAGAATGACTTGTTCGTCGATACTTGGATTGATAGTGTGAATATTTTGCCTCCGGAAACTGTTTTAGATCAAGGCGACCTCCACACTTCTGACTC
TCATACCTGTACAGTTGGTGCATTTAGTCAATTTGCTAGTGGAATTGGAGAAAATGATAATGATTTTGGTGAAGAAGATATCTTTGAAATATTAGAGGATCACTTTGGTG
TTCTTAACACCAATAATTGGACCAAGAAAGGAGGATCAAGTAAACATGGTTATAATGAAGAACCAAATGAGGAAGCTTCTAAGTTTTATAGATTGTTAAATGATGTAGAA
AATGAACTTTATCCTAGGTGGCCAAGAAAAGTGCGTGATCCCACACGCAATCTTAAACTAGTTAGATTAACTCATGGGATTAGATTTGAAGTATCATGGAGGAACAAAAG
ACCTGTTGGAGATAATGCTGATATTTTCAAAAGCCAATGCACTATTTTGACTCGACAAAAGCGATTTAAAGTTGAAGGTCATGAGGCTTCTATTTTACGTCAACTTAATC
GATCTTATAATAATTGGAGAGATAGCTTGAAGAAAAAATGGTTGTACAAATATGATACAGTTGCAGAAGCTTTGGCTAATATTCCACCAAAAATTACAAGAGAATATGCA
GAATATCTTGCAAACTTGTGGAAGTCAACTGAATATCAGGAGATGCGTGAAAGGAATAAAGTGGACCGAGAACTAGAGATTTCTTGTCCATGTAAGAGATGCAATAATGC
AATACTTAAAACTCGAGATGATGTTGAAGCAGATTTGTTAATGTTTGGTATAGTTCCAAGTTACACTCAATGGACAATGCATGGTGAAGAAAGTTTATCATATGGATTAG
GAGAAAATGATAATGACTCTGGTGAAGAAGATATTTTTGAAATATTAGAGGATCACTTTGGTGTTTTTAACACCAATAATTGGACCGAGAAAGGAGAATCAAGTAAACAT
GGTTATAATGAAGAACCAAATGAGGAAGCTTCTAAGTTTTATAGATTATTAAATGATGCAGAAAAAGAACTTTATCCTGGGTTTGGTCATCCTATGCGACCAGGTAAACA
ATCAATGAGGCTCACGACAAAAGAGTGGAGACAAGCACATCTTTATATTTTACAGAATTGTGATGATGTCCTACCGTATATTGGTGAACACATACAAGCATTACAACATG
TTGATGCCAGAAATGCACAGAGAAAGCACGATAAAGAGTTTATCGAATGGTTTGAAAGTCATGATGTAGATGATTTAAGAAAAACACAAAATAGTGGAGTAGTAGTGAGA
GGAACTGACAATCGAGAGTACTTTGGTGTGGTGCATGAAATTATTGAATTGCAGTATATGGTAGGAAATAATGTTGTTTTGTTTAAATGTAAATGGTGGGATATCAATAG
TCGTAGTAAAGGAATTAGAGTTGATGATCATGGACTGACTAGTGTGAATATGACCTACACATTTGCTACAAATGAGCCATTTGTATTGGCATGTCAATCTGAACAAGTTA
TTTACCTTGAAGATAGAAGAAATCCAACTTGGTATTTTGTGTTGAAGGTTGATCCAAGAGATTATTACAAGATCCCTTCGGAACTCAAATCATATGCCTCTCAAGTGATT
GATGGTACACTTGACATGAACAACGACGAGATTTTTGTGCAAGTCTTTGGACCAGAGAAACATGGGCGTGTTCGAGGTTATGGAGCCGGTGTTACTCCTTCTGAGTTGTT
TGGATCATCTTCCAAAGTCCGTGATCTTGAGCGACGCCTTAACGAGTCAGAACAACGTCTTCAAGAATCTGAACGGCAAAGAAAAATTGAGTGGACGAGATTTTTGTGCA
AGTCTTTGGACCATAGAAACATGGGCGTATTCGAGGTTATGGAACCGGTGTTACTCCTTCTGAGTTGTTTGGATCATCTTCCAAAGTTCGTGATCTTGAGCGACGCCTTA
AGTAGTCAGAACAACGTCTTCAAGAATCTGAACGACAAAGAAAAGTTGAGGGGTGGGGTAGTAGGATACGACCTACCCAAAAGGGTAACTGTGCGGATTGTCAAGGGCTT
TGTTGCCTTGGGAGGTGTTTTTGGCGTATTAGGCAATGCCTTGGGTCTGCGGTTGGGGCTAGTGTCCCTTGGGGAGTGGACTGAGACATTTCTATCTTCAATAAGTTGGG
TCTTCTTCATAATTCATTCGAGGCACACAGGTTCCCATAGTAGGAGTTCCACTCTAATATTTGAGTCCAGCCCATTGATGAATGTGCTCTCCACAATGTCGTCGGCTATA
TTGGGTAATGACGTTGACCATGCTTCAAACGTCTGGCAGAACTCAGTAACCGTACCTTCCTATCGAATGGACAGAAAGCGGGAAAGGAGGTTTCTATCCCTGGACTCGCT
AAACCGCTCAAAAATCCTAGTTTTTAGGTCGTTCCAATCTACAAACTGCTGCTGA
Protein sequenceShow/hide protein sequence
MRLIAENDLFVDTWIDSVNILPPETVLDQGDLHTSDSHTCTVGAFSQFASGIGENDNDFGEEDIFEILEDHFGVLNTNNWTKKGGSSKHGYNEEPNEEASKFYRLLNDVE
NELYPRWPRKVRDPTRNLKLVRLTHGIRFEVSWRNKRPVGDNADIFKSQCTILTRQKRFKVEGHEASILRQLNRSYNNWRDSLKKKWLYKYDTVAEALANIPPKITREYA
EYLANLWKSTEYQEMRERNKVDRELEISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTQWTMHGEESLSYGLGENDNDSGEEDIFEILEDHFGVFNTNNWTEKGESSKH
GYNEEPNEEASKFYRLLNDAEKELYPGFGHPMRPGKQSMRLTTKEWRQAHLYILQNCDDVLPYIGEHIQALQHVDARNAQRKHDKEFIEWFESHDVDDLRKTQNSGVVVR
GTDNREYFGVVHEIIELQYMVGNNVVLFKCKWWDINSRSKGIRVDDHGLTSVNMTYTFATNEPFVLACQSEQVIYLEDRRNPTWYFVLKVDPRDYYKIPSELKSYASQVI
DGTLDMNNDEIFVQVFGPEKHGRVRGYGAGVTPSELFGSSSKVRDLERRLNESEQRLQESERQRKIEWTRFLCKSLDHRNMGVFEVMEPVLLLLSCLDHLPKFVILSDAL
SSQNNVFKNLNDKEKLRGGVVGYDLPKRVTVRIVKGFVALGGVFGVLGNALGLRLGLVSLGEWTETFLSSISWVFFIIHSRHTGSHSRSSTLIFESSPLMNVLSTMSSAI
LGNDVDHASNVWQNSVTVPSYRMDRKRERRFLSLDSLNRSKILVFRSFQSTNCC