; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g09970 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g09970
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr9:8446010..8449464
RNA-Seq ExpressionMoc09g09970
SyntenyMoc09g09970
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058889.1 Transposon Ty3-G Gag-Pol polyprotein [Cucumis melo var. makuwa]6.6e-4040.34Show/hide
Query:  VEQERLRLDLRNQGNANRGGQL---QERPRNRSPTPYRARNMEGGGIRTDFHMKIDMPTFNGKMNVEAFLDWVENVENFFDYTRTPEDKKVKLVAFRLQS
        VEQ  + L   N  +++    L   QE  R  S  PY   + E         MK+D+PTFNG+M+VE FLDW++NVENFFDY    E KKV+LVA +LQ 
Subjt:  VEQERLRLDLRNQGNANRGGQL---QERPRNRSPTPYRARNMEGGGIRTDFHMKIDMPTFNGKMNVEAFLDWVENVENFFDYTRTPEDKKVKLVAFRLQS

Query:  GASAWWEQLEINRQHIGKRPIHSWPR----------------------------IKSIVDYTEEFHRLGACTNLAENEDYKIARFLDGLREDIQDQMNTQ
        GASAWW+QL+ NR+  GK+P+ SWP+                            ++SI+DYTEEF+RLGA +NL E E  +I+R + GL+E+I+D +N  
Subjt:  GASAWWEQLEINRQHIGKRPIHSWPR----------------------------IKSIVDYTEEFHRLGACTNLAENEDYKIARFLDGLREDIQDQMNTQ

Query:  PIEPIGLLTEAISMAPKIEAKLEKKRVKPLFRRVAWDKQPLSKTSGMDTNKSFTLDGPSTST---TTKTDDAPKALTLPKPTDSTTKKVD
         +  +  L +AIS+A KIE   E K+ K   R+  WDKQ  + T+     ++F   G S+ST   T K ++ PK  T      +T KKVD
Subjt:  PIEPIGLLTEAISMAPKIEAKLEKKRVKPLFRRVAWDKQPLSKTSGMDTNKSFTLDGPSTST---TTKTDDAPKALTLPKPTDSTTKKVD

XP_022138327.1 uncharacterized protein LOC111009540 isoform X1 [Momordica charantia]3.6e-8647.4Show/hide
Query:  GKNTEISRKGLAGDPSAEQVLSLKTTTNRLLLVEDSLGEVRANVRSIHELLENVGRQLELITVEQERLRLDLRNQGNAN--------------------R
        GK+T  + KG A D S  Q+LS K+TTNRLLL+EDSLGEVR+NV+ IH LLEN+GRQLELI  EQERL+ DL++   AN                    R
Subjt:  GKNTEISRKGLAGDPSAEQVLSLKTTTNRLLLVEDSLGEVRANVRSIHELLENVGRQLELITVEQERLRLDLRNQGNAN--------------------R

Query:  GGQLQERPRNRSPT----------PYRARN---MEGGGIRTDFHMKIDMPTFNGKMNVEAFLDWVENVENFFDYTRTPEDKKVKLVAFRLQSGASAWWEQ
          +LQER ++   +          P  AR    +E GG+RTDF MKID+PTFNGKM+VE FLD V+NVENFFDYT TPEDKKVKLVAF++QSGASAWW+Q
Subjt:  GGQLQERPRNRSPT----------PYRARN---MEGGGIRTDFHMKIDMPTFNGKMNVEAFLDWVENVENFFDYTRTPEDKKVKLVAFRLQSGASAWWEQ

Query:  LEINRQHIGKRPIHSWPRI----------------------------KSIVDYTEEFHRLGACTNLAENEDYKIARFLDGLREDIQDQMNTQPIEPIGLL
        LEIN + +GK+PI SWPR+                            K+I DYTE FHRLGA TN+AE EDYKIARF+DGLREDIQDQM+   I+PI LL
Subjt:  LEINRQHIGKRPIHSWPRI----------------------------KSIVDYTEEFHRLGACTNLAENEDYKIARFLDGLREDIQDQMNTQPIEPIGLL

Query:  TEAISMAPKIEAKLEKKRVKPLFRRVAWDKQPLSKTSGMDTNKSFTLDGPSTSTTTKTDDAPKALTLPKPTDSTTKK-----------------------
        T+AI MA KIE   +KKR++   RR  WDK  +SKT+  DT K   +   S STT   DD  K+     P D+++K+                       
Subjt:  TEAISMAPKIEAKLEKKRVKPLFRRVAWDKQPLSKTSGMDTNKSFTLDGPSTSTTTKTDDAPKALTLPKPTDSTTKK-----------------------

Query:  -----------VDEDDQDDQEPELLDEEDTAYIEPDDGEPINC
                   VD+DD  + + +L  E+D  Y+EPD+G+ ++C
Subjt:  -----------VDEDDQDDQEPELLDEEDTAYIEPDDGEPINC

XP_022138328.1 uncharacterized protein LOC111009540 isoform X2 [Momordica charantia]3.6e-8647.4Show/hide
Query:  GKNTEISRKGLAGDPSAEQVLSLKTTTNRLLLVEDSLGEVRANVRSIHELLENVGRQLELITVEQERLRLDLRNQGNAN--------------------R
        GK+T  + KG A D S  Q+LS K+TTNRLLL+EDSLGEVR+NV+ IH LLEN+GRQLELI  EQERL+ DL++   AN                    R
Subjt:  GKNTEISRKGLAGDPSAEQVLSLKTTTNRLLLVEDSLGEVRANVRSIHELLENVGRQLELITVEQERLRLDLRNQGNAN--------------------R

Query:  GGQLQERPRNRSPT----------PYRARN---MEGGGIRTDFHMKIDMPTFNGKMNVEAFLDWVENVENFFDYTRTPEDKKVKLVAFRLQSGASAWWEQ
          +LQER ++   +          P  AR    +E GG+RTDF MKID+PTFNGKM+VE FLD V+NVENFFDYT TPEDKKVKLVAF++QSGASAWW+Q
Subjt:  GGQLQERPRNRSPT----------PYRARN---MEGGGIRTDFHMKIDMPTFNGKMNVEAFLDWVENVENFFDYTRTPEDKKVKLVAFRLQSGASAWWEQ

Query:  LEINRQHIGKRPIHSWPRI----------------------------KSIVDYTEEFHRLGACTNLAENEDYKIARFLDGLREDIQDQMNTQPIEPIGLL
        LEIN + +GK+PI SWPR+                            K+I DYTE FHRLGA TN+AE EDYKIARF+DGLREDIQDQM+   I+PI LL
Subjt:  LEINRQHIGKRPIHSWPRI----------------------------KSIVDYTEEFHRLGACTNLAENEDYKIARFLDGLREDIQDQMNTQPIEPIGLL

Query:  TEAISMAPKIEAKLEKKRVKPLFRRVAWDKQPLSKTSGMDTNKSFTLDGPSTSTTTKTDDAPKALTLPKPTDSTTKK-----------------------
        T+AI MA KIE   +KKR++   RR  WDK  +SKT+  DT K   +   S STT   DD  K+     P D+++K+                       
Subjt:  TEAISMAPKIEAKLEKKRVKPLFRRVAWDKQPLSKTSGMDTNKSFTLDGPSTSTTTKTDDAPKALTLPKPTDSTTKK-----------------------

Query:  -----------VDEDDQDDQEPELLDEEDTAYIEPDDGEPINC
                   VD+DD  + + +L  E+D  Y+EPD+G+ ++C
Subjt:  -----------VDEDDQDDQEPELLDEEDTAYIEPDDGEPINC

XP_031741035.1 uncharacterized protein LOC116403692 [Cucumis sativus]2.0e-4435.87Show/hide
Query:  DFHMKIDMPTFNGKMNVEAFLDWVENVENFFDYTRTPEDKKVKLVAFRLQSGASAWWEQLEINRQHIGKRPIHSWPRIK---------------------
        D+ MKID+P ++GK N+EAFLDW+++ ENFF+Y  TPE KKV LVA +L++GASAWW+QLEINRQ  GK+P+ SW ++K                     
Subjt:  DFHMKIDMPTFNGKMNVEAFLDWVENVENFFDYTRTPEDKKVKLVAFRLQSGASAWWEQLEINRQHIGKRPIHSWPRIK---------------------

Query:  -------SIVDYTEEFHRLGACTNLAENEDYKIARFLDGLREDIQDQMNTQPIEPIGLLTEAISMAPKIEAKLEKKRVKPLFRRVAWDKQPLSKTSGMDT
               S+ +Y EEFHRL A TNL+ENE +++ARF+ GLR DI++++  QP      L+EAIS A  +E ++   R K L RR AW+          ++
Subjt:  -------SIVDYTEEFHRLGACTNLAENEDYKIARFLDGLREDIQDQMNTQPIEPIGLLTEAISMAPKIEAKLEKKRVKPLFRRVAWDKQPLSKTSGMDT

Query:  NKSFTLDGPSTSTTTKTDDAP-KALTLPKPTDSTTK----------------------------------KVDEDDQDDQEPELLDEEDTAYIEPDDGEP
         KS T D PSTST  K  +   + + + +  + T K                                   + E+     E  +  EE+T  IE DDGE 
Subjt:  NKSFTLDGPSTSTTTKTDDAP-KALTLPKPTDSTTK----------------------------------KVDEDDQDDQEPELLDEEDTAYIEPDDGEP

Query:  INC--------PSKSSPCSK-------------------DSGSCENMVPQRLVTALNLKVDPHPNHYK
        ++C        P +     +                   DSGS EN V ++LVT LNLK + HPN YK
Subjt:  INC--------PSKSSPCSK-------------------DSGSCENMVPQRLVTALNLKVDPHPNHYK

XP_031743026.1 uncharacterized protein LOC116404533 [Cucumis sativus]4.6e-4134.99Show/hide
Query:  DFHMKIDMPTFNGKMNVEAFLDWVENVENFFDYTRTPEDKKVKLVAFRLQSGASAWWEQLEINRQHIGKRPIHSWPRIK---------------------
        D+ MKID+P + GK N+EAFLDW+++ ENFF Y  TPE KKV LVA +L++GASAWW+QLEINRQ  GK+P+ SW ++K                     
Subjt:  DFHMKIDMPTFNGKMNVEAFLDWVENVENFFDYTRTPEDKKVKLVAFRLQSGASAWWEQLEINRQHIGKRPIHSWPRIK---------------------

Query:  -------SIVDYTEEFHRLGACTNLAENEDYKIARFLDGLREDIQDQMNTQPIEPIGLLTEAISMAPKIEAKLEKKRVKPLFRRVAWDKQPLSKTSGMDT
               ++ +Y EEFHRL A TNL+ENE +++ARF+ GLR DI++++  QP      L+EAIS A  +E ++   R K L RR AW+          ++
Subjt:  -------SIVDYTEEFHRLGACTNLAENEDYKIARFLDGLREDIQDQMNTQPIEPIGLLTEAISMAPKIEAKLEKKRVKPLFRRVAWDKQPLSKTSGMDT

Query:  NKSFTLDGPSTSTTTKTDDAP-KALTLPKPTDSTTK----------------------------------KVDEDDQDDQEPELLDEEDTAYIEPDDGEP
         KS T D PSTST  K  +   + + + +  + T K                                   + E+     E  +  EE+T  IE DDGE 
Subjt:  NKSFTLDGPSTSTTTKTDDAP-KALTLPKPTDSTTK----------------------------------KVDEDDQDDQEPELLDEEDTAYIEPDDGEP

Query:  INC--------PSKSSPCSK-------------------DSGSCENMVPQRLVTALNLKVDPH
        ++C        P +     +                   DSGS EN V ++LVT LNLK + H
Subjt:  INC--------PSKSSPCSK-------------------DSGSCENMVPQRLVTALNLKVDPH

TrEMBL top hitse value%identityAlignment
A0A5A7U7S6 E3 ubiquitin-protein ligase SHPRH isoform X45.3e-3545.09Show/hide
Query:  IRTDFHMKIDMPTFNGKMNVEAFLDWVENVENFFDYTRTPEDKKVKLVAFRLQSGASAWWEQLEINRQHIGKRPIHSWPRIK------------------
        +  D+ MKID+PT+NGK + ++FLDWV++ ENFF+Y  TP+ KKV LV  +L+ GASAWW+QLEINRQ  GK PI  W ++K                  
Subjt:  IRTDFHMKIDMPTFNGKMNVEAFLDWVENVENFFDYTRTPEDKKVKLVAFRLQSGASAWWEQLEINRQHIGKRPIHSWPRIK------------------

Query:  ----------SIVDYTEEFHRLGACTNLAENEDYKIARFLDGLREDIQDQMNTQPIEPIGLLTEAISMAPKIE
                  S+ +Y EEFHRL   TNL+ENE ++IARF+ GLR DI++++  QP      L+EAIS+A  ++
Subjt:  ----------SIVDYTEEFHRLGACTNLAENEDYKIARFLDGLREDIQDQMNTQPIEPIGLLTEAISMAPKIE

A0A5D3DGR0 Reverse transcriptase2.6e-3431.94Show/hide
Query:  TDFHMKIDMPTFNGKMNVEAFLDWVENVENFFDYTRTPEDKKVKLVAFRLQSGASAWWEQLEINRQHIGKRPIHSWPRIKSIV-----------------
        +++ MKID+P+++GK N+E FLDW++N ENFF Y  T ++KKV LVA +L+ GASAWW+Q+ +NRQ  GK PI SW ++K ++                 
Subjt:  TDFHMKIDMPTFNGKMNVEAFLDWVENVENFFDYTRTPEDKKVKLVAFRLQSGASAWWEQLEINRQHIGKRPIHSWPRIKSIV-----------------

Query:  -----------DYTEEFHRLGACTNLAENEDYKIARFLDGLREDIQDQMNTQPIEPIGLLTEAISMAPKIEAKLEKKRVKPLFRRVAWDKQPLSK-TSGM
                   +Y EEFHRLG  TNL E E + I+ F+ GLR D+++++  QP +    L+EAI+ A  +E  +E +      R+  W+     K T+G 
Subjt:  -----------DYTEEFHRLGACTNLAENEDYKIARFLDGLREDIQDQMNTQPIEPIGLLTEAISMAPKIEAKLEKKRVKPLFRRVAWDKQPLSK-TSGM

Query:  DTNKSFTLDGP----STSTTTKTDDAPKALTLP----------------KPTDSTTKK----VDEDDQDDQEPELLD-EEDTAYIEPDDGEPINC-----
           K+ T + P     +S   +  +  K    P                 P++   ++    V +D+ D     L + +E+T  IE D+G+ ++C     
Subjt:  DTNKSFTLDGP----STSTTTKTDDAPKALTLP----------------KPTDSTTKK----VDEDDQDDQEPELLD-EEDTAYIEPDDGEPINC-----

Query:  ---PSKSSPCSK-------------------DSGSCENMVPQRLVTALNLKVDPHPNHYK
           P + +   +                   DSGS EN V ++LVTALNLK  PH   YK
Subjt:  ---PSKSSPCSK-------------------DSGSCENMVPQRLVTALNLKVDPHPNHYK

A0A5D3DJC1 Transposon Ty3-G Gag-Pol polyprotein3.2e-4040.34Show/hide
Query:  VEQERLRLDLRNQGNANRGGQL---QERPRNRSPTPYRARNMEGGGIRTDFHMKIDMPTFNGKMNVEAFLDWVENVENFFDYTRTPEDKKVKLVAFRLQS
        VEQ  + L   N  +++    L   QE  R  S  PY   + E         MK+D+PTFNG+M+VE FLDW++NVENFFDY    E KKV+LVA +LQ 
Subjt:  VEQERLRLDLRNQGNANRGGQL---QERPRNRSPTPYRARNMEGGGIRTDFHMKIDMPTFNGKMNVEAFLDWVENVENFFDYTRTPEDKKVKLVAFRLQS

Query:  GASAWWEQLEINRQHIGKRPIHSWPR----------------------------IKSIVDYTEEFHRLGACTNLAENEDYKIARFLDGLREDIQDQMNTQ
        GASAWW+QL+ NR+  GK+P+ SWP+                            ++SI+DYTEEF+RLGA +NL E E  +I+R + GL+E+I+D +N  
Subjt:  GASAWWEQLEINRQHIGKRPIHSWPR----------------------------IKSIVDYTEEFHRLGACTNLAENEDYKIARFLDGLREDIQDQMNTQ

Query:  PIEPIGLLTEAISMAPKIEAKLEKKRVKPLFRRVAWDKQPLSKTSGMDTNKSFTLDGPSTST---TTKTDDAPKALTLPKPTDSTTKKVD
         +  +  L +AIS+A KIE   E K+ K   R+  WDKQ  + T+     ++F   G S+ST   T K ++ PK  T      +T KKVD
Subjt:  PIEPIGLLTEAISMAPKIEAKLEKKRVKPLFRRVAWDKQPLSKTSGMDTNKSFTLDGPSTST---TTKTDDAPKALTLPKPTDSTTKKVD

A0A6J1CAS9 uncharacterized protein LOC111009540 isoform X11.7e-8647.4Show/hide
Query:  GKNTEISRKGLAGDPSAEQVLSLKTTTNRLLLVEDSLGEVRANVRSIHELLENVGRQLELITVEQERLRLDLRNQGNAN--------------------R
        GK+T  + KG A D S  Q+LS K+TTNRLLL+EDSLGEVR+NV+ IH LLEN+GRQLELI  EQERL+ DL++   AN                    R
Subjt:  GKNTEISRKGLAGDPSAEQVLSLKTTTNRLLLVEDSLGEVRANVRSIHELLENVGRQLELITVEQERLRLDLRNQGNAN--------------------R

Query:  GGQLQERPRNRSPT----------PYRARN---MEGGGIRTDFHMKIDMPTFNGKMNVEAFLDWVENVENFFDYTRTPEDKKVKLVAFRLQSGASAWWEQ
          +LQER ++   +          P  AR    +E GG+RTDF MKID+PTFNGKM+VE FLD V+NVENFFDYT TPEDKKVKLVAF++QSGASAWW+Q
Subjt:  GGQLQERPRNRSPT----------PYRARN---MEGGGIRTDFHMKIDMPTFNGKMNVEAFLDWVENVENFFDYTRTPEDKKVKLVAFRLQSGASAWWEQ

Query:  LEINRQHIGKRPIHSWPRI----------------------------KSIVDYTEEFHRLGACTNLAENEDYKIARFLDGLREDIQDQMNTQPIEPIGLL
        LEIN + +GK+PI SWPR+                            K+I DYTE FHRLGA TN+AE EDYKIARF+DGLREDIQDQM+   I+PI LL
Subjt:  LEINRQHIGKRPIHSWPRI----------------------------KSIVDYTEEFHRLGACTNLAENEDYKIARFLDGLREDIQDQMNTQPIEPIGLL

Query:  TEAISMAPKIEAKLEKKRVKPLFRRVAWDKQPLSKTSGMDTNKSFTLDGPSTSTTTKTDDAPKALTLPKPTDSTTKK-----------------------
        T+AI MA KIE   +KKR++   RR  WDK  +SKT+  DT K   +   S STT   DD  K+     P D+++K+                       
Subjt:  TEAISMAPKIEAKLEKKRVKPLFRRVAWDKQPLSKTSGMDTNKSFTLDGPSTSTTTKTDDAPKALTLPKPTDSTTKK-----------------------

Query:  -----------VDEDDQDDQEPELLDEEDTAYIEPDDGEPINC
                   VD+DD  + + +L  E+D  Y+EPD+G+ ++C
Subjt:  -----------VDEDDQDDQEPELLDEEDTAYIEPDDGEPINC

A0A6J1CCQ8 uncharacterized protein LOC111009540 isoform X21.7e-8647.4Show/hide
Query:  GKNTEISRKGLAGDPSAEQVLSLKTTTNRLLLVEDSLGEVRANVRSIHELLENVGRQLELITVEQERLRLDLRNQGNAN--------------------R
        GK+T  + KG A D S  Q+LS K+TTNRLLL+EDSLGEVR+NV+ IH LLEN+GRQLELI  EQERL+ DL++   AN                    R
Subjt:  GKNTEISRKGLAGDPSAEQVLSLKTTTNRLLLVEDSLGEVRANVRSIHELLENVGRQLELITVEQERLRLDLRNQGNAN--------------------R

Query:  GGQLQERPRNRSPT----------PYRARN---MEGGGIRTDFHMKIDMPTFNGKMNVEAFLDWVENVENFFDYTRTPEDKKVKLVAFRLQSGASAWWEQ
          +LQER ++   +          P  AR    +E GG+RTDF MKID+PTFNGKM+VE FLD V+NVENFFDYT TPEDKKVKLVAF++QSGASAWW+Q
Subjt:  GGQLQERPRNRSPT----------PYRARN---MEGGGIRTDFHMKIDMPTFNGKMNVEAFLDWVENVENFFDYTRTPEDKKVKLVAFRLQSGASAWWEQ

Query:  LEINRQHIGKRPIHSWPRI----------------------------KSIVDYTEEFHRLGACTNLAENEDYKIARFLDGLREDIQDQMNTQPIEPIGLL
        LEIN + +GK+PI SWPR+                            K+I DYTE FHRLGA TN+AE EDYKIARF+DGLREDIQDQM+   I+PI LL
Subjt:  LEINRQHIGKRPIHSWPRI----------------------------KSIVDYTEEFHRLGACTNLAENEDYKIARFLDGLREDIQDQMNTQPIEPIGLL

Query:  TEAISMAPKIEAKLEKKRVKPLFRRVAWDKQPLSKTSGMDTNKSFTLDGPSTSTTTKTDDAPKALTLPKPTDSTTKK-----------------------
        T+AI MA KIE   +KKR++   RR  WDK  +SKT+  DT K   +   S STT   DD  K+     P D+++K+                       
Subjt:  TEAISMAPKIEAKLEKKRVKPLFRRVAWDKQPLSKTSGMDTNKSFTLDGPSTSTTTKTDDAPKALTLPKPTDSTTKK-----------------------

Query:  -----------VDEDDQDDQEPELLDEEDTAYIEPDDGEPINC
                   VD+DD  + + +L  E+D  Y+EPD+G+ ++C
Subjt:  -----------VDEDDQDDQEPELLDEEDTAYIEPDDGEPINC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGAACACAGAGATTTCCAGGAAGGGACTCGCGGGAGATCCTTCCGCAGAGCAAGTTTTATCTCTCAAGACAACCACCAATCGCTTGCTGTTGGTGGAA
GATTCATTGGGAGAAGTAAGGGCTAACGTGCGGAGCATCCATGAACTTCTGGAAAATGTCGGACGACAACTGGAATTGATCACCGTGGAGCAAGAACGTCTGCGG
TTAGACTTAAGGAATCAGGGTAACGCAAATCGGGGAGGACAACTCCAAGAAAGGCCAAGAAACAGATCACCAACTCCTTATCGTGCAAGAAATATGGAGGGGGGT
GGTATCCGAACTGATTTCCATATGAAAATTGACATGCCTACGTTCAATGGTAAGATGAATGTAGAGGCATTCTTGGACTGGGTCGAGAATGTTGAGAATTTTTTT
GATTACACGAGAACACCAGAGGACAAGAAGGTTAAGCTGGTTGCTTTTCGATTGCAATCGGGGGCGTCAGCTTGGTGGGAGCAACTTGAGATCAACCGCCAGCAC
ATTGGAAAGCGCCCCATTCATAGCTGGCCTCGAATTAAATCCATCGTTGACTATACGGAGGAGTTCCATCGTTTGGGCGCTTGCACAAACTTGGCTGAAAATGAA
GATTATAAGATTGCACGCTTTCTTGATGGTTTGCGCGAGGATATCCAAGATCAGATGAACACCCAACCAATTGAACCAATTGGATTACTCACCGAGGCTATTTCC
ATGGCCCCAAAAATTGAAGCAAAGCTGGAAAAGAAACGTGTAAAACCACTTTTTCGACGAGTTGCTTGGGACAAACAGCCTCTCAGTAAGACATCAGGTATGGAT
ACTAACAAATCTTTTACTTTGGATGGTCCTTCAACCTCTACTACTACAAAGACGGATGATGCCCCTAAAGCTTTGACTCTTCCCAAACCTACTGATTCTACAACC
AAAAAAGTCGATGAAGACGATCAAGATGATCAAGAACCCGAACTTCTTGATGAAGAAGATACTGCCTATATTGAACCAGATGATGGCGAACCTATTAATTGTCCT
TCAAAAAGTTCTCCTTGCTCCAAAGACAGTGGAAGTTGTGAGAATATGGTCCCCCAAAGGTTGGTGACCGCTTTAAATCTCAAGGTCGATCCCCATCCTAACCAC
TATAAGGCGCAAGGCACACCTCAAGGCGATAAGCCTCCTTGTCGCCTTGAGGCGAGGGGCGACAAAAAGGCGTTGCCCGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAGAACACAGAGATTTCCAGGAAGGGACTCGCGGGAGATCCTTCCGCAGAGCAAGTTTTATCTCTCAAGACAACCACCAATCGCTTGCTGTTGGTGGAA
GATTCATTGGGAGAAGTAAGGGCTAACGTGCGGAGCATCCATGAACTTCTGGAAAATGTCGGACGACAACTGGAATTGATCACCGTGGAGCAAGAACGTCTGCGG
TTAGACTTAAGGAATCAGGGTAACGCAAATCGGGGAGGACAACTCCAAGAAAGGCCAAGAAACAGATCACCAACTCCTTATCGTGCAAGAAATATGGAGGGGGGT
GGTATCCGAACTGATTTCCATATGAAAATTGACATGCCTACGTTCAATGGTAAGATGAATGTAGAGGCATTCTTGGACTGGGTCGAGAATGTTGAGAATTTTTTT
GATTACACGAGAACACCAGAGGACAAGAAGGTTAAGCTGGTTGCTTTTCGATTGCAATCGGGGGCGTCAGCTTGGTGGGAGCAACTTGAGATCAACCGCCAGCAC
ATTGGAAAGCGCCCCATTCATAGCTGGCCTCGAATTAAATCCATCGTTGACTATACGGAGGAGTTCCATCGTTTGGGCGCTTGCACAAACTTGGCTGAAAATGAA
GATTATAAGATTGCACGCTTTCTTGATGGTTTGCGCGAGGATATCCAAGATCAGATGAACACCCAACCAATTGAACCAATTGGATTACTCACCGAGGCTATTTCC
ATGGCCCCAAAAATTGAAGCAAAGCTGGAAAAGAAACGTGTAAAACCACTTTTTCGACGAGTTGCTTGGGACAAACAGCCTCTCAGTAAGACATCAGGTATGGAT
ACTAACAAATCTTTTACTTTGGATGGTCCTTCAACCTCTACTACTACAAAGACGGATGATGCCCCTAAAGCTTTGACTCTTCCCAAACCTACTGATTCTACAACC
AAAAAAGTCGATGAAGACGATCAAGATGATCAAGAACCCGAACTTCTTGATGAAGAAGATACTGCCTATATTGAACCAGATGATGGCGAACCTATTAATTGTCCT
TCAAAAAGTTCTCCTTGCTCCAAAGACAGTGGAAGTTGTGAGAATATGGTCCCCCAAAGGTTGGTGACCGCTTTAAATCTCAAGGTCGATCCCCATCCTAACCAC
TATAAGGCGCAAGGCACACCTCAAGGCGATAAGCCTCCTTGTCGCCTTGAGGCGAGGGGCGACAAAAAGGCGTTGCCCGAGTGA
Protein sequenceShow/hide protein sequence
MGKNTEISRKGLAGDPSAEQVLSLKTTTNRLLLVEDSLGEVRANVRSIHELLENVGRQLELITVEQERLRLDLRNQGNANRGGQLQERPRNRSPTPYRARNMEGG
GIRTDFHMKIDMPTFNGKMNVEAFLDWVENVENFFDYTRTPEDKKVKLVAFRLQSGASAWWEQLEINRQHIGKRPIHSWPRIKSIVDYTEEFHRLGACTNLAENE
DYKIARFLDGLREDIQDQMNTQPIEPIGLLTEAISMAPKIEAKLEKKRVKPLFRRVAWDKQPLSKTSGMDTNKSFTLDGPSTSTTTKTDDAPKALTLPKPTDSTT
KKVDEDDQDDQEPELLDEEDTAYIEPDDGEPINCPSKSSPCSKDSGSCENMVPQRLVTALNLKVDPHPNHYKAQGTPQGDKPPCRLEARGDKKALPE