; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g16210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g16210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDNA-directed DNA polymerase
Genome locationchr6:12745953..12752895
RNA-Seq ExpressionMoc06g16210
SyntenyMoc06g16210
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142953.1 uncharacterized protein LOC111012947 [Momordica charantia]2.6e-3534.28Show/hide
Query:  GVASSSAEEPDQQYKQNYTPLGFPTQPAF--LPQQYNQQRAQNTTQQGGSNTS-FEAMMKKFMTRSEATTKEFMTRT-----------------DVAIRN
        G  SSS    +QQYKQ YTP  FP  PAF   PQQYNQQ+  N  Q    N S  E +MK+F+T+++AT KE MTRT                 DV +RN
Subjt:  GVASSSAEEPDQQYKQNYTPLGFPTQPAF--LPQQYNQQRAQNTTQQGGSNTS-FEAMMKKFMTRSEATTKEFMTRT-----------------DVAIRN

Query:  LEMQVQQIANDQKSRPQGTLLGHTENPKR---DREGNEHCKAVITRSGLSYE---GPSLPVEGTDVVTPVPTSTFNPQQE--------------------
        LEMQ+ Q+AN+ ++RPQG+L   TE P+R        E    V+    +  E     +  V       P P       Q+                    
Subjt:  LEMQVQQIANDQKSRPQGTLLGHTENPKR---DREGNEHCKAVITRSGLSYE---GPSLPVEGTDVVTPVPTSTFNPQQE--------------------

Query:  EKAESVSTEEK------GKKANKGKQEVPSTT---------------------------------------------------LQLDIGEACPTTVTLQL
        E  E + T  K       +K   G+ E  + T                                                    +L+IG+A PTTVTL L
Subjt:  EKAESVSTEEK------GKKANKGKQEVPSTT---------------------------------------------------LQLDIGEACPTTVTLQL

Query:  ADRSIKKPEGKIEDVLVKVDKFIFPADFIILDCEAD---------------------------LEVNDEQVTFNVFDVVRLPDEVEDCSTI----GAIME
        ADRSI KPEGKIEDVLVKVDKFIFPADFIILDCEAD                           + V+D++VTFN+ D ++ PD+ E+C  I    G    
Subjt:  ADRSIKKPEGKIEDVLVKVDKFIFPADFIILDCEAD---------------------------LEVNDEQVTFNVFDVVRLPDEVEDCSTI----GAIME

Query:  ELQEMIVEDLEVDLEAAEKEAIL
        EL +++  ++E +LE AEKE I+
Subjt:  ELQEMIVEDLEVDLEAAEKEAIL

XP_022143639.1 uncharacterized protein LOC111013500 [Momordica charantia]3.8e-3468.84Show/hide
Query:  QLDIGEACPTTVTLQLADRSIKKPEGKIEDVLVKVDKFIFPADFIILDCEADLE---------------------------VNDEQVTFNVFDVVRLPDE
        +L+IGE  PT+VT QLADRSIKKPE KIE VLVKVDKFIFPADFIILD EADLE                           VNDEQ+TFNV DV RLPDE
Subjt:  QLDIGEACPTTVTLQLADRSIKKPEGKIEDVLVKVDKFIFPADFIILDCEADLE---------------------------VNDEQVTFNVFDVVRLPDE

Query:  VEDCSTIGAIMEELQEMIVEDLEVDLEAAEKEA--ILP
        VEDCS IGAIMEELQEMIVEDLE DLEAAEKEA  ILP
Subjt:  VEDCSTIGAIMEELQEMIVEDLEVDLEAAEKEA--ILP

XP_022157836.1 uncharacterized protein LOC111024449 [Momordica charantia]2.7e-4035.19Show/hide
Query:  QGVASSSAEEPDQQYKQNYTPLGFPTQPAFLPQQYNQQRAQN-------TTQQGGSNTSFEAMMKKFMTRSEATTKEFMTRTDVAI-----------RNL
        QG ++++ +  +QQYK    P      P   PQQ+NQQ+  +       +  +        +MMK+F  R++   +EF TR D AI           RNL
Subjt:  QGVASSSAEEPDQQYKQNYTPLGFPTQPAFLPQQYNQQRAQN-------TTQQGGSNTSFEAMMKKFMTRSEATTKEFMTRTDVAI-----------RNL

Query:  EMQVQQIANDQKSRPQGTLLGHTENPKRDREGNEHCKAVITRSGLSYEGPSLPVEGTDVVTPVPTSTFNPQQEEKAE---------SVSTEEK-------
        E Q+ Q+A++ K+RP+GTL   TE PK   EG EHCK + TRSGL+YE P +P EG+   T    +   P +  + E          + T +K       
Subjt:  EMQVQQIANDQKSRPQGTLLGHTENPKRDREGNEHCKAVITRSGLSYEGPSLPVEGTDVVTPVPTSTFNPQQEEKAE---------SVSTEEK-------

Query:  ----------------GKKANKGKQEVP------------------------STTLQLDIGEACPTTVTLQLADRSIKKPEGKIEDVLVKVDKFIFPADF
                         K  + G   +P                        S   +L+IG+A PTTVTLQLADRSI KPEGKIEDVLVKVDKFIFPADF
Subjt:  ----------------GKKANKGKQEVP------------------------STTLQLDIGEACPTTVTLQLADRSIKKPEGKIEDVLVKVDKFIFPADF

Query:  IILDCEAD---------------------------LEVNDEQVTFNVFDVVRLPDEVEDCSTI----GAIMEELQEMIVEDLEVDLEAAEKEAIL
        IIL+CEAD                           + V+D++VTFN+ D ++ PD++E+C+TI    G    EL +++  ++E  LE AEKE I+
Subjt:  IILDCEAD---------------------------LEVNDEQVTFNVFDVVRLPDEVEDCSTI----GAIMEELQEMIVEDLEVDLEAAEKEAIL

XP_022158314.1 uncharacterized protein LOC111024824 [Momordica charantia]1.0e-3983.78Show/hide
Query:  SNQGVASSSAEEPDQQYKQNYTPLGFPTQPAFLPQQYNQQRAQNTTQQGGSNTSFEAMMKKFMTRSEATTKEFMTRTDVAIRNLEMQVQQIANDQKSRPQ
        SNQGVASSSA+ P QQYKQNYTP  FPTQPA  PQQYNQQRAQNTTQQGGSN S EAM K+FMTRSEATTKEFMTRTD  IR LEMQV QIAND+KSRPQ
Subjt:  SNQGVASSSAEEPDQQYKQNYTPLGFPTQPAFLPQQYNQQRAQNTTQQGGSNTSFEAMMKKFMTRSEATTKEFMTRTDVAIRNLEMQVQQIANDQKSRPQ

Query:  GTLLGHTENPK
        GTL G+TENPK
Subjt:  GTLLGHTENPK

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]2.1e-4033.41Show/hide
Query:  GVASSSAEEPDQQYKQNYTPLGFPTQPAF--LPQQYNQQRAQNTTQQGGSNTS-FEAMMKKFMTRSEATTKEFMTRT-----------------DVAIRN
        G  SS+    +QQYK+ YTP GFP  PAF   P QYNQQ+  N  Q    N S  E +MK+ +T+++AT KE MTRT                 DV +R 
Subjt:  GVASSSAEEPDQQYKQNYTPLGFPTQPAF--LPQQYNQQRAQNTTQQGGSNTS-FEAMMKKFMTRSEATTKEFMTRT-----------------DVAIRN

Query:  LEMQVQQIANDQKSRPQGTLLGHTENPKRDREGNEHCKAVITRSGLSYEGPSLPVEGT--------------DVVTPVPTSTFNPQQE------------
        LEMQ+ Q+ N+ ++RPQG+L   TE P+  R G EHC ++ TRSGL YEGP +P E +               +V P  +    PQ              
Subjt:  LEMQVQQIANDQKSRPQGTLLGHTENPKRDREGNEHCKAVITRSGLSYEGPSLPVEGT--------------DVVTPVPTSTFNPQQE------------

Query:  --------------------------EKAESVSTEEKGKK---------------------ANKGKQEVP------------------------------
                                  E  E + T  K  K                     +N  K ++P                              
Subjt:  --------------------------EKAESVSTEEKGKK---------------------ANKGKQEVP------------------------------

Query:  ------STTLQLDIGEACPTTVTLQLADRSIKKPEGKIEDVLVKVDKFIFPADFIILDCEAD---------------------------LEVNDEQVTFN
              S   + +IG+A PTTVTLQLADRSI KPEGKIEDVLVKVDKFIFP DFIILDCEAD                           + V+D++VTFN
Subjt:  ------STTLQLDIGEACPTTVTLQLADRSIKKPEGKIEDVLVKVDKFIFPADFIILDCEAD---------------------------LEVNDEQVTFN

Query:  VFDVVRLPDEVEDCSTIG----AIMEELQEMIVEDLEVDLEAAEKEAIL
        + D ++  D++E+C+ I         EL +++  ++E +LE AEKE I+
Subjt:  VFDVVRLPDEVEDCSTIG----AIMEELQEMIVEDLEVDLEAAEKEAIL

TrEMBL top hitse value%identityAlignment
A0A6J1CPJ3 uncharacterized protein LOC1110129471.3e-3534.28Show/hide
Query:  GVASSSAEEPDQQYKQNYTPLGFPTQPAF--LPQQYNQQRAQNTTQQGGSNTS-FEAMMKKFMTRSEATTKEFMTRT-----------------DVAIRN
        G  SSS    +QQYKQ YTP  FP  PAF   PQQYNQQ+  N  Q    N S  E +MK+F+T+++AT KE MTRT                 DV +RN
Subjt:  GVASSSAEEPDQQYKQNYTPLGFPTQPAF--LPQQYNQQRAQNTTQQGGSNTS-FEAMMKKFMTRSEATTKEFMTRT-----------------DVAIRN

Query:  LEMQVQQIANDQKSRPQGTLLGHTENPKR---DREGNEHCKAVITRSGLSYE---GPSLPVEGTDVVTPVPTSTFNPQQE--------------------
        LEMQ+ Q+AN+ ++RPQG+L   TE P+R        E    V+    +  E     +  V       P P       Q+                    
Subjt:  LEMQVQQIANDQKSRPQGTLLGHTENPKR---DREGNEHCKAVITRSGLSYE---GPSLPVEGTDVVTPVPTSTFNPQQE--------------------

Query:  EKAESVSTEEK------GKKANKGKQEVPSTT---------------------------------------------------LQLDIGEACPTTVTLQL
        E  E + T  K       +K   G+ E  + T                                                    +L+IG+A PTTVTL L
Subjt:  EKAESVSTEEK------GKKANKGKQEVPSTT---------------------------------------------------LQLDIGEACPTTVTLQL

Query:  ADRSIKKPEGKIEDVLVKVDKFIFPADFIILDCEAD---------------------------LEVNDEQVTFNVFDVVRLPDEVEDCSTI----GAIME
        ADRSI KPEGKIEDVLVKVDKFIFPADFIILDCEAD                           + V+D++VTFN+ D ++ PD+ E+C  I    G    
Subjt:  ADRSIKKPEGKIEDVLVKVDKFIFPADFIILDCEAD---------------------------LEVNDEQVTFNVFDVVRLPDEVEDCSTI----GAIME

Query:  ELQEMIVEDLEVDLEAAEKEAIL
        EL +++  ++E +LE AEKE I+
Subjt:  ELQEMIVEDLEVDLEAAEKEAIL

A0A6J1CPX1 uncharacterized protein LOC1110135001.8e-3468.84Show/hide
Query:  QLDIGEACPTTVTLQLADRSIKKPEGKIEDVLVKVDKFIFPADFIILDCEADLE---------------------------VNDEQVTFNVFDVVRLPDE
        +L+IGE  PT+VT QLADRSIKKPE KIE VLVKVDKFIFPADFIILD EADLE                           VNDEQ+TFNV DV RLPDE
Subjt:  QLDIGEACPTTVTLQLADRSIKKPEGKIEDVLVKVDKFIFPADFIILDCEADLE---------------------------VNDEQVTFNVFDVVRLPDE

Query:  VEDCSTIGAIMEELQEMIVEDLEVDLEAAEKEA--ILP
        VEDCS IGAIMEELQEMIVEDLE DLEAAEKEA  ILP
Subjt:  VEDCSTIGAIMEELQEMIVEDLEVDLEAAEKEA--ILP

A0A6J1DY39 uncharacterized protein LOC1110256531.0e-4033.41Show/hide
Query:  GVASSSAEEPDQQYKQNYTPLGFPTQPAF--LPQQYNQQRAQNTTQQGGSNTS-FEAMMKKFMTRSEATTKEFMTRT-----------------DVAIRN
        G  SS+    +QQYK+ YTP GFP  PAF   P QYNQQ+  N  Q    N S  E +MK+ +T+++AT KE MTRT                 DV +R 
Subjt:  GVASSSAEEPDQQYKQNYTPLGFPTQPAF--LPQQYNQQRAQNTTQQGGSNTS-FEAMMKKFMTRSEATTKEFMTRT-----------------DVAIRN

Query:  LEMQVQQIANDQKSRPQGTLLGHTENPKRDREGNEHCKAVITRSGLSYEGPSLPVEGT--------------DVVTPVPTSTFNPQQE------------
        LEMQ+ Q+ N+ ++RPQG+L   TE P+  R G EHC ++ TRSGL YEGP +P E +               +V P  +    PQ              
Subjt:  LEMQVQQIANDQKSRPQGTLLGHTENPKRDREGNEHCKAVITRSGLSYEGPSLPVEGT--------------DVVTPVPTSTFNPQQE------------

Query:  --------------------------EKAESVSTEEKGKK---------------------ANKGKQEVP------------------------------
                                  E  E + T  K  K                     +N  K ++P                              
Subjt:  --------------------------EKAESVSTEEKGKK---------------------ANKGKQEVP------------------------------

Query:  ------STTLQLDIGEACPTTVTLQLADRSIKKPEGKIEDVLVKVDKFIFPADFIILDCEAD---------------------------LEVNDEQVTFN
              S   + +IG+A PTTVTLQLADRSI KPEGKIEDVLVKVDKFIFP DFIILDCEAD                           + V+D++VTFN
Subjt:  ------STTLQLDIGEACPTTVTLQLADRSIKKPEGKIEDVLVKVDKFIFPADFIILDCEAD---------------------------LEVNDEQVTFN

Query:  VFDVVRLPDEVEDCSTIG----AIMEELQEMIVEDLEVDLEAAEKEAIL
        + D ++  D++E+C+ I         EL +++  ++E +LE AEKE I+
Subjt:  VFDVVRLPDEVEDCSTIG----AIMEELQEMIVEDLEVDLEAAEKEAIL

A0A6J1DZ19 uncharacterized protein LOC1110248245.0e-4083.78Show/hide
Query:  SNQGVASSSAEEPDQQYKQNYTPLGFPTQPAFLPQQYNQQRAQNTTQQGGSNTSFEAMMKKFMTRSEATTKEFMTRTDVAIRNLEMQVQQIANDQKSRPQ
        SNQGVASSSA+ P QQYKQNYTP  FPTQPA  PQQYNQQRAQNTTQQGGSN S EAM K+FMTRSEATTKEFMTRTD  IR LEMQV QIAND+KSRPQ
Subjt:  SNQGVASSSAEEPDQQYKQNYTPLGFPTQPAFLPQQYNQQRAQNTTQQGGSNTSFEAMMKKFMTRSEATTKEFMTRTDVAIRNLEMQVQQIANDQKSRPQ

Query:  GTLLGHTENPK
        GTL G+TENPK
Subjt:  GTLLGHTENPK

A0A6J1DZC3 uncharacterized protein LOC1110244491.3e-4035.19Show/hide
Query:  QGVASSSAEEPDQQYKQNYTPLGFPTQPAFLPQQYNQQRAQN-------TTQQGGSNTSFEAMMKKFMTRSEATTKEFMTRTDVAI-----------RNL
        QG ++++ +  +QQYK    P      P   PQQ+NQQ+  +       +  +        +MMK+F  R++   +EF TR D AI           RNL
Subjt:  QGVASSSAEEPDQQYKQNYTPLGFPTQPAFLPQQYNQQRAQN-------TTQQGGSNTSFEAMMKKFMTRSEATTKEFMTRTDVAI-----------RNL

Query:  EMQVQQIANDQKSRPQGTLLGHTENPKRDREGNEHCKAVITRSGLSYEGPSLPVEGTDVVTPVPTSTFNPQQEEKAE---------SVSTEEK-------
        E Q+ Q+A++ K+RP+GTL   TE PK   EG EHCK + TRSGL+YE P +P EG+   T    +   P +  + E          + T +K       
Subjt:  EMQVQQIANDQKSRPQGTLLGHTENPKRDREGNEHCKAVITRSGLSYEGPSLPVEGTDVVTPVPTSTFNPQQEEKAE---------SVSTEEK-------

Query:  ----------------GKKANKGKQEVP------------------------STTLQLDIGEACPTTVTLQLADRSIKKPEGKIEDVLVKVDKFIFPADF
                         K  + G   +P                        S   +L+IG+A PTTVTLQLADRSI KPEGKIEDVLVKVDKFIFPADF
Subjt:  ----------------GKKANKGKQEVP------------------------STTLQLDIGEACPTTVTLQLADRSIKKPEGKIEDVLVKVDKFIFPADF

Query:  IILDCEAD---------------------------LEVNDEQVTFNVFDVVRLPDEVEDCSTI----GAIMEELQEMIVEDLEVDLEAAEKEAIL
        IIL+CEAD                           + V+D++VTFN+ D ++ PD++E+C+TI    G    EL +++  ++E  LE AEKE I+
Subjt:  IILDCEAD---------------------------LEVNDEQVTFNVFDVVRLPDEVEDCSTI----GAIMEELQEMIVEDLEVDLEAAEKEAIL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGACCTGGGTTACCCTAGTCTCTGCAGCAACCAAGAGAGGTAGTGGAAGAGTAGAACTCAGTTTCTCAAGAAAAGCCAGAATTGCACTTGGTGCATTTTCCCAACA
AAGTGTTTTTTCCATGTTTTGCATCAAAACTAAGCAACAAGTTCCTGAGGATTCGACCTTGGATTACTACCAGGTATTACTTGTGACCGTGTGCGCTTGGTCTGGTTGGT
TGACCTTTGATCCAGAGATTGAAAGAACCCTTAAGAGGATAAGGCGTGAGCTTTGGTTGAGAAAAGAGAAAGAAACTCAAAAAGAGGAAGAAGTTGAAGAAGTTGAAGAA
GAAGAGACCATCGAGATGAATTGGAATCCACAAGATCTTCCACCTCCACAAAATCCACCTGTGAACGGAGATATGGCAGGGATAAATAATCCTTTACCCCAAGCCGCAAA
GTTCAAGCTCAAGCCAGTCATGTTCCAGATGGATGGTGCAAGGACTTGGCTAAATGCGCTAAAACCAAATTCTGTCAACACATGGGCAGAAAAGATGGAGAAATTTTTGG
AAAAGTACCACACTTTGACCAGGAACGCAGACCTTCGAGAATATGTTAGTAACCAAGGAGTAGCTAGTAGCAGTGCGGAAGAACCCGATCAACAATACAAGCAAAACTAC
ACTCCTCTTGGTTTTCCTACTCAACCTGCGTTTCTGCCACAACAATACAATCAACAACGAGCTCAAAATACTACTCAGCAAGGTGGTAGCAACACGAGTTTTGAGGCCAT
GATGAAAAAATTCATGACAAGAAGTGAAGCTACAACAAAAGAGTTCATGACAAGAACTGATGTTGCGATAAGAAACTTGGAGATGCAAGTGCAGCAGATAGCAAATGACC
AAAAATCTAGACCTCAAGGTACATTGCTTGGACACACAGAGAACCCGAAGCGAGACCGTGAGGGAAATGAACACTGTAAGGCGGTTATCACGAGAAGCGGACTAAGCTAT
GAAGGACCCTCACTTCCAGTTGAAGGAACTGATGTAGTTACACCTGTTCCTACATCCACCTTCAATCCACAACAAGAAGAAAAAGCAGAATCTGTAAGTACAGAAGAAAA
AGGTAAGAAGGCGAATAAAGGTAAGCAAGAAGTGCCCAGCACTACCCTACAGTTAGATATAGGAGAAGCTTGTCCCACTACTGTCACTTTACAACTAGCTGATAGGTCCA
TAAAGAAACCAGAAGGAAAAATAGAAGATGTGCTTGTTAAAGTCGATAAATTTATTTTTCCCGCCGATTTCATAATTTTGGATTGTGAAGCAGATCTTGAGGTCAATGAT
GAGCAGGTAACCTTCAATGTCTTTGATGTGGTGCGGCTCCCAGATGAAGTCGAAGACTGCTCTACAATAGGGGCAATCATGGAGGAACTCCAGGAAATGATTGTGGAAGA
CTTAGAAGTTGATTTGGAGGCCGCAGAAAAAGAAGCCATTTTGCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGGACCTGGGTTACCCTAGTCTCTGCAGCAACCAAGAGAGGTAGTGGAAGAGTAGAACTCAGTTTCTCAAGAAAAGCCAGAATTGCACTTGGTGCATTTTCCCAACA
AAGTGTTTTTTCCATGTTTTGCATCAAAACTAAGCAACAAGTTCCTGAGGATTCGACCTTGGATTACTACCAGGTATTACTTGTGACCGTGTGCGCTTGGTCTGGTTGGT
TGACCTTTGATCCAGAGATTGAAAGAACCCTTAAGAGGATAAGGCGTGAGCTTTGGTTGAGAAAAGAGAAAGAAACTCAAAAAGAGGAAGAAGTTGAAGAAGTTGAAGAA
GAAGAGACCATCGAGATGAATTGGAATCCACAAGATCTTCCACCTCCACAAAATCCACCTGTGAACGGAGATATGGCAGGGATAAATAATCCTTTACCCCAAGCCGCAAA
GTTCAAGCTCAAGCCAGTCATGTTCCAGATGGATGGTGCAAGGACTTGGCTAAATGCGCTAAAACCAAATTCTGTCAACACATGGGCAGAAAAGATGGAGAAATTTTTGG
AAAAGTACCACACTTTGACCAGGAACGCAGACCTTCGAGAATATGTTAGTAACCAAGGAGTAGCTAGTAGCAGTGCGGAAGAACCCGATCAACAATACAAGCAAAACTAC
ACTCCTCTTGGTTTTCCTACTCAACCTGCGTTTCTGCCACAACAATACAATCAACAACGAGCTCAAAATACTACTCAGCAAGGTGGTAGCAACACGAGTTTTGAGGCCAT
GATGAAAAAATTCATGACAAGAAGTGAAGCTACAACAAAAGAGTTCATGACAAGAACTGATGTTGCGATAAGAAACTTGGAGATGCAAGTGCAGCAGATAGCAAATGACC
AAAAATCTAGACCTCAAGGTACATTGCTTGGACACACAGAGAACCCGAAGCGAGACCGTGAGGGAAATGAACACTGTAAGGCGGTTATCACGAGAAGCGGACTAAGCTAT
GAAGGACCCTCACTTCCAGTTGAAGGAACTGATGTAGTTACACCTGTTCCTACATCCACCTTCAATCCACAACAAGAAGAAAAAGCAGAATCTGTAAGTACAGAAGAAAA
AGGTAAGAAGGCGAATAAAGGTAAGCAAGAAGTGCCCAGCACTACCCTACAGTTAGATATAGGAGAAGCTTGTCCCACTACTGTCACTTTACAACTAGCTGATAGGTCCA
TAAAGAAACCAGAAGGAAAAATAGAAGATGTGCTTGTTAAAGTCGATAAATTTATTTTTCCCGCCGATTTCATAATTTTGGATTGTGAAGCAGATCTTGAGGTCAATGAT
GAGCAGGTAACCTTCAATGTCTTTGATGTGGTGCGGCTCCCAGATGAAGTCGAAGACTGCTCTACAATAGGGGCAATCATGGAGGAACTCCAGGAAATGATTGTGGAAGA
CTTAGAAGTTGATTTGGAGGCCGCAGAAAAAGAAGCCATTTTGCCCTAA
Protein sequenceShow/hide protein sequence
MWTWVTLVSAATKRGSGRVELSFSRKARIALGAFSQQSVFSMFCIKTKQQVPEDSTLDYYQVLLVTVCAWSGWLTFDPEIERTLKRIRRELWLRKEKETQKEEEVEEVEE
EETIEMNWNPQDLPPPQNPPVNGDMAGINNPLPQAAKFKLKPVMFQMDGARTWLNALKPNSVNTWAEKMEKFLEKYHTLTRNADLREYVSNQGVASSSAEEPDQQYKQNY
TPLGFPTQPAFLPQQYNQQRAQNTTQQGGSNTSFEAMMKKFMTRSEATTKEFMTRTDVAIRNLEMQVQQIANDQKSRPQGTLLGHTENPKRDREGNEHCKAVITRSGLSY
EGPSLPVEGTDVVTPVPTSTFNPQQEEKAESVSTEEKGKKANKGKQEVPSTTLQLDIGEACPTTVTLQLADRSIKKPEGKIEDVLVKVDKFIFPADFIILDCEADLEVND
EQVTFNVFDVVRLPDEVEDCSTIGAIMEELQEMIVEDLEVDLEAAEKEAILP