; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g16640 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g16640
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr5:12498643..12509817
RNA-Seq ExpressionMoc05g16640
SyntenyMoc05g16640
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN78195.1 hypothetical protein VITISV_008799 [Vitis vinifera]4.5e-2530.69Show/hide
Query:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM
        M  ++W  +D Q +  IR++LS  V   VVKE    +L++AL D YEK SAN K+ L  K FN+ M E  SV  ++NE   I N+L  + I   +E++++
Subjt:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM

Query:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEARR-KLGKMFASTSGADNRVESTLVAQNEGKGKINYTGQH---SIDTT------WVVGVSE
         +L SLP+SWE M++A+SNS+    LK++ I D+ L+EE RR   G+   S S  +   +     +N  +G+ N    +   S  T+      W  G + 
Subjt:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEARR-KLGKMFASTSGADNRVESTLVAQNEGKGKINYTGQH---SIDTT------WVVGVSE

Query:  NYNSQNQEKFHEVEKEKYDRSSLCLMAHSDNGSDLESDDN-----EIIHKPSSYDEVFDAFESMQWKLDDDCYNNEFVKGFWKLKRESTVVATGYKRSSV
        ++  Q +      +K+  D S+  +         L ++ +     ++I + S Y      F   + +LDD+ +   FV G WK+ + S V+  G K S++
Subjt:  NYNSQNQEKFHEVEKEKYDRSSLCLMAHSDNGSDLESDDN-----EIIHKPSSYDEVFDAFESMQWKLDDDCYNNEFVKGFWKLKRESTVVATGYKRSSV

Query:  YVS
        Y++
Subjt:  YVS

KAG7593230.1 Pentatricopeptide repeat [Arabidopsis thaliana x Arabidopsis arenosa]5.9e-2531.74Show/hide
Query:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM
        M  ++W+ +D Q +  IR++LS  V   V KE T + L++ L D YEKPSAN K+ L  K F++ MEE   V +++NE   I+N+L  + I+ ++EV+ +
Subjt:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM

Query:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEARRKLGKMFASTSGA---DNR-VESTLVAQNEGKGKI-NYTGQHSID---TTWVVGVSENY
         LL SLP+SWE M+ A+SNS+ +  LKF  + D  L EE RR +     STS A   +NR  +     QN G+ K  N  GQ         W  G + ++
Subjt:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEARRKLGKMFASTSGA---DNR-VESTLVAQNEGKGKI-NYTGQHSID---TTWVVGVSENY

Query:  NSQNQEKFHEVEKEKYDRSSLCLMAHSDNGSDLESDDNEII--HKPSSYDEVFDA-----------------FESMQWK-------------------LD
         + N  K        Y  S +         S   + D  I+  +   +Y +V+ A                  + + WK                   LD
Subjt:  NSQNQEKFHEVEKEKYDRSSLCLMAHSDNGSDLESDDNEII--HKPSSYDEVFDA-----------------FESMQWK-------------------LD

Query:  DDCYNNEFVKGFWKLKRESTVVATGYKRSSVYVS
        D  +N  F  G WK+K+ S VVA G+KR S+Y++
Subjt:  DDCYNNEFVKGFWKLKRESTVVATGYKRSSVYVS

RVW17147.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]7.7e-2532.53Show/hide
Query:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM
        M  ++W  +D Q +  IR++LS  V   VVKE T  +L++ L   YEKPSAN K+ L  K FN+ M E  SV  ++NE   I N+L  + I  ++E++ +
Subjt:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM

Query:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEARRKLGKMFASTSGADNRVESTLVAQNEGKGKINYTGQ-HSIDTTWVVGVSENYNSQNQEK
         +L SLP+SWE M++A+SNS     LK++ I D+ L+EE RR+       TSG+     S L  +  G+G    + Q  S  TT    + +NY + +  K
Subjt:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEARRKLGKMFASTSGADNRVESTLVAQNEGKGKINYTGQ-HSIDTTWVVGVSENYNSQNQEK

Query:  FHEVEKEKYDRSSLCLMAHSDNGSDLESDDNEIIHKPSSYDEVFDAFESMQWKLDDDCYNNEFVKGFWKLKRESTVVATGYKRSSVYVS
         +       D S+L ++   D    L +    ++ K     ++     S+  +LDD+ +   FV G WK+ + + V+A G K  ++Y++
Subjt:  FHEVEKEKYDRSSLCLMAHSDNGSDLESDDNEIIHKPSSYDEVFDAFESMQWKLDDDCYNNEFVKGFWKLKRESTVVATGYKRSSVYVS

RZB51390.1 Nucleolar pre-ribosomal-associated protein 1 isoform B [Glycine soja]1.6e-2529.59Show/hide
Query:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM
        M  ++WN +D QA+  IR++L+  V   +V E T   L++AL D YEKPSA  K+ L  + FN+ M E  SV  +INE   IL +LE + IK  +EVK +
Subjt:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM

Query:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEAR-RKLGKMFASTSGADNRVE----STLVAQN-EGKGKINYTGQHSID---TTWVVG----
         LL+SLPDSW     A+S+S  +N LK S I D+ LSE+ R R  G+  +  S +   +E    +T   QN  G+ K    GQ       T W       
Subjt:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEAR-RKLGKMFASTSGADNRVE----STLVAQN-EGKGKINYTGQHSID---TTWVVG----

Query:  VSENYNSQNQEKFHEVEKEKYDRSS-----------LCLMAHSDNGSDLESDDNEIIHKPSSYDEVFDAFESMQ-----------WKLDDDCYNNEFVKG
         S  Y +  + K H+ +K   D S+           +C +    +G  L+      I+  +S   ++                   +L D+ ++  F  G
Subjt:  VSENYNSQNQEKFHEVEKEKYDRSS-----------LCLMAHSDNGSDLESDDNEIIHKPSSYDEVFDAFESMQ-----------WKLDDDCYNNEFVKG

Query:  FWKLKRESTVVATGYKRSSVYVSEFEVARRSKRQRMQRAINCSGRDLKESATMTVRTDKENLPSIQVQQLG--SRKRERRTVQCGKEEPFKGVKMMGHCR
         WK+ + + +VA G KR S+Y+   E                          M   T+  N  ++  Q+LG  S K  +     GK    K V  +G C 
Subjt:  FWKLKRESTVVATGYKRSSVYVSEFEVARRSKRQRMQRAINCSGRDLKESATMTVRTDKENLPSIQVQQLG--SRKRERRTVQCGKEEPFKGVKMMGHCR

Query:  YS--GKQRIVALSPSGRSL
        +   GKQ+ V+ S +G++L
Subjt:  YS--GKQRIVALSPSGRSL

RZC08730.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]2.3e-2441.01Show/hide
Query:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM
        M  ++WN +D QA+  IR++LS  V   +V E T   L++AL D YEKPSA  K+ L  + FN+ M E  SV ++INE   IL +LE + IK  +EV  +
Subjt:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM

Query:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEARRKLGKMFASTSGADNRVESTLVAQNEGKGKINYTGQH
         LL+SLPDSW  + IA+S+S  +N LK S I D+ LSE+ R++        SG  +   S      EG+G+    GQ+
Subjt:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEARRKLGKMFASTSGADNRVESTLVAQNEGKGKINYTGQH

TrEMBL top hitse value%identityAlignment
A0A0D3AEM1 CCHC-type domain-containing protein2.4e-2438.12Show/hide
Query:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM
        M   +W  +D Q +  IR++LS  V   VVKE T + L++ L D YEKPSAN+K+ L  K F++ MEE   V ++INE   I+N+L  + I+  +EV+ +
Subjt:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM

Query:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEARRKLGKMFASTSGADNRVESTLVAQNEGKGKINYTGQHSIDTTWVVGVSENYNSQNQEKF
         LL SLP+SWE+M++A+SNS+    LKF+ + D  L+EE RR +    ASTS A N        +N G+        +        G S++ N + Q KF
Subjt:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEARRKLGKMFASTSGADNRVESTLVAQNEGKGKINYTGQHSIDTTWVVGVSENYNSQNQEKF

Query:  HE
         +
Subjt:  HE

A0A2N9EGP1 Uncharacterized protein2.2e-2533.11Show/hide
Query:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM
        MTD +W  +D Q +  IR++LS  +   VVKE T  EL+ AL D YEKPSAN K+ L  K FN+ M E  +V  ++NE   I N+L  + I  ++EV+ +
Subjt:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM

Query:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEARRK-LGKMFASTSGADNRVESTLVAQ--NEGKGKINYTGQHSIDTTWVVGVSENYNSQNQ
         +L SLP++WE M++A+SNS   + LK+  I  + LS+E RR+ +G+   STSG+   +E+    Q  N  +G+     + S   +    V  N      
Subjt:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEARRK-LGKMFASTSGADNRVESTLVAQ--NEGKGKINYTGQHSIDTTWVVGVSENYNSQNQ

Query:  EKFHEVEKEKYDRSSLCLMAHSDNGSDLESDDNEII--HKPSSYDEVF---DAFESMQWKLDDDCYNNEFVKGFWKLKRESTVVATGYKRSSVYVS
         K +  E +K   +    +              EII  H   ++D+V+   D    +  +LD++ ++  FV G WK+ +   VVA G K S++Y++
Subjt:  EKFHEVEKEKYDRSSLCLMAHSDNGSDLESDDNEII--HKPSSYDEVF---DAFESMQWKLDDDCYNNEFVKGFWKLKRESTVVATGYKRSSVYVS

A0A445FR60 Nucleolar pre-ribosomal-associated protein 1 isoform B7.6e-2629.59Show/hide
Query:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM
        M  ++WN +D QA+  IR++L+  V   +V E T   L++AL D YEKPSA  K+ L  + FN+ M E  SV  +INE   IL +LE + IK  +EVK +
Subjt:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM

Query:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEAR-RKLGKMFASTSGADNRVE----STLVAQN-EGKGKINYTGQHSID---TTWVVG----
         LL+SLPDSW     A+S+S  +N LK S I D+ LSE+ R R  G+  +  S +   +E    +T   QN  G+ K    GQ       T W       
Subjt:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEAR-RKLGKMFASTSGADNRVE----STLVAQN-EGKGKINYTGQHSID---TTWVVG----

Query:  VSENYNSQNQEKFHEVEKEKYDRSS-----------LCLMAHSDNGSDLESDDNEIIHKPSSYDEVFDAFESMQ-----------WKLDDDCYNNEFVKG
         S  Y +  + K H+ +K   D S+           +C +    +G  L+      I+  +S   ++                   +L D+ ++  F  G
Subjt:  VSENYNSQNQEKFHEVEKEKYDRSS-----------LCLMAHSDNGSDLESDDNEIIHKPSSYDEVFDAFESMQ-----------WKLDDDCYNNEFVKG

Query:  FWKLKRESTVVATGYKRSSVYVSEFEVARRSKRQRMQRAINCSGRDLKESATMTVRTDKENLPSIQVQQLG--SRKRERRTVQCGKEEPFKGVKMMGHCR
         WK+ + + +VA G KR S+Y+   E                          M   T+  N  ++  Q+LG  S K  +     GK    K V  +G C 
Subjt:  FWKLKRESTVVATGYKRSSVYVSEFEVARRSKRQRMQRAINCSGRDLKESATMTVRTDKENLPSIQVQQLG--SRKRERRTVQCGKEEPFKGVKMMGHCR

Query:  YS--GKQRIVALSPSGRSL
        +   GKQ+ V+ S +G++L
Subjt:  YS--GKQRIVALSPSGRSL

A0A445KDD6 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-2441.01Show/hide
Query:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM
        M  ++WN +D QA+  IR++LS  V   +V E T   L++AL D YEKPSA  K+ L  + FN+ M E  SV ++INE   IL +LE + IK  +EV  +
Subjt:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM

Query:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEARRKLGKMFASTSGADNRVESTLVAQNEGKGKINYTGQH
         LL+SLPDSW  + IA+S+S  +N LK S I D+ LSE+ R++        SG  +   S      EG+G+    GQ+
Subjt:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEARRKLGKMFASTSGADNRVESTLVAQNEGKGKINYTGQH

A5AQS3 CCHC-type domain-containing protein2.2e-2530.69Show/hide
Query:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM
        M  ++W  +D Q +  IR++LS  V   VVKE    +L++AL D YEK SAN K+ L  K FN+ M E  SV  ++NE   I N+L  + I   +E++++
Subjt:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM

Query:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEARR-KLGKMFASTSGADNRVESTLVAQNEGKGKINYTGQH---SIDTT------WVVGVSE
         +L SLP+SWE M++A+SNS+    LK++ I D+ L+EE RR   G+   S S  +   +     +N  +G+ N    +   S  T+      W  G + 
Subjt:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEARR-KLGKMFASTSGADNRVESTLVAQNEGKGKINYTGQH---SIDTT------WVVGVSE

Query:  NYNSQNQEKFHEVEKEKYDRSSLCLMAHSDNGSDLESDDN-----EIIHKPSSYDEVFDAFESMQWKLDDDCYNNEFVKGFWKLKRESTVVATGYKRSSV
        ++  Q +      +K+  D S+  +         L ++ +     ++I + S Y      F   + +LDD+ +   FV G WK+ + S V+  G K S++
Subjt:  NYNSQNQEKFHEVEKEKYDRSSLCLMAHSDNGSDLESDDN-----EIIHKPSSYDEVFDAFESMQWKLDDDCYNNEFVKGFWKLKRESTVVATGYKRSSV

Query:  YVS
        Y++
Subjt:  YVS

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-1124.02Show/hide
Query:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM
        M  +DW ++DE+A + IR+ LS  V + ++ E TA+ +   L   Y   +   K+ L  + + +HM E  +  S++N   G++ +L  +G+KI EE K +
Subjt:  MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIM

Query:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEARRKLGKMFASTSGADNRVESTLVAQNEGKGKINYTGQHSIDTTWVVGVSEN---------
         LL SLP S++ +   + +      LK  T   + L+E+ R+K           +N+ ++ +    EG+G+      ++   +   G S+N         
Subjt:  RLLTSLPDSWETMKIAMSNSLVDNNLKFSTICDVPLSEEARRKLGKMFASTSGADNRVESTLVAQNEGKGKINYTGQHSIDTTWVVGVSEN---------

Query:  YNSQNQEKFH-----------EVEKEKYDRSSLCLMAHSDNGSDLESDDNEIIH
        YN      F            E   +K D ++  ++ ++DN     +++ E +H
Subjt:  YNSQNQEKFH-----------EVEKEKYDRSSLCLMAHSDNGSDLESDDNEIIH

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGACAAGGATTGGAATGAGATGGATGAGCAGGCCATCGCGAACATCAGAATGTCTTTATCGATAGGTGTATGCAGTCTCGTGGTGAAAGAGATGACTGCGAAAGA
ATTGTTGCAGGCCTTGCATGATAGGTATGAAAAACCTTCTGCCAATACAAAAATACTTCTATGGACGAAGTATTTCAACATCCACATGGAGGAGCGACCATCGGTGAATT
CCAACATTAATGAGCTCACCGGCATCTTGAATAAATTAGAAGGTATGGGTATCAAGATCAATGAGGAGGTAAAGATTATGAGGTTGTTGACATCTTTGCCTGATAGTTGG
GAGACGATGAAGATCGCGATGTCGAATTCGTTAGTGGATAATAACTTGAAATTTTCAACTATTTGTGATGTCCCCTTATCTGAGGAAGCCCGAAGGAAATTAGGGAAAAT
GTTTGCATCTACTTCAGGGGCAGACAACAGGGTTGAATCAACCTTGGTAGCTCAGAACGAAGGGAAAGGCAAGATAAACTACACGGGGCAGCACAGCATAGATACAACAT
GGGTAGTGGGAGTTTCAGAGAATTATAATTCTCAAAATCAAGAAAAATTTCATGAAGTGGAAAAAGAAAAATATGATCGATCCTCTTTGTGTTTGATGGCTCATTCAGAC
AATGGGAGCGATCTTGAAAGTGATGACAATGAGATAATTCATAAACCCTCTTCATATGATGAAGTGTTTGATGCATTTGAAAGCATGCAATGGAAGCTAGATGATGATTG
CTACAACAATGAGTTTGTTAAGGGTTTCTGGAAGCTCAAGAGGGAATCTACGGTGGTGGCGACAGGCTACAAGAGATCTTCTGTTTATGTGTCTGAGTTTGAGGTTGCCA
GGAGATCTAAGAGACAGAGGATGCAAAGGGCTATAAATTGTTCAGGGCGAGACTTGAAAGAATCTGCAACAATGACAGTCAGGACAGATAAGGAGAATCTACCATCAATT
CAAGTACAACAGCTGGGAAGTAGAAAAAGGGAAAGAAGAACAGTTCAGTGTGGGAAAGAAGAACCATTCAAGGGTGTCAAGATGATGGGACACTGTCGATACAGTGGGAA
GCAGAGAATTGTCGCTTTGTCTCCAAGTGGAAGATCGTTGGGATTGGTGAAGCCAAAACAATGTCGGAACGAGCTAAGGATGAGTCGGGATCAAGCCAGGACGCATCAGG
ATCGAAATGGGGATGAAAAACAGGGAAGTGGAGCACTTGCAGTTCGCATTTTAGGGTTTTGTCCAACTGATTTTGAGCCATTTTCAGAGAATCTTATGGTAGTTTCAAGG
GAGAGGCTCAGGATATTGGCAGAGGCATTGGGATTGATCAAGATCGACATTATCGGGTCATTCCCTCTTCTCTCCCTCTCTTTTCCCTCTTCTTCCTTGTCGGTCTGGTT
CTCTCTTCCCTCCATCTTCGGATCTCCCCATGAGGAGCCCCAAAATGGGGTGAACGCCCTCTCAGCCCATATCACGAATCCGCTGTTGCCCGGGAAGGCGATGGAGAGCT
CCGCCGTGATCAGCACCTCCAGGACGGTGGCCAGCGACCTCGAAGTAGATAAGGAAGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGACTGACAAGGATTGGAATGAGATGGATGAGCAGGCCATCGCGAACATCAGAATGTCTTTATCGATAGGTGTATGCAGTCTCGTGGTGAAAGAGATGACTGCGAAAGA
ATTGTTGCAGGCCTTGCATGATAGGTATGAAAAACCTTCTGCCAATACAAAAATACTTCTATGGACGAAGTATTTCAACATCCACATGGAGGAGCGACCATCGGTGAATT
CCAACATTAATGAGCTCACCGGCATCTTGAATAAATTAGAAGGTATGGGTATCAAGATCAATGAGGAGGTAAAGATTATGAGGTTGTTGACATCTTTGCCTGATAGTTGG
GAGACGATGAAGATCGCGATGTCGAATTCGTTAGTGGATAATAACTTGAAATTTTCAACTATTTGTGATGTCCCCTTATCTGAGGAAGCCCGAAGGAAATTAGGGAAAAT
GTTTGCATCTACTTCAGGGGCAGACAACAGGGTTGAATCAACCTTGGTAGCTCAGAACGAAGGGAAAGGCAAGATAAACTACACGGGGCAGCACAGCATAGATACAACAT
GGGTAGTGGGAGTTTCAGAGAATTATAATTCTCAAAATCAAGAAAAATTTCATGAAGTGGAAAAAGAAAAATATGATCGATCCTCTTTGTGTTTGATGGCTCATTCAGAC
AATGGGAGCGATCTTGAAAGTGATGACAATGAGATAATTCATAAACCCTCTTCATATGATGAAGTGTTTGATGCATTTGAAAGCATGCAATGGAAGCTAGATGATGATTG
CTACAACAATGAGTTTGTTAAGGGTTTCTGGAAGCTCAAGAGGGAATCTACGGTGGTGGCGACAGGCTACAAGAGATCTTCTGTTTATGTGTCTGAGTTTGAGGTTGCCA
GGAGATCTAAGAGACAGAGGATGCAAAGGGCTATAAATTGTTCAGGGCGAGACTTGAAAGAATCTGCAACAATGACAGTCAGGACAGATAAGGAGAATCTACCATCAATT
CAAGTACAACAGCTGGGAAGTAGAAAAAGGGAAAGAAGAACAGTTCAGTGTGGGAAAGAAGAACCATTCAAGGGTGTCAAGATGATGGGACACTGTCGATACAGTGGGAA
GCAGAGAATTGTCGCTTTGTCTCCAAGTGGAAGATCGTTGGGATTGGTGAAGCCAAAACAATGTCGGAACGAGCTAAGGATGAGTCGGGATCAAGCCAGGACGCATCAGG
ATCGAAATGGGGATGAAAAACAGGGAAGTGGAGCACTTGCAGTTCGCATTTTAGGGTTTTGTCCAACTGATTTTGAGCCATTTTCAGAGAATCTTATGGTAGTTTCAAGG
GAGAGGCTCAGGATATTGGCAGAGGCATTGGGATTGATCAAGATCGACATTATCGGGTCATTCCCTCTTCTCTCCCTCTCTTTTCCCTCTTCTTCCTTGTCGGTCTGGTT
CTCTCTTCCCTCCATCTTCGGATCTCCCCATGAGGAGCCCCAAAATGGGGTGAACGCCCTCTCAGCCCATATCACGAATCCGCTGTTGCCCGGGAAGGCGATGGAGAGCT
CCGCCGTGATCAGCACCTCCAGGACGGTGGCCAGCGACCTCGAAGTAGATAAGGAAGATTAA
Protein sequenceShow/hide protein sequence
MTDKDWNEMDEQAIANIRMSLSIGVCSLVVKEMTAKELLQALHDRYEKPSANTKILLWTKYFNIHMEERPSVNSNINELTGILNKLEGMGIKINEEVKIMRLLTSLPDSW
ETMKIAMSNSLVDNNLKFSTICDVPLSEEARRKLGKMFASTSGADNRVESTLVAQNEGKGKINYTGQHSIDTTWVVGVSENYNSQNQEKFHEVEKEKYDRSSLCLMAHSD
NGSDLESDDNEIIHKPSSYDEVFDAFESMQWKLDDDCYNNEFVKGFWKLKRESTVVATGYKRSSVYVSEFEVARRSKRQRMQRAINCSGRDLKESATMTVRTDKENLPSI
QVQQLGSRKRERRTVQCGKEEPFKGVKMMGHCRYSGKQRIVALSPSGRSLGLVKPKQCRNELRMSRDQARTHQDRNGDEKQGSGALAVRILGFCPTDFEPFSENLMVVSR
ERLRILAEALGLIKIDIIGSFPLLSLSFPSSSLSVWFSLPSIFGSPHEEPQNGVNALSAHITNPLLPGKAMESSAVISTSRTVASDLEVDKED