; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g20130 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g20130
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:13591911..13593675
RNA-Seq ExpressionMoc03g20130
SyntenyMoc03g20130
Gene Ontology termsGO:0006950 - response to stress (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0043229 - intracellular organelle (cellular component)
GO:0003824 - catalytic activity (molecular function)
GO:0032555 - purine ribonucleotide binding (molecular function)
GO:0043168 - anion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8691424.1 Callose synthase 12 [Hibiscus syriacus]7.6e-3234.81Show/hide
Query:  RFDRENFGLWKMQVKDLLTYKKIHKTLKERQ-AGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYANSKVYLITRYFNIHM
        +FD  +FG WKMQ++D L  K +++ L  +Q  GM D+DWA +D QA+ +IRL L  NVA  +  +KT   LM AL+  YEK  A++KV+L+ R FN+ M
Subjt:  RFDRENFGLWKMQVKDLLTYKKIHKTLKERQ-AGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYANSKVYLITRYFNIHM

Query:  EEGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAVSNSLRDKKNRWKLMRGFEVVVVGHRKALVGDLKELAALTVKTDQENLPS
         EG SV  H+NE+  +  QL S+EI F +EV+A+ LLSSLPDSW    TAVS+S                   G+ K    D+++L              
Subjt:  EEGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAVSNSLRDKKNRWKLMRGFEVVVVGHRKALVGDLKELAALTVKTDQENLPS

Query:  VQVQQLGSRENGKWNNSVRCSIDCQFRTPIVRRISELMKSRRRKGALRKTTVGPEVDGGVSGLGWECQVISEKAFLQESLGASEEGSVGDHLV
        V  +++  RE+G+ + S     + + RT          KSRR K    K        G       +C+ + +    QES   +EE   GD ++
Subjt:  VQVQQLGSRENGKWNNSVRCSIDCQFRTPIVRRISELMKSRRRKGALRKTTVGPEVDGGVSGLGWECQVISEKAFLQESLGASEEGSVGDHLV

KAE8714488.1 hypothetical protein F3Y22_tig00110195pilonHSYRG00090 [Hibiscus syriacus]2.2e-3134.81Show/hide
Query:  RFDRENFGLWKMQVKDLLTYKKIHKTLKERQ-AGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYANSKVYLITRYFNIHM
        +FD  +FG WKMQ++D L  K +++ L  +Q  GM D+DWA +D QA+ +IRL L  NVA  +  +KT   LM AL+  YEK  A++KV+L+ R FN+ M
Subjt:  RFDRENFGLWKMQVKDLLTYKKIHKTLKERQ-AGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYANSKVYLITRYFNIHM

Query:  EEGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAVSNSLRDKKNRWKLMRGFEVVVVGHRKALVGDLKELAALTVKTDQENLPS
         EG SV  H+NE+  +  QL S+EI F +EV+A+ LLSSLPDSW    TAVS+S                   G+ K    D+++L              
Subjt:  EEGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAVSNSLRDKKNRWKLMRGFEVVVVGHRKALVGDLKELAALTVKTDQENLPS

Query:  VQVQQLGSRENGKWNNSVRCSIDCQFRTPIVRRISELMKSRRRKGALRKTTVGPEVDGGVSGLGWECQVISEKAFLQESLGASEEGSVGDHLV
        V  +++  RE+G+ + S     + + RT          KSRR K    K        G       +C+ + +    QES   +EE   GD ++
Subjt:  VQVQQLGSRENGKWNNSVRCSIDCQFRTPIVRRISELMKSRRRKGALRKTTVGPEVDGGVSGLGWECQVISEKAFLQESLGASEEGSVGDHLV

KAG7561662.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]4.4e-3236.82Show/hide
Query:  MGSSEKSHHYFEGVLRFDRENFGLWKMQVKDLLTYKKIHKTLKERQAGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYAN
        MG  + S H   G+ +FD  +F  W+MQ++D L  KK+H+ L  +   M  ++W  +D Q + +IRL L  NVA  V  +KT   LMK L+D YEK  AN
Subjt:  MGSSEKSHHYFEGVLRFDRENFGLWKMQVKDLLTYKKIHKTLKERQAGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYAN

Query:  SKVYLITRYFNIHMEEGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAVSNSLRDKKNRWKLMRGFEVVVVGHRKALVGDLKEL
        +KV+L+ + F++ MEEG  V +H+NE   ++NQL S+EI F +EV+A+ L++SLP+SWE M+ AVSNS+ ++K ++  +R   ++    R+   G+    
Subjt:  SKVYLITRYFNIHMEEGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAVSNSLRDKKNRWKLMRGFEVVVVGHRKALVGDLKEL

Query:  AALTVKT-DQENLPSVQVQQLGSRENGKWNNSVRCSIDC
        +A  V+   ++     Q +      NGK  +  R  ++C
Subjt:  AALTVKT-DQENLPSVQVQQLGSRENGKWNNSVRCSIDC

KAG7584790.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]3.4e-3237.24Show/hide
Query:  MGSSEKSHHYFEGVLRFDRENFGLWKMQVKDLLTYKKIHKTLKERQAGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYAN
        MG  + S H   G+ +FD  +F  W+MQ++D L  KK+H+ L  +   M  ++W  +D Q + +IRL L  NVA  V  +KT   LMK L+D YEK  AN
Subjt:  MGSSEKSHHYFEGVLRFDRENFGLWKMQVKDLLTYKKIHKTLKERQAGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYAN

Query:  SKVYLITRYFNIHMEEGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAVSNSLRDKKNRWKLMRGFEVVVVGHRKALVGDLKEL
        +KV+L+ + F++ MEEG  V +H+NE   ++NQL S+EI F +EV+A+ LL+SLP+SWE M+ AVSNS+ ++K ++  +R   ++    R+   G+    
Subjt:  SKVYLITRYFNIHMEEGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAVSNSLRDKKNRWKLMRGFEVVVVGHRKALVGDLKEL

Query:  AALTVKT-DQENLPSVQVQQLGSRENGKWNNSVRCSIDC
        +A  V+   ++     Q +      NGK  +  R  ++C
Subjt:  AALTVKT-DQENLPSVQVQQLGSRENGKWNNSVRCSIDC

KAG7593230.1 Pentatricopeptide repeat [Arabidopsis thaliana x Arabidopsis arenosa]2.6e-3237.24Show/hide
Query:  MGSSEKSHHYFEGVLRFDRENFGLWKMQVKDLLTYKKIHKTLKERQAGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYAN
        MG  + S H   G+ +FD  +F  W+MQ++D L  KK+H+ L  +   M  ++W  +D Q + +IRL L  NVA  V  +KT   LMK L+D YEK  AN
Subjt:  MGSSEKSHHYFEGVLRFDRENFGLWKMQVKDLLTYKKIHKTLKERQAGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYAN

Query:  SKVYLITRYFNIHMEEGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAVSNSLRDKKNRWKLMRGFEVVVVGHRKALVGDLKEL
        +KV+L+ + F++ MEEG  V +H+NE   ++NQL S+EI F +EV+A+ LL+SLP+SWE M+ AVSNS+ ++K ++  +R   ++    R+   G+    
Subjt:  SKVYLITRYFNIHMEEGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAVSNSLRDKKNRWKLMRGFEVVVVGHRKALVGDLKEL

Query:  AALTVKT-DQENLPSVQVQQLGSRENGKWNNSVRCSIDC
        +A  V+   ++     Q +      NGK  +  R  ++C
Subjt:  AALTVKT-DQENLPSVQVQQLGSRENGKWNNSVRCSIDC

TrEMBL top hitse value%identityAlignment
A0A0B2SMV2 Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment)6.9e-3150.32Show/hide
Query:  RFDRENFGLWKMQVKDLLTYKKIHKTLK-ERQAGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYANSKVYLITRYFNIHM
        +FD  +F  WKMQ++D L  KK+++ L   +   M  ++W  +D QA+ +IRL L  NVA  + N+KT   LMKAL+D YEKS A +KVYL+ R FN+ M
Subjt:  RFDRENFGLWKMQVKDLLTYKKIHKTLK-ERQAGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYANSKVYLITRYFNIHM

Query:  EEGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAVSNSLRD
         EG SV  HINE   ++ QLES++I F +EVKA+ LLSSLPDSW    TAVS+S R+
Subjt:  EEGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAVSNSLRD

A0A0D3AEM1 CCHC-type domain-containing protein1.8e-3137.5Show/hide
Query:  RFDRENFGLWKMQVKDLLTYKKIHKTLKERQAGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYANSKVYLITRYFNIHME
        +FD  ++  W+MQ++D L  KK+H+ L ++   M   +W  +D Q + +IRL L  NVA  V  +KT   LMK L+D YEK  ANSKV+L+ + F++ ME
Subjt:  RFDRENFGLWKMQVKDLLTYKKIHKTLKERQAGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYANSKVYLITRYFNIHME

Query:  EGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAVSNSLRDKKNRWKLMRGFEVVVVGHRKALVGDLKELAALTVKT-----DQE
        EG  V +HINE   ++NQL S+EI F +EV+A+ LL+SLP+SWE+M+ AVSNS+  +K ++  +R   ++    R+   G+    +A  V+      D+ 
Subjt:  EGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAVSNSLRDKKNRWKLMRGFEVVVVGHRKALVGDLKELAALTVKT-----DQE

Query:  NLPSVQVQQLGSRENGKWNNSVRC
        N  + + +    R   K+     C
Subjt:  NLPSVQVQQLGSRENGKWNNSVRC

A0A6A2YZ69 Actin-depolymerizing factor 13.1e-3135.23Show/hide
Query:  RFDRENFGLWKMQVKDLLTYKKIHKTLKERQ-AGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYANSKVYLITRYFNIHM
        +FD  +FG WKMQ++D L  K +++ L  +Q  GM D+DWA +D QA+ +IRL L  NVA  +  +KT   LM AL+  YEK  A++KV+L+ R FN+ M
Subjt:  RFDRENFGLWKMQVKDLLTYKKIHKTLKERQ-AGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYANSKVYLITRYFNIHM

Query:  EEGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAVSNSLRDKKNRWKLMRGFEVVVVGHRKALVGDLKELAALTVKTDQENLPS
         EG SV  H+NE+  +  QL S+EI F +EV+A+ LLSSLPDSW    TAVS+S                   G+ K    D ++L              
Subjt:  EEGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAVSNSLRDKKNRWKLMRGFEVVVVGHRKALVGDLKELAALTVKTDQENLPS

Query:  VQVQQLGSRENGKWNNSVRCSIDCQFRTPIVRRISELMKSRRRKGALRKTTVGPE----VDGGVSG-LGWECQVISEKAFLQESLGASEEGSVGDHLV
        V  +++  RE+G+ + S     + + RT    RIS   +S+ R+G   K+  G +     + G  G    +C+   +    QES   +EE  +GD ++
Subjt:  VQVQQLGSRENGKWNNSVRCSIDCQFRTPIVRRISELMKSRRRKGALRKTTVGPE----VDGGVSG-LGWECQVISEKAFLQESLGASEEGSVGDHLV

A0A6A2ZHE4 Callose synthase 123.7e-3234.81Show/hide
Query:  RFDRENFGLWKMQVKDLLTYKKIHKTLKERQ-AGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYANSKVYLITRYFNIHM
        +FD  +FG WKMQ++D L  K +++ L  +Q  GM D+DWA +D QA+ +IRL L  NVA  +  +KT   LM AL+  YEK  A++KV+L+ R FN+ M
Subjt:  RFDRENFGLWKMQVKDLLTYKKIHKTLKERQ-AGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYANSKVYLITRYFNIHM

Query:  EEGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAVSNSLRDKKNRWKLMRGFEVVVVGHRKALVGDLKELAALTVKTDQENLPS
         EG SV  H+NE+  +  QL S+EI F +EV+A+ LLSSLPDSW    TAVS+S                   G+ K    D+++L              
Subjt:  EEGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAVSNSLRDKKNRWKLMRGFEVVVVGHRKALVGDLKELAALTVKTDQENLPS

Query:  VQVQQLGSRENGKWNNSVRCSIDCQFRTPIVRRISELMKSRRRKGALRKTTVGPEVDGGVSGLGWECQVISEKAFLQESLGASEEGSVGDHLV
        V  +++  RE+G+ + S     + + RT          KSRR K    K        G       +C+ + +    QES   +EE   GD ++
Subjt:  VQVQQLGSRENGKWNNSVRCSIDCQFRTPIVRRISELMKSRRRKGALRKTTVGPEVDGGVSGLGWECQVISEKAFLQESLGASEEGSVGDHLV

A0A6A3BGE7 Uncharacterized protein1.1e-3134.81Show/hide
Query:  RFDRENFGLWKMQVKDLLTYKKIHKTLKERQ-AGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYANSKVYLITRYFNIHM
        +FD  +FG WKMQ++D L  K +++ L  +Q  GM D+DWA +D QA+ +IRL L  NVA  +  +KT   LM AL+  YEK  A++KV+L+ R FN+ M
Subjt:  RFDRENFGLWKMQVKDLLTYKKIHKTLKERQ-AGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYANSKVYLITRYFNIHM

Query:  EEGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAVSNSLRDKKNRWKLMRGFEVVVVGHRKALVGDLKELAALTVKTDQENLPS
         EG SV  H+NE+  +  QL S+EI F +EV+A+ LLSSLPDSW    TAVS+S                   G+ K    D+++L              
Subjt:  EEGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAVSNSLRDKKNRWKLMRGFEVVVVGHRKALVGDLKELAALTVKTDQENLPS

Query:  VQVQQLGSRENGKWNNSVRCSIDCQFRTPIVRRISELMKSRRRKGALRKTTVGPEVDGGVSGLGWECQVISEKAFLQESLGASEEGSVGDHLV
        V  +++  RE+G+ + S     + + RT          KSRR K    K        G       +C+ + +    QES   +EE   GD ++
Subjt:  VQVQQLGSRENGKWNNSVRCSIDCQFRTPIVRRISELMKSRRRKGALRKTTVGPEVDGGVSGLGWECQVISEKAFLQESLGASEEGSVGDHLV

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.5e-1733.97Show/hide
Query:  VLRFDREN-FGLWKMQVKDLLTYKKIHKTL---KERQAGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYANSKVYLITRY
        V +F+ +N F  W+ +++DLL  + +HK L    ++   M  +DWA++DE+A + IRL L  +V + + ++ TA  +   L   Y      +K+YL  + 
Subjt:  VLRFDREN-FGLWKMQVKDLLTYKKIHKTL---KERQAGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYANSKVYLITRY

Query:  FNIHMEEGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAV
        + +HM EGT+  SH+N    L+ QL ++ +   EE KAI LL+SLP S++ + T +
Subjt:  FNIHMEEGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAV

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein4.0e-0731.37Show/hide
Query:  RFDRENFGLWKMQVKDLLTYKKIHKTLKERQAGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYANSKVYLITRYFNIHME
        + D  ++   +M+++D L  KK+H+ L ++   M+  DW  +  Q + +IRL +  N+A  V  +K+   LMK L+D Y+K   N+ V  I+    I +E
Subjt:  RFDRENFGLWKMQVKDLLTYKKIHKTLKERQAGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYANSKVYLITRYFNIHME

Query:  EG
        +G
Subjt:  EG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCCAGTGAAAAGTCACATCACTACTTCGAGGGAGTTTTAAGATTTGACAGGGAAAATTTCGGGTTGTGGAAAATGCAAGTGAAAGATCTTCTTACATACAAGAA
GATACACAAAACCTTGAAGGAACGACAGGCTGGGATGACAGATAAAGATTGGGCGGAGATGGATGAACAGGCCGTAGCGATCATCAGGTTGTGCTTGTTAATGAATGTGG
CAAGTCTCGTGGAGAATCAGAAAACTGCAATGAGATTGATGAAGGCGCTGACAGACAGATATGAAAAATCTTATGCCAATAGCAAGGTGTATCTCATTACGAGATATTTT
AACATTCACATGGAGGAAGGTACGTCGGTGAACTCCCACATCAATGAGGTCACTCAACTGATGAACCAGTTAGAGTCGATGGAGATCACTTTCTCAGAGGAGGTGAAAGC
TATAAAGTTGTTGTCTTCTTTGCCTGACAGTTGGGAAACGATGAAGACGGCAGTGTCGAATTCGTTGAGGGACAAAAAGAACAGATGGAAACTTATGAGGGGATTCGAGG
TAGTGGTTGTTGGCCACAGAAAAGCTTTAGTGGGGGACTTGAAAGAACTAGCAGCATTGACAGTCAAGACAGATCAGGAGAATCTGCCATCAGTTCAAGTACAACAGCTG
GGAAGTAGAGAAAATGGAAAGTGGAACAACTCAGTGAGGTGTTCAATAGACTGTCAGTTTCGAACCCCAATTGTCAGACGGATTAGCGAGCTGATGAAGTCGCGTAGGCG
AAAAGGTGCATTAAGGAAGACTACAGTTGGTCCTGAGGTCGATGGTGGTGTCTCTGGACTTGGGTGGGAGTGCCAAGTCATCAGTGAAAAAGCTTTCCTTCAAGAGTCGT
TGGGTGCGAGTGAAGAAGGAAGCGTCGGAGATCACTTAGTTCGAGTAGGAGCACGTGGCTGTGTCTCTAACGTCTGGGAGCAAGACCACGTAGGATTACTTAGTCTCAAG
GCAATCACAAATGGATCCTTGGTTGGTGCCAAGAAGATTTTAGAAACTATTGGTGCAGTAGGAGTTGAGCCTTGGTTGGTGTCAAGAGGATGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGATCCAGTGAAAAGTCACATCACTACTTCGAGGGAGTTTTAAGATTTGACAGGGAAAATTTCGGGTTGTGGAAAATGCAAGTGAAAGATCTTCTTACATACAAGAA
GATACACAAAACCTTGAAGGAACGACAGGCTGGGATGACAGATAAAGATTGGGCGGAGATGGATGAACAGGCCGTAGCGATCATCAGGTTGTGCTTGTTAATGAATGTGG
CAAGTCTCGTGGAGAATCAGAAAACTGCAATGAGATTGATGAAGGCGCTGACAGACAGATATGAAAAATCTTATGCCAATAGCAAGGTGTATCTCATTACGAGATATTTT
AACATTCACATGGAGGAAGGTACGTCGGTGAACTCCCACATCAATGAGGTCACTCAACTGATGAACCAGTTAGAGTCGATGGAGATCACTTTCTCAGAGGAGGTGAAAGC
TATAAAGTTGTTGTCTTCTTTGCCTGACAGTTGGGAAACGATGAAGACGGCAGTGTCGAATTCGTTGAGGGACAAAAAGAACAGATGGAAACTTATGAGGGGATTCGAGG
TAGTGGTTGTTGGCCACAGAAAAGCTTTAGTGGGGGACTTGAAAGAACTAGCAGCATTGACAGTCAAGACAGATCAGGAGAATCTGCCATCAGTTCAAGTACAACAGCTG
GGAAGTAGAGAAAATGGAAAGTGGAACAACTCAGTGAGGTGTTCAATAGACTGTCAGTTTCGAACCCCAATTGTCAGACGGATTAGCGAGCTGATGAAGTCGCGTAGGCG
AAAAGGTGCATTAAGGAAGACTACAGTTGGTCCTGAGGTCGATGGTGGTGTCTCTGGACTTGGGTGGGAGTGCCAAGTCATCAGTGAAAAAGCTTTCCTTCAAGAGTCGT
TGGGTGCGAGTGAAGAAGGAAGCGTCGGAGATCACTTAGTTCGAGTAGGAGCACGTGGCTGTGTCTCTAACGTCTGGGAGCAAGACCACGTAGGATTACTTAGTCTCAAG
GCAATCACAAATGGATCCTTGGTTGGTGCCAAGAAGATTTTAGAAACTATTGGTGCAGTAGGAGTTGAGCCTTGGTTGGTGTCAAGAGGATGGTGA
Protein sequenceShow/hide protein sequence
MGSSEKSHHYFEGVLRFDRENFGLWKMQVKDLLTYKKIHKTLKERQAGMTDKDWAEMDEQAVAIIRLCLLMNVASLVENQKTAMRLMKALTDRYEKSYANSKVYLITRYF
NIHMEEGTSVNSHINEVTQLMNQLESMEITFSEEVKAIKLLSSLPDSWETMKTAVSNSLRDKKNRWKLMRGFEVVVVGHRKALVGDLKELAALTVKTDQENLPSVQVQQL
GSRENGKWNNSVRCSIDCQFRTPIVRRISELMKSRRRKGALRKTTVGPEVDGGVSGLGWECQVISEKAFLQESLGASEEGSVGDHLVRVGARGCVSNVWEQDHVGLLSLK
AITNGSLVGAKKILETIGAVGVEPWLVSRGW