; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g22460 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g22460
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr4:16284904..16293858
RNA-Seq ExpressionMoc04g22460
SyntenyMoc04g22460
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8658400.1 60S ribosomal protein L38 [Hibiscus syriacus]5.9e-3144.16Show/hide
Query:  SRTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLL-------TPPVKSTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSP
        SR +   +      S     A+ L  FSK QLEQLY+L+       TP   S  +SS +AQ+G +  A  +   S+ WI+DSGATDHMT    +F+ Y P
Subjt:  SRTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLL-------TPPVKSTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSP

Query:  NSIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLYYFRGPSLRNKQ
         S    VK+ADGS   I G GS+I+SP++TL +VLHVPKL CNLIS  ++ HD KC A  T +   FQD  +G  IGNA   +GLY+    +  NKQ
Subjt:  NSIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLYYFRGPSLRNKQ

RVW98618.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.6e-3144.57Show/hide
Query:  YQVGPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVKSTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGATDHMTAFHDMFTMYSPNSIQTHVKLADG
        +Q   + S    + +  P F+K QL  LY+L   P  S PS S   Q     AAL+S + +    WI+DSGATDHMT    +F+ Y P +    +K+ADG
Subjt:  YQVGPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVKSTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGATDHMTAFHDMFTMYSPNSIQTHVKLADG

Query:  SSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLYYFRGPSLRNK
        S + I G GSV +SP++TLH+VLHVP L CNL+S  K+T D +CQA F  S C FQ+  +G TIGNA    GLY+F   S   K
Subjt:  SSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLYYFRGPSLRNK

XP_004486931.2 uncharacterized protein LOC101513206 [Cicer arietinum]1.4e-3247.5Show/hide
Query:  RTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVKSTPSSSFVAQRGIF-SAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNSIQTHV
        R      Q G S    Q  ++S  PF+K QL+QLY+LL    +S  SS  +AQRG F + AL S   S  WI+DSGATDHMT    +F+ YSP +    +
Subjt:  RTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVKSTPSSSFVAQRGIF-SAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNSIQTHV

Query:  KLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLYYFRGPSLRNKQGKTEPITSSL
        K+ADGS + I G GSVILSP +TL  VLHVP L CNL+S  KLT D  CQA F  S C F+D  TG  IGNA    GLYY         Q KT     S+
Subjt:  KLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLYYFRGPSLRNKQGKTEPITSSL

XP_022159153.1 uncharacterized protein LOC111025577 [Momordica charantia]1.7e-7889.27Show/hide
Query:  SQDLATSLPPFSK---AQLEQLYRLLTPPVKSTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNSIQTHVKLADGSSAIIKGF
        S  LA   PP SK   AQLEQLYRLLT PV+STPSSSFVAQRGI SAALT QQHSDQWILDSGATDHMTAFHDMFTMYSPN IQTHVKLADGSSAIIKGF
Subjt:  SQDLATSLPPFSK---AQLEQLYRLLTPPVKSTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNSIQTHVKLADGSSAIIKGF

Query:  GSVILSPNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLYYFRGPSLRNKQ
        GSVILSPNITLHSVL VPKLCCNLIS QKLTHD KCQALFTDSKCLFQD ITGTTIG+ADGFEGLYYFRGPSLRNKQ
Subjt:  GSVILSPNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLYYFRGPSLRNKQ

XP_022159153.1 uncharacterized protein LOC111025577 [Momordica charantia]1.6e-0494.12Show/hide
Query:  MVKTAMMTDVCKDESFDGSNTTSISLPKSTSTTP
        MVKTAMMTDV KDES DGSNTTSISLPKSTSTTP
Subjt:  MVKTAMMTDVCKDESFDGSNTTSISLPKSTSTTP

XP_022159153.1 uncharacterized protein LOC111025577 [Momordica charantia]1.5e-4550.83Show/hide
Query:  MYSPNSIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLYYFRGPSLRNK
        MYSPNSIQTHVKLADGSSAII GFG VILS +I L  +L+VPKLC NLI  QKLTHD KCQA+FT SKCLFQD I GTTIG+AD FEGLYYFR PSLRNK
Subjt:  MYSPNSIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLYYFRGPSLRNK

Query:  QGKTEPITSSLDRNFWEIEDLNTRIESPQSKIPEIDGLNTESPQPKIPIPIIPITQIEESVPIIFCNNEDDQVNPNQSDKQPETLVYSRRQTVQRGVEPP
        Q                                                                                       +RQ VQRGVEPP
Subjt:  QGKTEPITSSLDRNFWEIEDLNTRIESPQSKIPEIDGLNTESPQPKIPIPIIPITQIEESVPIIFCNNEDDQVNPNQSDKQPETLVYSRRQTVQRGVEPP

Query:  QPQQQSHESISSLGTEQSTLVLQDNTNVLDLPIALRKVVE
        QPQQQSH+SISSLG +Q TLV QDN N+LDLPIALRK V+
Subjt:  QPQQQSHESISSLGTEQSTLVLQDNTNVLDLPIALRKVVE

TrEMBL top hitse value%identityAlignment
A0A1S2XBU5 uncharacterized protein LOC1015132066.8e-3347.5Show/hide
Query:  RTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVKSTPSSSFVAQRGIF-SAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNSIQTHV
        R      Q G S    Q  ++S  PF+K QL+QLY+LL    +S  SS  +AQRG F + AL S   S  WI+DSGATDHMT    +F+ YSP +    +
Subjt:  RTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVKSTPSSSFVAQRGIF-SAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNSIQTHV

Query:  KLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLYYFRGPSLRNKQGKTEPITSSL
        K+ADGS + I G GSVILSP +TL  VLHVP L CNL+S  KLT D  CQA F  S C F+D  TG  IGNA    GLYY         Q KT     S+
Subjt:  KLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLYYFRGPSLRNKQGKTEPITSSL

A0A438IPH4 Retrovirus-related Pol polyprotein from transposon RE17.6e-3244.57Show/hide
Query:  YQVGPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVKSTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGATDHMTAFHDMFTMYSPNSIQTHVKLADG
        +Q   + S    + +  P F+K QL  LY+L   P  S PS S   Q     AAL+S + +    WI+DSGATDHMT    +F+ Y P +    +K+ADG
Subjt:  YQVGPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVKSTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGATDHMTAFHDMFTMYSPNSIQTHVKLADG

Query:  SSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLYYFRGPSLRNK
        S + I G GSV +SP++TLH+VLHVP L CNL+S  K+T D +CQA F  S C FQ+  +G TIGNA    GLY+F   S   K
Subjt:  SSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLYYFRGPSLRNK

A0A6A2WU09 60S ribosomal protein L382.9e-3144.16Show/hide
Query:  SRTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLL-------TPPVKSTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSP
        SR +   +      S     A+ L  FSK QLEQLY+L+       TP   S  +SS +AQ+G +  A  +   S+ WI+DSGATDHMT    +F+ Y P
Subjt:  SRTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLL-------TPPVKSTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSP

Query:  NSIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLYYFRGPSLRNKQ
         S    VK+ADGS   I G GS+I+SP++TL +VLHVPKL CNLIS  ++ HD KC A  T +   FQD  +G  IGNA   +GLY+    +  NKQ
Subjt:  NSIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLYYFRGPSLRNKQ

A0A6J1DY12 uncharacterized protein LOC1110255778.3e-7989.27Show/hide
Query:  SQDLATSLPPFSK---AQLEQLYRLLTPPVKSTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNSIQTHVKLADGSSAIIKGF
        S  LA   PP SK   AQLEQLYRLLT PV+STPSSSFVAQRGI SAALT QQHSDQWILDSGATDHMTAFHDMFTMYSPN IQTHVKLADGSSAIIKGF
Subjt:  SQDLATSLPPFSK---AQLEQLYRLLTPPVKSTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNSIQTHVKLADGSSAIIKGF

Query:  GSVILSPNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLYYFRGPSLRNKQ
        GSVILSPNITLHSVL VPKLCCNLIS QKLTHD KCQALFTDSKCLFQD ITGTTIG+ADGFEGLYYFRGPSLRNKQ
Subjt:  GSVILSPNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLYYFRGPSLRNKQ

A0A6J1DY12 uncharacterized protein LOC1110255777.9e-0594.12Show/hide
Query:  MVKTAMMTDVCKDESFDGSNTTSISLPKSTSTTP
        MVKTAMMTDV KDES DGSNTTSISLPKSTSTTP
Subjt:  MVKTAMMTDVCKDESFDGSNTTSISLPKSTSTTP

A0A6J1DY12 uncharacterized protein LOC1110255777.1e-4650.83Show/hide
Query:  MYSPNSIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLYYFRGPSLRNK
        MYSPNSIQTHVKLADGSSAII GFG VILS +I L  +L+VPKLC NLI  QKLTHD KCQA+FT SKCLFQD I GTTIG+AD FEGLYYFR PSLRNK
Subjt:  MYSPNSIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLYYFRGPSLRNK

Query:  QGKTEPITSSLDRNFWEIEDLNTRIESPQSKIPEIDGLNTESPQPKIPIPIIPITQIEESVPIIFCNNEDDQVNPNQSDKQPETLVYSRRQTVQRGVEPP
        Q                                                                                       +RQ VQRGVEPP
Subjt:  QGKTEPITSSLDRNFWEIEDLNTRIESPQSKIPEIDGLNTESPQPKIPIPIIPITQIEESVPIIFCNNEDDQVNPNQSDKQPETLVYSRRQTVQRGVEPP

Query:  QPQQQSHESISSLGTEQSTLVLQDNTNVLDLPIALRKVVE
        QPQQQSH+SISSLG +Q TLV QDN N+LDLPIALRK V+
Subjt:  QPQQQSHESISSLGTEQSTLVLQDNTNVLDLPIALRKVVE

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-0727.36Show/hide
Query:  KDESFDGSNTTSISLPKSTSTTPPPTNTPPSRTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVKSTPSSSFVAQRGIFSAALTSQQHSDQ
        ++  +D  N  + S P   S+    TN  P+   S  Y     +   Q         S  +  QL   L+      P S F   +   + AL S   S+ 
Subjt:  KDESFDGSNTTSISLPKSTSTTPPPTNTPPSRTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVKSTPSSSFVAQRGIFSAALTSQQHSDQ

Query:  WILDSGATDHMTAFHDMFTMYSPNSIQTHVKLADGSSAIIKGFGSVILSPN---ITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGT
        W+LDSGAT H+T+  +  +++ P +    V +ADGS+  I   GS  LS     + LH++L+VP +  NLIS  +L + +     F  +    +D  TG 
Subjt:  WILDSGATDHMTAFHDMFTMYSPNSIQTHVKLADGSSAIIKGFGSVILSPN---ITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGT

Query:  TIGNADGFEGLY
         +      + LY
Subjt:  TIGNADGFEGLY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-0727.5Show/hide
Query:  SLPPFSKAQLEQLYRLLTPPVKSTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNSIQTHVKLADGSSAIIKGFGSVIL---S
        S+   S  +  QL++  +   +   +S F   +   + A+ S  +++ W+LDSGAT H+T+  +  + + P +    V +ADGS+  I   GS  L   S
Subjt:  SLPPFSKAQLEQLYRLLTPPVKSTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNSIQTHVKLADGSSAIIKGFGSVIL---S

Query:  PNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLY
         ++ L+ VL+VP +  NLIS  +L + ++    F  +    +D  TG  +      + LY
Subjt:  PNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTIGNADGFEGLY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTAAAACAGCCATGATGACTGATGTGTGCAAGGATGAGAGTTTCGACGGATCGAATACGACCTCAATTTCTCTCCCTAAAAGCACATCCACGACACCTCCA
CCAACAAATACTCCACCTTCTCGAACTAGCTCCTCCGGCTATCAAGTGGGCCCTAGTGTGTCAAACTCCCAAGATTTGGCCACCTCTCTCCCTCCATTTTCGAAG
GCACAACTTGAACAGCTCTATCGCCTCTTAACACCGCCGGTTAAGTCTACTCCTTCGTCAAGTTTTGTGGCACAACGAGGTATTTTTAGTGCAGCTTTAACAAGT
CAGCAGCATTCCGATCAGTGGATCTTAGATTCGGGTGCAACTGATCATATGACTGCTTTTCATGATATGTTTACCATGTACTCACCCAACTCGATTCAGACACAT
GTCAAGCTTGCAGATGGGTCATCGGCCATTATTAAGGGCTTTGGTTCTGTTATTCTTAGCCCAAACATTACATTGCACTCAGTGCTCCATGTGCCTAAATTATGT
TGCAATCTAATTTCTTTTCAGAAGTTGACTCATGATTCAAAGTGTCAAGCCCTGTTCACTGACTCTAAGTGTTTGTTTCAGGACTCGATAACGGGAACGACGATT
GGCAATGCTGATGGCTTTGAAGGGCTCTACTACTTCAGAGGACCAAGTCTAAGAAATAAACAAGGGAAGACTGAGCCTATTACAAGTAGTCTTGATAGAAATTTT
TGGGAGATTGAGGATCTAAATACCAGAATTGAGTCCCCTCAGTCAAAAATACCTGAGATTGATGGTCTAAATACCGAGTCACCTCAGCCAAAAATACCTATCCCG
ATCATCCCAATTACTCAGATAGAAGAGTCAGTTCCTATTATTTTCTGTAATAATGAAGATGATCAAGTCAACCCAAATCAAAGTGACAAGCAACCTGAGACTCTT
GTTTATTCTCGGCGACAAACGGTTCAAAGAGGAGTGGAGCCACCACAGCCTCAACAGCAAAGTCATGAATCCATCTCGTCCTTAGGTACTGAACAATCTACCCTT
GTGCTTCAAGACAATACTAATGTTCTTGATCTTCCTATTGCACTTAGGAAGGTAGTAGAAGGGTTAGATGCACCTGGTAAGGTGTCTAAGGGTGGTGGTAAAGGA
TTTGATATAGGAAATGTGAGGGCTGAGAGGTACTTCGAAATCCATAAGTTGACGGATGTAATGAAGATGGTGATTGCGGTAATAAGCTTCAATGGGGTCGCTCTA
GCGTGGCATCGATCGACAGACAACAGAGAGAAGTTTACATATTGGGAAAACTTGAAAACACGTTTGCTAGGTTGTTTCAGCTTTGAAGCTCTATCAGCCCCATTA
CCTCAGCTTTCGGAGGAAGTTCTTAAGAGTGCTTTTCTGAATGAGCTGGATCCTGTGGTGCAAGCAAGATGTGAGGATAAGTACACAGTGGGACACCAGTGCCAT
AACCAAGAGCTAAGAGTGTTTGTTGTGCACGACGAAGAGCTAATGATCGTCGAAGAAGAGGCCATCGATGTAGGAACAGATGAGAATGAGGCTGTAATCGGAAAA
ATCATAGTGTTGTTTGAACATGATTGTGGGGCTGTCAAGACCAGGGACGATAAAGATCAAGGGAACGGCACAAAGAAGGGAATTCGTGGTACTCCTAGACTGCGG
AGCCCCTCACAACTTCATCTCACAGAAGTTAGTAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTAAAACAGCCATGATGACTGATGTGTGCAAGGATGAGAGTTTCGACGGATCGAATACGACCTCAATTTCTCTCCCTAAAAGCACATCCACGACACCTCCA
CCAACAAATACTCCACCTTCTCGAACTAGCTCCTCCGGCTATCAAGTGGGCCCTAGTGTGTCAAACTCCCAAGATTTGGCCACCTCTCTCCCTCCATTTTCGAAG
GCACAACTTGAACAGCTCTATCGCCTCTTAACACCGCCGGTTAAGTCTACTCCTTCGTCAAGTTTTGTGGCACAACGAGGTATTTTTAGTGCAGCTTTAACAAGT
CAGCAGCATTCCGATCAGTGGATCTTAGATTCGGGTGCAACTGATCATATGACTGCTTTTCATGATATGTTTACCATGTACTCACCCAACTCGATTCAGACACAT
GTCAAGCTTGCAGATGGGTCATCGGCCATTATTAAGGGCTTTGGTTCTGTTATTCTTAGCCCAAACATTACATTGCACTCAGTGCTCCATGTGCCTAAATTATGT
TGCAATCTAATTTCTTTTCAGAAGTTGACTCATGATTCAAAGTGTCAAGCCCTGTTCACTGACTCTAAGTGTTTGTTTCAGGACTCGATAACGGGAACGACGATT
GGCAATGCTGATGGCTTTGAAGGGCTCTACTACTTCAGAGGACCAAGTCTAAGAAATAAACAAGGGAAGACTGAGCCTATTACAAGTAGTCTTGATAGAAATTTT
TGGGAGATTGAGGATCTAAATACCAGAATTGAGTCCCCTCAGTCAAAAATACCTGAGATTGATGGTCTAAATACCGAGTCACCTCAGCCAAAAATACCTATCCCG
ATCATCCCAATTACTCAGATAGAAGAGTCAGTTCCTATTATTTTCTGTAATAATGAAGATGATCAAGTCAACCCAAATCAAAGTGACAAGCAACCTGAGACTCTT
GTTTATTCTCGGCGACAAACGGTTCAAAGAGGAGTGGAGCCACCACAGCCTCAACAGCAAAGTCATGAATCCATCTCGTCCTTAGGTACTGAACAATCTACCCTT
GTGCTTCAAGACAATACTAATGTTCTTGATCTTCCTATTGCACTTAGGAAGGTAGTAGAAGGGTTAGATGCACCTGGTAAGGTGTCTAAGGGTGGTGGTAAAGGA
TTTGATATAGGAAATGTGAGGGCTGAGAGGTACTTCGAAATCCATAAGTTGACGGATGTAATGAAGATGGTGATTGCGGTAATAAGCTTCAATGGGGTCGCTCTA
GCGTGGCATCGATCGACAGACAACAGAGAGAAGTTTACATATTGGGAAAACTTGAAAACACGTTTGCTAGGTTGTTTCAGCTTTGAAGCTCTATCAGCCCCATTA
CCTCAGCTTTCGGAGGAAGTTCTTAAGAGTGCTTTTCTGAATGAGCTGGATCCTGTGGTGCAAGCAAGATGTGAGGATAAGTACACAGTGGGACACCAGTGCCAT
AACCAAGAGCTAAGAGTGTTTGTTGTGCACGACGAAGAGCTAATGATCGTCGAAGAAGAGGCCATCGATGTAGGAACAGATGAGAATGAGGCTGTAATCGGAAAA
ATCATAGTGTTGTTTGAACATGATTGTGGGGCTGTCAAGACCAGGGACGATAAAGATCAAGGGAACGGCACAAAGAAGGGAATTCGTGGTACTCCTAGACTGCGG
AGCCCCTCACAACTTCATCTCACAGAAGTTAGTAAATGA
Protein sequenceShow/hide protein sequence
MVKTAMMTDVCKDESFDGSNTTSISLPKSTSTTPPPTNTPPSRTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVKSTPSSSFVAQRGIFSAALTS
QQHSDQWILDSGATDHMTAFHDMFTMYSPNSIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISFQKLTHDSKCQALFTDSKCLFQDSITGTTI
GNADGFEGLYYFRGPSLRNKQGKTEPITSSLDRNFWEIEDLNTRIESPQSKIPEIDGLNTESPQPKIPIPIIPITQIEESVPIIFCNNEDDQVNPNQSDKQPETL
VYSRRQTVQRGVEPPQPQQQSHESISSLGTEQSTLVLQDNTNVLDLPIALRKVVEGLDAPGKVSKGGGKGFDIGNVRAERYFEIHKLTDVMKMVIAVISFNGVAL
AWHRSTDNREKFTYWENLKTRLLGCFSFEALSAPLPQLSEEVLKSAFLNELDPVVQARCEDKYTVGHQCHNQELRVFVVHDEELMIVEEEAIDVGTDENEAVIGK
IIVLFEHDCGAVKTRDDKDQGNGTKKGIRGTPRLRSPSQLHLTEVSK