; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g27010 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g27010
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr9:20249317..20250462
RNA-Seq ExpressionMoc09g27010
SyntenyMoc09g27010
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153526.1 uncharacterized protein LOC111021009 [Momordica charantia]5.4e-7146.93Show/hide
Query:  VMFQMLQTMGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDI----
        +M QML T+GQF GL +EDP+SHLKSF ++AN+F+LPG+S+DALRLK+FPFSL   A  WLNA  P+SIN+W  + +KFLAKY   T+NAD+RE+I    
Subjt:  VMFQMLQTMGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDI----

Query:  -------------------------------IEQFYRGLDRSSMMMLNTAANGSLLEKPVNEIVYILNKMTDINDQ--GEIGRSLPKKQVSVGIFELDTV
                                       IE FYRG D  + MMLNTAANG    K  NEIV IL+++T+ ND    E  R+ PK     G+F LD +
Subjt:  -------------------------------IEQFYRGLDRSSMMMLNTAANGSLLEKPVNEIVYILNKMTDINDQ--GEIGRSLPKKQVSVGIFELDTV

Query:  ASMQAQMAAMNQMLKQLTMEKETKT-VTSAIPEPSPILQISDISCVYCGNNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNLGWRHHPNFSWSNQGVA
        +SMQ+Q+  + QM+          T  +++    +P+  +++  C YCG++H  ENCP+ PAS+ YVGQG QRNF+PYSNTYN GWRHHPNFSWS QG +
Subjt:  ASMQAQMAAMNQMLKQLTMEKETKT-VTSAIPEPSPILQISDISCVYCGNNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNLGWRHHPNFSWSNQGVA

Query:  SSSAQAPAQ
        +++ Q   Q
Subjt:  SSSAQAPAQ

XP_022157400.1 uncharacterized protein LOC111024107 [Momordica charantia]6.4e-7245.48Show/hide
Query:  AGINDPLPQAAQFELKLVMFQMLQTMGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKY
        + + +PLP  AQFE K +M QML  + QFGGL +EDP SHLKSFI++AN  +LPG+S+DALRL +FPFSL   A  WLNA    +I TW+++ +KFL KY
Subjt:  AGINDPLPQAAQFELKLVMFQMLQTMGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKY

Query:  HTLTRNADLREDI-----------------------------------IEQFYRGLDRSSMMMLNTAANGSLLEKPVNEIVYILNKMTDINDQ--GEIGR
           TRNAD+RE+I                                   IE F+RG D  + MMLN AANG    K  NEIV IL+++++ NDQ   E  R
Subjt:  HTLTRNADLREDI-----------------------------------IEQFYRGLDRSSMMMLNTAANGSLLEKPVNEIVYILNKMTDINDQ--GEIGR

Query:  SLPKKQVSVGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGNNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNL
        +  K+    G+  LD + SMQ Q+  + QMLK +           A   PSP+ QI++ +C YCG+ H  ENCP+NP+S++YVGQ  Q+ FNPYSNTY+ 
Subjt:  SLPKKQVSVGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGNNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNL

Query:  GWRHHPNFSWSNQGVASSSAQ
        GW+ HPNFSWS QG +S + Q
Subjt:  GWRHHPNFSWSNQGVASSSAQ

XP_022158314.1 uncharacterized protein LOC111024824 [Momordica charantia]7.2e-9261.59Show/hide
Query:  MNRHAQDPPSPQNPPVNGDVAG---------------INDPLPQAAQFELKLVMFQMLQTM---GQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALR
        MN + QDPP P NPPV+GD AG               + D    A +  +      +   +   G      NEDPYSHLKSFIEIANAFQL GVSEDALR
Subjt:  MNRHAQDPPSPQNPPVNGDVAG---------------INDPLPQAAQFELKLVMFQMLQTM---GQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALR

Query:  LKMFPFSLRD--GARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIIEQFYRGLDRSSMMMLNTAANGSLLEKPVNEIVYILNKMTDINDQGE
        LKM      D    R   N         + EL  + L+  H L          IEQFYRGLDR S MMLNTAAN SL EK ++EI+ ILNKMTD NDQGE
Subjt:  LKMFPFSLRD--GARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIIEQFYRGLDRSSMMMLNTAANGSLLEKPVNEIVYILNKMTDINDQGE

Query:  IGRSLPKKQVSVGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGNNHLYENCPANPASIFYVGQGAQRNFNPYSNT
        IGRSLPKKQVS  +FELDTVASMQAQMA +NQMLKQLTMEKETKT TSA+ EPS  LQISDISCVYCG+N LYENCPANP S+FYVGQ AQRNFNPYSNT
Subjt:  IGRSLPKKQVSVGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGNNHLYENCPANPASIFYVGQGAQRNFNPYSNT

Query:  YNLGWRHHPNFSWSNQGVASSSAQAPAQ
        Y+  WR+HPNFSWSNQGVASSSAQ PAQ
Subjt:  YNLGWRHHPNFSWSNQGVASSSAQAPAQ

XP_022158836.1 uncharacterized protein LOC111025302 [Momordica charantia]2.4e-13570.87Show/hide
Query:  MNRHAQDPPSPQNPPVNGDVA--------------------------------------GINDPLPQAAQFELKLVMFQMLQTMGQFGGLTNEDPYSHLK
        MNR+AQDPP PQNPPVNGD+A                                      GIN+PLPQAAQFELK VMFQ+LQTMGQFGGLTNEDPYSHLK
Subjt:  MNRHAQDPPSPQNPPVNGDVA--------------------------------------GINDPLPQAAQFELKLVMFQMLQTMGQFGGLTNEDPYSHLK

Query:  SFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDI-----------------------------
        SFIEIANAFQLPG SEDALRLKMFPFSLRDGARTW+NALEPNSINTWAELT+KFLAKYHTLT+NADLREDI                             
Subjt:  SFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDI-----------------------------

Query:  ------IEQFYRGLDRSSMMMLNTAANGSLLEKPVNEIVYILNKMTDINDQGEIGRSLPKKQVSVGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVT
              IEQFYRGLDRSS MMLNT ANGSLLEK VNEIV +LNKMTDINDQGE+GRSLPKKQVS GIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVT
Subjt:  ------IEQFYRGLDRSSMMMLNTAANGSLLEKPVNEIVYILNKMTDINDQGEIGRSLPKKQVSVGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVT

Query:  SAIPEPSPILQISDISCVYCGNNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNLGWRHHPNFSWSNQGVASSSAQAPAQ
        SAIPE SPILQISDISCVYC                   GQGAQRNFNPYSNTYN GWRHHPNFSWSNQGVASSSAQAPAQ
Subjt:  SAIPEPSPILQISDISCVYCGNNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNLGWRHHPNFSWSNQGVASSSAQAPAQ

XP_022159127.1 uncharacterized protein LOC111025557 [Momordica charantia]1.2e-10270.63Show/hide
Query:  MGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDI------------
        M QFGG TNEDPYSHLKSFI+IANAFQLPGVSEDALRLKMFPFSLRDGA TW+N LE N I TWAELT+KFLAKYHTLTRNADL+EDI            
Subjt:  MGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDI------------

Query:  -----------------------IEQFYRGLDRSSMMMLNTAANGSLLEKPVNEIVYILNKMTDINDQGEIGRSLPKKQVSVGIFELDTVASMQAQMAAM
                               I+QFYRGLD    MM +TAAN SLLEK VNEI+ ILNKM DINDQ E+GRSLPKKQ S GIFELDTV S+QAQ++AM
Subjt:  -----------------------IEQFYRGLDRSSMMMLNTAANGSLLEKPVNEIVYILNKMTDINDQGEIGRSLPKKQVSVGIFELDTVASMQAQMAAM

Query:  NQMLKQLTMEKETKTVTSA-IPEPSPILQISDISCVYCGNNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNLGWRHHPNFSWSN
        +QMLKQLTM+K  K  TS  I EPS ILQISDISCVYC +NHLYENC ANPA IFYVGQG QRNFNPYSNTYN GWR HPNFS SN
Subjt:  NQMLKQLTMEKETKTVTSA-IPEPSPILQISDISCVYCGNNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNLGWRHHPNFSWSN

TrEMBL top hitse value%identityAlignment
A0A6J1DKX0 uncharacterized protein LOC1110210092.6e-7146.93Show/hide
Query:  VMFQMLQTMGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDI----
        +M QML T+GQF GL +EDP+SHLKSF ++AN+F+LPG+S+DALRLK+FPFSL   A  WLNA  P+SIN+W  + +KFLAKY   T+NAD+RE+I    
Subjt:  VMFQMLQTMGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDI----

Query:  -------------------------------IEQFYRGLDRSSMMMLNTAANGSLLEKPVNEIVYILNKMTDINDQ--GEIGRSLPKKQVSVGIFELDTV
                                       IE FYRG D  + MMLNTAANG    K  NEIV IL+++T+ ND    E  R+ PK     G+F LD +
Subjt:  -------------------------------IEQFYRGLDRSSMMMLNTAANGSLLEKPVNEIVYILNKMTDINDQ--GEIGRSLPKKQVSVGIFELDTV

Query:  ASMQAQMAAMNQMLKQLTMEKETKT-VTSAIPEPSPILQISDISCVYCGNNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNLGWRHHPNFSWSNQGVA
        +SMQ+Q+  + QM+          T  +++    +P+  +++  C YCG++H  ENCP+ PAS+ YVGQG QRNF+PYSNTYN GWRHHPNFSWS QG +
Subjt:  ASMQAQMAAMNQMLKQLTMEKETKT-VTSAIPEPSPILQISDISCVYCGNNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNLGWRHHPNFSWSNQGVA

Query:  SSSAQAPAQ
        +++ Q   Q
Subjt:  SSSAQAPAQ

A0A6J1DSZ5 uncharacterized protein LOC1110241073.1e-7245.48Show/hide
Query:  AGINDPLPQAAQFELKLVMFQMLQTMGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKY
        + + +PLP  AQFE K +M QML  + QFGGL +EDP SHLKSFI++AN  +LPG+S+DALRL +FPFSL   A  WLNA    +I TW+++ +KFL KY
Subjt:  AGINDPLPQAAQFELKLVMFQMLQTMGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKY

Query:  HTLTRNADLREDI-----------------------------------IEQFYRGLDRSSMMMLNTAANGSLLEKPVNEIVYILNKMTDINDQ--GEIGR
           TRNAD+RE+I                                   IE F+RG D  + MMLN AANG    K  NEIV IL+++++ NDQ   E  R
Subjt:  HTLTRNADLREDI-----------------------------------IEQFYRGLDRSSMMMLNTAANGSLLEKPVNEIVYILNKMTDINDQ--GEIGR

Query:  SLPKKQVSVGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGNNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNL
        +  K+    G+  LD + SMQ Q+  + QMLK +           A   PSP+ QI++ +C YCG+ H  ENCP+NP+S++YVGQ  Q+ FNPYSNTY+ 
Subjt:  SLPKKQVSVGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGNNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNL

Query:  GWRHHPNFSWSNQGVASSSAQ
        GW+ HPNFSWS QG +S + Q
Subjt:  GWRHHPNFSWSNQGVASSSAQ

A0A6J1DYY9 uncharacterized protein LOC1110255574.4e-10370.98Show/hide
Query:  MGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDI------------
        M QFGG TNEDPYSHLKSFI+IANAFQLPGVSEDALRLKMFPFSLRDGA TWLN LE N I TWAELT+KFLAKYHTLTRNADL+EDI            
Subjt:  MGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDI------------

Query:  -----------------------IEQFYRGLDRSSMMMLNTAANGSLLEKPVNEIVYILNKMTDINDQGEIGRSLPKKQVSVGIFELDTVASMQAQMAAM
                               I+QFYRGLD    MM +TAAN SLLEK VNEI+ ILNKM DINDQ E+GRSLPKKQ S GIFELDTV S+QAQ++AM
Subjt:  -----------------------IEQFYRGLDRSSMMMLNTAANGSLLEKPVNEIVYILNKMTDINDQGEIGRSLPKKQVSVGIFELDTVASMQAQMAAM

Query:  NQMLKQLTMEKETKTVTSA-IPEPSPILQISDISCVYCGNNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNLGWRHHPNFSWSN
        +QMLKQLTM+K  K  TS  I EPS ILQISDISCVYC +NHLYENC ANPA IFYVGQG QRNFNPYSNTYN GWR HPNFS SN
Subjt:  NQMLKQLTMEKETKTVTSA-IPEPSPILQISDISCVYCGNNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNLGWRHHPNFSWSN

A0A6J1DZ19 uncharacterized protein LOC1110248243.5e-9261.59Show/hide
Query:  MNRHAQDPPSPQNPPVNGDVAG---------------INDPLPQAAQFELKLVMFQMLQTM---GQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALR
        MN + QDPP P NPPV+GD AG               + D    A +  +      +   +   G      NEDPYSHLKSFIEIANAFQL GVSEDALR
Subjt:  MNRHAQDPPSPQNPPVNGDVAG---------------INDPLPQAAQFELKLVMFQMLQTM---GQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALR

Query:  LKMFPFSLRD--GARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIIEQFYRGLDRSSMMMLNTAANGSLLEKPVNEIVYILNKMTDINDQGE
        LKM      D    R   N         + EL  + L+  H L          IEQFYRGLDR S MMLNTAAN SL EK ++EI+ ILNKMTD NDQGE
Subjt:  LKMFPFSLRD--GARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIIEQFYRGLDRSSMMMLNTAANGSLLEKPVNEIVYILNKMTDINDQGE

Query:  IGRSLPKKQVSVGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGNNHLYENCPANPASIFYVGQGAQRNFNPYSNT
        IGRSLPKKQVS  +FELDTVASMQAQMA +NQMLKQLTMEKETKT TSA+ EPS  LQISDISCVYCG+N LYENCPANP S+FYVGQ AQRNFNPYSNT
Subjt:  IGRSLPKKQVSVGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGNNHLYENCPANPASIFYVGQGAQRNFNPYSNT

Query:  YNLGWRHHPNFSWSNQGVASSSAQAPAQ
        Y+  WR+HPNFSWSNQGVASSSAQ PAQ
Subjt:  YNLGWRHHPNFSWSNQGVASSSAQAPAQ

A0A6J1E251 uncharacterized protein LOC1110253021.2e-13570.87Show/hide
Query:  MNRHAQDPPSPQNPPVNGDVA--------------------------------------GINDPLPQAAQFELKLVMFQMLQTMGQFGGLTNEDPYSHLK
        MNR+AQDPP PQNPPVNGD+A                                      GIN+PLPQAAQFELK VMFQ+LQTMGQFGGLTNEDPYSHLK
Subjt:  MNRHAQDPPSPQNPPVNGDVA--------------------------------------GINDPLPQAAQFELKLVMFQMLQTMGQFGGLTNEDPYSHLK

Query:  SFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDI-----------------------------
        SFIEIANAFQLPG SEDALRLKMFPFSLRDGARTW+NALEPNSINTWAELT+KFLAKYHTLT+NADLREDI                             
Subjt:  SFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDI-----------------------------

Query:  ------IEQFYRGLDRSSMMMLNTAANGSLLEKPVNEIVYILNKMTDINDQGEIGRSLPKKQVSVGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVT
              IEQFYRGLDRSS MMLNT ANGSLLEK VNEIV +LNKMTDINDQGE+GRSLPKKQVS GIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVT
Subjt:  ------IEQFYRGLDRSSMMMLNTAANGSLLEKPVNEIVYILNKMTDINDQGEIGRSLPKKQVSVGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVT

Query:  SAIPEPSPILQISDISCVYCGNNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNLGWRHHPNFSWSNQGVASSSAQAPAQ
        SAIPE SPILQISDISCVYC                   GQGAQRNFNPYSNTYN GWRHHPNFSWSNQGVASSSAQAPAQ
Subjt:  SAIPEPSPILQISDISCVYCGNNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNLGWRHHPNFSWSNQGVASSSAQAPAQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGACATGCACAAGATCCTCCATCGCCACAAAATCCACCTGTGAATGGAGATGTGGCAGGGATAAATGATCCTTTACCCCAAGCCGCACAATTCGAGCTC
AAGCTAGTCATGTTCCAGATGTTACAGACGATGGGCCAGTTCGGAGGATTGACTAACGAAGATCCTTACTCCCATCTCAAATCCTTTATTGAAATAGCTAATGCA
TTTCAACTTCCTGGTGTCTCTGAGGATGCACTAAGATTAAAAATGTTTCCTTTTTCTCTTAGGGATGGTGCAAGGACTTGGCTAAACGCGTTAGAACCAAATTCT
ATCAACACATGGGCAGAACTGACGGAGAAATTTTTGGCAAAGTACCACACTTTGACCAGGAACGCAGACCTTCGAGAGGACATTATTGAACAATTCTATAGAGGA
TTGGATCGTTCATCAATGATGATGTTGAACACTGCAGCCAATGGCTCGTTGTTAGAGAAGCCGGTAAATGAGATCGTTTATATCTTGAATAAGATGACAGATATT
AATGACCAAGGGGAAATAGGAAGGTCATTACCAAAGAAGCAAGTATCAGTTGGAATCTTTGAGTTAGACACAGTAGCTTCAATGCAAGCCCAAATGGCGGCTATG
AACCAAATGTTAAAGCAGTTGACAATGGAGAAGGAAACCAAAACCGTCACTTCGGCGATACCTGAACCCTCTCCTATTTTACAAATTTCAGATATATCTTGTGTC
TATTGTGGTAATAACCACTTGTATGAGAACTGTCCAGCTAATCCAGCGTCTATTTTCTATGTAGGTCAAGGTGCCCAGCGGAATTTCAACCCGTATTCAAACACT
TACAACCTTGGGTGGAGGCACCATCCAAACTTTTCCTGGAGTAACCAAGGAGTAGCTAGTAGCAGTGCACAAGCACCCGCTCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAATAGACATGCACAAGATCCTCCATCGCCACAAAATCCACCTGTGAATGGAGATGTGGCAGGGATAAATGATCCTTTACCCCAAGCCGCACAATTCGAGCTC
AAGCTAGTCATGTTCCAGATGTTACAGACGATGGGCCAGTTCGGAGGATTGACTAACGAAGATCCTTACTCCCATCTCAAATCCTTTATTGAAATAGCTAATGCA
TTTCAACTTCCTGGTGTCTCTGAGGATGCACTAAGATTAAAAATGTTTCCTTTTTCTCTTAGGGATGGTGCAAGGACTTGGCTAAACGCGTTAGAACCAAATTCT
ATCAACACATGGGCAGAACTGACGGAGAAATTTTTGGCAAAGTACCACACTTTGACCAGGAACGCAGACCTTCGAGAGGACATTATTGAACAATTCTATAGAGGA
TTGGATCGTTCATCAATGATGATGTTGAACACTGCAGCCAATGGCTCGTTGTTAGAGAAGCCGGTAAATGAGATCGTTTATATCTTGAATAAGATGACAGATATT
AATGACCAAGGGGAAATAGGAAGGTCATTACCAAAGAAGCAAGTATCAGTTGGAATCTTTGAGTTAGACACAGTAGCTTCAATGCAAGCCCAAATGGCGGCTATG
AACCAAATGTTAAAGCAGTTGACAATGGAGAAGGAAACCAAAACCGTCACTTCGGCGATACCTGAACCCTCTCCTATTTTACAAATTTCAGATATATCTTGTGTC
TATTGTGGTAATAACCACTTGTATGAGAACTGTCCAGCTAATCCAGCGTCTATTTTCTATGTAGGTCAAGGTGCCCAGCGGAATTTCAACCCGTATTCAAACACT
TACAACCTTGGGTGGAGGCACCATCCAAACTTTTCCTGGAGTAACCAAGGAGTAGCTAGTAGCAGTGCACAAGCACCCGCTCAATAA
Protein sequenceShow/hide protein sequence
MNRHAQDPPSPQNPPVNGDVAGINDPLPQAAQFELKLVMFQMLQTMGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNS
INTWAELTEKFLAKYHTLTRNADLREDIIEQFYRGLDRSSMMMLNTAANGSLLEKPVNEIVYILNKMTDINDQGEIGRSLPKKQVSVGIFELDTVASMQAQMAAM
NQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGNNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNLGWRHHPNFSWSNQGVASSSAQAPAQ