; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g31210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g31210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr8:22359068..22365609
RNA-Seq ExpressionMoc08g31210
SyntenyMoc08g31210
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157400.1 uncharacterized protein LOC111024107 [Momordica charantia]5.9e-5037.17Show/hide
Query:  IPNPILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAVQFELKPGWCKDL--------------------------------------------------
        +PNPI +AD +D AMR+Y      +LNS + NPLP   QFE KP   + L                                                  
Subjt:  IPNPILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAVQFELKPGWCKDL--------------------------------------------------

Query:  ------AKPVGTKFYQHMVELTKKFLAKYHTLTRNADFRENIVSFRQKENEAVQEAWERFKELLRRCLSHGLPACVQIEQFYRGLDCSSRMMLNTAANGS
              A P GT       ++  KFL KY   TRNAD RE I+SFRQKENEAV  AWE FK+L+R C + G+PACVQIE F+RG D  ++MMLN AANG 
Subjt:  ------AKPVGTKFYQHMVELTKKFLAKYHTLTRNADFRENIVSFRQKENEAVQEAWERFKELLRRCLSHGLPACVQIEQFYRGLDCSSRMMLNTAANGS

Query:  LLEKSVNEIIDILNKMTDIND---------QGEIGSPNG----------------------GYEPKVKAVGNGEGNQNP---------------------
           KS NEI++IL+++++ ND         Q +   P G                        E    A  +     NP                     
Subjt:  LLEKSVNEIIDILNKMTDIND---------QGEIGSPNG----------------------GYEPKVKAVGNGEGNQNP---------------------

Query:  ----NPASIFYVGQGARRNFNPYSNTYNLGWRHHPNFSWNNQGVASSSAQASAQQYKQNYTPPSFPTQPA--SQPQQYNQQR
            NP+S++YVGQ  ++ FNPYSNTY+ GW+ HPNFSW+ QG  SSS     QQYKQ YTPP FP  PA    P QYNQQ+
Subjt:  ----NPASIFYVGQGARRNFNPYSNTYNLGWRHHPNFSWNNQGVASSSAQASAQQYKQNYTPPSFPTQPA--SQPQQYNQQR

XP_022158314.1 uncharacterized protein LOC111024824 [Momordica charantia]4.3e-9356.99Show/hide
Query:  MNRNPQDHSPPQNPHVNRDMAGEGAANRAGEIPNPILLADNRDVAMRNYVTHAFHNLNSGI--NNPLPQAVQFELKPGWCKDLAKPVGTKFYQHM---VE
        MN NPQD   P NP V+ D AGEGAANRAGE+PNPILL DNRDVA+RNYVTHAFHNLNS +  + P+ +A           D   P     Y H+   +E
Subjt:  MNRNPQDHSPPQNPHVNRDMAGEGAANRAGEIPNPILLADNRDVAMRNYVTHAFHNLNSGI--NNPLPQAVQFELKPGWCKDLAKPVGTKFYQHM---VE

Query:  LTKKFLAKYHT-----LTRNADFRENIVSFRQKENEAVQEAWERFKELLRRCLSHGLPACVQIEQFYRGLDCSSRMMLNTAANGSLLEKSVNEIIDILNK
        +   F     +     L  NAD RE+IVSFRQKENEAVQE WERFKELLRRCLSHGLP CVQIEQFYRGLD  SRMMLNTAAN SL EKS++EIIDILNK
Subjt:  LTKKFLAKYHT-----LTRNADFRENIVSFRQKENEAVQEAWERFKELLRRCLSHGLPACVQIEQFYRGLDCSSRMMLNTAANGSLLEKSVNEIIDILNK

Query:  MTDINDQGEIG-------------------------------------------SPNGGYEPK-------VKAVGNGEG----NQNPNPASIFYVGQGAR
        MTD NDQGEIG                                           + +   EP        +  V  G+     N   NP S+FYVGQ A+
Subjt:  MTDINDQGEIG-------------------------------------------SPNGGYEPK-------VKAVGNGEG----NQNPNPASIFYVGQGAR

Query:  RNFNPYSNTYNLGWRHHPNFSWNNQGVASSSAQASAQQYKQNYTPPSFPTQPASQPQQYNQQRGQNTTQHSG
        RNFNPYSNTY+  WR+HPNFSW+NQGVASSSAQ  AQQYKQNYTPP FPTQPASQPQQYNQQR QNTTQ  G
Subjt:  RNFNPYSNTYNLGWRHHPNFSWNNQGVASSSAQASAQQYKQNYTPPSFPTQPASQPQQYNQQRGQNTTQHSG

XP_022158836.1 uncharacterized protein LOC111025302 [Momordica charantia]9.3e-9658.08Show/hide
Query:  MNRNPQDHSPPQNPHVNRDMAGEGAANRAGEIPNPILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAVQFELKPGWCKDLAKP------VGTKFYQHM-
        MNRN QD  PPQNP VN DMAGE AANR GEIPN ILLADNRDVAMRNYVTHAFHNLNSGINNPLPQA QFELKP   + L              Y H+ 
Subjt:  MNRNPQDHSPPQNPHVNRDMAGEGAANRAGEIPNPILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAVQFELKPGWCKDLAKP------VGTKFYQHM-

Query:  -----------------------------------------------VELTKKFLAKYHTLTRNADFRENIVSFRQKENEAVQEAWERFKELLRRCLSHG
                                                        ELT KFLAKYHTLT+NAD RE+IVSFRQKENEAVQEAWERFKELLRRC SHG
Subjt:  -----------------------------------------------VELTKKFLAKYHTLTRNADFRENIVSFRQKENEAVQEAWERFKELLRRCLSHG

Query:  LPACVQIEQFYRGLDCSSRMMLNTAANGSLLEKSVNEIIDILNKMTDINDQGEIGS-------PNGGY------------------------EPKVKAVG
        LP+CVQIEQFYRGLD SS+MMLNT ANGSLLEKSVNEI+D+LNKMTDINDQGE+G          G +                        E + K V 
Subjt:  LPACVQIEQFYRGLDCSSRMMLNTAANGSLLEKSVNEIIDILNKMTDINDQGEIGS-------PNGGY------------------------EPKVKAVG

Query:  NGEGNQNP----NPASIFYVGQGARRNFNPYSNTYNLGWRHHPNFSWNNQGVASSSAQASAQQYK
        +     +P    +  S  Y GQGA+RNFNPYSNTYN GWRHHPNFSW+NQGVASSSAQA AQQYK
Subjt:  NGEGNQNP----NPASIFYVGQGARRNFNPYSNTYNLGWRHHPNFSWNNQGVASSSAQASAQQYK

XP_022159127.1 uncharacterized protein LOC111025557 [Momordica charantia]1.3e-4955.51Show/hide
Query:  FYQHMVELTKKFLAKYHTLTRNADFRENIVSFRQKENEAVQEAWERFKELLRRCLSHGLPACVQIEQFYRGLDCSSRMMLNTAANGSLLEKSVNEIIDIL
        F     ELT KFLAKYHTLTRNAD +E+IVSFRQ+E+EAVQEAWERFKELL+RC SHGLP CVQI+QFYRGLD   RMM +TAAN SLLEKSVNEIIDIL
Subjt:  FYQHMVELTKKFLAKYHTLTRNADFRENIVSFRQKENEAVQEAWERFKELLRRCLSHGLPACVQIEQFYRGLDCSSRMMLNTAANGSLLEKSVNEIIDIL

Query:  NKMTDINDQGEIG------SPNGG--------------------------------------YEPK----------VKAVGNG-EGNQNPNPASIFYVGQ
        NKM DINDQ E+G        + G                                       EP           V  V N    N + NPA IFYVGQ
Subjt:  NKMTDINDQGEIG------SPNGG--------------------------------------YEPK----------VKAVGNG-EGNQNPNPASIFYVGQ

Query:  GARRNFNPYSNTYNLGWRHHPNFSWNN
        G +RNFNPYSNTYN GWR HPNFS +N
Subjt:  GARRNFNPYSNTYNLGWRHHPNFSWNN

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]3.2e-4830.88Show/hide
Query:  IPNPILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAVQFELKP---------------------GWCKDLAKPVGT-----------------------
        +PNPI +AD RD AMR+Y      +LNS + N  P   +FE KP                        K   K   T                       
Subjt:  IPNPILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAVQFELKP---------------------GWCKDLAKPVGT-----------------------

Query:  ------KFYQHMV----ELTKKFLAKYHTLTRNADFRENIVSFRQKENEAVQEAWERFKELLRRCLSHGLPACVQIEQFYRGLDCSSRMMLNTAANGSLL
               F    +    ++  KFL KY   TRNAD RE I+SFRQKENEAV  AWERFK+L+  C + G+PACVQIE F+RG D  ++MMLN AANG   
Subjt:  ------KFYQHMV----ELTKKFLAKYHTLTRNADFRENIVSFRQKENEAVQEAWERFKELLRRCLSHGLPACVQIEQFYRGLDCSSRMMLNTAANGSLL

Query:  EKSVNEIIDILNKMTDIN---------DQGEIGSPNG----------------------GYEPKVKAVGNGEGNQNP-----------------------
         KS NEI++IL+++++ N          Q +   P G                        E       +     NP                       
Subjt:  EKSVNEIIDILNKMTDIN---------DQGEIGSPNG----------------------GYEPKVKAVGNGEGNQNP-----------------------

Query:  --NPASIFYVGQGARRNFNPYSNTYNLGWRHHPNFSWNNQGVASSSAQASAQQYKQNYTPPSFPTQPA--SQPQQYNQQRG-------------------
          NP+S++YVGQ  ++ FNPYSNTYN GW+ HPNFSW+ QG  SS+     QQYK+ YTPP FP  PA    P QYNQQ+                    
Subjt:  --NPASIFYVGQGARRNFNPYSNTYNLGWRHHPNFSWNNQGVASSSAQASAQQYKQNYTPPSFPTQPA--SQPQQYNQQRG-------------------

Query:  ----------------------QNTTQHSGR-------------------------KFGGDDERGDREGKEHCKAVITRSGLSYNGPSPPNEGTNAVTPV
                              ++   + GR                               E   R GKEHC ++ TRSGL Y GP  P+E +++    
Subjt:  ----------------------QNTTQHSGR-------------------------KFGGDDERGDREGKEHCKAVITRSGLSYNGPSPPNEGTNAVTPV

Query:  CASTSNPQELVSSEEKGKKADKDKQVVPSTT----PQVGNLQCP
                    S EK  +A  DK V P+ +    PQV N + P
Subjt:  CASTSNPQELVSSEEKGKKADKDKQVVPSTT----PQVGNLQCP

TrEMBL top hitse value%identityAlignment
A0A6J1DSZ5 uncharacterized protein LOC1110241072.9e-5037.17Show/hide
Query:  IPNPILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAVQFELKPGWCKDL--------------------------------------------------
        +PNPI +AD +D AMR+Y      +LNS + NPLP   QFE KP   + L                                                  
Subjt:  IPNPILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAVQFELKPGWCKDL--------------------------------------------------

Query:  ------AKPVGTKFYQHMVELTKKFLAKYHTLTRNADFRENIVSFRQKENEAVQEAWERFKELLRRCLSHGLPACVQIEQFYRGLDCSSRMMLNTAANGS
              A P GT       ++  KFL KY   TRNAD RE I+SFRQKENEAV  AWE FK+L+R C + G+PACVQIE F+RG D  ++MMLN AANG 
Subjt:  ------AKPVGTKFYQHMVELTKKFLAKYHTLTRNADFRENIVSFRQKENEAVQEAWERFKELLRRCLSHGLPACVQIEQFYRGLDCSSRMMLNTAANGS

Query:  LLEKSVNEIIDILNKMTDIND---------QGEIGSPNG----------------------GYEPKVKAVGNGEGNQNP---------------------
           KS NEI++IL+++++ ND         Q +   P G                        E    A  +     NP                     
Subjt:  LLEKSVNEIIDILNKMTDIND---------QGEIGSPNG----------------------GYEPKVKAVGNGEGNQNP---------------------

Query:  ----NPASIFYVGQGARRNFNPYSNTYNLGWRHHPNFSWNNQGVASSSAQASAQQYKQNYTPPSFPTQPA--SQPQQYNQQR
            NP+S++YVGQ  ++ FNPYSNTY+ GW+ HPNFSW+ QG  SSS     QQYKQ YTPP FP  PA    P QYNQQ+
Subjt:  ----NPASIFYVGQGARRNFNPYSNTYNLGWRHHPNFSWNNQGVASSSAQASAQQYKQNYTPPSFPTQPA--SQPQQYNQQR

A0A6J1DY39 uncharacterized protein LOC1110256531.6e-4830.88Show/hide
Query:  IPNPILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAVQFELKP---------------------GWCKDLAKPVGT-----------------------
        +PNPI +AD RD AMR+Y      +LNS + N  P   +FE KP                        K   K   T                       
Subjt:  IPNPILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAVQFELKP---------------------GWCKDLAKPVGT-----------------------

Query:  ------KFYQHMV----ELTKKFLAKYHTLTRNADFRENIVSFRQKENEAVQEAWERFKELLRRCLSHGLPACVQIEQFYRGLDCSSRMMLNTAANGSLL
               F    +    ++  KFL KY   TRNAD RE I+SFRQKENEAV  AWERFK+L+  C + G+PACVQIE F+RG D  ++MMLN AANG   
Subjt:  ------KFYQHMV----ELTKKFLAKYHTLTRNADFRENIVSFRQKENEAVQEAWERFKELLRRCLSHGLPACVQIEQFYRGLDCSSRMMLNTAANGSLL

Query:  EKSVNEIIDILNKMTDIN---------DQGEIGSPNG----------------------GYEPKVKAVGNGEGNQNP-----------------------
         KS NEI++IL+++++ N          Q +   P G                        E       +     NP                       
Subjt:  EKSVNEIIDILNKMTDIN---------DQGEIGSPNG----------------------GYEPKVKAVGNGEGNQNP-----------------------

Query:  --NPASIFYVGQGARRNFNPYSNTYNLGWRHHPNFSWNNQGVASSSAQASAQQYKQNYTPPSFPTQPA--SQPQQYNQQRG-------------------
          NP+S++YVGQ  ++ FNPYSNTYN GW+ HPNFSW+ QG  SS+     QQYK+ YTPP FP  PA    P QYNQQ+                    
Subjt:  --NPASIFYVGQGARRNFNPYSNTYNLGWRHHPNFSWNNQGVASSSAQASAQQYKQNYTPPSFPTQPA--SQPQQYNQQRG-------------------

Query:  ----------------------QNTTQHSGR-------------------------KFGGDDERGDREGKEHCKAVITRSGLSYNGPSPPNEGTNAVTPV
                              ++   + GR                               E   R GKEHC ++ TRSGL Y GP  P+E +++    
Subjt:  ----------------------QNTTQHSGR-------------------------KFGGDDERGDREGKEHCKAVITRSGLSYNGPSPPNEGTNAVTPV

Query:  CASTSNPQELVSSEEKGKKADKDKQVVPSTT----PQVGNLQCP
                    S EK  +A  DK V P+ +    PQV N + P
Subjt:  CASTSNPQELVSSEEKGKKADKDKQVVPSTT----PQVGNLQCP

A0A6J1DYY9 uncharacterized protein LOC1110255576.4e-5055.51Show/hide
Query:  FYQHMVELTKKFLAKYHTLTRNADFRENIVSFRQKENEAVQEAWERFKELLRRCLSHGLPACVQIEQFYRGLDCSSRMMLNTAANGSLLEKSVNEIIDIL
        F     ELT KFLAKYHTLTRNAD +E+IVSFRQ+E+EAVQEAWERFKELL+RC SHGLP CVQI+QFYRGLD   RMM +TAAN SLLEKSVNEIIDIL
Subjt:  FYQHMVELTKKFLAKYHTLTRNADFRENIVSFRQKENEAVQEAWERFKELLRRCLSHGLPACVQIEQFYRGLDCSSRMMLNTAANGSLLEKSVNEIIDIL

Query:  NKMTDINDQGEIG------SPNGG--------------------------------------YEPK----------VKAVGNG-EGNQNPNPASIFYVGQ
        NKM DINDQ E+G        + G                                       EP           V  V N    N + NPA IFYVGQ
Subjt:  NKMTDINDQGEIG------SPNGG--------------------------------------YEPK----------VKAVGNG-EGNQNPNPASIFYVGQ

Query:  GARRNFNPYSNTYNLGWRHHPNFSWNN
        G +RNFNPYSNTYN GWR HPNFS +N
Subjt:  GARRNFNPYSNTYNLGWRHHPNFSWNN

A0A6J1DZ19 uncharacterized protein LOC1110248242.1e-9356.99Show/hide
Query:  MNRNPQDHSPPQNPHVNRDMAGEGAANRAGEIPNPILLADNRDVAMRNYVTHAFHNLNSGI--NNPLPQAVQFELKPGWCKDLAKPVGTKFYQHM---VE
        MN NPQD   P NP V+ D AGEGAANRAGE+PNPILL DNRDVA+RNYVTHAFHNLNS +  + P+ +A           D   P     Y H+   +E
Subjt:  MNRNPQDHSPPQNPHVNRDMAGEGAANRAGEIPNPILLADNRDVAMRNYVTHAFHNLNSGI--NNPLPQAVQFELKPGWCKDLAKPVGTKFYQHM---VE

Query:  LTKKFLAKYHT-----LTRNADFRENIVSFRQKENEAVQEAWERFKELLRRCLSHGLPACVQIEQFYRGLDCSSRMMLNTAANGSLLEKSVNEIIDILNK
        +   F     +     L  NAD RE+IVSFRQKENEAVQE WERFKELLRRCLSHGLP CVQIEQFYRGLD  SRMMLNTAAN SL EKS++EIIDILNK
Subjt:  LTKKFLAKYHT-----LTRNADFRENIVSFRQKENEAVQEAWERFKELLRRCLSHGLPACVQIEQFYRGLDCSSRMMLNTAANGSLLEKSVNEIIDILNK

Query:  MTDINDQGEIG-------------------------------------------SPNGGYEPK-------VKAVGNGEG----NQNPNPASIFYVGQGAR
        MTD NDQGEIG                                           + +   EP        +  V  G+     N   NP S+FYVGQ A+
Subjt:  MTDINDQGEIG-------------------------------------------SPNGGYEPK-------VKAVGNGEG----NQNPNPASIFYVGQGAR

Query:  RNFNPYSNTYNLGWRHHPNFSWNNQGVASSSAQASAQQYKQNYTPPSFPTQPASQPQQYNQQRGQNTTQHSG
        RNFNPYSNTY+  WR+HPNFSW+NQGVASSSAQ  AQQYKQNYTPP FPTQPASQPQQYNQQR QNTTQ  G
Subjt:  RNFNPYSNTYNLGWRHHPNFSWNNQGVASSSAQASAQQYKQNYTPPSFPTQPASQPQQYNQQRGQNTTQHSG

A0A6J1E251 uncharacterized protein LOC1110253024.5e-9658.08Show/hide
Query:  MNRNPQDHSPPQNPHVNRDMAGEGAANRAGEIPNPILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAVQFELKPGWCKDLAKP------VGTKFYQHM-
        MNRN QD  PPQNP VN DMAGE AANR GEIPN ILLADNRDVAMRNYVTHAFHNLNSGINNPLPQA QFELKP   + L              Y H+ 
Subjt:  MNRNPQDHSPPQNPHVNRDMAGEGAANRAGEIPNPILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAVQFELKPGWCKDLAKP------VGTKFYQHM-

Query:  -----------------------------------------------VELTKKFLAKYHTLTRNADFRENIVSFRQKENEAVQEAWERFKELLRRCLSHG
                                                        ELT KFLAKYHTLT+NAD RE+IVSFRQKENEAVQEAWERFKELLRRC SHG
Subjt:  -----------------------------------------------VELTKKFLAKYHTLTRNADFRENIVSFRQKENEAVQEAWERFKELLRRCLSHG

Query:  LPACVQIEQFYRGLDCSSRMMLNTAANGSLLEKSVNEIIDILNKMTDINDQGEIGS-------PNGGY------------------------EPKVKAVG
        LP+CVQIEQFYRGLD SS+MMLNT ANGSLLEKSVNEI+D+LNKMTDINDQGE+G          G +                        E + K V 
Subjt:  LPACVQIEQFYRGLDCSSRMMLNTAANGSLLEKSVNEIIDILNKMTDINDQGEIGS-------PNGGY------------------------EPKVKAVG

Query:  NGEGNQNP----NPASIFYVGQGARRNFNPYSNTYNLGWRHHPNFSWNNQGVASSSAQASAQQYK
        +     +P    +  S  Y GQGA+RNFNPYSNTYN GWRHHPNFSW+NQGVASSSAQA AQQYK
Subjt:  NGEGNQNP----NPASIFYVGQGARRNFNPYSNTYNLGWRHHPNFSWNNQGVASSSAQASAQQYK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGAAACCCACAAGATCATTCACCGCCACAAAATCCACATGTGAATAGAGATATGGCAGGTGAAGGAGCAGCAAACCGAGCAGGAGAAATTCCTAATCCGATCCT
TCTAGCAGATAATCGAGATGTAGCCATGCGGAATTATGTCACTCATGCGTTCCACAACCTAAATTCAGGGATAAATAATCCTTTACCCCAAGCCGTACAGTTCGAGCTCA
AGCCAGGATGGTGCAAGGACTTGGCTAAACCCGTTGGAACCAAATTCTATCAACACATGGTAGAACTGACGAAGAAATTTTTGGCAAAGTACCATACTTTGACCAGAAAC
GCAGACTTTCGAGAGAACATTGTGTCTTTTAGACAGAAAGAGAACGAAGCAGTTCAAGAAGCTTGGGAGCGTTTTAAGGAGTTACTTAGAAGATGCCTAAGCCATGGATT
GCCCGCATGTGTGCAGATTGAACAATTCTATAGAGGATTGGATTGTTCATCAAGAATGATGTTGAACACTGCAGCTAATGGCTCGTTGTTAGAGAAGTCGGTAAATGAGA
TCATTGATATTTTGAATAAGATGACAGACATTAATGACCAAGGCGAAATAGGAAGCCCAAATGGCGGCTATGAACCAAAAGTTAAAGCAGTTGGCAATGGAGAAGGAAAC
CAAAACCCTAATCCAGCGTCTATTTTCTATGTAGGTCAAGGTGCCCGGCGGAATTTCAACCCGTATTCAAACACTTACAACCTTGGATGGAGGCACCATCCAAACTTTTC
CTGGAATAACCAAGGAGTAGCTAGTAGTAGTGCACAAGCATCCGCTCAACAATACAAGCAAAACTACACTCCTCCTAGTTTTCCAACTCAACCGGCGTCGCAGCCTCAAC
AATACAATCAGCAAAGAGGTCAAAATACTACTCAGCATAGTGGTCGCAAGTTTGGAGGCGATGATGAAAGAGGAGATCGTGAGGGAAAGGAGCACTGTAAGGCAGTTATC
ACGAGAAGCGGATTAAGTTATAATGGACCCTCACCTCCAAACGAAGGAACTAATGCAGTTACACCTGTTTGTGCATCCACATCCAATCCACAAGAACTTGTAAGTTCAGA
AGAAAAAGGTAAGAAGGCGGATAAAGATAAGCAAGTAGTGCCCAGCACTACTCCACAGGTAGGTAACCTTCAATGTCCTCGATGCGATCGTCTCCCAGATGAAGTCGAGG
AGTGCTGTACAATAGGAGAAATCATGGAGGAACTCCAGCAAATGATGGTGGAAGACTTGGAAGCAGATTTAAAGGCTGCAGAAAAAGAATCCATTTTGCCCCAATTTGAG
CGTTTTGAGTTTTTGCAGCCGACAATAGCGGATTTGAAGGCCTTGCAACCTTCCATCATTGAACCTCCAGAATTGGAAAAGAAACCTCTACCTTCTCATTTAAAATATGC
TTATTTGGAGGATAAAGGTGCTATTTTAGATGAAGAGATAGCTAGACTTCAAGAGAGGGCGGAGATGTTCAGTAAAAATAACGAAATTAGGGACAAAGAGAATGAGAGGG
TTTATGCGAAAATTGAGGAATTAAACATAAAATGGCAAGAATTCATGGAAAATTCAAAGAAAGTGAGCGAGGAGATTCAACTTGAGTTAAATAGCATGAGTATACGTCGT
AGGATGAATCTTTCTCAAGATAACCCCGTTTCCAAGTCTTTAGAACTGTCTATCCCTCCTCCTCTTTCCACTACTGCTGCTGTACATGTTGAAGGTCAAGAACATGTTAG
TGGAGACTCATAA
mRNA sequenceShow/hide mRNA sequence
ATGAATAGAAACCCACAAGATCATTCACCGCCACAAAATCCACATGTGAATAGAGATATGGCAGGTGAAGGAGCAGCAAACCGAGCAGGAGAAATTCCTAATCCGATCCT
TCTAGCAGATAATCGAGATGTAGCCATGCGGAATTATGTCACTCATGCGTTCCACAACCTAAATTCAGGGATAAATAATCCTTTACCCCAAGCCGTACAGTTCGAGCTCA
AGCCAGGATGGTGCAAGGACTTGGCTAAACCCGTTGGAACCAAATTCTATCAACACATGGTAGAACTGACGAAGAAATTTTTGGCAAAGTACCATACTTTGACCAGAAAC
GCAGACTTTCGAGAGAACATTGTGTCTTTTAGACAGAAAGAGAACGAAGCAGTTCAAGAAGCTTGGGAGCGTTTTAAGGAGTTACTTAGAAGATGCCTAAGCCATGGATT
GCCCGCATGTGTGCAGATTGAACAATTCTATAGAGGATTGGATTGTTCATCAAGAATGATGTTGAACACTGCAGCTAATGGCTCGTTGTTAGAGAAGTCGGTAAATGAGA
TCATTGATATTTTGAATAAGATGACAGACATTAATGACCAAGGCGAAATAGGAAGCCCAAATGGCGGCTATGAACCAAAAGTTAAAGCAGTTGGCAATGGAGAAGGAAAC
CAAAACCCTAATCCAGCGTCTATTTTCTATGTAGGTCAAGGTGCCCGGCGGAATTTCAACCCGTATTCAAACACTTACAACCTTGGATGGAGGCACCATCCAAACTTTTC
CTGGAATAACCAAGGAGTAGCTAGTAGTAGTGCACAAGCATCCGCTCAACAATACAAGCAAAACTACACTCCTCCTAGTTTTCCAACTCAACCGGCGTCGCAGCCTCAAC
AATACAATCAGCAAAGAGGTCAAAATACTACTCAGCATAGTGGTCGCAAGTTTGGAGGCGATGATGAAAGAGGAGATCGTGAGGGAAAGGAGCACTGTAAGGCAGTTATC
ACGAGAAGCGGATTAAGTTATAATGGACCCTCACCTCCAAACGAAGGAACTAATGCAGTTACACCTGTTTGTGCATCCACATCCAATCCACAAGAACTTGTAAGTTCAGA
AGAAAAAGGTAAGAAGGCGGATAAAGATAAGCAAGTAGTGCCCAGCACTACTCCACAGGTAGGTAACCTTCAATGTCCTCGATGCGATCGTCTCCCAGATGAAGTCGAGG
AGTGCTGTACAATAGGAGAAATCATGGAGGAACTCCAGCAAATGATGGTGGAAGACTTGGAAGCAGATTTAAAGGCTGCAGAAAAAGAATCCATTTTGCCCCAATTTGAG
CGTTTTGAGTTTTTGCAGCCGACAATAGCGGATTTGAAGGCCTTGCAACCTTCCATCATTGAACCTCCAGAATTGGAAAAGAAACCTCTACCTTCTCATTTAAAATATGC
TTATTTGGAGGATAAAGGTGCTATTTTAGATGAAGAGATAGCTAGACTTCAAGAGAGGGCGGAGATGTTCAGTAAAAATAACGAAATTAGGGACAAAGAGAATGAGAGGG
TTTATGCGAAAATTGAGGAATTAAACATAAAATGGCAAGAATTCATGGAAAATTCAAAGAAAGTGAGCGAGGAGATTCAACTTGAGTTAAATAGCATGAGTATACGTCGT
AGGATGAATCTTTCTCAAGATAACCCCGTTTCCAAGTCTTTAGAACTGTCTATCCCTCCTCCTCTTTCCACTACTGCTGCTGTACATGTTGAAGGTCAAGAACATGTTAG
TGGAGACTCATAA
Protein sequenceShow/hide protein sequence
MNRNPQDHSPPQNPHVNRDMAGEGAANRAGEIPNPILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAVQFELKPGWCKDLAKPVGTKFYQHMVELTKKFLAKYHTLTRN
ADFRENIVSFRQKENEAVQEAWERFKELLRRCLSHGLPACVQIEQFYRGLDCSSRMMLNTAANGSLLEKSVNEIIDILNKMTDINDQGEIGSPNGGYEPKVKAVGNGEGN
QNPNPASIFYVGQGARRNFNPYSNTYNLGWRHHPNFSWNNQGVASSSAQASAQQYKQNYTPPSFPTQPASQPQQYNQQRGQNTTQHSGRKFGGDDERGDREGKEHCKAVI
TRSGLSYNGPSPPNEGTNAVTPVCASTSNPQELVSSEEKGKKADKDKQVVPSTTPQVGNLQCPRCDRLPDEVEECCTIGEIMEELQQMMVEDLEADLKAAEKESILPQFE
RFEFLQPTIADLKALQPSIIEPPELEKKPLPSHLKYAYLEDKGAILDEEIARLQERAEMFSKNNEIRDKENERVYAKIEELNIKWQEFMENSKKVSEEIQLELNSMSIRR
RMNLSQDNPVSKSLELSIPPPLSTTAAVHVEGQEHVSGDS