; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g28290 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g28290
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr6:21220781..21223131
RNA-Seq ExpressionMoc06g28290
SyntenyMoc06g28290
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142953.1 uncharacterized protein LOC111012947 [Momordica charantia]1.2e-5228.94Show/hide
Query:  ISNPILLADNRDVAMQNYVTHAFHNLNSGINNHLSQAAQFELKPVMFQM-------------------------------------------------DG
        + NPI +AD RD AM++Y      +LNS + N     A FE KP+M QM                                                   
Subjt:  ISNPILLADNRDVAMQNYVTHAFHNLNSGINNHLSQAAQFELKPVMFQM-------------------------------------------------DG

Query:  ARTWLNALEPNSINTWAELTKKFLAKYHTLTRSANLREDIVSFRQKENEAVQEAWKRFKELLRRCPRHGLPACVQIEQFYRGFDRSSRMMLNTAANGSLL
        A  WLNA   ++I T +++  KFL KY   TR+A++RE+I+SFRQKENEAV  AW+RFK+L+R CP  G+PACVQIE F+R  D  + MMLN AANG   
Subjt:  ARTWLNALEPNSINTWAELTKKFLAKYHTLTRSANLREDIVSFRQKENEAVQEAWKRFKELLRRCPRHGLPACVQIEQFYRGFDRSSRMMLNTAANGSLL

Query:  EKSVNEIVDILKKMTDINDQ--GEIGRSLPKKQVSAGAFELDIVASMQGQMAVMNQMLKQLTMKKETKTAISAIPEPSPV--------------------
         KS NEIV+IL ++++ NDQ   E  R+  K+   A    LD + SMQ Q+  + QMLK +           A   PSPV                    
Subjt:  EKSVNEIVDILKKMTDINDQ--GEIGRSLPKKQVSAGAFELDIVASMQGQMAVMNQMLKQLTMKKETKTAISAIPEPSPV--------------------

Query:  ------------------------------------KVP--------------------------SRISIRIRTVTTPDGGTIKTFLGTLTSAEGTDA--
                                            + P                          S + I ++   T +  T+K  + T T A   D   
Subjt:  ------------------------------------KVP--------------------------SRISIRIRTVTTPDGGTIKTFLGTLTSAEGTDA--

Query:  --------------VALVLASTSNP-QQEEKAELVSSEEKGKKV-------DKGKQVVP----------STTPQV-------------------------
                      + + L   +N  +   +  L SS E+ +++       +K  QVVP          S  PQV                         
Subjt:  --------------VALVLASTSNP-QQEEKAELVSSEEKGKKV-------DKGKQVVP----------STTPQV-------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --GEARSTTVTLQLADRSIKKSEGKIEGVLVKVDKFIFPADFIILDCEVDLEVPIILGRPFLATEDTVFNVRKEEIIMKVNDEQVTFNVLDTMQLQDEVE
          G+A  TTVTL LADRSI K EGKIE VLVKVDKFIFPADFIILDCE D +VPIILGRPFLAT +T+ +V+K E+ M+V+D++VTFN+LD M+  D+ E
Subjt:  --GEARSTTVTLQLADRSIKKSEGKIEGVLVKVDKFIFPADFIILDCEVDLEVPIILGRPFLATEDTVFNVRKEEIIMKVNDEQVTFNVLDTMQLQDEVE

Query:  ECSTI
        EC  I
Subjt:  ECSTI

XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]9.2e-5657.54Show/hide
Query:  MNRNPQDPPRPQNPPVNGDMVG------VGEISNPILLADNRDVAMQNYVTHAFHNLNSGINNHLSQAAQFELKPVMFQMDGARTWLNALEPNSINTWAE
        MNRN QDPP PQNPPVNGDM G       GEI N ILLADNRDVAM+NYVT AFHNLNSGINN L QAAQ ELKPVMF M                    
Subjt:  MNRNPQDPPRPQNPPVNGDMVG------VGEISNPILLADNRDVAMQNYVTHAFHNLNSGINNHLSQAAQFELKPVMFQMDGARTWLNALEPNSINTWAE

Query:  LTKKFLAKYHTLTRSANLREDIVSFRQKENEAVQEAWKRFKELLRRCPRHGLPACVQIEQFYRGFDRSSRMMLNTAANGSLLEKSVNEIVDILKKMTDIN
           + + ++  LT                NE      K F E+       G+       +   G DRSSRMMLNTAANGSLLEKSVNEIVDIL KM DIN
Subjt:  LTKKFLAKYHTLTRSANLREDIVSFRQKENEAVQEAWKRFKELLRRCPRHGLPACVQIEQFYRGFDRSSRMMLNTAANGSLLEKSVNEIVDILKKMTDIN

Query:  DQGEIGRSLPKKQVSAGAFELDIVASMQGQMAVMNQMLKQLTMKKETKTAIS
        DQGE GRSL KKQVSAG FELD VA MQ QMA MNQMLKQ TM+KETKT  S
Subjt:  DQGEIGRSLPKKQVSAGAFELDIVASMQGQMAVMNQMLKQLTMKKETKTAIS

XP_022158314.1 uncharacterized protein LOC111024824 [Momordica charantia]1.4e-7562.88Show/hide
Query:  MNRNPQDPPRPQNPPVNGDMVG------VGEISNPILLADNRDVAMQNYVTHAFHNLNSGINNHLSQAAQFELKPVMFQMDGARTWLNALEPNS-INTWA
        MN NPQDPP P NPPV+GD  G       GE+ NPILL DNRDVA++NYVTHAFHNLNS + +                 DG     +  +P S + ++ 
Subjt:  MNRNPQDPPRPQNPPVNGDMVG------VGEISNPILLADNRDVAMQNYVTHAFHNLNSGINNHLSQAAQFELKPVMFQMDGARTWLNALEPNS-INTWA

Query:  ELTKKFLAKYHT-----LTRSANLREDIVSFRQKENEAVQEAWKRFKELLRRCPRHGLPACVQIEQFYRGFDRSSRMMLNTAANGSLLEKSVNEIVDILK
        E+   F     +     L  +A+LREDIVSFRQKENEAVQE W+RFKELLRRC  HGLP CVQIEQFYRG DR SRMMLNTAAN SL EKS++EI+DIL 
Subjt:  ELTKKFLAKYHT-----LTRSANLREDIVSFRQKENEAVQEAWKRFKELLRRCPRHGLPACVQIEQFYRGFDRSSRMMLNTAANGSLLEKSVNEIVDILK

Query:  KMTDINDQGEIGRSLPKKQVSAGAFELDIVASMQGQMAVMNQMLKQLTMKKETKTAISAIPEPS
        KMTD NDQGEIGRSLPKKQVSA  FELD VASMQ QMA +NQMLKQLTM+KETKTA SA+ EPS
Subjt:  KMTDINDQGEIGRSLPKKQVSAGAFELDIVASMQGQMAVMNQMLKQLTMKKETKTAISAIPEPS

XP_022158836.1 uncharacterized protein LOC111025302 [Momordica charantia]2.0e-11170.89Show/hide
Query:  MNRNPQDPPRPQNPPVNGDMVG------VGEISNPILLADNRDVAMQNYVTHAFHNLNSGINNHLSQAAQFELKPVMFQM--------------------
        MNRN QDPP PQNPPVNGDM G      VGEI N ILLADNRDVAM+NYVTHAFHNLNSGINN L QAAQFELKPVMFQ+                    
Subjt:  MNRNPQDPPRPQNPPVNGDMVG------VGEISNPILLADNRDVAMQNYVTHAFHNLNSGINNHLSQAAQFELKPVMFQM--------------------

Query:  -----------------------------DGARTWLNALEPNSINTWAELTKKFLAKYHTLTRSANLREDIVSFRQKENEAVQEAWKRFKELLRRCPRHG
                                     DGARTW+NALEPNSINTWAELT KFLAKYHTLT++A+LREDIVSFRQKENEAVQEAW+RFKELLRRCP HG
Subjt:  -----------------------------DGARTWLNALEPNSINTWAELTKKFLAKYHTLTRSANLREDIVSFRQKENEAVQEAWKRFKELLRRCPRHG

Query:  LPACVQIEQFYRGFDRSSRMMLNTAANGSLLEKSVNEIVDILKKMTDINDQGEIGRSLPKKQVSAGAFELDIVASMQGQMAVMNQMLKQLTMKKETKTAI
        LP+CVQIEQFYRG DRSS+MMLNT ANGSLLEKSVNEIVD+L KMTDINDQGE+GRSLPKKQVS G FELD VASMQ QMA MNQMLKQLTM+KETKT  
Subjt:  LPACVQIEQFYRGFDRSSRMMLNTAANGSLLEKSVNEIVDILKKMTDINDQGEIGRSLPKKQVSAGAFELDIVASMQGQMAVMNQMLKQLTMKKETKTAI

Query:  SAIPEPSPVKVPSRIS
        SAIPE SP+   S IS
Subjt:  SAIPEPSPVKVPSRIS

XP_022159127.1 uncharacterized protein LOC111025557 [Momordica charantia]1.1e-6471.81Show/hide
Query:  DGARTWLNALEPNSINTWAELTKKFLAKYHTLTRSANLREDIVSFRQKENEAVQEAWKRFKELLRRCPRHGLPACVQIEQFYRGFDRSSRMMLNTAANGS
        DGA TW+N LE N I TWAELT KFLAKYHTLTR+A+L+EDIVSFRQ+E+EAVQEAW+RFKELL+RC  HGLP CVQI+QFYRG D   RMM +TAAN S
Subjt:  DGARTWLNALEPNSINTWAELTKKFLAKYHTLTRSANLREDIVSFRQKENEAVQEAWKRFKELLRRCPRHGLPACVQIEQFYRGFDRSSRMMLNTAANGS

Query:  LLEKSVNEIVDILKKMTDINDQGEIGRSLPKKQVSAGAFELDIVASMQGQMAVMNQMLKQLTMKKETKTAISA-IPEPSPVKVPSRIS
        LLEKSVNEI+DIL KM DINDQ E+GRSLPKKQ SAG FELD V S+Q Q++ M+QMLKQLTMKK  K A S  I EPS +   S IS
Subjt:  LLEKSVNEIVDILKKMTDINDQGEIGRSLPKKQVSAGAFELDIVASMQGQMAVMNQMLKQLTMKKETKTAISA-IPEPSPVKVPSRIS

TrEMBL top hitse value%identityAlignment
A0A6J1CPJ3 uncharacterized protein LOC1110129471.3e-5228.94Show/hide
Query:  ISNPILLADNRDVAMQNYVTHAFHNLNSGINNHLSQAAQFELKPVMFQM-------------------------------------------------DG
        + NPI +AD RD AM++Y      +LNS + N     A FE KP+M QM                                                   
Subjt:  ISNPILLADNRDVAMQNYVTHAFHNLNSGINNHLSQAAQFELKPVMFQM-------------------------------------------------DG

Query:  ARTWLNALEPNSINTWAELTKKFLAKYHTLTRSANLREDIVSFRQKENEAVQEAWKRFKELLRRCPRHGLPACVQIEQFYRGFDRSSRMMLNTAANGSLL
        A  WLNA   ++I T +++  KFL KY   TR+A++RE+I+SFRQKENEAV  AW+RFK+L+R CP  G+PACVQIE F+R  D  + MMLN AANG   
Subjt:  ARTWLNALEPNSINTWAELTKKFLAKYHTLTRSANLREDIVSFRQKENEAVQEAWKRFKELLRRCPRHGLPACVQIEQFYRGFDRSSRMMLNTAANGSLL

Query:  EKSVNEIVDILKKMTDINDQ--GEIGRSLPKKQVSAGAFELDIVASMQGQMAVMNQMLKQLTMKKETKTAISAIPEPSPV--------------------
         KS NEIV+IL ++++ NDQ   E  R+  K+   A    LD + SMQ Q+  + QMLK +           A   PSPV                    
Subjt:  EKSVNEIVDILKKMTDINDQ--GEIGRSLPKKQVSAGAFELDIVASMQGQMAVMNQMLKQLTMKKETKTAISAIPEPSPV--------------------

Query:  ------------------------------------KVP--------------------------SRISIRIRTVTTPDGGTIKTFLGTLTSAEGTDA--
                                            + P                          S + I ++   T +  T+K  + T T A   D   
Subjt:  ------------------------------------KVP--------------------------SRISIRIRTVTTPDGGTIKTFLGTLTSAEGTDA--

Query:  --------------VALVLASTSNP-QQEEKAELVSSEEKGKKV-------DKGKQVVP----------STTPQV-------------------------
                      + + L   +N  +   +  L SS E+ +++       +K  QVVP          S  PQV                         
Subjt:  --------------VALVLASTSNP-QQEEKAELVSSEEKGKKV-------DKGKQVVP----------STTPQV-------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --GEARSTTVTLQLADRSIKKSEGKIEGVLVKVDKFIFPADFIILDCEVDLEVPIILGRPFLATEDTVFNVRKEEIIMKVNDEQVTFNVLDTMQLQDEVE
          G+A  TTVTL LADRSI K EGKIE VLVKVDKFIFPADFIILDCE D +VPIILGRPFLAT +T+ +V+K E+ M+V+D++VTFN+LD M+  D+ E
Subjt:  --GEARSTTVTLQLADRSIKKSEGKIEGVLVKVDKFIFPADFIILDCEVDLEVPIILGRPFLATEDTVFNVRKEEIIMKVNDEQVTFNVLDTMQLQDEVE

Query:  ECSTI
        EC  I
Subjt:  ECSTI

A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220074.5e-5657.54Show/hide
Query:  MNRNPQDPPRPQNPPVNGDMVG------VGEISNPILLADNRDVAMQNYVTHAFHNLNSGINNHLSQAAQFELKPVMFQMDGARTWLNALEPNSINTWAE
        MNRN QDPP PQNPPVNGDM G       GEI N ILLADNRDVAM+NYVT AFHNLNSGINN L QAAQ ELKPVMF M                    
Subjt:  MNRNPQDPPRPQNPPVNGDMVG------VGEISNPILLADNRDVAMQNYVTHAFHNLNSGINNHLSQAAQFELKPVMFQMDGARTWLNALEPNSINTWAE

Query:  LTKKFLAKYHTLTRSANLREDIVSFRQKENEAVQEAWKRFKELLRRCPRHGLPACVQIEQFYRGFDRSSRMMLNTAANGSLLEKSVNEIVDILKKMTDIN
           + + ++  LT                NE      K F E+       G+       +   G DRSSRMMLNTAANGSLLEKSVNEIVDIL KM DIN
Subjt:  LTKKFLAKYHTLTRSANLREDIVSFRQKENEAVQEAWKRFKELLRRCPRHGLPACVQIEQFYRGFDRSSRMMLNTAANGSLLEKSVNEIVDILKKMTDIN

Query:  DQGEIGRSLPKKQVSAGAFELDIVASMQGQMAVMNQMLKQLTMKKETKTAIS
        DQGE GRSL KKQVSAG FELD VA MQ QMA MNQMLKQ TM+KETKT  S
Subjt:  DQGEIGRSLPKKQVSAGAFELDIVASMQGQMAVMNQMLKQLTMKKETKTAIS

A0A6J1DYY9 uncharacterized protein LOC1110255574.0e-6572.34Show/hide
Query:  DGARTWLNALEPNSINTWAELTKKFLAKYHTLTRSANLREDIVSFRQKENEAVQEAWKRFKELLRRCPRHGLPACVQIEQFYRGFDRSSRMMLNTAANGS
        DGA TWLN LE N I TWAELT KFLAKYHTLTR+A+L+EDIVSFRQ+E+EAVQEAW+RFKELL+RC  HGLP CVQI+QFYRG D   RMM +TAAN S
Subjt:  DGARTWLNALEPNSINTWAELTKKFLAKYHTLTRSANLREDIVSFRQKENEAVQEAWKRFKELLRRCPRHGLPACVQIEQFYRGFDRSSRMMLNTAANGS

Query:  LLEKSVNEIVDILKKMTDINDQGEIGRSLPKKQVSAGAFELDIVASMQGQMAVMNQMLKQLTMKKETKTAISA-IPEPSPVKVPSRIS
        LLEKSVNEI+DIL KM DINDQ E+GRSLPKKQ SAG FELD V S+Q Q++ M+QMLKQLTMKK  K A S  I EPS +   S IS
Subjt:  LLEKSVNEIVDILKKMTDINDQGEIGRSLPKKQVSAGAFELDIVASMQGQMAVMNQMLKQLTMKKETKTAISA-IPEPSPVKVPSRIS

A0A6J1DZ19 uncharacterized protein LOC1110248246.6e-7662.88Show/hide
Query:  MNRNPQDPPRPQNPPVNGDMVG------VGEISNPILLADNRDVAMQNYVTHAFHNLNSGINNHLSQAAQFELKPVMFQMDGARTWLNALEPNS-INTWA
        MN NPQDPP P NPPV+GD  G       GE+ NPILL DNRDVA++NYVTHAFHNLNS + +                 DG     +  +P S + ++ 
Subjt:  MNRNPQDPPRPQNPPVNGDMVG------VGEISNPILLADNRDVAMQNYVTHAFHNLNSGINNHLSQAAQFELKPVMFQMDGARTWLNALEPNS-INTWA

Query:  ELTKKFLAKYHT-----LTRSANLREDIVSFRQKENEAVQEAWKRFKELLRRCPRHGLPACVQIEQFYRGFDRSSRMMLNTAANGSLLEKSVNEIVDILK
        E+   F     +     L  +A+LREDIVSFRQKENEAVQE W+RFKELLRRC  HGLP CVQIEQFYRG DR SRMMLNTAAN SL EKS++EI+DIL 
Subjt:  ELTKKFLAKYHT-----LTRSANLREDIVSFRQKENEAVQEAWKRFKELLRRCPRHGLPACVQIEQFYRGFDRSSRMMLNTAANGSLLEKSVNEIVDILK

Query:  KMTDINDQGEIGRSLPKKQVSAGAFELDIVASMQGQMAVMNQMLKQLTMKKETKTAISAIPEPS
        KMTD NDQGEIGRSLPKKQVSA  FELD VASMQ QMA +NQMLKQLTM+KETKTA SA+ EPS
Subjt:  KMTDINDQGEIGRSLPKKQVSAGAFELDIVASMQGQMAVMNQMLKQLTMKKETKTAISAIPEPS

A0A6J1E251 uncharacterized protein LOC1110253029.8e-11270.89Show/hide
Query:  MNRNPQDPPRPQNPPVNGDMVG------VGEISNPILLADNRDVAMQNYVTHAFHNLNSGINNHLSQAAQFELKPVMFQM--------------------
        MNRN QDPP PQNPPVNGDM G      VGEI N ILLADNRDVAM+NYVTHAFHNLNSGINN L QAAQFELKPVMFQ+                    
Subjt:  MNRNPQDPPRPQNPPVNGDMVG------VGEISNPILLADNRDVAMQNYVTHAFHNLNSGINNHLSQAAQFELKPVMFQM--------------------

Query:  -----------------------------DGARTWLNALEPNSINTWAELTKKFLAKYHTLTRSANLREDIVSFRQKENEAVQEAWKRFKELLRRCPRHG
                                     DGARTW+NALEPNSINTWAELT KFLAKYHTLT++A+LREDIVSFRQKENEAVQEAW+RFKELLRRCP HG
Subjt:  -----------------------------DGARTWLNALEPNSINTWAELTKKFLAKYHTLTRSANLREDIVSFRQKENEAVQEAWKRFKELLRRCPRHG

Query:  LPACVQIEQFYRGFDRSSRMMLNTAANGSLLEKSVNEIVDILKKMTDINDQGEIGRSLPKKQVSAGAFELDIVASMQGQMAVMNQMLKQLTMKKETKTAI
        LP+CVQIEQFYRG DRSS+MMLNT ANGSLLEKSVNEIVD+L KMTDINDQGE+GRSLPKKQVS G FELD VASMQ QMA MNQMLKQLTM+KETKT  
Subjt:  LPACVQIEQFYRGFDRSSRMMLNTAANGSLLEKSVNEIVDILKKMTDINDQGEIGRSLPKKQVSAGAFELDIVASMQGQMAVMNQMLKQLTMKKETKTAI

Query:  SAIPEPSPVKVPSRIS
        SAIPE SP+   S IS
Subjt:  SAIPEPSPVKVPSRIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGAAATCCACAAGATCCTCCACGGCCACAAAATCCACCTGTAAACGGAGATATGGTGGGTGTAGGAGAAATTTCTAATCCGATCCTTCTAGCAGATAAC
CGAGATGTAGCCATGCAGAATTATGTCACTCATGCGTTCCACAACCTAAATTCAGGGATAAATAATCATTTATCCCAAGCCGCACAGTTCGAGCTCAAGCCAGTC
ATGTTCCAGATGGATGGTGCAAGGACTTGGCTAAACGCGTTAGAACCAAATTCTATCAACACATGGGCGGAACTGACGAAGAAATTTTTGGCAAAGTACCACACT
TTGACCAGGAGCGCAAACCTTCGAGAAGACATTGTGTCTTTTCGACAAAAGGAAAATGAAGCAGTTCAAGAAGCTTGGAAGCGTTTTAAGGAGTTACTGAGAAGA
TGCCCGAGACATGGATTGCCCGCATGTGTGCAAATTGAACAATTCTATAGAGGATTCGATCGTTCATCAAGGATGATGTTGAACACCGCAGCCAATGGCTCGTTG
TTAGAAAAGTCGGTTAATGAGATCGTTGATATCTTAAAGAAGATGACAGATATTAATGACCAAGGCGAAATAGGAAGGTCATTGCCAAAGAAGCAAGTATCAGCC
GGAGCCTTTGAGTTAGACATAGTAGCTTCAATGCAAGGCCAAATGGCAGTTATGAACCAGATGTTAAAGCAGCTGACAATGAAGAAGGAAACCAAAACCGCCATT
TCGGCGATACCTGAACCCTCTCCTGTCAAGGTGCCCAGCAGAATTTCAATTCGTATTCGAACAGTTACAACCCCGGATGGAGGCACCATCAAAACTTTTCTTGGA
ACCCTCACTTCAGCTGAAGGAACTGATGCAGTTGCACTTGTTCTTGCATCCACCTCCAATCCACAACAAGAAGAGAAAGCAGAACTCGTAAGTTCAGAAGAAAAA
GGTAAGAAGGTGGATAAAGGTAAGCAAGTAGTGCCCAGCACTACTCCACAGGTAGGAGAAGCTCGTTCCACTACTGTCACTTTACAACTAGCTGATAGGTCCATA
AAGAAATCAGAAGGAAAAATAGAAGGTGTGCTTGTTAAAGTCGATAAGTTTATTTTTCCCGCCGATTTCATAATTTTGGATTGTGAAGTAGATCTTGAGGTGCCG
ATCATTCTTGGGAGGCCATTTTTAGCAACTGAAGATACGGTATTCAATGTTAGGAAAGAAGAGATCATTATGAAGGTCAATGATGAGCAAGTCACCTTCAACGTC
CTTGATACGATGCAACTCCAGGATGAAGTCGAGGAGTGCTCTACAATAAGGGCAATCATGGAGGAACTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATAGAAATCCACAAGATCCTCCACGGCCACAAAATCCACCTGTAAACGGAGATATGGTGGGTGTAGGAGAAATTTCTAATCCGATCCTTCTAGCAGATAAC
CGAGATGTAGCCATGCAGAATTATGTCACTCATGCGTTCCACAACCTAAATTCAGGGATAAATAATCATTTATCCCAAGCCGCACAGTTCGAGCTCAAGCCAGTC
ATGTTCCAGATGGATGGTGCAAGGACTTGGCTAAACGCGTTAGAACCAAATTCTATCAACACATGGGCGGAACTGACGAAGAAATTTTTGGCAAAGTACCACACT
TTGACCAGGAGCGCAAACCTTCGAGAAGACATTGTGTCTTTTCGACAAAAGGAAAATGAAGCAGTTCAAGAAGCTTGGAAGCGTTTTAAGGAGTTACTGAGAAGA
TGCCCGAGACATGGATTGCCCGCATGTGTGCAAATTGAACAATTCTATAGAGGATTCGATCGTTCATCAAGGATGATGTTGAACACCGCAGCCAATGGCTCGTTG
TTAGAAAAGTCGGTTAATGAGATCGTTGATATCTTAAAGAAGATGACAGATATTAATGACCAAGGCGAAATAGGAAGGTCATTGCCAAAGAAGCAAGTATCAGCC
GGAGCCTTTGAGTTAGACATAGTAGCTTCAATGCAAGGCCAAATGGCAGTTATGAACCAGATGTTAAAGCAGCTGACAATGAAGAAGGAAACCAAAACCGCCATT
TCGGCGATACCTGAACCCTCTCCTGTCAAGGTGCCCAGCAGAATTTCAATTCGTATTCGAACAGTTACAACCCCGGATGGAGGCACCATCAAAACTTTTCTTGGA
ACCCTCACTTCAGCTGAAGGAACTGATGCAGTTGCACTTGTTCTTGCATCCACCTCCAATCCACAACAAGAAGAGAAAGCAGAACTCGTAAGTTCAGAAGAAAAA
GGTAAGAAGGTGGATAAAGGTAAGCAAGTAGTGCCCAGCACTACTCCACAGGTAGGAGAAGCTCGTTCCACTACTGTCACTTTACAACTAGCTGATAGGTCCATA
AAGAAATCAGAAGGAAAAATAGAAGGTGTGCTTGTTAAAGTCGATAAGTTTATTTTTCCCGCCGATTTCATAATTTTGGATTGTGAAGTAGATCTTGAGGTGCCG
ATCATTCTTGGGAGGCCATTTTTAGCAACTGAAGATACGGTATTCAATGTTAGGAAAGAAGAGATCATTATGAAGGTCAATGATGAGCAAGTCACCTTCAACGTC
CTTGATACGATGCAACTCCAGGATGAAGTCGAGGAGTGCTCTACAATAAGGGCAATCATGGAGGAACTCTAG
Protein sequenceShow/hide protein sequence
MNRNPQDPPRPQNPPVNGDMVGVGEISNPILLADNRDVAMQNYVTHAFHNLNSGINNHLSQAAQFELKPVMFQMDGARTWLNALEPNSINTWAELTKKFLAKYHT
LTRSANLREDIVSFRQKENEAVQEAWKRFKELLRRCPRHGLPACVQIEQFYRGFDRSSRMMLNTAANGSLLEKSVNEIVDILKKMTDINDQGEIGRSLPKKQVSA
GAFELDIVASMQGQMAVMNQMLKQLTMKKETKTAISAIPEPSPVKVPSRISIRIRTVTTPDGGTIKTFLGTLTSAEGTDAVALVLASTSNPQQEEKAELVSSEEK
GKKVDKGKQVVPSTTPQVGEARSTTVTLQLADRSIKKSEGKIEGVLVKVDKFIFPADFIILDCEVDLEVPIILGRPFLATEDTVFNVRKEEIIMKVNDEQVTFNV
LDTMQLQDEVEECSTIRAIMEEL