; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028241 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028241
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionEnzymatic polyprotein
Genome locationchr8:16141124..16153861
RNA-Seq ExpressionLag0028241
SyntenyLag0028241
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040829.1 polyprotein [Cucumis melo var. makuwa]8.3e-4366.21Show/hide
Query:  LLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTC--KPL-----FQRLKKNPKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGG
        LL +  I PSKS WSC+AFYVNN  E+ERG+ R     KPL     + RLKKNPK W+DEHT+AVQ IK+L KSIP LSL++ +A LI++TDASDIGYGG
Subjt:  LLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTC--KPL-----FQRLKKNPKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGG

Query:  ILKQCLDKEESIVRFHSGIWNDTQKNYSTVKKELLAIVLCVKKFQ
        ILKQ L+++ SIVR+HSGIWN  QKNYSTVKKE+LAIVL V+KFQ
Subjt:  ILKQCLDKEESIVRFHSGIWNDTQKNYSTVKKELLAIVLCVKKFQ

KAA0056776.1 Enzymatic polyprotein [Cucumis melo var. makuwa]3.1e-0550.7Show/hide
Query:  DLLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTC--KPLFQRLK--KNPKPWSDEHTKAVQKIKALVK
        +LLNK  I PSKSPWSC+AFYVNN  E+ERGVPR     KPL + LK  + P P   +  K +   K   K
Subjt:  DLLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTC--KPLFQRLK--KNPKPWSDEHTKAVQKIKALVK

KAA0059217.1 Enzymatic polyprotein [Cucumis melo var. makuwa]5.2e-3768.64Show/hide
Query:  VNNAGEQERGVPRPTCKPLFQRLKKNPKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGGILKQCLDKEESIVRFHSGIWNDTQKNY
        VN  G+  R + R  C PL+ RLKKNPKPW+DEHT+AVQ IK+L KSIPCLSL++ +A LI++TDASDIGYGGILKQ L+ + S+VR+HSGIWN  QKNY
Subjt:  VNNAGEQERGVPRPTCKPLFQRLKKNPKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGGILKQCLDKEESIVRFHSGIWNDTQKNY

Query:  STVKKELLAIVLCVKKFQ
        STVKKE+LAIVL V+KFQ
Subjt:  STVKKELLAIVLCVKKFQ

KAA0059217.1 Enzymatic polyprotein [Cucumis melo var. makuwa]3.1e-0550.7Show/hide
Query:  DLLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTC--KPLFQRLK--KNPKPWSDEHTKAVQKIKALVK
        +LLNK  I PSKSPWSC+AFYVNN  E+ERGVPR     KPL + LK  + P P   +  K +   K   K
Subjt:  DLLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTC--KPLFQRLK--KNPKPWSDEHTKAVQKIKALVK

KAA0059217.1 Enzymatic polyprotein [Cucumis melo var. makuwa]5.2e-3768.64Show/hide
Query:  VNNAGEQERGVPRPTCKPLFQRLKKNPKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGGILKQCLDKEESIVRFHSGIWNDTQKNY
        VN  G+  R + R  C PL+ RLKKNPKPW+DEHT+AVQ IK+L KSIPCLSL++ +A LI++TDASDIGYGGILKQ L+ + S+VR+HSGIWN  QKNY
Subjt:  VNNAGEQERGVPRPTCKPLFQRLKKNPKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGGILKQCLDKEESIVRFHSGIWNDTQKNY

Query:  STVKKELLAIVLCVKKFQ
        STVKKE+LAIVL V+KFQ
Subjt:  STVKKELLAIVLCVKKFQ

TYK17805.1 polyprotein [Cucumis melo var. makuwa]8.3e-4366.21Show/hide
Query:  LLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTC--KPL-----FQRLKKNPKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGG
        LL +  I PSKS WSC+AFYVNN  E+ERG+ R     KPL     + RLKKNPK W+DEHT+AVQ IK+L KSIP LSL++ +A LI++TDASDIGYGG
Subjt:  LLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTC--KPL-----FQRLKKNPKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGG

Query:  ILKQCLDKEESIVRFHSGIWNDTQKNYSTVKKELLAIVLCVKKFQ
        ILKQ L+++ SIVR+HSGIWN  QKNYSTVKKE+LAIVL V+KFQ
Subjt:  ILKQCLDKEESIVRFHSGIWNDTQKNYSTVKKELLAIVLCVKKFQ

XP_016899790.1 PREDICTED: uncharacterized protein LOC107990655 [Cucumis melo]3.6e-3859.86Show/hide
Query:  DLLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTC--KPL-----FQRLKKNPKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYG
        +LL K  I PSKS WSC+  YVNN  + ERG+P+     KPL     + RLKKNPKPW  EHT+AVQ IK L KSIPCLS+I+ KA +IV+TDAS+IGY 
Subjt:  DLLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTC--KPL-----FQRLKKNPKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYG

Query:  GILKQCLDKEESIVRFHSGIWNDTQKNYSTVKKELLAIVLCV
         +LKQ + ++  +V +HSG+WN  QKNYSTVKKE+LAIVLCV
Subjt:  GILKQCLDKEESIVRFHSGIWNDTQKNYSTVKKELLAIVLCV

TrEMBL top hitse value%identityAlignment
A0A1S4DVP6 uncharacterized protein LOC1079906551.7e-3859.86Show/hide
Query:  DLLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTC--KPL-----FQRLKKNPKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYG
        +LL K  I PSKS WSC+  YVNN  + ERG+P+     KPL     + RLKKNPKPW  EHT+AVQ IK L KSIPCLS+I+ KA +IV+TDAS+IGY 
Subjt:  DLLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTC--KPL-----FQRLKKNPKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYG

Query:  GILKQCLDKEESIVRFHSGIWNDTQKNYSTVKKELLAIVLCV
         +LKQ + ++  +V +HSG+WN  QKNYSTVKKE+LAIVLCV
Subjt:  GILKQCLDKEESIVRFHSGIWNDTQKNYSTVKKELLAIVLCV

A0A5A7THR3 Polyprotein4.0e-4366.21Show/hide
Query:  LLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTC--KPL-----FQRLKKNPKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGG
        LL +  I PSKS WSC+AFYVNN  E+ERG+ R     KPL     + RLKKNPK W+DEHT+AVQ IK+L KSIP LSL++ +A LI++TDASDIGYGG
Subjt:  LLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTC--KPL-----FQRLKKNPKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGG

Query:  ILKQCLDKEESIVRFHSGIWNDTQKNYSTVKKELLAIVLCVKKFQ
        ILKQ L+++ SIVR+HSGIWN  QKNYSTVKKE+LAIVL V+KFQ
Subjt:  ILKQCLDKEESIVRFHSGIWNDTQKNYSTVKKELLAIVLCVKKFQ

A0A5A7UR29 Enzymatic polyprotein1.5e-0550.7Show/hide
Query:  DLLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTC--KPLFQRLK--KNPKPWSDEHTKAVQKIKALVK
        +LLNK  I PSKSPWSC+AFYVNN  E+ERGVPR     KPL + LK  + P P   +  K +   K   K
Subjt:  DLLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTC--KPLFQRLK--KNPKPWSDEHTKAVQKIKALVK

A0A5A7UX67 Enzymatic polyprotein2.5e-3768.64Show/hide
Query:  VNNAGEQERGVPRPTCKPLFQRLKKNPKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGGILKQCLDKEESIVRFHSGIWNDTQKNY
        VN  G+  R + R  C PL+ RLKKNPKPW+DEHT+AVQ IK+L KSIPCLSL++ +A LI++TDASDIGYGGILKQ L+ + S+VR+HSGIWN  QKNY
Subjt:  VNNAGEQERGVPRPTCKPLFQRLKKNPKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGGILKQCLDKEESIVRFHSGIWNDTQKNY

Query:  STVKKELLAIVLCVKKFQ
        STVKKE+LAIVL V+KFQ
Subjt:  STVKKELLAIVLCVKKFQ

A0A5A7UX67 Enzymatic polyprotein1.5e-0550.7Show/hide
Query:  DLLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTC--KPLFQRLK--KNPKPWSDEHTKAVQKIKALVK
        +LLNK  I PSKSPWSC+AFYVNN  E+ERGVPR     KPL + LK  + P P   +  K +   K   K
Subjt:  DLLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTC--KPLFQRLK--KNPKPWSDEHTKAVQKIKALVK

A0A5A7UX67 Enzymatic polyprotein2.5e-3768.64Show/hide
Query:  VNNAGEQERGVPRPTCKPLFQRLKKNPKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGGILKQCLDKEESIVRFHSGIWNDTQKNY
        VN  G+  R + R  C PL+ RLKKNPKPW+DEHT+AVQ IK+L KSIPCLSL++ +A LI++TDASDIGYGGILKQ L+ + S+VR+HSGIWN  QKNY
Subjt:  VNNAGEQERGVPRPTCKPLFQRLKKNPKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGGILKQCLDKEESIVRFHSGIWNDTQKNY

Query:  STVKKELLAIVLCVKKFQ
        STVKKE+LAIVL V+KFQ
Subjt:  STVKKELLAIVLCVKKFQ

A0A5D3D268 Polyprotein4.0e-4366.21Show/hide
Query:  LLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTC--KPL-----FQRLKKNPKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGG
        LL +  I PSKS WSC+AFYVNN  E+ERG+ R     KPL     + RLKKNPK W+DEHT+AVQ IK+L KSIP LSL++ +A LI++TDASDIGYGG
Subjt:  LLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTC--KPL-----FQRLKKNPKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGG

Query:  ILKQCLDKEESIVRFHSGIWNDTQKNYSTVKKELLAIVLCVKKFQ
        ILKQ L+++ SIVR+HSGIWN  QKNYSTVKKE+LAIVL V+KFQ
Subjt:  ILKQCLDKEESIVRFHSGIWNDTQKNYSTVKKELLAIVLCVKKFQ

SwissProt top hitse value%identityAlignment
P03554 Enzymatic polyprotein2.3e-1141.51Show/hide
Query:  KPLFQRLKKN-PKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGGILKQCLDKE----ESIVRFHSGIWNDTQKNYSTVKKELLAIV
        KPL  +LK+N P  W+ E T  +QK+K  ++  P L    P+  LI+ETDASD  +GG+LK     E    E I R+ SG +   +KNY +  KE LA++
Subjt:  KPLFQRLKKN-PKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGGILKQCLDKE----ESIVRFHSGIWNDTQKNYSTVKKELLAIV

Query:  LCVKKF
          +KKF
Subjt:  LCVKKF

P03556 Enzymatic polyprotein1.7e-1141.51Show/hide
Query:  KPLFQRLKKN-PKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGGILKQCLDKE----ESIVRFHSGIWNDTQKNYSTVKKELLAIV
        KPL  +LK+N P  W+ E T  +QK+K  ++  P L    P+  LI+ETDASD  +GG+LK     E    E I R+ SG +   +KNY +  KE LA++
Subjt:  KPLFQRLKKN-PKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGGILKQCLDKE----ESIVRFHSGIWNDTQKNYSTVKKELLAIV

Query:  LCVKKF
          +KKF
Subjt:  LCVKKF

P05400 Enzymatic polyprotein1.6e-1240.2Show/hide
Query:  KPLFQRLKKNPK-PWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGGILKQCLDKEESIVRFHSGIWNDTQKNYSTVKKELLAIVLCVK
        KPL  +LK++    W+D  ++ + KIK  +KS P L    P   L++ETDAS+  +GGILK   +  E I R+ SG +   ++NY + +KELLA++  +K
Subjt:  KPLFQRLKKNPK-PWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGGILKQCLDKEESIVRFHSGIWNDTQKNYSTVKKELLAIVLCVK

Query:  KF
        KF
Subjt:  KF

P09523 Enzymatic polyprotein1.2e-1243.81Show/hide
Query:  KPLFQRLKKNPK-PWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGGILK-QCLDKEESIVRFHSGIWNDTQKNYSTVKKELLAIVLCV
        KPL  +LKK+    W+   +  V+KIK  + S P L L  P+ +LI+ETDASD  +GG+LK + LD  E I R+ SG +   +KNY +  KELLA+   +
Subjt:  KPLFQRLKKNPK-PWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGGILK-QCLDKEESIVRFHSGIWNDTQKNYSTVKKELLAIVLCV

Query:  KKFQA
         KF A
Subjt:  KKFQA

Q00962 Enzymatic polyprotein1.0e-1133.33Show/hide
Query:  EDLLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTCKPLFQRLKKN-PKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGGILKQ
        + L +KK+++      + A+ Y+ N  +          +PL  +LK+N P  W+ E T  +QK+K  ++  P L    P+  LI+ETDASD  +GG+LK 
Subjt:  EDLLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTCKPLFQRLKKN-PKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGGILKQ

Query:  CLDKE----ESIVRFHSGIWNDTQKNYSTVKKELLAIVLCVKKF
            E    E I R+ SG +   ++NY +  KE LA++  +KKF
Subjt:  CLDKE----ESIVRFHSGIWNDTQKNYSTVKKELLAIVLCVKKF

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGATTTACTCAATAAGAAGCGGATTAGACCATCCAAGAGTCCCTGGTCATGCGCAGCCTTCTATGTCAACAACGCAGGCGAACAGGAGCGTGGTGTGCCAAGACC
CACTTGCAAGCCGCTTTTTCAAAGGCTCAAGAAAAATCCAAAACCATGGAGCGATGAGCATACTAAAGCTGTTCAAAAGATAAAAGCTTTAGTCAAATCAATACCCTGTT
TGAGTCTTATCAACCCAAAAGCCAATCTGATAGTGGAAACAGATGCATCAGATATTGGATATGGGGGAATCCTTAAACAGTGTTTAGATAAGGAGGAATCAATTGTCCGA
TTTCATTCAGGAATTTGGAATGACACTCAAAAGAATTACTCCACAGTTAAAAAGGAGTTGCTTGCTATAGTACTCTGTGTTAAAAAATTCCAAGCACCTCCACCCATGAG
TGCTGATCAATATGCGATGGATTTAGGGTTTACTTCGGTAACTCGATCCAGGTCCAGGCAAGGTGATATACGCCCGATGCCTCCAATGGAGTCATCCACTCCACCTTCAA
GGCCTTCGGCCAACCTTGTCCGTCCTTCTGGCGGAGTTGTTCATTTGAGACCTCCAGTCTCACCAGATTCAAGGATAGGTCATCAACCCCGTCACCTTCTTCCTATTCTC
AGGCACACAAGAAAAGAGTACAAGGATAAGGCTGGGTACCTTATCCTGGTAACACTATGTGATACGGCCCGCTTTGTATTCGATACAGACGTATTAATCCAAGGCGTCCA
TGTAGGAGACATGCGAGCGGGGGTATCCTATGCAATGAGTTTGCATAAGACCGGACCACGAAATAGTAAACCACTAGATGTAACACCGTTAACTACATGGGTGAGAGTGG
CCAATACGCCGACTCAATATGCCGTCCTTTTTGGGGACAAGACCGAGTGGGAGGCTGGGGACATGACAACACAAGAAGGAATTCACTCCTTCCCACTTCTAGGAGAAGTA
GATGAGTGTTCCCTTAAGAGGTTGTTCATTAGAGGAGCGCTGGTACTTAAGGAGACAGATGTAACACAGGGAGCAATCAAGAATCATCCGTTGAAGTATTCAGAGTCGTC
AACGGAGATCAAATGGTGGGACAAATTCAATTTTCAAAACGCCACCCTCAACAAGGTCAAAGAATGGTTCGCAGCAAATGGATATCTCCAAGATATCGACATCAAGAGGA
ATGCAGAATTCCTCAATGACAAATCAAAGCTTTTGGCAGCCCTCGCACAGACAACCTCTGATGCAGACTTCCAGAGGATTCTACAAATGGCAGACTCCTCTGCAAGCGAG
TCACCTGCTTCTTCTGCACAAGGAGAAGAGAACGAACCAGACTACGATCTGGATGATCCATTCCTTGACTCACAATCCATAGTTATAGCGGGGAAGATCAAATGGTGGTG
TTCTTCGTTGTTGCAGAGCAATCAAGAATCATCCGTTGAAGTATTCAGAGTCGTCAACGGAGATAATCTTCTCAAGATTACCATTCAAAAGGGTAGTCTTGTCATCCATT
ATAAATGGACAAGAGTTCTAACAGACTTCAACATGACAACAGGAGATAAGAATTTCTTACAGTCTCCTGTCTCTGGAGGTTTAGAGCAACAGCCACCAAAGGTGATCAGC
TACTCCTCCTTTGAGATGAGACACTTCTTAGCTAATTCTAATTCCGGAATAACTCATATTCCTATAGGCATTGGCCACCATTATTTGGTCAAGGATTTGTTAACACACTT
AACTCATCCTCGTAAGTGTAACCCTCCCACTTTCAAATCCCAGAGGCACACACCAACAGGCTACCAAAGGGAAAGACAACTGTTGGTAATTAGCCTAGGTGACCCTATCC
TTTTCAGGAGTTCACGGTGTTACGAATCTTATGACCAGGCCCTCCGAAGGGATGGTTGCTCCAGGACAACACGCAGGCGGATCATAAGACTCTCACGGTGTGACCTATGG
GTGGAGACCGAGGGATACGTTGACACGTGTCCCGCTTCCACTTACTATGGGCATTTCCCATTCACCTTGTTATTGACTCATGCAAACACTTTCCGAAGGGAGGTCGCGCT
AAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGATTTACTCAATAAGAAGCGGATTAGACCATCCAAGAGTCCCTGGTCATGCGCAGCCTTCTATGTCAACAACGCAGGCGAACAGGAGCGTGGTGTGCCAAGACC
CACTTGCAAGCCGCTTTTTCAAAGGCTCAAGAAAAATCCAAAACCATGGAGCGATGAGCATACTAAAGCTGTTCAAAAGATAAAAGCTTTAGTCAAATCAATACCCTGTT
TGAGTCTTATCAACCCAAAAGCCAATCTGATAGTGGAAACAGATGCATCAGATATTGGATATGGGGGAATCCTTAAACAGTGTTTAGATAAGGAGGAATCAATTGTCCGA
TTTCATTCAGGAATTTGGAATGACACTCAAAAGAATTACTCCACAGTTAAAAAGGAGTTGCTTGCTATAGTACTCTGTGTTAAAAAATTCCAAGCACCTCCACCCATGAG
TGCTGATCAATATGCGATGGATTTAGGGTTTACTTCGGTAACTCGATCCAGGTCCAGGCAAGGTGATATACGCCCGATGCCTCCAATGGAGTCATCCACTCCACCTTCAA
GGCCTTCGGCCAACCTTGTCCGTCCTTCTGGCGGAGTTGTTCATTTGAGACCTCCAGTCTCACCAGATTCAAGGATAGGTCATCAACCCCGTCACCTTCTTCCTATTCTC
AGGCACACAAGAAAAGAGTACAAGGATAAGGCTGGGTACCTTATCCTGGTAACACTATGTGATACGGCCCGCTTTGTATTCGATACAGACGTATTAATCCAAGGCGTCCA
TGTAGGAGACATGCGAGCGGGGGTATCCTATGCAATGAGTTTGCATAAGACCGGACCACGAAATAGTAAACCACTAGATGTAACACCGTTAACTACATGGGTGAGAGTGG
CCAATACGCCGACTCAATATGCCGTCCTTTTTGGGGACAAGACCGAGTGGGAGGCTGGGGACATGACAACACAAGAAGGAATTCACTCCTTCCCACTTCTAGGAGAAGTA
GATGAGTGTTCCCTTAAGAGGTTGTTCATTAGAGGAGCGCTGGTACTTAAGGAGACAGATGTAACACAGGGAGCAATCAAGAATCATCCGTTGAAGTATTCAGAGTCGTC
AACGGAGATCAAATGGTGGGACAAATTCAATTTTCAAAACGCCACCCTCAACAAGGTCAAAGAATGGTTCGCAGCAAATGGATATCTCCAAGATATCGACATCAAGAGGA
ATGCAGAATTCCTCAATGACAAATCAAAGCTTTTGGCAGCCCTCGCACAGACAACCTCTGATGCAGACTTCCAGAGGATTCTACAAATGGCAGACTCCTCTGCAAGCGAG
TCACCTGCTTCTTCTGCACAAGGAGAAGAGAACGAACCAGACTACGATCTGGATGATCCATTCCTTGACTCACAATCCATAGTTATAGCGGGGAAGATCAAATGGTGGTG
TTCTTCGTTGTTGCAGAGCAATCAAGAATCATCCGTTGAAGTATTCAGAGTCGTCAACGGAGATAATCTTCTCAAGATTACCATTCAAAAGGGTAGTCTTGTCATCCATT
ATAAATGGACAAGAGTTCTAACAGACTTCAACATGACAACAGGAGATAAGAATTTCTTACAGTCTCCTGTCTCTGGAGGTTTAGAGCAACAGCCACCAAAGGTGATCAGC
TACTCCTCCTTTGAGATGAGACACTTCTTAGCTAATTCTAATTCCGGAATAACTCATATTCCTATAGGCATTGGCCACCATTATTTGGTCAAGGATTTGTTAACACACTT
AACTCATCCTCGTAAGTGTAACCCTCCCACTTTCAAATCCCAGAGGCACACACCAACAGGCTACCAAAGGGAAAGACAACTGTTGGTAATTAGCCTAGGTGACCCTATCC
TTTTCAGGAGTTCACGGTGTTACGAATCTTATGACCAGGCCCTCCGAAGGGATGGTTGCTCCAGGACAACACGCAGGCGGATCATAAGACTCTCACGGTGTGACCTATGG
GTGGAGACCGAGGGATACGTTGACACGTGTCCCGCTTCCACTTACTATGGGCATTTCCCATTCACCTTGTTATTGACTCATGCAAACACTTTCCGAAGGGAGGTCGCGCT
AAAATGA
Protein sequenceShow/hide protein sequence
MEDLLNKKRIRPSKSPWSCAAFYVNNAGEQERGVPRPTCKPLFQRLKKNPKPWSDEHTKAVQKIKALVKSIPCLSLINPKANLIVETDASDIGYGGILKQCLDKEESIVR
FHSGIWNDTQKNYSTVKKELLAIVLCVKKFQAPPPMSADQYAMDLGFTSVTRSRSRQGDIRPMPPMESSTPPSRPSANLVRPSGGVVHLRPPVSPDSRIGHQPRHLLPIL
RHTRKEYKDKAGYLILVTLCDTARFVFDTDVLIQGVHVGDMRAGVSYAMSLHKTGPRNSKPLDVTPLTTWVRVANTPTQYAVLFGDKTEWEAGDMTTQEGIHSFPLLGEV
DECSLKRLFIRGALVLKETDVTQGAIKNHPLKYSESSTEIKWWDKFNFQNATLNKVKEWFAANGYLQDIDIKRNAEFLNDKSKLLAALAQTTSDADFQRILQMADSSASE
SPASSAQGEENEPDYDLDDPFLDSQSIVIAGKIKWWCSSLLQSNQESSVEVFRVVNGDNLLKITIQKGSLVIHYKWTRVLTDFNMTTGDKNFLQSPVSGGLEQQPPKVIS
YSSFEMRHFLANSNSGITHIPIGIGHHYLVKDLLTHLTHPRKCNPPTFKSQRHTPTGYQRERQLLVISLGDPILFRSSRCYESYDQALRRDGCSRTTRRRIIRLSRCDLW
VETEGYVDTCPASTYYGHFPFTLLLTHANTFRREVALK