; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg26382 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg26382
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionReverse transcriptase
Genome locationCarg_Chr19:1199449..1200201
RNA-Seq ExpressionCarg26382
SyntenyCarg26382
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0068077.1 reverse transcriptase [Cucumis melo var. makuwa]6.5e-7565.47Show/hide
Query:  MGFLATTYHVVFDSLRPRTPDE----SPMEELEAYKIVFEAYTF-DSEQNPYEHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEAK
        MGFL+TTY ++FDSLR RTP+E     P+EELEAYKIVFE YTF  SEQ PY       E+EVDFQEPME+FP++ + LP+      E EAKTEE +EA+
Subjt:  MGFLATTYHVVFDSLRPRTPDE----SPMEELEAYKIVFEAYTF-DSEQNPYEHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEAK

Query:  FVNRDEKHEIMNDLK----ESSSSSRSESSPPWSSPGSF-GRDYSS---LGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYE--------K
          NR+  +E+M DL+    ESS SSR+ESS PWSSPGSF  R+Y+S   LGSYGSMRKEKEWRRTLACKLFEERH+SE TEGMDSLWETYE        K
Subjt:  FVNRDEKHEIMNDLK----ESSSSSRSESSPPWSSPGSF-GRDYSS---LGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYE--------K

Query:  KEKDNRK-------SKKEEEEEEEEEEEEGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRKRLIH
        KEK N K        KK ++++EEEE+ EGQLCCLQALKFSAGKMNLGM +PNL+KMTKALKGFGWL+R GSRKRLIH
Subjt:  KEKDNRK-------SKKEEEEEEEEEEEEGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRKRLIH

KAG7011337.1 hypothetical protein SDJN02_26241, partial [Cucurbita argyrosperma subsp. argyrosperma]2.2e-131100Show/hide
Query:  MGFLATTYHVVFDSLRPRTPDESPMEELEAYKIVFEAYTFDSEQNPYEHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEAKFVNRD
        MGFLATTYHVVFDSLRPRTPDESPMEELEAYKIVFEAYTFDSEQNPYEHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEAKFVNRD
Subjt:  MGFLATTYHVVFDSLRPRTPDESPMEELEAYKIVFEAYTFDSEQNPYEHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEAKFVNRD

Query:  EKHEIMNDLKESSSSSRSESSPPWSSPGSFGRDYSSLGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYEKKEKDNRKSKKEEEEEEEEEEE
        EKHEIMNDLKESSSSSRSESSPPWSSPGSFGRDYSSLGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYEKKEKDNRKSKKEEEEEEEEEEE
Subjt:  EKHEIMNDLKESSSSSRSESSPPWSSPGSFGRDYSSLGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYEKKEKDNRKSKKEEEEEEEEEEE

Query:  EGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRKRLIH
        EGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRKRLIH
Subjt:  EGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRKRLIH

KAG7036245.1 hypothetical protein SDJN02_03047, partial [Cucurbita argyrosperma subsp. argyrosperma]1.4e-7766.78Show/hide
Query:  MGFLATTYHVVFDSLRPRTPDE----SPMEELEAYKIVFEAYTFDSEQNPY-----EHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKT---
        MGFLATTY VVFDSLRPRTP+E     PMEELEAYKIVFE YTF +EQN Y     E ++ EVEVEVD +E ME+FP K++  PENPL   E EAKT   
Subjt:  MGFLATTYHVVFDSLRPRTPDE----SPMEELEAYKIVFEAYTFDSEQNPY-----EHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKT---

Query:  EELQEAKFVNRDEKHEIMNDL--------KESSSSSRSESSPPWSSPGSFGRDYSSLGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYE--
        EE  E K   RD    I N++        +ESS SSRSESS PWSSPGSF RDY SLGSYGSMRKEKEWRRTLACKLFEERH+SE TEGMDSLWETYE  
Subjt:  EELQEAKFVNRDEKHEIMNDL--------KESSSSSRSESSPPWSSPGSFGRDYSSLGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYE--

Query:  ------KKEKDNRKSKK--------EEEEEEEEEEEEGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRKRLIH
              K EK N KSKK        E+E+E+E E+ EGQLCCLQALKFSAGKMNLGM RPNL+KM+KALKGFGWLSR GSRKR +H
Subjt:  ------KKEKDNRKSKK--------EEEEEEEEEEEEGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRKRLIH

XP_022963518.1 uncharacterized protein LOC111463823 [Cucurbita moschata]7.1e-13099.2Show/hide
Query:  MGFLATTYHVVFDSLRPRTPDESPMEELEAYKIVFEAYTFDSEQNPYEHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEAKFVNRD
        MGFLATTYHVVFDSLRPRTPDESPMEELEAYKIVFEAYTF SEQNPYEHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEAKF NRD
Subjt:  MGFLATTYHVVFDSLRPRTPDESPMEELEAYKIVFEAYTFDSEQNPYEHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEAKFVNRD

Query:  EKHEIMNDLKESSSSSRSESSPPWSSPGSFGRDYSSLGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYEKKEKDNRKSKKEEEEEEEEEEE
        EKHEIMNDLKESSSSSRSESSPPWSSPGSFGRDYSSLGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYEKKEKDNRKSKKEEEEEEEEEEE
Subjt:  EKHEIMNDLKESSSSSRSESSPPWSSPGSFGRDYSSLGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYEKKEKDNRKSKKEEEEEEEEEEE

Query:  EGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRKRLIH
        EGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRKRLIH
Subjt:  EGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRKRLIH

XP_038887002.1 stress response protein NST1 [Benincasa hispida]5.9e-8466.78Show/hide
Query:  MGFLATTYHVVFDSLRPRTPDES-----PMEELEAYKIVFEAYTFDSEQNPY-EHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEA
        MGFLATTY VVFDSLRPRTP+E+     PMEELEAYKIVFE YTF SEQ PY +    EVEVE DFQEPME+FPE+I+ L ENPL++  +E KTEEL+EA
Subjt:  MGFLATTYHVVFDSLRPRTPDES-----PMEELEAYKIVFEAYTFDSEQNPY-EHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEA

Query:  KFVNRDE----------------KHEIMNDLK----ESSSSSRSESSPPWSSPGSF-GRDYSSLGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSL
        +  N DE                ++E+M DL+    ESS SSR+ESS PWSSPGSF  R+Y+SLGSYGSMRKEKEWRRTLACKLFEERH+SE TEGMDSL
Subjt:  KFVNRDE----------------KHEIMNDLK----ESSSSSRSESSPPWSSPGSF-GRDYSSLGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSL

Query:  WETYEKKE--------------KDNRKSKKEEEEEEEEEEEEGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRK-RLIH
        WETYEK E              K  +K +K+EEE+EEEE+ EGQLCCLQALKFSAGKMNLGM RPNL+KMTKALKGFGWLSR GSR+ RLIH
Subjt:  WETYEKKE--------------KDNRKSKKEEEEEEEEEEEEGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRK-RLIH

TrEMBL top hitse value%identityAlignment
A0A0A0L955 Uncharacterized protein6.0e-6663.77Show/hide
Query:  LRPRTPDE----SPMEELEAYKIVFEAYTF-DSEQNPYEHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEAKFVNRDEKHEIMNDL
        LR RTP+E     P+EELEAYKIVFE YTF  SEQ PY       E+EVDFQE ME+FP++ + LP+      E EAKTEE +EA+  NR+  +E+M DL
Subjt:  LRPRTPDE----SPMEELEAYKIVFEAYTF-DSEQNPYEHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEAKFVNRDEKHEIMNDL

Query:  K----ESSSSSRSESSPPWSSPGSF-GRDYS---SLGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYE--------KKEKDNRKS------
        +    ESS SSR+ESS PWSSPGSF  R+Y+   +LGSYGSMRKEKEWRRTLACKLFEERH+SE TEGMDSLWETYE        KKEK N KS      
Subjt:  K----ESSSSSRSESSPPWSSPGSF-GRDYS---SLGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYE--------KKEKDNRKS------

Query:  --KKEEEEEEEEEEEEGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRKRLIH
          K ++++EEEE+ E+GQLCCLQALKFSAGKMNLGM +PNL+KMTKALKGFGWL+R GSRK+LIH
Subjt:  --KKEEEEEEEEEEEEGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRKRLIH

A0A1S4E2S0 uncharacterized protein LOC1034992176.4e-6865.15Show/hide
Query:  LRPRTPDE----SPMEELEAYKIVFEAYTF-DSEQNPYEHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEAKFVNRDEKHEIMNDL
        LR RTP+E     P+EELEAYKIVFE YTF  SEQ PY       E+EVDFQEPME+FP++ + LP+      E EAKTEE +EA+  NR+  +E+M DL
Subjt:  LRPRTPDE----SPMEELEAYKIVFEAYTF-DSEQNPYEHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEAKFVNRDEKHEIMNDL

Query:  K----ESSSSSRSESSPPWSSPGSF-GRDYSS---LGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYE--------KKEKDNRK-------
        +    ESS SSR+ESS PWSSPGSF  R+Y+S   LGSYGSMRKEKEWRRTLACKLFEERH+SE TEGMDSLWETYE        KKEK N K       
Subjt:  K----ESSSSSRSESSPPWSSPGSF-GRDYSS---LGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYE--------KKEKDNRK-------

Query:  SKKEEEEEEEEEEEEGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRKRLIH
         KK ++++EEEE+ EGQLCCLQALKFSAGKMNLGM +PNL+KMTKALKGFGWL+R GSRKRLIH
Subjt:  SKKEEEEEEEEEEEEGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRKRLIH

A0A5D3DQW2 Reverse transcriptase3.2e-7565.47Show/hide
Query:  MGFLATTYHVVFDSLRPRTPDE----SPMEELEAYKIVFEAYTF-DSEQNPYEHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEAK
        MGFL+TTY ++FDSLR RTP+E     P+EELEAYKIVFE YTF  SEQ PY       E+EVDFQEPME+FP++ + LP+      E EAKTEE +EA+
Subjt:  MGFLATTYHVVFDSLRPRTPDE----SPMEELEAYKIVFEAYTF-DSEQNPYEHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEAK

Query:  FVNRDEKHEIMNDLK----ESSSSSRSESSPPWSSPGSF-GRDYSS---LGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYE--------K
          NR+  +E+M DL+    ESS SSR+ESS PWSSPGSF  R+Y+S   LGSYGSMRKEKEWRRTLACKLFEERH+SE TEGMDSLWETYE        K
Subjt:  FVNRDEKHEIMNDLK----ESSSSSRSESSPPWSSPGSF-GRDYSS---LGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYE--------K

Query:  KEKDNRK-------SKKEEEEEEEEEEEEGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRKRLIH
        KEK N K        KK ++++EEEE+ EGQLCCLQALKFSAGKMNLGM +PNL+KMTKALKGFGWL+R GSRKRLIH
Subjt:  KEKDNRK-------SKKEEEEEEEEEEEEGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRKRLIH

A0A6J1E4P5 uncharacterized protein LOC1110260375.6e-7260.92Show/hide
Query:  MGFLATTYHVVFDSLRPRTPDE------SPM-EELEAYKIVFEAYTF----DSEQNPY---EHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEA
        MG LATTY +VFD LRPRTPDE       P+ EELEAY IVFE YTF    ++E+NPY      + EVEV+VD +EP+ +FPE IK LPENP+   + E 
Subjt:  MGFLATTYHVVFDSLRPRTPDE------SPM-EELEAYKIVFEAYTF----DSEQNPY---EHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEA

Query:  K--TEELQEAKFVNRDEKHEIMNDL---KESSSSSRSESSPPWSSPGSFGRDYSSLGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYEKKE
        K   E+L++    +  E  E++      ++SS SSRSESS PWSSPGSFGR+YSSLGSYGSMRKEKEWRRTLACKLFEERH++E +EGMDSLWETYEK E
Subjt:  K--TEELQEAKFVNRDEKHEIMNDL---KESSSSSRSESSPPWSSPGSFGRDYSSLGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYEKKE

Query:  K------DNRKSK---------KEEEEEEEEEEEEGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRKRLIH
               +N+K K         K+ +EEE+E+ +EGQLCCLQALKFSAGKMNLGM RPNL+KM+KA KGFGWL+R GSRK LIH
Subjt:  K------DNRKSK---------KEEEEEEEEEEEEGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRKRLIH

A0A6J1HI04 uncharacterized protein LOC1114638233.4e-13099.2Show/hide
Query:  MGFLATTYHVVFDSLRPRTPDESPMEELEAYKIVFEAYTFDSEQNPYEHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEAKFVNRD
        MGFLATTYHVVFDSLRPRTPDESPMEELEAYKIVFEAYTF SEQNPYEHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEAKF NRD
Subjt:  MGFLATTYHVVFDSLRPRTPDESPMEELEAYKIVFEAYTFDSEQNPYEHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEAKFVNRD

Query:  EKHEIMNDLKESSSSSRSESSPPWSSPGSFGRDYSSLGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYEKKEKDNRKSKKEEEEEEEEEEE
        EKHEIMNDLKESSSSSRSESSPPWSSPGSFGRDYSSLGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYEKKEKDNRKSKKEEEEEEEEEEE
Subjt:  EKHEIMNDLKESSSSSRSESSPPWSSPGSFGRDYSSLGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYEKKEKDNRKSKKEEEEEEEEEEE

Query:  EGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRKRLIH
        EGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRKRLIH
Subjt:  EGQLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFGWLSRKGSRKRLIH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G25130.1 unknown protein8.6e-2536.21Show/hide
Query:  MEELEAYKIVFEA---------------YTFDSEQNPYEHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEAKFVNRDEKH--EIMN
        +EELEAYK+V EA                TF  +   +E ++ E   +   +E +E     I+ L    +I+ E E +T++ ++ +   +  KH  +++ 
Subjt:  MEELEAYKIVFEA---------------YTFDSEQNPYEHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEAKFVNRDEKH--EIMN

Query:  DLKE---------------SSSSSRSESSPPWSSPGSFG------------RDYSSLGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYE--
        D +E                 S++ S   P  S+    G             D  SL S+GSMRKEKEWRRTLACKLFEERH+++  +GMD LWETYE  
Subjt:  DLKE---------------SSSSSRSESSPPWSSPGSFG------------RDYSSLGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYE--

Query:  -----KKEKDNRKSKK-----------EEEEEEEEEEEEG----QLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFG--WLSRKGSRK
             + E++ +K KK           E+E   EEE+++G    QLCCLQALKFS GKM+LG+ RPNL+K++KA KG G  + + K S+K
Subjt:  -----KKEKDNRKSKK-----------EEEEEEEEEEEEG----QLCCLQALKFSAGKMNLGMRRPNLMKMTKALKGFG--WLSRKGSRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTTTCTTGCTACTACTTACCATGTTGTTTTTGACAGTTTACGCCCAAGAACACCAGACGAATCCCCCATGGAAGAGCTTGAAGCTTACAAAATCGTGTTTGAGGC
TTACACTTTTGACTCTGAACAAAACCCATATGAACACAGTCTCCAAGAAGTGGAAGTGGAAGTCGATTTTCAAGAACCCATGGAGCATTTTCCCGAGAAAATCAAAACTC
TCCCGGAAAATCCCCTGATAATAGCAGAAGTTGAAGCAAAAACAGAGGAATTACAAGAAGCCAAATTCGTAAACAGAGACGAAAAGCATGAAATCATGAATGATTTGAAA
GAATCATCGAGTTCTTCAAGATCTGAATCGAGTCCTCCATGGAGTTCACCAGGGAGTTTTGGTAGAGATTATTCATCATTAGGAAGCTACGGATCGATGAGGAAAGAGAA
AGAATGGCGAAGAACACTCGCCTGTAAGCTCTTCGAAGAGAGGCACAGTTCAGAGGAAACAGAAGGGATGGATTCACTATGGGAAACATACGAGAAGAAGGAGAAAGACA
ATAGAAAATCCAAGAAAGAAGAAGAAGAAGAAGAAGAAGAAGAGGAAGAAGAAGGGCAACTTTGCTGCTTACAGGCACTGAAATTCTCGGCGGGGAAGATGAATTTGGGA
ATGAGGAGACCAAATCTTATGAAAATGACCAAAGCTTTGAAGGGATTTGGATGGTTGAGTAGAAAAGGAAGTAGAAAGAGATTGATCCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCTTTCTTGCTACTACTTACCATGTTGTTTTTGACAGTTTACGCCCAAGAACACCAGACGAATCCCCCATGGAAGAGCTTGAAGCTTACAAAATCGTGTTTGAGGC
TTACACTTTTGACTCTGAACAAAACCCATATGAACACAGTCTCCAAGAAGTGGAAGTGGAAGTCGATTTTCAAGAACCCATGGAGCATTTTCCCGAGAAAATCAAAACTC
TCCCGGAAAATCCCCTGATAATAGCAGAAGTTGAAGCAAAAACAGAGGAATTACAAGAAGCCAAATTCGTAAACAGAGACGAAAAGCATGAAATCATGAATGATTTGAAA
GAATCATCGAGTTCTTCAAGATCTGAATCGAGTCCTCCATGGAGTTCACCAGGGAGTTTTGGTAGAGATTATTCATCATTAGGAAGCTACGGATCGATGAGGAAAGAGAA
AGAATGGCGAAGAACACTCGCCTGTAAGCTCTTCGAAGAGAGGCACAGTTCAGAGGAAACAGAAGGGATGGATTCACTATGGGAAACATACGAGAAGAAGGAGAAAGACA
ATAGAAAATCCAAGAAAGAAGAAGAAGAAGAAGAAGAAGAAGAGGAAGAAGAAGGGCAACTTTGCTGCTTACAGGCACTGAAATTCTCGGCGGGGAAGATGAATTTGGGA
ATGAGGAGACCAAATCTTATGAAAATGACCAAAGCTTTGAAGGGATTTGGATGGTTGAGTAGAAAAGGAAGTAGAAAGAGATTGATCCATTGA
Protein sequenceShow/hide protein sequence
MGFLATTYHVVFDSLRPRTPDESPMEELEAYKIVFEAYTFDSEQNPYEHSLQEVEVEVDFQEPMEHFPEKIKTLPENPLIIAEVEAKTEELQEAKFVNRDEKHEIMNDLK
ESSSSSRSESSPPWSSPGSFGRDYSSLGSYGSMRKEKEWRRTLACKLFEERHSSEETEGMDSLWETYEKKEKDNRKSKKEEEEEEEEEEEEGQLCCLQALKFSAGKMNLG
MRRPNLMKMTKALKGFGWLSRKGSRKRLIH