; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032990 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032990
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr11:39593165..39595455
RNA-Seq ExpressionLag0032990
SyntenyLag0032990
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG5574372.1 hypothetical protein H5410_054506 [Solanum commersonii]8.9e-3024.54Show/hide
Query:  IDKFLISDGIITKFTNATARRLDRNSSDHFPISLNLGKEKWGPAPFRLNIAWVNHSSFLNTVDSWWKNTPSMGWPGHGFIQKLKGLKKELKQWNHSVFGQ
        +D+FL S     +F N   + + R  SDH P+ L  G  +   A F+    W+N   F + V +WW      G P   F  KLK LK+++K W+   FG+
Subjt:  IDKFLISDGIITKFTNATARRLDRNSSDHFPISLNLGKEKWGPAPFRLNIAWVNHSSFLNTVDSWWKNTPSMGWPGHGFIQKLKGLKKELKQWNHSVFGQ

Query:  SKEKKSCLSRELSSLEHREELGQLTADDINRRIEIKAALISISANEEILWRQQCKLKWLKE---------------------------------------
           KK+ L  EL+ ++  +    LT D++  R  +   L  ++ NEE  WRQ+ ++ WLK+                                       
Subjt:  SKEKKSCLSRELSSLEHREELGQLTADDINRRIEIKAALISISANEEILWRQQCKLKWLKE---------------------------------------

Query:  -----------------------------------------------------------------------------APKKVTKAIEKLYHRFLWSGSSE
                                                                                      PK + K + KL   FLW G+ E
Subjt:  -----------------------------------------------------------------------------APKKVTKAIEKLYHRFLWSGSSE

Query:  KKGYHLLRWSHIQLPMEEGGRGIFDIHKKNISLLAKWSWRFYREPKALWRKIITTKFGLA--WRQDPILG--GCLA------------------------
        K+GY+L++W  + +   +GG GI  +  +N SLL KW WRF  E  ALWR+ I  K+GL   W  + + G  GC                          
Subjt:  KKGYHLLRWSHIQLPMEEGGRGIFDIHKKNISLLAKWSWRFYREPKALWRKIITTKFGLA--WRQDPILG--GCLA------------------------

Query:  ---------GRISLETLVPISIQPIAQEKASIANLWSTNEGTWNLFLRRNLLESEILEWAMLSHHLSSF-SLSNRDDAWTWQLESDGLFSTGS
                 G ++L  L P       Q+  +IA +W T +G W+L  RR L + E+   A L H L+ F       D   W+L S G+F+  S
Subjt:  ---------GRISLETLVPISIQPIAQEKASIANLWSTNEGTWNLFLRRNLLESEILEWAMLSHHLSSF-SLSNRDDAWTWQLESDGLFSTGS

PWA97226.1 reverse transcriptase domain, Reverse transcriptase zinc-binding domain protein [Artemisia annua]4.4e-2922.53Show/hide
Query:  IDKFLISDGIITKFTNATARRLDRNSSDHFPISLNLGKEKWGPAPFRLNIAWVNHSSFLNTVDSWWKNTPSMGWPGHGFIQKLKGLKKELKQWNHSVFGQ
        ID+FL+ D +  K+ NA+ R ++R+ SDH P+  ++    +GP PFRL  +W+     +  + S  +   + G P    + KLK L+  LK+W       
Subjt:  IDKFLISDGIITKFTNATARRLDRNSSDHFPISLNLGKEKWGPAPFRLNIAWVNHSSFLNTVDSWWKNTPSMGWPGHGFIQKLKGLKKELKQWNHSVFGQ

Query:  SKEKKSCLSRELSSLEHREELGQLTADDINRRIEIKAALISISANEEILWRQQCKLKWLK--------------------------------EAPKKVTK
          E +S L +E    E R E   L   D+   +E K AL  I  ++    RQ+ ++KW                                  E P  V +
Subjt:  SKEKKSCLSRELSSLEHREELGQLTADDINRRIEIKAALISISANEEILWRQQCKLKWLK--------------------------------EAPKKVTK

Query:  AIEKLY--------------------HRFLWSGSSEKKGYHLLRWSHIQLPMEEGGRGIFDIHKKNISLLAKWSWRFYREPKALWRKIITTKFGLA----
         I K +                     RFLW+G+SE+K    + W  I  P  +GG G+  +   N +LL KWSWRF +E  +LW+ +I +  G +    
Subjt:  AIEKLY--------------------HRFLWSGSSEKKGYHLLRWSHIQLPMEEGGRGIFDIHKKNISLLAKWSWRFYREPKALWRKIITTKFGLA----

Query:  -----------WRQDPILG---------------GCLAG-----------------RISLETLVPISIQPIAQEKASIANLWSTNEGTWNLFLRRNLLES
                   W+Q   +G               GCL                   RI    L  +      +    I  + +    TWN + R      
Subjt:  -----------WRQDPILG---------------GCLAG-----------------RISLETLVPISIQPIAQEKASIANLWSTNEGTWNLFLRRNLLES

Query:  EILEWAMLSHHLSSFSLSNRDDAWTWQLESDGLFSTGSLT-----------------KNWLP------------------------------------HP
        EI E  +L   +   +     D W W+  +DG+FS  S                   K W+P                                    H 
Subjt:  EILEWAMLSHHLSSFSLSNRDDAWTWQLESDGLFSTGSLT-----------------KNWLP------------------------------------HP

Query:  FWDHVQGAFGWHFARPGNIQTLHHYSLLGHPF-----------KNDT-----KILWRNFLYAFFWNLWLERNARIFNNKQQNIYAFIESTSYLAMYWSSL
          + +   F            +  +  L H F           +ND+     + + R  +Y   W +W ERNARIF+NK +     ++   Y + +W   
Subjt:  FWDHVQGAFGWHFARPGNIQTLHHYSLLGHPF-----------KNDT-----KILWRNFLYAFFWNLWLERNARIFNNKQQNIYAFIESTSYLAMYWSSL

Query:  NS--------PFCNYPL
         S         +CNYPL
Subjt:  NS--------PFCNYPL

RVW99725.1 DNA repair protein RAD50 [Vitis vinifera]5.4e-2736.93Show/hide
Query:  IDKFLISDGIITKFTNATARRLDRNSSDHFPISLNLGKEKWGPAPFRLNIAWVNHSSFLNTVDSWWKNTPSMGWPGHGFIQKLKGLKKELKQWNHSVFGQ
        +D+FL S+     F  +    L R +SDH+PI L+    KWGP PFR    W+ H +F  +  SWW+     GW GH F++KL+ +K +LK WN + FG 
Subjt:  IDKFLISDGIITKFTNATARRLDRNSSDHFPISLNLGKEKWGPAPFRLNIAWVNHSSFLNTVDSWWKNTPSMGWPGHGFIQKLKGLKKELKQWNHSVFGQ

Query:  SKEKKSCLSRELSSLEHREELGQLTADDINRRIEIKAALISISANEEILWRQQCKLKWLKEAPKKVTKAIEKLYHR
         KE+K  +S E+++++  E+ G L++D + +R   K  L  +   EEI W+Q+ K+KW+KE          KL+H+
Subjt:  SKEKKSCLSRELSSLEHREELGQLTADDINRRIEIKAALISISANEEILWRQQCKLKWLKEAPKKVTKAIEKLYHR

RVW99725.1 DNA repair protein RAD50 [Vitis vinifera]3.0e-0944.05Show/hide
Query:  LKEAPKKVTKAIEKLYHRFLWSGSSEKKGYHLLRWSHIQLPMEEGGRGIFDIHKKNISLLAKWSWRFYREPKALWRKIITTKFG
        L + P  V   IE+L   FLWSG  E K  HL+RW  +  P   GG GI  I  +N +LL KW WRF RE  +LW ++I + +G
Subjt:  LKEAPKKVTKAIEKLYHRFLWSGSSEKKGYHLLRWSHIQLPMEEGGRGIFDIHKKNISLLAKWSWRFYREPKALWRKIITTKFG

XP_014630645.2 uncharacterized protein LOC102661789 [Glycine max]1.5e-3232.94Show/hide
Query:  IDKFLISDGIITKFTNATARRLDRNSSDHFPISLNLGKEKWGPAPFRLNIAWVNHSSFLNTVDSWWKNTPSMGWPGHGFIQKLKGLKKELKQWNHSVFGQ
        +D+FL+S   +TK+  +    LDRN   H  + L      WGP PFR+   W+ + SF + V   W +    GW G    +K+K LK+ LK WN   FG 
Subjt:  IDKFLISDGIITKFTNATARRLDRNSSDHFPISLNLGKEKWGPAPFRLNIAWVNHSSFLNTVDSWWKNTPSMGWPGHGFIQKLKGLKKELKQWNHSVFGQ

Query:  SKEKKSCLSRELSSLEHREELGQLTADDINRRIEIKAALISISANEEILWRQQCKLKWLKEA----------PKKVTKAIEKLYHRFLWSGSSEKKGYHL
        + +K   +  EL+ LE       L+   +  R +++  L   + + E L RQ+ +  W+KE           PKKV   +  L  RFLW G S+      
Subjt:  SKEKKSCLSRELSSLEHREELGQLTADDINRRIEIKAALISISANEEILWRQQCKLKWLKEA----------PKKVTKAIEKLYHRFLWSGSSEKKGYHL

Query:  LRWSHIQLPMEEGGRGIFDIHKKNISLLAKWSWRFYREPKALWRKIITTKFG
        ++W  + LP E+GG GI DI   N+++L KW W  + +   LW KI+ +K+G
Subjt:  LRWSHIQLPMEEGGRGIFDIHKKNISLLAKWSWRFYREPKALWRKIITTKFG

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]2.1e-3141.11Show/hide
Query:  TLIDKFLISDGIITKFTNATARRLDRNSSDHFPISLNLGKEKWGPAPFRLNIAWVNHSSFLNTVDSWWKNTPSMGWPGHGFIQKLKGLKKELKQWNHSVF
        +LID FL+++G I K     A+R+ R +SDHFPI L+ G+  WG  PFR    W++H +F   +++WW N P  GWPGHG + KLK LK  +K W    F
Subjt:  TLIDKFLISDGIITKFTNATARRLDRNSSDHFPISLNLGKEKWGPAPFRLNIAWVNHSSFLNTVDSWWKNTPSMGWPGHGFIQKLKGLKKELKQWNHSVF

Query:  GQSKEKKSCLSRELSSLEHREELGQLTADDINRRIEIKAALISISANEEILWRQQCKLKWLKEAPKKVTKAIEKLYHRFL
             +K  L+  ++SL+  E    +T D    RI+ K  L+S+ A EE  WRQ+CK KWL E  +       K +HRFL
Subjt:  GQSKEKKSCLSRELSSLEHREELGQLTADDINRRIEIKAALISISANEEILWRQQCKLKWLKEAPKKVTKAIEKLYHRFL

TrEMBL top hitse value%identityAlignment
A0A2N9ESC2 Endo/exonuclease/phosphatase domain-containing protein1.2e-3229.62Show/hide
Query:  MTLIDKFLISDGIITKFTNATARRLDRNSSDHFPISLNLGKEKWGPAPFRLNIAWVNHSSFLNTVDSWWKNTPSMGWPGHGFIQKLKGLKKELKQWNHSV
        M+ ID+FL S      F+    RRL +  SDHFPI L+ G    G +PF     W+    F+N + +WW +    G P      KLK LK +LK+WN   
Subjt:  MTLIDKFLISDGIITKFTNATARRLDRNSSDHFPISLNLGKEKWGPAPFRLNIAWVNHSSFLNTVDSWWKNTPSMGWPGHGFIQKLKGLKKELKQWNHSV

Query:  FGQSKEKKSCLSRELSSLEHREELGQLTADDINRRIEIKAALISISANEEILWRQQCKLKWLKEA--------------PKKVTKAIEKLYHRFLWSGSS
        FG   +KK+ L   L   +   E  +  +  I   I +K    S+S  + +   +  K+  +K                P  V   I+K+   FLW G  
Subjt:  FGQSKEKKSCLSRELSSLEHREELGQLTADDINRRIEIKAALISISANEEILWRQQCKLKWLKEA--------------PKKVTKAIEKLYHRFLWSGSS

Query:  EKKGYHLLRWSHIQLPMEEGGRGIFDIHKKNISLLAKWSWRFYREPKALWRKIITTKFGLAWRQDPILGGCLAGRISLETLVPISIQPIAQEKASIANLW
        E   +HL+ WS I  P   GG GI ++   N +LL KW WRF  E  ALWR++I +K+           GCL G    +  VP S         +  +LW
Subjt:  EKKGYHLLRWSHIQLPMEEGGRGIFDIHKKNISLLAKWSWRFYREPKALWRKIITTKFGLAWRQDPILGGCLAGRISLETLVPISIQPIAQEKASIANLW

Query:  STNEGTWNLFLRRNLLESEILEWAMLSHHLSSFSLSNRDDAW---TWQLES-----DGLFS---------TGSLTKNWLPHPFWDHVQGAFGWHFARPGN
              W  F       SE L + +   HL  F  S     W    W+LES     D L+S           S  K    +    HV   FG  +  P  
Subjt:  STNEGTWNLFLRRNLLESEILEWAMLSHHLSSFSLSNRDDAW---TWQLES-----DGLFS---------TGSLTKNWLPHPFWDHVQGAFGWHFARPGN

Query:  IQTLHHYSLLGHPFKNDTKILWRNFLYAFFWNLWLERNARIFNNKQQNI
        I  L      G     + +I WR   +   W +W ERN+R F +K++N+
Subjt:  IQTLHHYSLLGHPFKNDTKILWRNFLYAFFWNLWLERNARIFNNKQQNI

A0A2N9F8S0 zf-RVT domain-containing protein2.1e-2928.32Show/hide
Query:  MTLIDKFLISDGIITKFTNATARRLDRNSSDHFPISLNLGKEKWGPAPFRLNIAWVNHSSFLNTVDSWWKNTPSMGWPGHGFIQKLKGLKKELKQWNHSV
        M+ ID+FL SD     F +   +RL R  SDHFPI L  G       PFR    W+    F   V  WW +    G PG+    KLK LK +LK+WN  V
Subjt:  MTLIDKFLISDGIITKFTNATARRLDRNSSDHFPISLNLGKEKWGPAPFRLNIAWVNHSSFLNTVDSWWKNTPSMGWPGHGFIQKLKGLKKELKQWNHSV

Query:  FGQSKEKKSCLSRELSSLEHREELGQLTADDINRRIEIKAALISISANEEILWRQQCKLKWLKEA-----------------------------------
        FG    K++ L  EL  L+   +L  LT  +  ++  I A L S S  EEI WRQ+ +  WL+E                                    
Subjt:  FGQSKEKKSCLSRELSSLEHREELGQLTADDINRRIEIKAALISISANEEILWRQQCKLKWLKEA-----------------------------------

Query:  ---------------------------------------------------------------PKKVTKAIEKLYHRFLWSGSSEKKGYHLLRWSHIQLP
                                                                       P +V   +EKL   FLW G  E   +HL+ WS I  P
Subjt:  ---------------------------------------------------------------PKKVTKAIEKLYHRFLWSGSSEKKGYHLLRWSHIQLP

Query:  MEEGGRGIFDIHKKNISLLAKWSWRFYREPKALWRKIITTKFGLAW
        +  GG  I ++   N +LL KW WR+  E +ALWR ++  K+G  W
Subjt:  MEEGGRGIFDIHKKNISLLAKWSWRFYREPKALWRKIITTKFGLAW

A0A2U1QGT5 Reverse transcriptase domain, Reverse transcriptase zinc-binding domain protein2.1e-2922.53Show/hide
Query:  IDKFLISDGIITKFTNATARRLDRNSSDHFPISLNLGKEKWGPAPFRLNIAWVNHSSFLNTVDSWWKNTPSMGWPGHGFIQKLKGLKKELKQWNHSVFGQ
        ID+FL+ D +  K+ NA+ R ++R+ SDH P+  ++    +GP PFRL  +W+     +  + S  +   + G P    + KLK L+  LK+W       
Subjt:  IDKFLISDGIITKFTNATARRLDRNSSDHFPISLNLGKEKWGPAPFRLNIAWVNHSSFLNTVDSWWKNTPSMGWPGHGFIQKLKGLKKELKQWNHSVFGQ

Query:  SKEKKSCLSRELSSLEHREELGQLTADDINRRIEIKAALISISANEEILWRQQCKLKWLK--------------------------------EAPKKVTK
          E +S L +E    E R E   L   D+   +E K AL  I  ++    RQ+ ++KW                                  E P  V +
Subjt:  SKEKKSCLSRELSSLEHREELGQLTADDINRRIEIKAALISISANEEILWRQQCKLKWLK--------------------------------EAPKKVTK

Query:  AIEKLY--------------------HRFLWSGSSEKKGYHLLRWSHIQLPMEEGGRGIFDIHKKNISLLAKWSWRFYREPKALWRKIITTKFGLA----
         I K +                     RFLW+G+SE+K    + W  I  P  +GG G+  +   N +LL KWSWRF +E  +LW+ +I +  G +    
Subjt:  AIEKLY--------------------HRFLWSGSSEKKGYHLLRWSHIQLPMEEGGRGIFDIHKKNISLLAKWSWRFYREPKALWRKIITTKFGLA----

Query:  -----------WRQDPILG---------------GCLAG-----------------RISLETLVPISIQPIAQEKASIANLWSTNEGTWNLFLRRNLLES
                   W+Q   +G               GCL                   RI    L  +      +    I  + +    TWN + R      
Subjt:  -----------WRQDPILG---------------GCLAG-----------------RISLETLVPISIQPIAQEKASIANLWSTNEGTWNLFLRRNLLES

Query:  EILEWAMLSHHLSSFSLSNRDDAWTWQLESDGLFSTGSLT-----------------KNWLP------------------------------------HP
        EI E  +L   +   +     D W W+  +DG+FS  S                   K W+P                                    H 
Subjt:  EILEWAMLSHHLSSFSLSNRDDAWTWQLESDGLFSTGSLT-----------------KNWLP------------------------------------HP

Query:  FWDHVQGAFGWHFARPGNIQTLHHYSLLGHPF-----------KNDT-----KILWRNFLYAFFWNLWLERNARIFNNKQQNIYAFIESTSYLAMYWSSL
          + +   F            +  +  L H F           +ND+     + + R  +Y   W +W ERNARIF+NK +     ++   Y + +W   
Subjt:  FWDHVQGAFGWHFARPGNIQTLHHYSLLGHPF-----------KNDT-----KILWRNFLYAFFWNLWLERNARIFNNKQQNIYAFIESTSYLAMYWSSL

Query:  NS--------PFCNYPL
         S         +CNYPL
Subjt:  NS--------PFCNYPL

A0A6J1E2G6 uncharacterized protein LOC1110254051.0e-3141.11Show/hide
Query:  TLIDKFLISDGIITKFTNATARRLDRNSSDHFPISLNLGKEKWGPAPFRLNIAWVNHSSFLNTVDSWWKNTPSMGWPGHGFIQKLKGLKKELKQWNHSVF
        +LID FL+++G I K     A+R+ R +SDHFPI L+ G+  WG  PFR    W++H +F   +++WW N P  GWPGHG + KLK LK  +K W    F
Subjt:  TLIDKFLISDGIITKFTNATARRLDRNSSDHFPISLNLGKEKWGPAPFRLNIAWVNHSSFLNTVDSWWKNTPSMGWPGHGFIQKLKGLKKELKQWNHSVF

Query:  GQSKEKKSCLSRELSSLEHREELGQLTADDINRRIEIKAALISISANEEILWRQQCKLKWLKEAPKKVTKAIEKLYHRFL
             +K  L+  ++SL+  E    +T D    RI+ K  L+S+ A EE  WRQ+CK KWL E  +       K +HRFL
Subjt:  GQSKEKKSCLSRELSSLEHREELGQLTADDINRRIEIKAALISISANEEILWRQQCKLKWLKEAPKKVTKAIEKLYHRFL

A5C3T9 Uncharacterized protein2.6e-2736.93Show/hide
Query:  IDKFLISDGIITKFTNATARRLDRNSSDHFPISLNLGKEKWGPAPFRLNIAWVNHSSFLNTVDSWWKNTPSMGWPGHGFIQKLKGLKKELKQWNHSVFGQ
        +D+FL S+     F  +    L R +SDH+PI L+    KWGP PFR    W+ H +F  +  SWW+     GW GH F++KL+ +K +LK WN + FG 
Subjt:  IDKFLISDGIITKFTNATARRLDRNSSDHFPISLNLGKEKWGPAPFRLNIAWVNHSSFLNTVDSWWKNTPSMGWPGHGFIQKLKGLKKELKQWNHSVFGQ

Query:  SKEKKSCLSRELSSLEHREELGQLTADDINRRIEIKAALISISANEEILWRQQCKLKWLKEAPKKVTKAIEKLYHR
         KE+K  +S E+++++  E+ G L++D +  R   K  L  +   EEI W+Q+ K+KW+KE          KL+H+
Subjt:  SKEKKSCLSRELSSLEHREELGQLTADDINRRIEIKAALISISANEEILWRQQCKLKWLKEAPKKVTKAIEKLYHR

A5C3T9 Uncharacterized protein5.0e-0240.32Show/hide
Query:  ILWRNFLYAFFWNLWLERNARIFNNKQQNIYAFIESTSYLAMYWSSLNSPFCNYPLSTFISQ
        ILW+N   A  W +W ERNARIF +K +N     +S  +LA  W+  ++ F   PL+ F  Q
Subjt:  ILWRNFLYAFFWNLWLERNARIFNNKQQNIYAFIESTSYLAMYWSSLNSPFCNYPLSTFISQ

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.4e-0932.91Show/hide
Query:  PKKVTKAIEKLYHRFLWSGSSEKKGYHLLRWSHIQLPMEEGGRGIFDIHKKNISLLAKWSWRFYREPKALWRKIITTKF
        P+ +   +++L   FLW  ++EKK  HL++WS +  P +EGG G+      N +L++K  WR  +E  +LW  ++  K+
Subjt:  PKKVTKAIEKLYHRFLWSGSSEKKGYHLLRWSHIQLPMEEGGRGIFDIHKKNISLLAKWSWRFYREPKALWRKIITTKF

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein7.1e-0935.44Show/hide
Query:  PKKVTKAIEKLYHRFLWSGSSEKKGYHLLRWSHIQLPMEEGGRGIFDIHKKNISLLAKWSWRFYREPKALWRKIITTKF
        PK V K I  +   F W    E KG H   W H+     EGG G  DI   N++LL K  WR    P++L  K+  +++
Subjt:  PKKVTKAIEKLYHRFLWSGSSEKKGYHLLRWSHIQLPMEEGGRGIFDIHKKNISLLAKWSWRFYREPKALWRKIITTKF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCCTTATCGACAAGTTTCTTATATCCGATGGAATCATAACCAAATTCACAAATGCCACTGCTCGCCGACTGGACAGAAACTCCTCTGATCATTTCCCTATCAGCCT
CAATTTGGGGAAAGAGAAATGGGGACCAGCCCCTTTTAGGCTCAACATCGCTTGGGTTAATCATAGCTCCTTTCTCAATACAGTGGATTCTTGGTGGAAAAACACTCCCT
CCATGGGCTGGCCAGGACATGGGTTCATACAAAAATTGAAAGGCCTCAAGAAGGAGCTTAAGCAATGGAATCACTCTGTTTTCGGTCAGTCTAAAGAGAAAAAATCCTGC
CTGAGCAGAGAGCTTTCTTCCTTAGAACATAGGGAGGAGCTTGGTCAGCTTACCGCAGATGATATCAATAGAAGAATTGAGATAAAGGCTGCTCTTATCTCTATCTCGGC
CAATGAGGAGATCTTGTGGCGACAGCAATGCAAGCTGAAATGGCTTAAAGAAGCTCCAAAGAAGGTCACCAAGGCCATTGAGAAGTTATACCACAGGTTCCTATGGAGTG
GCAGTTCCGAGAAGAAAGGCTACCACCTTTTGAGGTGGTCTCATATTCAACTGCCTATGGAAGAGGGAGGTCGGGGCATTTTTGATATTCACAAGAAGAACATTTCTCTA
TTAGCTAAATGGTCTTGGAGATTCTATCGCGAGCCGAAAGCCCTTTGGAGGAAAATTATCACTACTAAATTTGGCCTTGCTTGGCGACAAGACCCTATTTTGGGAGGATG
TTTGGCTGGGCGCATCTCCCTTGAAACACTCGTACCCATCTCTATTCAACCTATCGCTCAAGAAAAGGCCAGCATTGCCAATTTATGGAGCACAAATGAGGGGACCTGGA
ATCTCTTTTTAAGAAGGAATTTACTTGAATCTGAAATCCTAGAATGGGCCATGTTATCTCACCACCTCTCATCTTTCTCCCTCTCTAATAGGGATGATGCTTGGACTTGG
CAGCTTGAAAGTGATGGTTTGTTCTCCACTGGGTCGCTCACTAAAAATTGGCTTCCTCATCCTTTTTGGGACCATGTTCAAGGCGCTTTTGGATGGCACTTTGCTAGACC
AGGGAACATCCAAACTCTTCATCATTATTCTCTTCTTGGTCACCCCTTCAAAAACGACACTAAGATTCTTTGGCGGAACTTTTTGTATGCATTCTTCTGGAACTTGTGGT
TAGAGAGAAATGCTAGAATCTTCAATAATAAGCAGCAGAATATTTATGCTTTCATCGAATCCACATCATACCTTGCTATGTATTGGAGCAGTCTCAATTCCCCCTTTTGT
AACTACCCCTTATCTACCTTTATATCTCAATGGGGATCTCTCTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACCCTTATCGACAAGTTTCTTATATCCGATGGAATCATAACCAAATTCACAAATGCCACTGCTCGCCGACTGGACAGAAACTCCTCTGATCATTTCCCTATCAGCCT
CAATTTGGGGAAAGAGAAATGGGGACCAGCCCCTTTTAGGCTCAACATCGCTTGGGTTAATCATAGCTCCTTTCTCAATACAGTGGATTCTTGGTGGAAAAACACTCCCT
CCATGGGCTGGCCAGGACATGGGTTCATACAAAAATTGAAAGGCCTCAAGAAGGAGCTTAAGCAATGGAATCACTCTGTTTTCGGTCAGTCTAAAGAGAAAAAATCCTGC
CTGAGCAGAGAGCTTTCTTCCTTAGAACATAGGGAGGAGCTTGGTCAGCTTACCGCAGATGATATCAATAGAAGAATTGAGATAAAGGCTGCTCTTATCTCTATCTCGGC
CAATGAGGAGATCTTGTGGCGACAGCAATGCAAGCTGAAATGGCTTAAAGAAGCTCCAAAGAAGGTCACCAAGGCCATTGAGAAGTTATACCACAGGTTCCTATGGAGTG
GCAGTTCCGAGAAGAAAGGCTACCACCTTTTGAGGTGGTCTCATATTCAACTGCCTATGGAAGAGGGAGGTCGGGGCATTTTTGATATTCACAAGAAGAACATTTCTCTA
TTAGCTAAATGGTCTTGGAGATTCTATCGCGAGCCGAAAGCCCTTTGGAGGAAAATTATCACTACTAAATTTGGCCTTGCTTGGCGACAAGACCCTATTTTGGGAGGATG
TTTGGCTGGGCGCATCTCCCTTGAAACACTCGTACCCATCTCTATTCAACCTATCGCTCAAGAAAAGGCCAGCATTGCCAATTTATGGAGCACAAATGAGGGGACCTGGA
ATCTCTTTTTAAGAAGGAATTTACTTGAATCTGAAATCCTAGAATGGGCCATGTTATCTCACCACCTCTCATCTTTCTCCCTCTCTAATAGGGATGATGCTTGGACTTGG
CAGCTTGAAAGTGATGGTTTGTTCTCCACTGGGTCGCTCACTAAAAATTGGCTTCCTCATCCTTTTTGGGACCATGTTCAAGGCGCTTTTGGATGGCACTTTGCTAGACC
AGGGAACATCCAAACTCTTCATCATTATTCTCTTCTTGGTCACCCCTTCAAAAACGACACTAAGATTCTTTGGCGGAACTTTTTGTATGCATTCTTCTGGAACTTGTGGT
TAGAGAGAAATGCTAGAATCTTCAATAATAAGCAGCAGAATATTTATGCTTTCATCGAATCCACATCATACCTTGCTATGTATTGGAGCAGTCTCAATTCCCCCTTTTGT
AACTACCCCTTATCTACCTTTATATCTCAATGGGGATCTCTCTTGTAA
Protein sequenceShow/hide protein sequence
MTLIDKFLISDGIITKFTNATARRLDRNSSDHFPISLNLGKEKWGPAPFRLNIAWVNHSSFLNTVDSWWKNTPSMGWPGHGFIQKLKGLKKELKQWNHSVFGQSKEKKSC
LSRELSSLEHREELGQLTADDINRRIEIKAALISISANEEILWRQQCKLKWLKEAPKKVTKAIEKLYHRFLWSGSSEKKGYHLLRWSHIQLPMEEGGRGIFDIHKKNISL
LAKWSWRFYREPKALWRKIITTKFGLAWRQDPILGGCLAGRISLETLVPISIQPIAQEKASIANLWSTNEGTWNLFLRRNLLESEILEWAMLSHHLSSFSLSNRDDAWTW
QLESDGLFSTGSLTKNWLPHPFWDHVQGAFGWHFARPGNIQTLHHYSLLGHPFKNDTKILWRNFLYAFFWNLWLERNARIFNNKQQNIYAFIESTSYLAMYWSSLNSPFC
NYPLSTFISQWGSLL