; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg035792 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg035792
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold5:37758037..37761670
RNA-Seq ExpressionSpg035792
SyntenySpg035792
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037436.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.5e-3735Show/hide
Query:  SVWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSME
        S+W  +++ +   K + +   P  +G +S  S K PWR I K      N     +  G+   FW   WI +  L  T+P LY +S+ + A I +LW+   
Subjt:  SVWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSME

Query:  GAWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLASSTQS-NIND--FYVQLWKGPMPKKVKFFLWELSHECINTAD
          WNLH RR L + EI  WT+L  IL   +L   +++  W     G ++  S  ++L S   + N+N+  FY  LWK  +PKK KFF+W L HE INT D
Subjt:  GAWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLASSTQS-NIND--FYVQLWKGPMPKKVKFFLWELSHECINTAD

Query:  LIQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIWDYLQQAFGWSFVRPGDIQSL
        ++Q+R P T L P+ C L   D+E   H+F  C    ++WD L +  G +F    DI+S+
Subjt:  LIQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIWDYLQQAFGWSFVRPGDIQSL

KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]7.9e-3932.82Show/hide
Query:  VWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSMEG
        +WK L+  +   +K G   +  K       S+  PW+++ +     Y ++  +V  G+   FW D W GN PL +  P L+ +S  ++  + + WN    
Subjt:  VWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSMEG

Query:  AWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLASSTQSNIN---DFYVQLWKGPMPKKVKFFLWELSHECINTADL
         W+LH+ R L + E   W ++   L             W L  N +F T S+ + +A +  S  N   + Y  LWK   PKK KFF+W L H CINTAD 
Subjt:  AWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLASSTQSNIN---DFYVQLWKGPMPKKVKFFLWELSHECINTADL

Query:  IQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIWDYLQQAFGWSFVRPGDIQSLLQS
        +Q+R P  TLSPN C +C K  E   HLF  C Y+  +W   +    W+   P D+QSL+Q+
Subjt:  IQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIWDYLQQAFGWSFVRPGDIQSLLQS

KAA0041397.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]9.3e-4034.35Show/hide
Query:  VWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSMEG
        +WK L+  +   +K G   +  K       S+  PW+++       Y ++  +V  G+   FW D W GN PL +  P L+ +S  ++  + DLWN    
Subjt:  VWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSMEG

Query:  AWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLASSTQSNIN---DFYVQLWKGPMPKKVKFFLWELSHECINTADL
         WN+H+ R L + E   W ++   L             WKL  N +F T S+ K L+ ++ S  N     Y  LWK   PKK KFF+W L H CINTAD 
Subjt:  AWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLASSTQSNIN---DFYVQLWKGPMPKKVKFFLWELSHECINTADL

Query:  IQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIWDYLQQAFGWSFVRPGDIQSLLQS
        +Q+R P  TLSPN C +C K  E   HLF  C Y+  +W   Q    W+   P D++SL Q+
Subjt:  IQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIWDYLQQAFGWSFVRPGDIQSLLQS

KAA0044556.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]9.3e-4034.35Show/hide
Query:  VWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSMEG
        +WK L+  +   +K G   +  K       S+  PW+++       Y ++  +V  G+   FW D W GN PL +  P L+ +S  ++  + DLWN    
Subjt:  VWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSMEG

Query:  AWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLASSTQSNIN---DFYVQLWKGPMPKKVKFFLWELSHECINTADL
         WN+H+ R L + E   W ++   L             WKL  N +F T S+ K L+ ++ S  N     Y  LWK   PKK KFF+W L H CINTAD 
Subjt:  AWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLASSTQSNIN---DFYVQLWKGPMPKKVKFFLWELSHECINTADL

Query:  IQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIWDYLQQAFGWSFVRPGDIQSLLQS
        +Q+R P  TLSPN C +C K  E   HLF  C Y+  +W   Q    W+   P D++SL Q+
Subjt:  IQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIWDYLQQAFGWSFVRPGDIQSLLQS

TYK00226.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.5e-3735Show/hide
Query:  SVWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSME
        S+W  +++ +   K + +   P  +G +S  S K PWR I K      N     +  G+   FW   WI +  L  T+P LY +S+ + A I +LW+   
Subjt:  SVWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSME

Query:  GAWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLASSTQS-NIND--FYVQLWKGPMPKKVKFFLWELSHECINTAD
          WNLH RR L + EI  WT+L  IL   +L   +++  W     G ++  S  ++L S   + N+N+  FY  LWK  +PKK KFF+W L HE INT D
Subjt:  GAWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLASSTQS-NIND--FYVQLWKGPMPKKVKFFLWELSHECINTAD

Query:  LIQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIWDYLQQAFGWSFVRPGDIQSL
        ++Q+R P T L P+ C L   D+E   H+F  C    ++WD L +  G +F    DI+S+
Subjt:  LIQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIWDYLQQAFGWSFVRPGDIQSL

TrEMBL top hitse value%identityAlignment
A0A5A7T5L3 LINE-1 retrotransposable element ORF2 protein7.2e-3835Show/hide
Query:  SVWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSME
        S+W  +++ +   K + +   P  +G +S  S K PWR I K      N     +  G+   FW   WI +  L  T+P LY +S+ + A I +LW+   
Subjt:  SVWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSME

Query:  GAWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLASSTQS-NIND--FYVQLWKGPMPKKVKFFLWELSHECINTAD
          WNLH RR L + EI  WT+L  IL   +L   +++  W     G ++  S  ++L S   + N+N+  FY  LWK  +PKK KFF+W L HE INT D
Subjt:  GAWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLASSTQS-NIND--FYVQLWKGPMPKKVKFFLWELSHECINTAD

Query:  LIQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIWDYLQQAFGWSFVRPGDIQSL
        ++Q+R P T L P+ C L   D+E   H+F  C    ++WD L +  G +F    DI+S+
Subjt:  LIQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIWDYLQQAFGWSFVRPGDIQSL

A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein3.8e-3932.82Show/hide
Query:  VWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSMEG
        +WK L+  +   +K G   +  K       S+  PW+++ +     Y ++  +V  G+   FW D W GN PL +  P L+ +S  ++  + + WN    
Subjt:  VWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSMEG

Query:  AWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLASSTQSNIN---DFYVQLWKGPMPKKVKFFLWELSHECINTADL
         W+LH+ R L + E   W ++   L             W L  N +F T S+ + +A +  S  N   + Y  LWK   PKK KFF+W L H CINTAD 
Subjt:  AWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLASSTQSNIN---DFYVQLWKGPMPKKVKFFLWELSHECINTADL

Query:  IQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIWDYLQQAFGWSFVRPGDIQSLLQS
        +Q+R P  TLSPN C +C K  E   HLF  C Y+  +W   +    W+   P D+QSL+Q+
Subjt:  IQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIWDYLQQAFGWSFVRPGDIQSLLQS

A0A5A7TIB8 LINE-1 retrotransposable element ORF2 protein4.5e-4034.35Show/hide
Query:  VWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSMEG
        +WK L+  +   +K G   +  K       S+  PW+++       Y ++  +V  G+   FW D W GN PL +  P L+ +S  ++  + DLWN    
Subjt:  VWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSMEG

Query:  AWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLASSTQSNIN---DFYVQLWKGPMPKKVKFFLWELSHECINTADL
         WN+H+ R L + E   W ++   L             WKL  N +F T S+ K L+ ++ S  N     Y  LWK   PKK KFF+W L H CINTAD 
Subjt:  AWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLASSTQSNIN---DFYVQLWKGPMPKKVKFFLWELSHECINTADL

Query:  IQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIWDYLQQAFGWSFVRPGDIQSLLQS
        +Q+R P  TLSPN C +C K  E   HLF  C Y+  +W   Q    W+   P D++SL Q+
Subjt:  IQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIWDYLQQAFGWSFVRPGDIQSLLQS

A0A5A7TR15 LINE-1 retrotransposable element ORF2 protein4.5e-4034.35Show/hide
Query:  VWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSMEG
        +WK L+  +   +K G   +  K       S+  PW+++       Y ++  +V  G+   FW D W GN PL +  P L+ +S  ++  + DLWN    
Subjt:  VWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSMEG

Query:  AWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLASSTQSNIN---DFYVQLWKGPMPKKVKFFLWELSHECINTADL
         WN+H+ R L + E   W ++   L             WKL  N +F T S+ K L+ ++ S  N     Y  LWK   PKK KFF+W L H CINTAD 
Subjt:  AWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLASSTQSNIN---DFYVQLWKGPMPKKVKFFLWELSHECINTADL

Query:  IQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIWDYLQQAFGWSFVRPGDIQSLLQS
        +Q+R P  TLSPN C +C K  E   HLF  C Y+  +W   Q    W+   P D++SL Q+
Subjt:  IQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIWDYLQQAFGWSFVRPGDIQSLLQS

A0A5D3BPP1 LINE-1 retrotransposable element ORF2 protein7.2e-3835Show/hide
Query:  SVWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSME
        S+W  +++ +   K + +   P  +G +S  S K PWR I K      N     +  G+   FW   WI +  L  T+P LY +S+ + A I +LW+   
Subjt:  SVWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSME

Query:  GAWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLASSTQS-NIND--FYVQLWKGPMPKKVKFFLWELSHECINTAD
          WNLH RR L + EI  WT+L  IL   +L   +++  W     G ++  S  ++L S   + N+N+  FY  LWK  +PKK KFF+W L HE INT D
Subjt:  GAWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLASSTQS-NIND--FYVQLWKGPMPKKVKFFLWELSHECINTAD

Query:  LIQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIWDYLQQAFGWSFVRPGDIQSL
        ++Q+R P T L P+ C L   D+E   H+F  C    ++WD L +  G +F    DI+S+
Subjt:  LIQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIWDYLQQAFGWSFVRPGDIQSL

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657506.5e-1226.12Show/hide
Query:  SSVWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHK-----LQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIID
        +S+W TLV    + KKY V     ++ D      KG W S  +     L+ +V + +    G G +  FW D W+  +PL +   +  R +     +  D
Subjt:  SSVWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHK-----LQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIID

Query:  LWNSMEGAWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLA--SSTQSNINDFYVQLWKGPMPKKVKFFLWELSHEC
        LW    G     +    T     E  ++   L    +  A D   WK  ++G FS  S  + L      + N+  F+  LWK  +P++VK FLW + ++ 
Subjt:  LWNSMEGAWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLA--SSTQSNINDFYVQLWKGPMPKKVKFFLWELSHEC

Query:  INTADLIQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIW
        + T +   RR+   +   N C +C    ES +H+   C     IW
Subjt:  INTADLIQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIW

Arabidopsis top hitse value%identityAlignment
AT1G60720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.6e-0525.13Show/hide
Query:  LQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSM-EGAWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWK
        L+ L    ++  +G G    FW D W    PL +     Y     R  L   +  ++    W L L R+   + I +  S    ++  S    EDS+ W 
Subjt:  LQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTFPSLYRVSQKREALIIDLWNSM-EGAWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWK

Query:  LEKNGLFSTGSLTKKL--ASSTQSNINDFYVQLW-KGPMPKKVKFFLWELSHECINTADLIQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIW
        +   G+   G  + +   A   ++   D+   +W KG +PK   F +W    + + T    QR      +    CCLC  ++ES+ HL   C +AA +W
Subjt:  LEKNGLFSTGSLTKKL--ASSTQSNINDFYVQLW-KGPMPKKVKFFLWELSHECINTADLIQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIW

AT3G09510.1 Ribonuclease H-like superfamily protein5.1e-0424.32Show/hide
Query:  DSWFWKLEKNGLFSTGSLTKKLASSTQSNIN---------DFYVQLWKGPMPKKVKFFLWELSHECINTADLIQRRNPGTTLSPNCCCLCYKDSESQVHL
        D   W     G ++  S    L     +NI          D   ++W  P+  K+K FLW    + + T + +  R  G  + P+ C  C++++ES  H 
Subjt:  DSWFWKLEKNGLFSTGSLTKKLASSTQSNIN---------DFYVQLWKGPMPKKVKFFLWELSHECINTADLIQRRNPGTTLSPNCCCLCYKDSESQVHL

Query:  FSKCVYAASIW
           C +A   W
Subjt:  FSKCVYAASIW

AT4G29090.1 Ribonuclease H-like superfamily protein1.5e-0821.66Show/hide
Query:  WRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTF-----PSLYRVSQKREALIIDLWNSMEGAWNLHLRRNLTEEEIFEWTSLSHILSCFSL
        W+SIH  Q+++       VG G+  + W   W+ ++P          P     S      + DL +     W   +   L  E   E   +  +      
Subjt:  WRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIGNEPLKVTF-----PSLYRVSQKREALIIDLWNSMEGAWNLHLRRNLTEEEIFEWTSLSHILSCFSL

Query:  KEAEDSWFWKLEKNGLFSTGS-------LTKKLASS---TQSNINDFYVQLWKGPMPKKVKFFLWELSHECINTADLIQRRNPGTTLSPNCCCL-CYKDS
        +   DS+ W    +G ++  S       +  K +S    ++ ++N  Y ++WK     K++ FLW+     +  A  +  R+    LS    C+ C    
Subjt:  KEAEDSWFWKLEKNGLFSTGS-------LTKKLASS---TQSNINDFYVQLWKGPMPKKVKFFLWELSHECINTADLIQRRNPGTTLSPNCCCL-CYKDS

Query:  ESQVHLFSKCVYAASIW
        E+  HL  KC +A   W
Subjt:  ESQVHLFSKCVYAASIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GTTGCATTCGATGGCCTCTATTTGCCCTCTTCTAGGGGTGATCGTCGATCGGTCGGCGTCAGTTTTGGGCTCAAACCGATGCCGACCCCCGACATGTCGATTTCCAACGA
TCGGTTCTCGTCAGTTTGGAAGACGTTGGTTACGCGAAGATGCATTGCCAAAAAGTATGGAGTTGCTTTAAATCCTTTCAAACTGGGAGATCACTCCATTAAATCCTCTA
AAGGGCCATGGAGATCTATTCATAAATTGCAGCAACTTGTCTATAATTCCATGGAGATCAGAGTTGGAAAAGGTGATAAAACAATGTTCTGGGATGACCTATGGATTGGA
AATGAACCTCTAAAAGTCACATTCCCCTCTCTCTACAGGGTATCCCAAAAAAGAGAAGCCCTCATTATTGACCTATGGAACTCTATGGAAGGGGCTTGGAACCTACACTT
AAGGAGAAATCTCACAGAAGAAGAAATTTTTGAATGGACATCTCTCTCCCACATTCTCTCTTGTTTCTCTCTAAAAGAGGCAGAGGACTCATGGTTTTGGAAGCTTGAAA
AGAATGGTCTATTCTCTACAGGATCACTTACCAAAAAACTGGCCTCCTCTACTCAGTCCAACATAAATGATTTCTATGTTCAGCTATGGAAAGGCCCCATGCCGAAAAAG
GTAAAATTCTTTCTATGGGAGCTCAGCCACGAATGTATAAATACAGCCGATCTCATACAAAGGAGAAATCCGGGGACAACCCTTTCCCCAAATTGCTGCTGCCTATGTTA
CAAAGATTCGGAGTCTCAAGTCCATCTCTTCAGTAAGTGTGTTTATGCTGCATCTATCTGGGACTACCTTCAACAGGCTTTTGGATGGTCCTTTGTTCGACCGGGAGACA
TCCAATCCTTGCTTCAATCCTCTCTCACCGGGCATCCCTTTAAAAAGGAAAAGAAGTTTCTTTGGAGGACCTTTCTCCCAGCGAGCAAAGAGAACGAAGATATGTTGGAA
GAAAACGACAGTATTATTGAGGCTTCGGACGAGGAAATCCTAGAGGGGTACCATCGATGCTTCATGAGCGAAAATAGGGAAGAAGGAAATATGAGTCTTGATCATGTTGA
GGAAACTCTGGATAGAGTCGAGGGATTGCTCCCGAAAGGCATAACGAACCATGAGGATGAGCCAAACGAACCTGACAACATGGACGAGCAGTCGAAAACAAACCCAAGGG
CCCTGGCCATAATCTCGCCAAAGAGTATGGATGAAACTAGCTCCTTAGAAGGATTCTCGATTAGCAAAGAAATTGTTCTTACTCTTCAAAGAAATAATTTGTGCATCAAA
CCTATCTCGGGTTCGAATATGAAAAAAGGCAGTACCGCTCAGAGGAGGCGTAACAGGGAAATGACGAGCCACCTAAGAACCTGGGAAAAGGAAGTAGAAGATACCAACGA
AAATGAAGATCTCGAAGAGTATGTTGATCTAGTCGATAGCTTAGTCAGGGCCGAAAAGGAAACTAACCTTGCTAAGGTTGATAGGAGGATAGTGAAATCCATTTGGAGCT
CTAGGCATGTTGCGTTGCTGTCCTTAGATGCTTGTAACTCAGCCAGGGGTATCATTATTCTGTGGAAGGAGAATTCGATTGATGTTGTGAACTCGTTTTTGGGGGTTTTC
TCGATTACTATTCAGTGCACGCTCCAAGGGCAGAAGGAGGGTTGGATTACTGGTGTTTACGACCCGTGTGATTACCAGGATAGAAAACATTTCCTTTAA
mRNA sequenceShow/hide mRNA sequence
GTTGCATTCGATGGCCTCTATTTGCCCTCTTCTAGGGGTGATCGTCGATCGGTCGGCGTCAGTTTTGGGCTCAAACCGATGCCGACCCCCGACATGTCGATTTCCAACGA
TCGGTTCTCGTCAGTTTGGAAGACGTTGGTTACGCGAAGATGCATTGCCAAAAAGTATGGAGTTGCTTTAAATCCTTTCAAACTGGGAGATCACTCCATTAAATCCTCTA
AAGGGCCATGGAGATCTATTCATAAATTGCAGCAACTTGTCTATAATTCCATGGAGATCAGAGTTGGAAAAGGTGATAAAACAATGTTCTGGGATGACCTATGGATTGGA
AATGAACCTCTAAAAGTCACATTCCCCTCTCTCTACAGGGTATCCCAAAAAAGAGAAGCCCTCATTATTGACCTATGGAACTCTATGGAAGGGGCTTGGAACCTACACTT
AAGGAGAAATCTCACAGAAGAAGAAATTTTTGAATGGACATCTCTCTCCCACATTCTCTCTTGTTTCTCTCTAAAAGAGGCAGAGGACTCATGGTTTTGGAAGCTTGAAA
AGAATGGTCTATTCTCTACAGGATCACTTACCAAAAAACTGGCCTCCTCTACTCAGTCCAACATAAATGATTTCTATGTTCAGCTATGGAAAGGCCCCATGCCGAAAAAG
GTAAAATTCTTTCTATGGGAGCTCAGCCACGAATGTATAAATACAGCCGATCTCATACAAAGGAGAAATCCGGGGACAACCCTTTCCCCAAATTGCTGCTGCCTATGTTA
CAAAGATTCGGAGTCTCAAGTCCATCTCTTCAGTAAGTGTGTTTATGCTGCATCTATCTGGGACTACCTTCAACAGGCTTTTGGATGGTCCTTTGTTCGACCGGGAGACA
TCCAATCCTTGCTTCAATCCTCTCTCACCGGGCATCCCTTTAAAAAGGAAAAGAAGTTTCTTTGGAGGACCTTTCTCCCAGCGAGCAAAGAGAACGAAGATATGTTGGAA
GAAAACGACAGTATTATTGAGGCTTCGGACGAGGAAATCCTAGAGGGGTACCATCGATGCTTCATGAGCGAAAATAGGGAAGAAGGAAATATGAGTCTTGATCATGTTGA
GGAAACTCTGGATAGAGTCGAGGGATTGCTCCCGAAAGGCATAACGAACCATGAGGATGAGCCAAACGAACCTGACAACATGGACGAGCAGTCGAAAACAAACCCAAGGG
CCCTGGCCATAATCTCGCCAAAGAGTATGGATGAAACTAGCTCCTTAGAAGGATTCTCGATTAGCAAAGAAATTGTTCTTACTCTTCAAAGAAATAATTTGTGCATCAAA
CCTATCTCGGGTTCGAATATGAAAAAAGGCAGTACCGCTCAGAGGAGGCGTAACAGGGAAATGACGAGCCACCTAAGAACCTGGGAAAAGGAAGTAGAAGATACCAACGA
AAATGAAGATCTCGAAGAGTATGTTGATCTAGTCGATAGCTTAGTCAGGGCCGAAAAGGAAACTAACCTTGCTAAGGTTGATAGGAGGATAGTGAAATCCATTTGGAGCT
CTAGGCATGTTGCGTTGCTGTCCTTAGATGCTTGTAACTCAGCCAGGGGTATCATTATTCTGTGGAAGGAGAATTCGATTGATGTTGTGAACTCGTTTTTGGGGGTTTTC
TCGATTACTATTCAGTGCACGCTCCAAGGGCAGAAGGAGGGTTGGATTACTGGTGTTTACGACCCGTGTGATTACCAGGATAGAAAACATTTCCTTTAA
Protein sequenceShow/hide protein sequence
VAFDGLYLPSSRGDRRSVGVSFGLKPMPTPDMSISNDRFSSVWKTLVTRRCIAKKYGVALNPFKLGDHSIKSSKGPWRSIHKLQQLVYNSMEIRVGKGDKTMFWDDLWIG
NEPLKVTFPSLYRVSQKREALIIDLWNSMEGAWNLHLRRNLTEEEIFEWTSLSHILSCFSLKEAEDSWFWKLEKNGLFSTGSLTKKLASSTQSNINDFYVQLWKGPMPKK
VKFFLWELSHECINTADLIQRRNPGTTLSPNCCCLCYKDSESQVHLFSKCVYAASIWDYLQQAFGWSFVRPGDIQSLLQSSLTGHPFKKEKKFLWRTFLPASKENEDMLE
ENDSIIEASDEEILEGYHRCFMSENREEGNMSLDHVEETLDRVEGLLPKGITNHEDEPNEPDNMDEQSKTNPRALAIISPKSMDETSSLEGFSISKEIVLTLQRNNLCIK
PISGSNMKKGSTAQRRRNREMTSHLRTWEKEVEDTNENEDLEEYVDLVDSLVRAEKETNLAKVDRRIVKSIWSSRHVALLSLDACNSARGIIILWKENSIDVVNSFLGVF
SITIQCTLQGQKEGWITGVYDPCDYQDRKHFL