; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041050 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041050
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr13:11310693..11312478
RNA-Seq ExpressionLag0041050
SyntenyLag0041050
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588985.1 Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyrosperma subsp. sororia]7.1e-3539.34Show/hide
Query:  IQSANSNSAS-------NPFVSLLHNICNLVSVRLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAP------------PKTLTT----------LSYVI
        + S+NS+S+S       +P V LL NICNL+S++L S+NYVLW+FQ+T LLK+HKLF +IDG+   P             + L T          L+YV+
Subjt:  IQSANSNSAS-------NPFVSLLHNICNLVSVRLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAP------------PKTLTT----------LSYVI

Query:  GCQTAKDVWDKLEKHFSYSNRTNIVGMKSELQSIAKKL---------------------------EDLVIYAVNGLPSAYNVFKMSLRTRSQNLTFDELH
        G  T+K VW+ L K +S S+R+N+V +KS+LQ+I+KK                            EDL+IYA+NGLP+ YN F+ S+RTRS  +TF+ELH
Subjt:  GCQTAKDVWDKLEKHFSYSNRTNIVGMKSELQSIAKKL---------------------------EDLVIYAVNGLPSAYNVFKMSLRTRSQNLTFDELH

Query:  ILMKTEETTIDKQSKSDD--TNTVTQLAMAASLDFNGKGNNSNWRSNGRGRGRFDGNRNGGRGRGNSFPSNANQSFSPGQLSSTSPSPGQFSSKVHCQIC
        +L+K EE+ + KQSK DD        LA + SL       N+N+    RGRGR       GRG G+   S   Q    G  SS        ++   CQIC
Subjt:  ILMKTEETTIDKQSKSDD--TNTVTQLAMAASLDFNGKGNNSNWRSNGRGRGRFDGNRNGGRGRGNSFPSNANQSFSPGQLSSTSPSPGQFSSKVHCQIC

Query:  QRYGH
         R GH
Subjt:  QRYGH

KAG7015254.1 hypothetical protein SDJN02_22888, partial [Cucurbita argyrosperma subsp. argyrosperma]7.1e-3539.34Show/hide
Query:  IQSANSNSAS-------NPFVSLLHNICNLVSVRLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAP------------PKTLTT----------LSYVI
        + S+NS+S+S       +P V LL NICNL+S++L S+NYVLW+FQ+T LLK+HKLF +IDG+   P             + L T          L+YV+
Subjt:  IQSANSNSAS-------NPFVSLLHNICNLVSVRLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAP------------PKTLTT----------LSYVI

Query:  GCQTAKDVWDKLEKHFSYSNRTNIVGMKSELQSIAKKL---------------------------EDLVIYAVNGLPSAYNVFKMSLRTRSQNLTFDELH
        G  T+K VW+ L K +S S+R+N+V +KS+LQ+I+KK                            EDL+IYA+NGLP+ YN F+ S+RTRS  +TF+ELH
Subjt:  GCQTAKDVWDKLEKHFSYSNRTNIVGMKSELQSIAKKL---------------------------EDLVIYAVNGLPSAYNVFKMSLRTRSQNLTFDELH

Query:  ILMKTEETTIDKQSKSDD--TNTVTQLAMAASLDFNGKGNNSNWRSNGRGRGRFDGNRNGGRGRGNSFPSNANQSFSPGQLSSTSPSPGQFSSKVHCQIC
        +L+K EE+ + KQSK DD        LA + SL       N+N+    RGRGR       GRG G+   S   Q    G  SS        ++   CQIC
Subjt:  ILMKTEETTIDKQSKSDD--TNTVTQLAMAASLDFNGKGNNSNWRSNGRGRGRFDGNRNGGRGRGNSFPSNANQSFSPGQLSSTSPSPGQFSSKVHCQIC

Query:  QRYGH
         R GH
Subjt:  QRYGH

TYK17989.1 uncharacterized protein E5676_scaffold306G002980 [Cucumis melo var. makuwa]1.6e-3438Show/hide
Query:  SANSNSASNPFVSLLHNICNLVSVRLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAPPKTLT-------------------------------------
        S+ S   S+P + LL NICNL+S+RL S+NY LW+FQ  P+LK+HKL+ +ID SI  PPKT++                                     
Subjt:  SANSNSASNPFVSLLHNICNLVSVRLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAPPKTLT-------------------------------------

Query:  ----------TLSYVIGCQTAKDVWDKLEKHFSYSNRTNIVGMKSELQSIAKKL---------------------------EDLVIYAVNGLPSAYNVFK
                   L+YV+GC+++  VW  LE+H+S + RTNIV +KS+LQ I+KK                            EDLVIYA+NGLP  YN F+
Subjt:  ----------TLSYVIGCQTAKDVWDKLEKHFSYSNRTNIVGMKSELQSIAKKL---------------------------EDLVIYAVNGLPSAYNVFK

Query:  MSLRTRSQNLTFDELHILMKTEETTIDKQSKSDDTNTVTQLAMAASLDFN
         S++TRSQ ++F ELHIL+K+EE+ ++KQ+K +D  +V   AM A+ + N
Subjt:  MSLRTRSQNLTFDELHILMKTEETTIDKQSKSDDTNTVTQLAMAASLDFN

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]6.0e-3435.37Show/hide
Query:  MSSSTEQTLIQSANSNSASNPFVSLLHNICNLVSVRLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAPPKTLTT-------------------------
        MSSST  TL  S+    + +P + LL NICNL+S+RL S+N+VLW+FQ+T +LK+HKL+ +IDG+   PP+T  +                         
Subjt:  MSSSTEQTLIQSANSNSASNPFVSLLHNICNLVSVRLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAPPKTLTT-------------------------

Query:  -----------LSYVIGCQTAKDVWDKLEKHFSYSNRTNIVGMKSELQSIAKK---------------------------LEDLVIYAVNGLPSAYNVFK
                   L+YV+G  ++K VWD L K +S  +R+N+V +KS+LQ+I KK                            EDL+IYA+NGLP+ YN F+
Subjt:  -----------LSYVIGCQTAKDVWDKLEKHFSYSNRTNIVGMKSELQSIAKK---------------------------LEDLVIYAVNGLPSAYNVFK

Query:  MSLRTRSQNLTFDELHILMKTEETTIDKQSKSDD---------TNTVTQLAMAASLDFNGKGNNSNWRSNGRGRGRFDGNRNGGRGRGNSFPSNANQSFS
         S+RTRSQ +TF+ELH+L++ EE+ + KQSK DD         +++ + L+ A + D N    N + +  G GR  FD      RG G+          S
Subjt:  MSLRTRSQNLTFDELHILMKTEETTIDKQSKSDD---------TNTVTQLAMAASLDFNGKGNNSNWRSNGRGRGRFDGNRNGGRGRGNSFPSNANQSFS

Query:  PGQLSSTSPSPGQFSSKVHCQICQRYGH
        P Q S          +   CQIC R GH
Subjt:  PGQLSSTSPSPGQFSSKVHCQICQRYGH

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]7.1e-3535.13Show/hide
Query:  SANSNSASNPFVSLLHNICNLVSVRLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAP-------------PKTLTT-----------------------
        S N+    +  + LL NICNLVS+RL S++++LW+FQ+T +LK+HKLF +IDGS++AP             P T T+                       
Subjt:  SANSNSASNPFVSLLHNICNLVSVRLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAP-------------PKTLTT-----------------------

Query:  -------LSYVIGCQTAKDVWDKLEKHFSYSNRTNIVGMKSELQSIAKKLED---------------------------LVIYAVNGLPSAYNVFKMSLR
               L+YV+   T+K VW+ LEKH+S ++RTN+V +KS+LQSI KK E+                           L+IYA+NGL + YN    S+R
Subjt:  -------LSYVIGCQTAKDVWDKLEKHFSYSNRTNIVGMKSELQSIAKKLED---------------------------LVIYAVNGLPSAYNVFKMSLR

Query:  TRSQNLTFDELHILMKTEETTIDKQSKSDDTNTVTQLAMAASLDFNGKGNNSN-WRSNGRGRGRFDGNRNGGRGRGNSFPSNANQSFSPGQLSSTSPSPG
        TR+Q+++F+ELH+ MK+EE+ I+KQ K +D  T      A+S     + +  +  +S+ RGRG     +N GRG+ N  P+  NQ    G+ S    +  
Subjt:  TRSQNLTFDELHILMKTEETTIDKQSKSDDTNTVTQLAMAASLDFNGKGNNSN-WRSNGRGRGRFDGNRNGGRGRGNSFPSNANQSFSPGQLSSTSPSPG

Query:  QFSSKVHCQICQRYGH
        Q  ++  CQIC + GH
Subjt:  QFSSKVHCQICQRYGH

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X22.9e-3435.37Show/hide
Query:  MSSSTEQTLIQSANSNSASNPFVSLLHNICNLVSVRLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAPPKTLTT-------------------------
        MSSST  TL  S+    + +P + LL NICNL+S+RL S+N+VLW+FQ+T +LK+HKL+ +IDG+   PP+T  +                         
Subjt:  MSSSTEQTLIQSANSNSASNPFVSLLHNICNLVSVRLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAPPKTLTT-------------------------

Query:  -----------LSYVIGCQTAKDVWDKLEKHFSYSNRTNIVGMKSELQSIAKK---------------------------LEDLVIYAVNGLPSAYNVFK
                   L+YV+G  ++K VWD L K +S  +R+N+V +KS+LQ+I KK                            EDL+IYA+NGLP+ YN F+
Subjt:  -----------LSYVIGCQTAKDVWDKLEKHFSYSNRTNIVGMKSELQSIAKK---------------------------LEDLVIYAVNGLPSAYNVFK

Query:  MSLRTRSQNLTFDELHILMKTEETTIDKQSKSDD---------TNTVTQLAMAASLDFNGKGNNSNWRSNGRGRGRFDGNRNGGRGRGNSFPSNANQSFS
         S+RTRSQ +TF+ELH+L++ EE+ + KQSK DD         +++ + L+ A + D N    N + +  G GR  FD      RG G+          S
Subjt:  MSLRTRSQNLTFDELHILMKTEETTIDKQSKSDD---------TNTVTQLAMAASLDFNGKGNNSNWRSNGRGRGRFDGNRNGGRGRGNSFPSNANQSFS

Query:  PGQLSSTSPSPGQFSSKVHCQICQRYGH
        P Q S          +   CQIC R GH
Subjt:  PGQLSSTSPSPGQFSSKVHCQICQRYGH

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X12.9e-3435.37Show/hide
Query:  MSSSTEQTLIQSANSNSASNPFVSLLHNICNLVSVRLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAPPKTLTT-------------------------
        MSSST  TL  S+    + +P + LL NICNL+S+RL S+N+VLW+FQ+T +LK+HKL+ +IDG+   PP+T  +                         
Subjt:  MSSSTEQTLIQSANSNSASNPFVSLLHNICNLVSVRLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAPPKTLTT-------------------------

Query:  -----------LSYVIGCQTAKDVWDKLEKHFSYSNRTNIVGMKSELQSIAKK---------------------------LEDLVIYAVNGLPSAYNVFK
                   L+YV+G  ++K VWD L K +S  +R+N+V +KS+LQ+I KK                            EDL+IYA+NGLP+ YN F+
Subjt:  -----------LSYVIGCQTAKDVWDKLEKHFSYSNRTNIVGMKSELQSIAKK---------------------------LEDLVIYAVNGLPSAYNVFK

Query:  MSLRTRSQNLTFDELHILMKTEETTIDKQSKSDD---------TNTVTQLAMAASLDFNGKGNNSNWRSNGRGRGRFDGNRNGGRGRGNSFPSNANQSFS
         S+RTRSQ +TF+ELH+L++ EE+ + KQSK DD         +++ + L+ A + D N    N + +  G GR  FD      RG G+          S
Subjt:  MSLRTRSQNLTFDELHILMKTEETTIDKQSKSDD---------TNTVTQLAMAASLDFNGKGNNSNWRSNGRGRGRFDGNRNGGRGRGNSFPSNANQSFS

Query:  PGQLSSTSPSPGQFSSKVHCQICQRYGH
        P Q S          +   CQIC R GH
Subjt:  PGQLSSTSPSPGQFSSKVHCQICQRYGH

A0A5D3CLI6 T4.52.9e-3435.37Show/hide
Query:  MSSSTEQTLIQSANSNSASNPFVSLLHNICNLVSVRLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAPPKTLTT-------------------------
        MSSST  TL  S+    + +P + LL NICNL+S+RL S+N+VLW+FQ+T +LK+HKL+ +IDG+   PP+T  +                         
Subjt:  MSSSTEQTLIQSANSNSASNPFVSLLHNICNLVSVRLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAPPKTLTT-------------------------

Query:  -----------LSYVIGCQTAKDVWDKLEKHFSYSNRTNIVGMKSELQSIAKK---------------------------LEDLVIYAVNGLPSAYNVFK
                   L+YV+G  ++K VWD L K +S  +R+N+V +KS+LQ+I KK                            EDL+IYA+NGLP+ YN F+
Subjt:  -----------LSYVIGCQTAKDVWDKLEKHFSYSNRTNIVGMKSELQSIAKK---------------------------LEDLVIYAVNGLPSAYNVFK

Query:  MSLRTRSQNLTFDELHILMKTEETTIDKQSKSDD---------TNTVTQLAMAASLDFNGKGNNSNWRSNGRGRGRFDGNRNGGRGRGNSFPSNANQSFS
         S+RTRSQ +TF+ELH+L++ EE+ + KQSK DD         +++ + L+ A + D N    N + +  G GR  FD      RG G+          S
Subjt:  MSLRTRSQNLTFDELHILMKTEETTIDKQSKSDD---------TNTVTQLAMAASLDFNGKGNNSNWRSNGRGRGRFDGNRNGGRGRGNSFPSNANQSFS

Query:  PGQLSSTSPSPGQFSSKVHCQICQRYGH
        P Q S          +   CQIC R GH
Subjt:  PGQLSSTSPSPGQFSSKVHCQICQRYGH

A0A5D3D3T6 Retrotran_gag_3 domain-containing protein7.6e-3538Show/hide
Query:  SANSNSASNPFVSLLHNICNLVSVRLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAPPKTLT-------------------------------------
        S+ S   S+P + LL NICNL+S+RL S+NY LW+FQ  P+LK+HKL+ +ID SI  PPKT++                                     
Subjt:  SANSNSASNPFVSLLHNICNLVSVRLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAPPKTLT-------------------------------------

Query:  ----------TLSYVIGCQTAKDVWDKLEKHFSYSNRTNIVGMKSELQSIAKKL---------------------------EDLVIYAVNGLPSAYNVFK
                   L+YV+GC+++  VW  LE+H+S + RTNIV +KS+LQ I+KK                            EDLVIYA+NGLP  YN F+
Subjt:  ----------TLSYVIGCQTAKDVWDKLEKHFSYSNRTNIVGMKSELQSIAKKL---------------------------EDLVIYAVNGLPSAYNVFK

Query:  MSLRTRSQNLTFDELHILMKTEETTIDKQSKSDDTNTVTQLAMAASLDFN
         S++TRSQ ++F ELHIL+K+EE+ ++KQ+K +D  +V   AM A+ + N
Subjt:  MSLRTRSQNLTFDELHILMKTEETTIDKQSKSDDTNTVTQLAMAASLDFN

A0A6J1D9L6 uncharacterized protein LOC1110188923.4e-3535.13Show/hide
Query:  SANSNSASNPFVSLLHNICNLVSVRLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAP-------------PKTLTT-----------------------
        S N+    +  + LL NICNLVS+RL S++++LW+FQ+T +LK+HKLF +IDGS++AP             P T T+                       
Subjt:  SANSNSASNPFVSLLHNICNLVSVRLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAP-------------PKTLTT-----------------------

Query:  -------LSYVIGCQTAKDVWDKLEKHFSYSNRTNIVGMKSELQSIAKKLED---------------------------LVIYAVNGLPSAYNVFKMSLR
               L+YV+   T+K VW+ LEKH+S ++RTN+V +KS+LQSI KK E+                           L+IYA+NGL + YN    S+R
Subjt:  -------LSYVIGCQTAKDVWDKLEKHFSYSNRTNIVGMKSELQSIAKKLED---------------------------LVIYAVNGLPSAYNVFKMSLR

Query:  TRSQNLTFDELHILMKTEETTIDKQSKSDDTNTVTQLAMAASLDFNGKGNNSN-WRSNGRGRGRFDGNRNGGRGRGNSFPSNANQSFSPGQLSSTSPSPG
        TR+Q+++F+ELH+ MK+EE+ I+KQ K +D  T      A+S     + +  +  +S+ RGRG     +N GRG+ N  P+  NQ    G+ S    +  
Subjt:  TRSQNLTFDELHILMKTEETTIDKQSKSDDTNTVTQLAMAASLDFNGKGNNSN-WRSNGRGRGRFDGNRNGGRGRGNSFPSNANQSFSPGQLSSTSPSPG

Query:  QFSSKVHCQICQRYGH
        Q  ++  CQIC + GH
Subjt:  QFSSKVHCQICQRYGH

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.7e-0421.63Show/hide
Query:  RLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAPPKTLTT--------------------LSYVIG------------CQTAKDVWDKLEKHFSYSNRTN
        +L S+NY++W  Q+  L   ++L  ++DGS T PP T+ T                     S V+G              TA  +W+ L K ++  +  +
Subjt:  RLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAPPKTLTT--------------------LSYVIG------------CQTAKDVWDKLEKHFSYSNRTN

Query:  IVGMKSEL--------------QSIAKKLEDLVIY------------AVNGLPSAYNVFKMSLRTRSQNLTFDELHILMKTEETTIDKQSKSDDTNTVTQ
        +  ++++L              Q +  + + L +              +  LP  Y      +  +    T  E+H      E  ++ +SK    ++ T 
Subjt:  IVGMKSEL--------------QSIAKKLEDLVIY------------AVNGLPSAYNVFKMSLRTRSQNLTFDELHILMKTEETTIDKQSKSDDTNTVTQ

Query:  LAMAASLDFNGKGNNSNWRSNGRGRGRFDGNRNGGRGRGNSFPSNANQSFSPGQLSSTSPSPGQFSSKVH---CQICQRYGH
        + + A+   +     +N  +NG    R+D NRN             N +  P Q SST+  P    SK +   CQIC   GH
Subjt:  LAMAASLDFNGKGNNSNWRSNGRGRGRFDGNRNGGRGRGNSFPSNANQSFSPGQLSSTSPSPGQFSSKVH---CQICQRYGH

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAGGCATGTCACAAAATATGGCAGAATTTCGATCCATATCGACCCAGAGGTAAAAAAACCAGTTGGAGAATATGCGACGCAATTCAACATAGTGATTGGTACGGT
CGTCCAGGAGTCTTTCTCGGTTCGATTCAACACATGGGATGATGTAATTGAAGAGACTAAGAACCTAGTAAAGGCTCGACTGCTGAAGACTGACATTATCTGTGCAACAG
GTTCGAGGACCCGAATTATATTGGAAGATTCTTCTAGATTGATCTTCGTTCTCGTCAAGATGTCATCCAGCACTGAACAAACCCTAATTCAGTCTGCGAATTCGAATTCT
GCTTCGAATCCTTTCGTCTCGCTCCTCCACAACATCTGCAATTTGGTATCTGTACGTCTTGGTTCCTCGAATTATGTTCTCTGGCGTTTCCAGATCACACCTCTCTTGAA
ATCACACAAACTCTTCAAGTATATTGATGGATCGATCACTGCTCCACCAAAGACTCTGACAACATTGTCTTATGTTATCGGCTGCCAGACTGCTAAGGACGTTTGGGACA
AACTTGAGAAGCATTTCTCCTATTCTAATCGAACGAACATTGTTGGCATGAAATCTGAATTACAGAGTATTGCGAAGAAACTAGAGGATTTAGTCATTTATGCTGTCAAT
GGACTACCGTCTGCCTATAATGTCTTCAAAATGTCCCTGCGCACACGATCTCAGAACTTGACTTTTGATGAACTCCACATCCTTATGAAAACAGAGGAAACAACTATTGA
TAAACAGTCCAAGAGTGATGACACTAATACTGTTACTCAGCTTGCTATGGCTGCTAGCCTGGATTTTAATGGGAAAGGAAACAACAGTAATTGGCGTTCCAATGGCAGAG
GACGTGGTCGATTTGATGGTAATCGAAATGGCGGTCGTGGCCGTGGGAACTCTTTCCCTTCCAATGCTAATCAGTCCTTTTCTCCAGGCCAATTATCCTCTACTTCACCT
TCTCCAGGTCAATTTTCCAGCAAGGTTCACTGTCAGATCTGCCAACGATATGGTCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTAGGCATGTCACAAAATATGGCAGAATTTCGATCCATATCGACCCAGAGGTAAAAAAACCAGTTGGAGAATATGCGACGCAATTCAACATAGTGATTGGTACGGT
CGTCCAGGAGTCTTTCTCGGTTCGATTCAACACATGGGATGATGTAATTGAAGAGACTAAGAACCTAGTAAAGGCTCGACTGCTGAAGACTGACATTATCTGTGCAACAG
GTTCGAGGACCCGAATTATATTGGAAGATTCTTCTAGATTGATCTTCGTTCTCGTCAAGATGTCATCCAGCACTGAACAAACCCTAATTCAGTCTGCGAATTCGAATTCT
GCTTCGAATCCTTTCGTCTCGCTCCTCCACAACATCTGCAATTTGGTATCTGTACGTCTTGGTTCCTCGAATTATGTTCTCTGGCGTTTCCAGATCACACCTCTCTTGAA
ATCACACAAACTCTTCAAGTATATTGATGGATCGATCACTGCTCCACCAAAGACTCTGACAACATTGTCTTATGTTATCGGCTGCCAGACTGCTAAGGACGTTTGGGACA
AACTTGAGAAGCATTTCTCCTATTCTAATCGAACGAACATTGTTGGCATGAAATCTGAATTACAGAGTATTGCGAAGAAACTAGAGGATTTAGTCATTTATGCTGTCAAT
GGACTACCGTCTGCCTATAATGTCTTCAAAATGTCCCTGCGCACACGATCTCAGAACTTGACTTTTGATGAACTCCACATCCTTATGAAAACAGAGGAAACAACTATTGA
TAAACAGTCCAAGAGTGATGACACTAATACTGTTACTCAGCTTGCTATGGCTGCTAGCCTGGATTTTAATGGGAAAGGAAACAACAGTAATTGGCGTTCCAATGGCAGAG
GACGTGGTCGATTTGATGGTAATCGAAATGGCGGTCGTGGCCGTGGGAACTCTTTCCCTTCCAATGCTAATCAGTCCTTTTCTCCAGGCCAATTATCCTCTACTTCACCT
TCTCCAGGTCAATTTTCCAGCAAGGTTCACTGTCAGATCTGCCAACGATATGGTCATTGA
Protein sequenceShow/hide protein sequence
MSRHVTKYGRISIHIDPEVKKPVGEYATQFNIVIGTVVQESFSVRFNTWDDVIEETKNLVKARLLKTDIICATGSRTRIILEDSSRLIFVLVKMSSSTEQTLIQSANSNS
ASNPFVSLLHNICNLVSVRLGSSNYVLWRFQITPLLKSHKLFKYIDGSITAPPKTLTTLSYVIGCQTAKDVWDKLEKHFSYSNRTNIVGMKSELQSIAKKLEDLVIYAVN
GLPSAYNVFKMSLRTRSQNLTFDELHILMKTEETTIDKQSKSDDTNTVTQLAMAASLDFNGKGNNSNWRSNGRGRGRFDGNRNGGRGRGNSFPSNANQSFSPGQLSSTSP
SPGQFSSKVHCQICQRYGH