; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041431 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041431
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr13:17657929..17660992
RNA-Seq ExpressionLag0041431
SyntenyLag0041431
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8676815.1 hypothetical protein F3Y22_tig00111582pilonHSYRG01273 [Hibiscus syriacus]8.3e-2724.83Show/hide
Query:  RRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNICILLAVDYVSKWVEAIACHQNDAKTV----------------------------------
        + GN+  + EMPL  ILE+ELFDVWGIDFM PFP S G + ILLAVDYVSKWVEAIA   ND+KT+                                  
Subjt:  RRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNICILLAVDYVSKWVEAIACHQNDAKTV----------------------------------

Query:  --------ASYLPWAIEIQ---VFTKIPFEFSMAKTRARKERDNEEEEVPVTPEEGWEDVSATVVEEDPKEPEEQNPEQTEPRVADTEEVQEGNTEEIQE
                A +LP  +E +   V  ++ F+  +A+ + R    NE EE      E     +A + +E  K   + +  +  PR  +   V      +I+ 
Subjt:  --------ASYLPWAIEIQ---VFTKIPFEFSMAKTRARKERDNEEEEVPVTPEEGWEDVSATVVEEDPKEPEEQNPEQTEPRVADTEEVQEGNTEEIQE

Query:  IQN-----------------------EDVREEQAEVAPEKGN----EPV--QEARVEVIMLEVPKRRRIKRKTGRVRVVRAFYANIDKEDGFQVIVRGV-
        I N                       + +R   AE  PE+       PV  + ++ +    E  K R    K  ++    +F    + + G    +  + 
Subjt:  IQN-----------------------EDVREEQAEVAPEKGN----EPV--QEARVEVIMLEVPKRRRIKRKTGRVRVVRAFYANIDKEDGFQVIVRGV-

Query:  -EVDWS-----PSAINALYNLQNFPHATYNEMAVAPSNEKLSDVVREVGIEGARWQLSKTEKRTFQSTYLKREANTCMGFIRQRMLPMTHDSTVSRERVL
          + W      P ++N   ++ +  H  + + A    +    +++ ++  E   W   +T + +     L+  A     F++ +++P +H++TVS  R+L
Subjt:  -EVDWS-----PSAINALYNLQNFPHATYNEMAVAPSNEKLSDVVREVGIEGARWQLSKTEKRTFQSTYLKREANTCMGFIRQRMLPMTHDSTVSRERVL

Query:  LAFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCKRAGVLENEGDVILFDKGIIITSNLARLQRMQEVRQGGLVYD-------INTILEQLT
        L  +I+ S  IDVG+IIV ++  C  KK   L FPN IT LC++  V EN  D IL     I  + L  L  ++  +    +++        N     L 
Subjt:  LAFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCKRAGVLENEGDVILFDKGIIITSNLARLQRMQEVRQGGLVYD-------INTILEQLT

Query:  LSTSRQEFSERQEFAERQALTFWNYVKNCDVNLKKALQENFSKPYPALPTFPEDLLNPWVLPPPIERGEGDD
        L  +  +              F+ YVK+ D  ++   QE         P F +++L+ +     +E  + ++
Subjt:  LSTSRQEFSERQEFAERQALTFWNYVKNCDVNLKKALQENFSKPYPALPTFPEDLLNPWVLPPPIERGEGDD

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.1e-3034.08Show/hide
Query:  VRVVRAFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHATYNEMAVAPSNEKLSDVVREVGIEGARWQLSKTEKRTFQSTYLKREANTCMGFIR
        V +VR FYAN+   +   V VRGV+V WS  AINA++ L + P   ++E     + + L  V+  V   GA W +S     T   + L   A     F++
Subjt:  VRVVRAFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHATYNEMAVAPSNEKLSDVVREVGIEGARWQLSKTEKRTFQSTYLKREANTCMGFIR

Query:  QRMLPMTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCK--RAGVLENEGDVILFDKGIIITSNLARL----------
         R+LP TH  TVS++R+LL  ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC+  RA  L NE    L + G I    +AR+          
Subjt:  QRMLPMTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCK--RAGVLENEGDVILFDKGIIITSNLARL----------

Query:  ----QRMQEVRQGGLVYDINTILEQLTLSTSRQEFSERQ-----EFAERQALTFWNYVKNCDVNLKKALQENFSKPYPALPTFPEDLLNPWVLPPPIERG
             R           DI   L+ L    S+QE  +       +   +Q   FW Y K  D  LKKALQ NF++P P  P FP+++L    L    E  
Subjt:  ----QRMQEVRQGGLVYDINTILEQLTLSTSRQEFSERQ-----EFAERQALTFWNYVKNCDVNLKKALQENFSKPYPALPTFPEDLLNPWVLPPPIERG

Query:  EGDDGNEQGQE
           DG+ +  E
Subjt:  EGDDGNEQGQE

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]2.5e-3134.11Show/hide
Query:  VVRAFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHATYNEMAVAPSNEKLSDVVREVGIEGARWQLSKTEKRTFQSTYLKREANTCMGFIRQR
        +VR FYAN+   +   + VRGV+V WS  AINA++ L + P   ++E     +  +L  V+  V   GA W +S     T   + L   A     F++ R
Subjt:  VVRAFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHATYNEMAVAPSNEKLSDVVREVGIEGARWQLSKTEKRTFQSTYLKREANTCMGFIRQR

Query:  MLPMTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCKRAGVLENEGDVILFDKGIIITSNLARL---------QRMQE
        +LP TH   VS++R+LL  ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC+ A  L NE    L + G I    +AR+         Q+   
Subjt:  MLPMTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCKRAGVLENEGDVILFDKGIIITSNLARL---------QRMQE

Query:  VRQGGLVYDINT--ILEQLTLSTSRQEFSERQEFAERQALTFWNYVKNCDVNLKKALQENFSKPYPALPTFPEDLLNPWVLPPPIERGEGDDGNEQGQE
         R            +L+QL     R     +QE   +Q   FW Y K  D  LKKALQ NF++P P  P FP+++L    L    E     DG+ +  E
Subjt:  VRQGGLVYDINT--ILEQLTLSTSRQEFSERQEFAERQALTFWNYVKNCDVNLKKALQENFSKPYPALPTFPEDLLNPWVLPPPIERGEGDDGNEQGQE

XP_023521407.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785222 [Cucurbita pepo subsp. pepo]4.1e-2672.15Show/hide
Query:  FYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNICILLAVDYVSKWVEAIACHQNDAKTVASYL
        F K CD CQR GN+  ++E+PL  ILEVELFDVWGIDFMGPFPPS GN+ IL+AVDYVSKWVEAIAC  ND KTV  +L
Subjt:  FYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNICILLAVDYVSKWVEAIACHQNDAKTVASYL

XP_042757945.1 uncharacterized protein LOC111885853 [Lactuca sativa]3.1e-2670.89Show/hide
Query:  FYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNICILLAVDYVSKWVEAIACHQNDAKTVASYL
        F K+CD CQR GN+G R EMPL+ I+EVELFDVWGIDFMGPF PS+G + IL+AVDYVSKWVEA+AC +NDA+TV ++L
Subjt:  FYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNICILLAVDYVSKWVEAIACHQNDAKTVASYL

TrEMBL top hitse value%identityAlignment
A0A1U7XG07 uncharacterized protein LOC1042340821.9e-2469.14Show/hide
Query:  HWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNICILLAVDYVSKWVEAIACHQNDAKTVASYL
        H F K+CD CQR G +  R EMPL  ILEVELFDVWGIDFMGPFPPS GN  ILLAVDYVSKW+E IA   NDA  VA+++
Subjt:  HWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNICILLAVDYVSKWVEAIACHQNDAKTVASYL

A0A2P5BCG4 Uncharacterized protein (Fragment)1.0e-3034.08Show/hide
Query:  VRVVRAFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHATYNEMAVAPSNEKLSDVVREVGIEGARWQLSKTEKRTFQSTYLKREANTCMGFIR
        V +VR FYAN+   +   V VRGV+V WS  AINA++ L + P   ++E     + + L  V+  V   GA W +S     T   + L   A     F++
Subjt:  VRVVRAFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHATYNEMAVAPSNEKLSDVVREVGIEGARWQLSKTEKRTFQSTYLKREANTCMGFIR

Query:  QRMLPMTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCK--RAGVLENEGDVILFDKGIIITSNLARL----------
         R+LP TH  TVS++R+LL  ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC+  RA  L NE    L + G I    +AR+          
Subjt:  QRMLPMTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCK--RAGVLENEGDVILFDKGIIITSNLARL----------

Query:  ----QRMQEVRQGGLVYDINTILEQLTLSTSRQEFSERQ-----EFAERQALTFWNYVKNCDVNLKKALQENFSKPYPALPTFPEDLLNPWVLPPPIERG
             R           DI   L+ L    S+QE  +       +   +Q   FW Y K  D  LKKALQ NF++P P  P FP+++L    L    E  
Subjt:  ----QRMQEVRQGGLVYDINTILEQLTLSTSRQEFSERQ-----EFAERQALTFWNYVKNCDVNLKKALQENFSKPYPALPTFPEDLLNPWVLPPPIERG

Query:  EGDDGNEQGQE
           DG+ +  E
Subjt:  EGDDGNEQGQE

A0A2P5DXM3 Uncharacterized protein1.2e-3134.11Show/hide
Query:  VVRAFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHATYNEMAVAPSNEKLSDVVREVGIEGARWQLSKTEKRTFQSTYLKREANTCMGFIRQR
        +VR FYAN+   +   + VRGV+V WS  AINA++ L + P   ++E     +  +L  V+  V   GA W +S     T   + L   A     F++ R
Subjt:  VVRAFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHATYNEMAVAPSNEKLSDVVREVGIEGARWQLSKTEKRTFQSTYLKREANTCMGFIRQR

Query:  MLPMTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCKRAGVLENEGDVILFDKGIIITSNLARL---------QRMQE
        +LP TH   VS++R+LL  ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC+ A  L NE    L + G I    +AR+         Q+   
Subjt:  MLPMTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCKRAGVLENEGDVILFDKGIIITSNLARL---------QRMQE

Query:  VRQGGLVYDINT--ILEQLTLSTSRQEFSERQEFAERQALTFWNYVKNCDVNLKKALQENFSKPYPALPTFPEDLLNPWVLPPPIERGEGDDGNEQGQE
         R            +L+QL     R     +QE   +Q   FW Y K  D  LKKALQ NF++P P  P FP+++L    L    E     DG+ +  E
Subjt:  VRQGGLVYDINT--ILEQLTLSTSRQEFSERQEFAERQALTFWNYVKNCDVNLKKALQENFSKPYPALPTFPEDLLNPWVLPPPIERGEGDDGNEQGQE

A0A6A2Y697 Reverse transcriptase domain-containing protein4.0e-2724.83Show/hide
Query:  RRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNICILLAVDYVSKWVEAIACHQNDAKTV----------------------------------
        + GN+  + EMPL  ILE+ELFDVWGIDFM PFP S G + ILLAVDYVSKWVEAIA   ND+KT+                                  
Subjt:  RRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNICILLAVDYVSKWVEAIACHQNDAKTV----------------------------------

Query:  --------ASYLPWAIEIQ---VFTKIPFEFSMAKTRARKERDNEEEEVPVTPEEGWEDVSATVVEEDPKEPEEQNPEQTEPRVADTEEVQEGNTEEIQE
                A +LP  +E +   V  ++ F+  +A+ + R    NE EE      E     +A + +E  K   + +  +  PR  +   V      +I+ 
Subjt:  --------ASYLPWAIEIQ---VFTKIPFEFSMAKTRARKERDNEEEEVPVTPEEGWEDVSATVVEEDPKEPEEQNPEQTEPRVADTEEVQEGNTEEIQE

Query:  IQN-----------------------EDVREEQAEVAPEKGN----EPV--QEARVEVIMLEVPKRRRIKRKTGRVRVVRAFYANIDKEDGFQVIVRGV-
        I N                       + +R   AE  PE+       PV  + ++ +    E  K R    K  ++    +F    + + G    +  + 
Subjt:  IQN-----------------------EDVREEQAEVAPEKGN----EPV--QEARVEVIMLEVPKRRRIKRKTGRVRVVRAFYANIDKEDGFQVIVRGV-

Query:  -EVDWS-----PSAINALYNLQNFPHATYNEMAVAPSNEKLSDVVREVGIEGARWQLSKTEKRTFQSTYLKREANTCMGFIRQRMLPMTHDSTVSRERVL
          + W      P ++N   ++ +  H  + + A    +    +++ ++  E   W   +T + +     L+  A     F++ +++P +H++TVS  R+L
Subjt:  -EVDWS-----PSAINALYNLQNFPHATYNEMAVAPSNEKLSDVVREVGIEGARWQLSKTEKRTFQSTYLKREANTCMGFIRQRMLPMTHDSTVSRERVL

Query:  LAFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCKRAGVLENEGDVILFDKGIIITSNLARLQRMQEVRQGGLVYD-------INTILEQLT
        L  +I+ S  IDVG+IIV ++  C  KK   L FPN IT LC++  V EN  D IL     I  + L  L  ++  +    +++        N     L 
Subjt:  LAFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCKRAGVLENEGDVILFDKGIIITSNLARLQRMQEVRQGGLVYD-------INTILEQLT

Query:  LSTSRQEFSERQEFAERQALTFWNYVKNCDVNLKKALQENFSKPYPALPTFPEDLLNPWVLPPPIERGEGDD
        L  +  +              F+ YVK+ D  ++   QE         P F +++L+ +     +E  + ++
Subjt:  LSTSRQEFSERQEFAERQALTFWNYVKNCDVNLKKALQENFSKPYPALPTFPEDLLNPWVLPPPIERGEGDD

A0A6J1DZ22 uncharacterized protein LOC1110255867.6e-2670.13Show/hide
Query:  KQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNICILLAVDYVSKWVEAIACHQNDAKTVASYL
        + C+ CQR GN+  R EMPLTYILE+  FDVWG+DF+GPFPPSNGN+ ILLAVDYVSKWVEA+AC  +DAK VA +L
Subjt:  KQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNICILLAVDYVSKWVEAIACHQNDAKTVASYL

SwissProt top hitse value%identityAlignment
P92516 Uncharacterized mitochondrial protein AtMg007505.1e-1151.67Show/hide
Query:  HWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM-------GPFPPSNGNICI
        H F   CDACQR+GN   R+EMP  +ILEVE+FDVWGI FM        P  P+ G +C+
Subjt:  HWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM-------GPFPPSNGNICI

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein3.6e-1251.67Show/hide
Query:  HWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM-------GPFPPSNGNICI
        H F   CDACQR+GN   R+EMP  +ILEVE+FDVWGI FM        P  P+ G +C+
Subjt:  HWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM-------GPFPPSNGNICI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACTCTTCGCCGTATGGAGGCCATTTCAGCAGTTAGAGGACAGCTATGAGGATTTTGCATTGTGGATTTTTCTGGCCTACTTTATGGTCCATTGGTTCTACAAGCA
ATGTGATGCTTGCCAAAGGAGAGGAAACTTAGGGCCTAGAGATGAAATGCCTCTTACTTACATTTTGGAAGTTGAATTATTCGATGTTTGGGGTATTGACTTTATGGGGC
CATTTCCTCCTTCTAATGGCAATATTTGTATCTTATTGGCAGTTGATTACGTGTCCAAGTGGGTTGAGGCCATCGCATGCCATCAGAATGATGCCAAGACAGTAGCAAGT
TATTTGCCATGGGCAATTGAAATTCAAGTTTTTACCAAAATTCCTTTTGAGTTTTCAATGGCTAAAACGAGAGCAAGAAAAGAGAGAGATAATGAGGAAGAGGAGGTACC
TGTTACGCCCGAAGAAGGTTGGGAAGATGTTTCTGCCACAGTGGTTGAAGAAGATCCGAAGGAACCAGAGGAACAAAATCCAGAGCAGACAGAGCCAAGAGTTGCGGATA
CAGAGGAAGTTCAAGAAGGGAATACCGAGGAAATTCAAGAAATACAGAATGAGGATGTGCGAGAGGAACAAGCAGAGGTTGCGCCTGAAAAAGGTAATGAGCCAGTGCAG
GAGGCTCGAGTGGAGGTGATCATGCTGGAGGTACCCAAACGTCGCCGCATTAAGCGAAAAACGGGTCGCGTCAGGGTGGTACGCGCATTTTATGCTAATATTGACAAAGA
AGATGGTTTCCAAGTGATTGTTCGAGGAGTCGAAGTGGACTGGAGTCCTAGTGCTATTAACGCACTGTATAACCTTCAGAATTTCCCCCACGCAACGTATAATGAGATGG
CTGTAGCGCCATCTAATGAGAAACTAAGTGATGTTGTGCGGGAGGTGGGTATTGAAGGGGCACGGTGGCAGCTGTCAAAGACAGAGAAAAGGACGTTTCAGTCAACTTAT
TTGAAGAGGGAAGCGAACACATGCATGGGATTTATCAGACAGAGGATGCTTCCAATGACTCATGACTCGACGGTCTCGAGGGAACGGGTTCTTTTGGCTTTCGCGATTTT
GCGGTCTCTCAGCATTGATGTAGGGAAGATTATTGTTAATGAGATTTCTGGTTGTTGGAAGAAGAAGGTGGGGAAGCTATTTTTTCCAAATACTATTACCATGCTTTGTA
AGAGAGCAGGGGTTCTAGAGAATGAGGGAGATGTGATTTTGTTTGACAAGGGGATCATCATCACGTCTAACTTGGCACGACTTCAGCGTATGCAGGAGGTACGTCAGGGT
GGACTTGTCTACGACATCAACACGATTTTAGAACAACTAACACTTTCGACCAGTAGGCAAGAGTTTTCCGAGAGGCAAGAGTTTGCCGAGAGGCAAGCTTTGACCTTCTG
GAACTATGTTAAAAATTGTGATGTCAATCTGAAGAAGGCGCTACAAGAGAATTTTTCCAAACCGTATCCAGCCCTTCCAACATTCCCTGAAGACTTATTGAATCCCTGGG
TTCTACCCCCACCAATTGAAAGAGGAGAAGGGGATGATGGAAATGAACAGGGCCAAGAGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCACTCTTCGCCGTATGGAGGCCATTTCAGCAGTTAGAGGACAGCTATGAGGATTTTGCATTGTGGATTTTTCTGGCCTACTTTATGGTCCATTGGTTCTACAAGCA
ATGTGATGCTTGCCAAAGGAGAGGAAACTTAGGGCCTAGAGATGAAATGCCTCTTACTTACATTTTGGAAGTTGAATTATTCGATGTTTGGGGTATTGACTTTATGGGGC
CATTTCCTCCTTCTAATGGCAATATTTGTATCTTATTGGCAGTTGATTACGTGTCCAAGTGGGTTGAGGCCATCGCATGCCATCAGAATGATGCCAAGACAGTAGCAAGT
TATTTGCCATGGGCAATTGAAATTCAAGTTTTTACCAAAATTCCTTTTGAGTTTTCAATGGCTAAAACGAGAGCAAGAAAAGAGAGAGATAATGAGGAAGAGGAGGTACC
TGTTACGCCCGAAGAAGGTTGGGAAGATGTTTCTGCCACAGTGGTTGAAGAAGATCCGAAGGAACCAGAGGAACAAAATCCAGAGCAGACAGAGCCAAGAGTTGCGGATA
CAGAGGAAGTTCAAGAAGGGAATACCGAGGAAATTCAAGAAATACAGAATGAGGATGTGCGAGAGGAACAAGCAGAGGTTGCGCCTGAAAAAGGTAATGAGCCAGTGCAG
GAGGCTCGAGTGGAGGTGATCATGCTGGAGGTACCCAAACGTCGCCGCATTAAGCGAAAAACGGGTCGCGTCAGGGTGGTACGCGCATTTTATGCTAATATTGACAAAGA
AGATGGTTTCCAAGTGATTGTTCGAGGAGTCGAAGTGGACTGGAGTCCTAGTGCTATTAACGCACTGTATAACCTTCAGAATTTCCCCCACGCAACGTATAATGAGATGG
CTGTAGCGCCATCTAATGAGAAACTAAGTGATGTTGTGCGGGAGGTGGGTATTGAAGGGGCACGGTGGCAGCTGTCAAAGACAGAGAAAAGGACGTTTCAGTCAACTTAT
TTGAAGAGGGAAGCGAACACATGCATGGGATTTATCAGACAGAGGATGCTTCCAATGACTCATGACTCGACGGTCTCGAGGGAACGGGTTCTTTTGGCTTTCGCGATTTT
GCGGTCTCTCAGCATTGATGTAGGGAAGATTATTGTTAATGAGATTTCTGGTTGTTGGAAGAAGAAGGTGGGGAAGCTATTTTTTCCAAATACTATTACCATGCTTTGTA
AGAGAGCAGGGGTTCTAGAGAATGAGGGAGATGTGATTTTGTTTGACAAGGGGATCATCATCACGTCTAACTTGGCACGACTTCAGCGTATGCAGGAGGTACGTCAGGGT
GGACTTGTCTACGACATCAACACGATTTTAGAACAACTAACACTTTCGACCAGTAGGCAAGAGTTTTCCGAGAGGCAAGAGTTTGCCGAGAGGCAAGCTTTGACCTTCTG
GAACTATGTTAAAAATTGTGATGTCAATCTGAAGAAGGCGCTACAAGAGAATTTTTCCAAACCGTATCCAGCCCTTCCAACATTCCCTGAAGACTTATTGAATCCCTGGG
TTCTACCCCCACCAATTGAAAGAGGAGAAGGGGATGATGGAAATGAACAGGGCCAAGAGGACTGA
Protein sequenceShow/hide protein sequence
MSLFAVWRPFQQLEDSYEDFALWIFLAYFMVHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNICILLAVDYVSKWVEAIACHQNDAKTVAS
YLPWAIEIQVFTKIPFEFSMAKTRARKERDNEEEEVPVTPEEGWEDVSATVVEEDPKEPEEQNPEQTEPRVADTEEVQEGNTEEIQEIQNEDVREEQAEVAPEKGNEPVQ
EARVEVIMLEVPKRRRIKRKTGRVRVVRAFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHATYNEMAVAPSNEKLSDVVREVGIEGARWQLSKTEKRTFQSTY
LKREANTCMGFIRQRMLPMTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEISGCWKKKVGKLFFPNTITMLCKRAGVLENEGDVILFDKGIIITSNLARLQRMQEVRQG
GLVYDINTILEQLTLSTSRQEFSERQEFAERQALTFWNYVKNCDVNLKKALQENFSKPYPALPTFPEDLLNPWVLPPPIERGEGDDGNEQGQED