; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016574 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016574
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr12:39121177..39125038
RNA-Seq ExpressionLag0016574
SyntenyLag0016574
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149385.1 uncharacterized protein LOC111017816 [Momordica charantia]3.6e-3731.99Show/hide
Query:  SPGPLNDTLVVLIPKVASP---------------------------RKILKVAISPNQSAFVRGRSVIENAILGYECIHSLKGRTGGKFGWTTLKLD---
        SP PLN+TL+ LIPKV  P                           +++L + IS NQSAFV GR V++NAI+GYEC+HS++      F W     D   
Subjt:  SPGPLNDTLVVLIPKVASP---------------------------RKILKVAISPNQSAFVRGRSVIENAILGYECIHSLKGRTGGKFGWTTLKLD---

Query:  ----------MSKRLGGMEFLGG---------------CVGSSGSEGLVVRNILELYEKVSGQTMNFDKSMVAFSPNTVLAAQYDLSS---LFAVPVIKD
                  +S+   G+  + G                  +S  E   V+ IL  Y++ SGQT+NF+KS+ +FSPNT L+    + S   L  VP  + 
Subjt:  ----------MSKRLGGMEFLGG---------------CVGSSGSEGLVVRNILELYEKVSGQTMNFDKSMVAFSPNTVLAAQYDLSS---LFAVPVIKD

Query:  RIWKQLQGWQGSAGGKEVLIKTVVQAIPCYTINCFRLPMNLVHEIHQLIARFW----------------------------------------------P
                      G++  +K ++QAIP Y++NCF+LP+ L+H+    +ARFW                                              P
Subjt:  RIWKQLQGWQGSAGGKEVLIKTVVQAIPCYTINCFRLPMNLVHEIHQLIARFW----------------------------------------------P

Query:  SFIWRSLLWGRDLLCQGIWWRIGDGRSVPIYNSNWIPRNYGLRERIP
        S+IWRSLL GR +L  G+ WR+GDG +VPIY SNWIPR+  L    P
Subjt:  SFIWRSLLWGRDLLCQGIWWRIGDGRSVPIYNSNWIPRNYGLRERIP

XP_023894138.1 uncharacterized protein LOC112006071 [Quercus suber]2.9e-3127.96Show/hide
Query:  PGPLNDTLVVLIPKVASP---------------------------RKILKVAISPNQSAFVRGRSVIENAILGYECIHSLKGRTGGKFGWTTLKLDMSKR
        P PLN T + LIPK  +P                           +++L   IS +QSAF++GR + +N ++ YE +H +K    GK G+  LKLDMSK 
Subjt:  PGPLNDTLVVLIPKVASP---------------------------RKILKVAISPNQSAFVRGRSVIENAILGYECIHSLKGRTGGKFGWTTLKLDMSKR

Query:  LGGMEF-------------------------------------LGGCVGSSG---------------SEGL-----------------VVRN--------
           +++                                     +G    S G               SEGL                 + RN        
Subjt:  LGGMEF-------------------------------------LGGCVGSSG---------------SEGL-----------------VVRN--------

Query:  ---------------------ILELYEKVSGQTMNFDKSMVAFSPNTVLAAQYDLSSLFAVP------------------------VIKDRIWKQLQGWQ
                             IL  YE+VSGQ +N +K+ + FS +T L  Q  +     V                          IK R+WK+LQGW+
Subjt:  ---------------------ILELYEKVSGQTMNFDKSMVAFSPNTVLAAQYDLSSLFAVP------------------------VIKDRIWKQLQGWQ

Query:  G---SAGGKEVLIKTVVQAIPCYTINCFRLPMNLVHEIHQLIARFWP-------------SFIWRSLLWGRDLLCQGIWWRIGDGRSVPIYNSNWIP
        G   S  G+EVLIK+V+QAIP YT++CF+ P+ L HEI    ARF+P             S+ WRS++ GRD++ +G  WR+GDG+ + I+  NW+P
Subjt:  G---SAGGKEVLIKTVVQAIPCYTINCFRLPMNLVHEIHQLIARFWP-------------SFIWRSLLWGRDLLCQGIWWRIGDGRSVPIYNSNWIP

XP_023908235.1 uncharacterized protein LOC112019924 [Quercus suber]1.5e-3028.64Show/hide
Query:  LNDTLVVLIPKVASPRK---------------------------ILKVAISPNQSAFVRGRSVIENAILGYECIHSLKGRTGGKFGWTTLKLDMSKRLGG
        +N T +VLIPKV +P K                           +L   ISP QSAFV GR + +N +L YE +H++  R  GK G+  LKLD+SK    
Subjt:  LNDTLVVLIPKVASPRK---------------------------ILKVAISPNQSAFVRGRSVIENAILGYECIHSLKGRTGGKFGWTTLKLDMSKRLGG

Query:  ME--FLGG-----------------CVGSSGSEGLV----------------------------------------------------------------
        +E  FL G                 CV ++    LV                                                                
Subjt:  ME--FLGG-----------------CVGSSGSEGLV----------------------------------------------------------------

Query:  ---------------VRNILELYEKVSGQTMNFDKSMVAFSPNTVLAAQYDLSSLFAV------------------------PVIKDRIWKQLQGWQG--
                       +  IL++Y K SGQ++N +KS V FS N  +  + +   +  V                          IKDR+WK+LQGW+G  
Subjt:  ---------------VRNILELYEKVSGQTMNFDKSMVAFSPNTVLAAQYDLSSLFAV------------------------PVIKDRIWKQLQGWQG--

Query:  -SAGGKEVLIKTVVQAIPCYTINCFRLPMNLVHEIHQLIARFWP-------------SFIWRSLLWGRDLLCQGIWWRIGDGRSVPIYNSNWIPRNYGLR
         S  GKEVLIK V QAIP YT++ F+LP+ L  E   L AR++P             SF+WRSLL    +L  G  WR+GDGRS+ +Y   WIP  Y   
Subjt:  -SAGGKEVLIKTVVQAIPCYTINCFRLPMNLVHEIHQLIARFWP-------------SFIWRSLLWGRDLLCQGIWWRIGDGRSVPIYNSNWIPRNYGLR

Query:  ERIPSCKFIDED
        + + S +  DE+
Subjt:  ERIPSCKFIDED

XP_030478079.1 uncharacterized protein LOC115695128 [Cannabis sativa]5.3e-3327.66Show/hide
Query:  PLNDTLVVLIPKVASP---------------------------RKILKVAISPNQSAFVRGRSVIENAILGYECIHSLKGRTGGKFGWTTLKLDMSKRLG
        PLN+TL+ LI KV  P                           + +L   ISPNQSAF+ GR + +N I+  E  HS+K +T GK GW  +KLDM+K   
Subjt:  PLNDTLVVLIPKVASP---------------------------RKILKVAISPNQSAFVRGRSVIENAILGYECIHSLKGRTGGKFGWTTLKLDMSKRLG

Query:  GME--FLGGCV------------GSSGSEGLVVRNILELYEKVSGQTMNFDKSMVAFSPNTVLAAQYDLSSLFAVPV-----------------------
         +E  F+   +             +S S   +++++LE+YE+ +GQ +NF KS + FSPN  L  Q  +S    +PV                       
Subjt:  GME--FLGGCV------------GSSGSEGLVVRNILELYEKVSGQTMNFDKSMVAFSPNTVLAAQYDLSSLFAVPV-----------------------

Query:  -IKDRIWKQLQGWQG---SAGGKEVLIKTVVQAIPCYTINCFRLPMNLVHEIHQLIARFW----------------------------------------
         ++D++W  L  W+    S GGKE L+K+V+QAIP Y++ CFRLP+   H +  ++A FW                                        
Subjt:  -IKDRIWKQLQGWQG---SAGGKEVLIKTVVQAIPCYTINCFRLPMNLVHEIHQLIARFW----------------------------------------

Query:  --------------------------------------PSFIWRSLLWGRDLLCQGIWWRIGDGRSVPIYNSNWIP
                                              PSF+WRS+ WG++LL +GI  +IG+G++      +WIP
Subjt:  --------------------------------------PSFIWRSLLWGRDLLCQGIWWRIGDGRSVPIYNSNWIP

XP_030483228.1 uncharacterized protein LOC115699823 [Cannabis sativa]1.5e-3227.15Show/hide
Query:  EEESPGPLNDTLVVLIPKVASPRKILKVAISPNQSAFVRGRSVIENAILGYECIHSLKGRTGGKFGWTTLKLDMSKRLGGME--FLGGCVGSSGSEGLV-
        E  SP  LN TL+ LIPK    + +        QSAF+  R + +N ++ +E +H LK +T G+ G++ LKLDMSK    +E  F+ G +G  G   L+ 
Subjt:  EEESPGPLNDTLVVLIPKVASPRKILKVAISPNQSAFVRGRSVIENAILGYECIHSLKGRTGGKFGWTTLKLDMSKRLGGME--FLGGCVGSSGSEGLV-

Query:  ---------------------------------VRNILELYEKVSGQTMNFDKSMVAFSPNTVLAAQYDLSSLFAVPV----------------------
                                         ++ +L+ Y + SGQ +N DKS+++FSPNT  AA++   ++  +P+                      
Subjt:  ---------------------------------VRNILELYEKVSGQTMNFDKSMVAFSPNTVLAAQYDLSSLFAVPV----------------------

Query:  --IKDRIWKQLQGWQG---SAGGKEVLIKTVVQAIPCYTINCFRLPMNLVHEIHQLIARFW---------------------------------------
          IK+RIWK L  W     S GGKEVL+K V+Q+IP YT++CF+LP+    EI  +++ FW                                       
Subjt:  --IKDRIWKQLQGWQG---SAGGKEVLIKTVVQAIPCYTINCFRLPMNLVHEIHQLIARFW---------------------------------------

Query:  -----------------------PSFIWRSLLWGRDLLCQGIWWRIGDGRSVPIYNSNWIP
                                S  W+ + WGR+LL +GI  ++G+G  +      WIP
Subjt:  -----------------------PSFIWRSLLWGRDLLCQGIWWRIGDGRSVPIYNSNWIP

TrEMBL top hitse value%identityAlignment
A0A2N9G219 RNase H domain-containing protein3.1e-3936.3Show/hide
Query:  EESPGPLNDTLVVLIPKVASPR--KILKVAISPNQSAFVRGRSVIENAILGYECIHSLKGRTGGKFGWTTLKLDMSKRLGGMEFLGGCVGSSGSEGLVVR
        E  P  L + L  LI KV + R  KIL   IS  QSAFV GR + +N ++ +E +H +  +  GK G   LKLDMSK    +E+           G V +
Subjt:  EESPGPLNDTLVVLIPKVASPR--KILKVAISPNQSAFVRGRSVIENAILGYECIHSLKGRTGGKFGWTTLKLDMSKRLGGMEFLGGCVGSSGSEGLVVR

Query:  NILELYEKVSGQTMNFDKSMVAFSPNTVLAAQYDLSSLFAVP------------------------VIKDRIWKQLQGWQG---SAGGKEVLIKTVVQAI
         IL +YEK SGQ +N  K+ + FS NT  A Q D+  +  VP                         IK+R+W +++GW+    S  G+E+LIK VVQAI
Subjt:  NILELYEKVSGQTMNFDKSMVAFSPNTVLAAQYDLSSLFAVP------------------------VIKDRIWKQLQGWQG---SAGGKEVLIKTVVQAI

Query:  PCYTINCFRLPMNLVHEIHQLIARFW-----------------------------PSFIWRSLLWGRDLLCQGIWWRIGDGRSVPIYNSNWI
        P YT+NCF+LP+ L  EI  +I RFW                              SF WRS+L  + L+  G+ WR+GDG  +PI  SNW+
Subjt:  PCYTINCFRLPMNLVHEIHQLIARFW-----------------------------PSFIWRSLLWGRDLLCQGIWWRIGDGRSVPIYNSNWI

A0A6J1D5K1 uncharacterized protein LOC1110178161.7e-3731.99Show/hide
Query:  SPGPLNDTLVVLIPKVASP---------------------------RKILKVAISPNQSAFVRGRSVIENAILGYECIHSLKGRTGGKFGWTTLKLD---
        SP PLN+TL+ LIPKV  P                           +++L + IS NQSAFV GR V++NAI+GYEC+HS++      F W     D   
Subjt:  SPGPLNDTLVVLIPKVASP---------------------------RKILKVAISPNQSAFVRGRSVIENAILGYECIHSLKGRTGGKFGWTTLKLD---

Query:  ----------MSKRLGGMEFLGG---------------CVGSSGSEGLVVRNILELYEKVSGQTMNFDKSMVAFSPNTVLAAQYDLSS---LFAVPVIKD
                  +S+   G+  + G                  +S  E   V+ IL  Y++ SGQT+NF+KS+ +FSPNT L+    + S   L  VP  + 
Subjt:  ----------MSKRLGGMEFLGG---------------CVGSSGSEGLVVRNILELYEKVSGQTMNFDKSMVAFSPNTVLAAQYDLSS---LFAVPVIKD

Query:  RIWKQLQGWQGSAGGKEVLIKTVVQAIPCYTINCFRLPMNLVHEIHQLIARFW----------------------------------------------P
                      G++  +K ++QAIP Y++NCF+LP+ L+H+    +ARFW                                              P
Subjt:  RIWKQLQGWQGSAGGKEVLIKTVVQAIPCYTINCFRLPMNLVHEIHQLIARFW----------------------------------------------P

Query:  SFIWRSLLWGRDLLCQGIWWRIGDGRSVPIYNSNWIPRNYGLRERIP
        S+IWRSLL GR +L  G+ WR+GDG +VPIY SNWIPR+  L    P
Subjt:  SFIWRSLLWGRDLLCQGIWWRIGDGRSVPIYNSNWIPRNYGLRERIP

A0A803P621 Uncharacterized protein7.8e-3027.84Show/hide
Query:  PGPLNDTLVVLIPKVASP---------------------------RKILKVAISPNQSAFVRGRSVIENAILGYECIHSLKGRTGGKFGWTTLKLDMSKR
        P PLN T+++LIPK   P                           + +L + IS  QSAF+  R + +N ++ +E IH +K RT G  G   +KLDMSK 
Subjt:  PGPLNDTLVVLIPKVASP---------------------------RKILKVAISPNQSAFVRGRSVIENAILGYECIHSLKGRTGGKFGWTTLKLDMSKR

Query:  LGGMEFL-------------GGCVGSSGSEG--LVVRNILELYEKVSGQTMNFDKSMVAFSPNTVLAAQYDLSSLFAVPV--------------------
           +E+               G +     E   L ++ + ++Y + SGQT+N +KS ++FSPNT LAAQ       ++ +                    
Subjt:  LGGMEFL-------------GGCVGSSGSEG--LVVRNILELYEKVSGQTMNFDKSMVAFSPNTVLAAQYDLSSLFAVPV--------------------

Query:  ----IKDRIWKQLQGWQG---SAGGKEVLIKTVVQAIPCYTINCFRLP-MNLV---------------HEIHQLIAR-FW--------------------
            IK+++W+ +  W     S+GGKE+L++ VVQ+IP Y ++CFRLP M L+               H    L+A+  W                    
Subjt:  ----IKDRIWKQLQGWQG---SAGGKEVLIKTVVQAIPCYTINCFRLP-MNLV---------------HEIHQLIAR-FW--------------------

Query:  ------------PSFIWRSLLWGRDLLCQGIWWRIGDGRSVPIYNSNWIPRN
                    PS  W+ +L+GR+LL +G+ W++G+GR++      WIP N
Subjt:  ------------PSFIWRSLLWGRDLLCQGIWWRIGDGRSVPIYNSNWIPRN

A0A803QBU0 Uncharacterized protein3.5e-3025.24Show/hide
Query:  PGPLNDTLVVLIPKVASP---------------------------RKILKVAISPNQSAFVRGRSVIENAILGYECIHSLKGRTGGKFGWTTLKLDMSKR
        P  LN TL+ LIPK+  P                           +++L + IS  QSAF+  R + +N ++ +E +H++K +T GK G  +LKLDMSK 
Subjt:  PGPLNDTLVVLIPKVASP---------------------------RKILKVAISPNQSAFVRGRSVIENAILGYECIHSLKGRTGGKFGWTTLKLDMSKR

Query:  LG---------GMEFLGGCVGSS----------------------------------------GSEGLVVRNILELYEKVSGQTMNFDKSMVAFSPNTVL
                   GM+ +G   G S                                         S  L ++ +L++Y + SGQ +N DK +++FSPNT L
Subjt:  LG---------GMEFLGGCVGSS----------------------------------------GSEGLVVRNILELYEKVSGQTMNFDKSMVAFSPNTVL

Query:  AAQYDLSSLFAVPV------------------------IKDRIWKQLQGWQG---SAGGKEVLIKTVVQAIPCYTINCFRLPMNLVHEIHQLIARFW---
        AAQ       ++P+                        IK+RIWK +Q W     SAGG+EVL+K VVQ+IP Y ++CFRLP+   +++  +++ FW   
Subjt:  AAQYDLSSLFAVPV------------------------IKDRIWKQLQGWQG---SAGGKEVLIKTVVQAIPCYTINCFRLPMNLVHEIHQLIARFW---

Query:  --------------------------------------------------------------------------PSFIWRSLLWGRDLLCQGIWWRIGDG
                                                                                  PS  W+ + WGR+LL +G+ W+IG+G
Subjt:  --------------------------------------------------------------------------PSFIWRSLLWGRDLLCQGIWWRIGDG

Query:  RSVPIYNSNWIP
        R +      WIP
Subjt:  RSVPIYNSNWIP

B9FYK3 Uncharacterized protein3.5e-3027.11Show/hide
Query:  ESPGPLNDTLVVLIPKVASP---------------------------RKILKVAISPNQSAFVRGRSVIENAILGYECIHSLKGRTGGKFGWTTLKLDMS
        E P   N+T+VVLIPK+ +P                           ++IL   IS NQSAFV GR +++N +L YE  H L+ +  G   +  LKLDMS
Subjt:  ESPGPLNDTLVVLIPKVASP---------------------------RKILKVAISPNQSAFVRGRSVIENAILGYECIHSLKGRTGGKFGWTTLKLDMS

Query:  KRLGGMEF---------LGGC-------VGSSGSEGLVVRNILELYEKVSGQTMNFDKSMVAFSPNTVLAAQYDL------------SSLFAVPV-----
        K    +E+         LG C       +G   +    ++N+L LYE   GQT+N DKS + FS N+    + ++                 +P+     
Subjt:  KRLGGMEF---------LGGC-------VGSSGSEGLVVRNILELYEKVSGQTMNFDKSMVAFSPNTVLAAQYDL------------SSLFAVPV-----

Query:  -------IKDRIWKQLQGWQG---SAGGKEVLIKTVVQAIPCYTINCFRLPMNLVHEIHQLIARFW----------------------------------
               +KDR+WK+LQGW+    S  GKE+LIK+VVQ+IP Y ++CF L   L +E+  L+ RFW                                  
Subjt:  -------IKDRIWKQLQGWQG---SAGGKEVLIKTVVQAIPCYTINCFRLPMNLVHEIHQLIARFW----------------------------------

Query:  -------------------------------------------PSFIWRSLLWGRDLLCQGIWWRIGDGRSVPIYNSNWIPRNYGLRERIP
                                                    S+ WRS++ G   L +G+ WR+GDG ++ I++  W+P     R   P
Subjt:  -------------------------------------------PSFIWRSLLWGRDLLCQGIWWRIGDGRSVPIYNSNWIPRNYGLRERIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGTTGGAAACACCAGTAATCTACTAAGGCCTTTTTTTCTTAAGGTACGGGGCTTGCAACTGCTGGAAAAACAACCCCAACGGCGGCGACACCAAACCCCGACGAC
AGCAACGGAACACGGCGAAACTGGCCTTGGAGAGCAGCTGAACGGGCGTGAGGATAGTCTCGTGGTTTCAAGGTCGGCGGCGACTAGGCCTTCGAATGGCGGTGGTCTCA
CTAGAGGCGAGAGGTTGAGAGGGGAGACGCGACGGAAGAGGCAGACGAGAGCGAGAGAGAGCTACGGGGGAGGGAGAGTGGTGGCTGAAGGTGGAAGGGGAAAGGGGCTG
CGCACGGGGCTGAAAGGAAAAAAAATAGGTTTTTCCAGCATCGATTCATGGAGGAGAAGCTTCTCCTCCCCCTCTCGTTTCAGACCTTCCGTTTCAGATTTGAAACGGAA
GGGGGAGGAGGAGCTCCTCCCTCCCCAAATCGGCCCCTGTTTAAAAAAAACAGAGGAAAGAAAAGAAAAGAGAAGAAGGAAAGAAGAAGAAAAGAAGGAAAAAAAAAGTT
TCTGGCAGCCGCCGCCGTCGTTCGCCGGCCGACCACCGGAACTGAAGGAAGAAGAGGAATCTCCTGGGCCTCTGAATGATACTCTAGTGGTGTTGATTCCAAAAGTGGCC
TCCCCTAGAAAGATCCTGAAAGTGGCTATTTCTCCCAACCAAAGTGCCTTTGTTCGAGGGAGATCTGTTATTGAAAATGCGATTTTGGGCTATGAATGTATCCATTCCCT
TAAAGGCAGAACGGGTGGGAAATTTGGATGGACGACCTTAAAACTTGATATGAGTAAGCGACTGGGTGGAATGGAGTTTCTTGGAGGTTGTGTTGGGTCATCTGGGTCTG
AAGGTCTTGTGGTCCGAAATATACTGGAGTTGTATGAGAAAGTCTCGGGCCAAACAATGAATTTTGATAAGTCTATGGTGGCCTTTAGTCCAAATACAGTATTGGCTGCT
CAGTATGACTTGAGTAGCCTTTTCGCGGTTCCAGTGATTAAAGATCGTATTTGGAAGCAATTACAAGGCTGGCAAGGTTCTGCAGGGGGAAAGGAGGTTCTAATCAAGAC
GGTAGTCCAAGCCATTCCATGTTACACTATAAATTGTTTCCGCTTACCAATGAATCTGGTCCATGAAATTCATCAACTTATTGCCCGATTTTGGCCTTCATTCATATGGC
GCAGCTTGCTGTGGGGTCGAGATCTGCTCTGTCAGGGTATATGGTGGCGCATCGGTGATGGTCGTTCTGTTCCAATTTATAATTCGAATTGGATTCCTCGAAATTATGGC
TTACGGGAGCGGATACCGTCTTGCAAATTCATCGACGAAGACTATGGTTATCTGGTGGAATCGGTTTCGGCAAATGAATGTGGCGCCAAAAAATTAAAATCTTTATGTGG
AGATTATGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGTTGGAAACACCAGTAATCTACTAAGGCCTTTTTTTCTTAAGGTACGGGGCTTGCAACTGCTGGAAAAACAACCCCAACGGCGGCGACACCAAACCCCGACGAC
AGCAACGGAACACGGCGAAACTGGCCTTGGAGAGCAGCTGAACGGGCGTGAGGATAGTCTCGTGGTTTCAAGGTCGGCGGCGACTAGGCCTTCGAATGGCGGTGGTCTCA
CTAGAGGCGAGAGGTTGAGAGGGGAGACGCGACGGAAGAGGCAGACGAGAGCGAGAGAGAGCTACGGGGGAGGGAGAGTGGTGGCTGAAGGTGGAAGGGGAAAGGGGCTG
CGCACGGGGCTGAAAGGAAAAAAAATAGGTTTTTCCAGCATCGATTCATGGAGGAGAAGCTTCTCCTCCCCCTCTCGTTTCAGACCTTCCGTTTCAGATTTGAAACGGAA
GGGGGAGGAGGAGCTCCTCCCTCCCCAAATCGGCCCCTGTTTAAAAAAAACAGAGGAAAGAAAAGAAAAGAGAAGAAGGAAAGAAGAAGAAAAGAAGGAAAAAAAAAGTT
TCTGGCAGCCGCCGCCGTCGTTCGCCGGCCGACCACCGGAACTGAAGGAAGAAGAGGAATCTCCTGGGCCTCTGAATGATACTCTAGTGGTGTTGATTCCAAAAGTGGCC
TCCCCTAGAAAGATCCTGAAAGTGGCTATTTCTCCCAACCAAAGTGCCTTTGTTCGAGGGAGATCTGTTATTGAAAATGCGATTTTGGGCTATGAATGTATCCATTCCCT
TAAAGGCAGAACGGGTGGGAAATTTGGATGGACGACCTTAAAACTTGATATGAGTAAGCGACTGGGTGGAATGGAGTTTCTTGGAGGTTGTGTTGGGTCATCTGGGTCTG
AAGGTCTTGTGGTCCGAAATATACTGGAGTTGTATGAGAAAGTCTCGGGCCAAACAATGAATTTTGATAAGTCTATGGTGGCCTTTAGTCCAAATACAGTATTGGCTGCT
CAGTATGACTTGAGTAGCCTTTTCGCGGTTCCAGTGATTAAAGATCGTATTTGGAAGCAATTACAAGGCTGGCAAGGTTCTGCAGGGGGAAAGGAGGTTCTAATCAAGAC
GGTAGTCCAAGCCATTCCATGTTACACTATAAATTGTTTCCGCTTACCAATGAATCTGGTCCATGAAATTCATCAACTTATTGCCCGATTTTGGCCTTCATTCATATGGC
GCAGCTTGCTGTGGGGTCGAGATCTGCTCTGTCAGGGTATATGGTGGCGCATCGGTGATGGTCGTTCTGTTCCAATTTATAATTCGAATTGGATTCCTCGAAATTATGGC
TTACGGGAGCGGATACCGTCTTGCAAATTCATCGACGAAGACTATGGTTATCTGGTGGAATCGGTTTCGGCAAATGAATGTGGCGCCAAAAAATTAAAATCTTTATGTGG
AGATTATGCTTGA
Protein sequenceShow/hide protein sequence
MLVGNTSNLLRPFFLKVRGLQLLEKQPQRRRHQTPTTATEHGETGLGEQLNGREDSLVVSRSAATRPSNGGGLTRGERLRGETRRKRQTRARESYGGGRVVAEGGRGKGL
RTGLKGKKIGFSSIDSWRRSFSSPSRFRPSVSDLKRKGEEELLPPQIGPCLKKTEERKEKRRRKEEEKKEKKSFWQPPPSFAGRPPELKEEEESPGPLNDTLVVLIPKVA
SPRKILKVAISPNQSAFVRGRSVIENAILGYECIHSLKGRTGGKFGWTTLKLDMSKRLGGMEFLGGCVGSSGSEGLVVRNILELYEKVSGQTMNFDKSMVAFSPNTVLAA
QYDLSSLFAVPVIKDRIWKQLQGWQGSAGGKEVLIKTVVQAIPCYTINCFRLPMNLVHEIHQLIARFWPSFIWRSLLWGRDLLCQGIWWRIGDGRSVPIYNSNWIPRNYG
LRERIPSCKFIDEDYGYLVESVSANECGAKKLKSLCGDYA