; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg027307 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg027307
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionFanconi-associated nuclease
Genome locationscaffold7:39631212..39634995
RNA-Seq ExpressionSpg027307
SyntenySpg027307
Gene Ontology termsGO:0009987 - cellular process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]2.5e-2026.3Show/hide
Query:  RKAVCVQLPYDRFVNNSARAKYAEFLKRDFLFERGF------NGDLPHFLRTGIADHGWELFCAKPESVNAKVVREFYANIDKEDGFQVIVRGVEVDWSP
        R A    + + +F N+ A+A++  F  R+  FE GF      +G     +   +    W  F   P SVNA +V+EFYANI K +   + VRG ++ ++ 
Subjt:  RKAVCVQLPYDRFVNNSARAKYAEFLKRDFLFERGF------NGDLPHFLRTGIADHGWELFCAKPESVNAKVVREFYANIDKEDGFQVIVRGVEVDWSP

Query:  SAINALYNLQNF--PHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLS
         AIN  ++LQ     HA + E      + +    + ++  E   W                                          R+LL  +++ S  
Subjt:  SAINALYNLQNF--PHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLS

Query:  IDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKRAGVPEDEGGVILFDKGIIDTPNLARLQRMQEVRQGGLVY-------DINMILEQLALSPN-RQEFA
        IDVG+I+V ++  C  KK   L FPN IT LC++  V E+    IL     I    L  L  ++  +    V+       + N  +  LAL     Q  A
Subjt:  IDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKRAGVPEDEGGVILFDKGIIDTPNLARLQRMQEVRQGGLVY-------DINMILEQLALSPN-RQEFA

Query:  ERQAL-----TFWNYVKNRDANLKKALQENFSKPYPALPGFPEDLL
        +  AL      F+ YVK+RD  ++   QE         PGFP+++L
Subjt:  ERQAL-----TFWNYVKNRDANLKKALQENFSKPYPALPGFPEDLL

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]4.7e-1930.12Show/hide
Query:  RFVNNSARAKYAEFLK-RDFLFERGF-------NGDLPHFLRTGIADHGWELFCAKPESVNAKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL
        +F   +A  +Y   ++ R    E+GF        G LP F+   I  H W+ FCA PE     +VREFYAN+       V VRGV+V WS  AINA++ L
Subjt:  RFVNNSARAKYAEFLK-RDFLFERGF-------NGDLPHFLRTGIADHGWELFCAKPESVNAKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEI
         + P   ++E +   +   L   +  V + GA W                                         +R+LL  ++L   SI+VG+++ +EI
Subjt:  QNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEI

Query:  SGCWKKKVGKLFFPNTITMLCKRAGVPEDEGGVILFDKGIIDTPNLARL
          C  +K G LFFP+ IT LC+ A  P       L + G ID   +AR+
Subjt:  SGCWKKKVGKLFFPNTITMLCKRAGVPEDEGGVILFDKGIIDTPNLARL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.3e-2929.11Show/hide
Query:  RFVNNSARAKYAEFLK-RDFLFERGF-------NGDLPHFLRTGIADHGWELFCAKPESVNAKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL
        +F   +A  +Y   ++ R    E+GF        G LP F+   I  H W+ FCA PE     +VREFYAN+   +   V VRGV+V WS  AINA++ L
Subjt:  RFVNNSARAKYAEFLK-RDFLFERGF-------NGDLPHFLRTGIADHGWELFCAKPESVNAKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEI
         + P   ++E +   + + L   +  V   GA W                                         +R+LL  ++L   SI+VG+++ +EI
Subjt:  QNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEI

Query:  SGCWKKKVGKLFFPNTITMLCKRAGVPEDEGGVILFDKGIIDTPNLARL---------QRMQEVR---------QGGLVYDINMILEQLALSPNRQ----
          C  +K G LFFP+ IT LC+ A  P       L + G ID   +AR+         Q+    R          G ++  +  + ++L+    +Q    
Subjt:  SGCWKKKVGKLFFPNTITMLCKRAGVPEDEGGVILFDKGIIDTPNLARL---------QRMQEVR---------QGGLVYDINMILEQLALSPNRQ----

Query:  ---EFAERQALTFWNYVKNRDANLKKALQENFSKPYPALPGFPEDLL
           +   +Q   FW Y K RD  LKKALQ NF++P P  P FP+++L
Subjt:  ---EFAERQALTFWNYVKNRDANLKKALQENFSKPYPALPGFPEDLL

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]9.2e-2330.04Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------
        +VREFYAN+   +   + VRGV+V WS  AINA++ L + P   ++E +   +  +L   +  V   GA W                             
Subjt:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------

Query:  ------------ERVLLAFAILRSLSIDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKRAGVPEDEGGVILFDKGIIDTPNLARL--------------
                    +R+LL  ++L   SI+VG+++ +EI  C  +K G LFFP+ IT LC+ A    +E    L + G ID   +AR+              
Subjt:  ------------ERVLLAFAILRSLSIDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKRAGVPEDEGGVILFDKGIIDTPNLARL--------------

Query:  QRMQEVRQGGLVYDINMILEQLALSPNRQEFAERQALTFWNYVKNRDANLKKALQENFSKPYPALPGFPEDLL
         R           D+   L+ L    ++QE   +Q   FW Y K RD  LKKALQ NF++P P  P FP+++L
Subjt:  QRMQEVRQGGLVYDINMILEQLALSPNRQEFAERQALTFWNYVKNRDANLKKALQENFSKPYPALPGFPEDLL

XP_004514776.1 uncharacterized protein LOC101493401 isoform X3 [Cicer arietinum]1.2e-1723.5Show/hide
Query:  YDRFVNNSARAKYAEFLK-RDFLFERGFNGD-------LPHFLRTGIADHGWELFCAKPESVNAKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALY
        + +F+N   + K+   +K R+F  E GF+ +       LP  L + I  H W+ F     +  A +VREFY+ I +     V+VRGV V ++P  +N  +
Subjt:  YDRFVNNSARAKYAEFLK-RDFLFERGFNGD-------LPHFLRTGIADHGWELFCAKPESVNAKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALY

Query:  NL-------QNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW------------------------------------------ERVLLAFAILRSLSI
        NL        N     Y  +    S+E+L+  ++ + + G  W                                          +R+LL + ++   SI
Subjt:  NL-------QNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW------------------------------------------ERVLLAFAILRSLSI

Query:  DVGKIVVNEISGC--WKKKVGKLFFPNTITMLCKRAGVPEDEGGVILFDK---GIIDTPNLARLQRMQEVRQGGLVYDINMIL-----------EQLALS
        +VGKI+ +EI  C   KKK  +L FP+ I+ LC R GV  ++   ++ ++   G+ D       + M   R+G +  +    +           E+  + 
Subjt:  DVGKIVVNEISGC--WKKKVGKLFFPNTITMLCKRAGVPEDEGGVILFDK---GIIDTPNLARLQRMQEVRQGGLVYDINMIL-----------EQLALS

Query:  PNRQEFA--------------ERQALTFWNYVKNRDANLKKALQENFSKPYPALPGFPEDLLNPWIPPPPVERGEEDDENELG
          +++F                +Q   FW + K      +K  + NF K     P FP+++L P++  P  E+G+  D  E G
Subjt:  PNRQEFA--------------ERQALTFWNYVKNRDANLKKALQENFSKPYPALPGFPEDLLNPWIPPPPVERGEEDDENELG

TrEMBL top hitse value%identityAlignment
A0A1S2Z475 uncharacterized protein LOC101493401 isoform X35.6e-1823.5Show/hide
Query:  YDRFVNNSARAKYAEFLK-RDFLFERGFNGD-------LPHFLRTGIADHGWELFCAKPESVNAKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALY
        + +F+N   + K+   +K R+F  E GF+ +       LP  L + I  H W+ F     +  A +VREFY+ I +     V+VRGV V ++P  +N  +
Subjt:  YDRFVNNSARAKYAEFLK-RDFLFERGFNGD-------LPHFLRTGIADHGWELFCAKPESVNAKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALY

Query:  NL-------QNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW------------------------------------------ERVLLAFAILRSLSI
        NL        N     Y  +    S+E+L+  ++ + + G  W                                          +R+LL + ++   SI
Subjt:  NL-------QNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW------------------------------------------ERVLLAFAILRSLSI

Query:  DVGKIVVNEISGC--WKKKVGKLFFPNTITMLCKRAGVPEDEGGVILFDK---GIIDTPNLARLQRMQEVRQGGLVYDINMIL-----------EQLALS
        +VGKI+ +EI  C   KKK  +L FP+ I+ LC R GV  ++   ++ ++   G+ D       + M   R+G +  +    +           E+  + 
Subjt:  DVGKIVVNEISGC--WKKKVGKLFFPNTITMLCKRAGVPEDEGGVILFDK---GIIDTPNLARLQRMQEVRQGGLVYDINMIL-----------EQLALS

Query:  PNRQEFA--------------ERQALTFWNYVKNRDANLKKALQENFSKPYPALPGFPEDLLNPWIPPPPVERGEEDDENELG
          +++F                +Q   FW + K      +K  + NF K     P FP+++L P++  P  E+G+  D  E G
Subjt:  PNRQEFA--------------ERQALTFWNYVKNRDANLKKALQENFSKPYPALPGFPEDLLNPWIPPPPVERGEEDDENELG

A0A2P5AGA5 Uncharacterized protein (Fragment)2.3e-1930.12Show/hide
Query:  RFVNNSARAKYAEFLK-RDFLFERGF-------NGDLPHFLRTGIADHGWELFCAKPESVNAKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL
        +F   +A  +Y   ++ R    E+GF        G LP F+   I  H W+ FCA PE     +VREFYAN+       V VRGV+V WS  AINA++ L
Subjt:  RFVNNSARAKYAEFLK-RDFLFERGF-------NGDLPHFLRTGIADHGWELFCAKPESVNAKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEI
         + P   ++E +   +   L   +  V + GA W                                         +R+LL  ++L   SI+VG+++ +EI
Subjt:  QNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEI

Query:  SGCWKKKVGKLFFPNTITMLCKRAGVPEDEGGVILFDKGIIDTPNLARL
          C  +K G LFFP+ IT LC+ A  P       L + G ID   +AR+
Subjt:  SGCWKKKVGKLFFPNTITMLCKRAGVPEDEGGVILFDKGIIDTPNLARL

A0A2P5BCG4 Uncharacterized protein (Fragment)1.1e-2929.11Show/hide
Query:  RFVNNSARAKYAEFLK-RDFLFERGF-------NGDLPHFLRTGIADHGWELFCAKPESVNAKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL
        +F   +A  +Y   ++ R    E+GF        G LP F+   I  H W+ FCA PE     +VREFYAN+   +   V VRGV+V WS  AINA++ L
Subjt:  RFVNNSARAKYAEFLK-RDFLFERGF-------NGDLPHFLRTGIADHGWELFCAKPESVNAKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEI
         + P   ++E +   + + L   +  V   GA W                                         +R+LL  ++L   SI+VG+++ +EI
Subjt:  QNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLSIDVGKIVVNEI

Query:  SGCWKKKVGKLFFPNTITMLCKRAGVPEDEGGVILFDKGIIDTPNLARL---------QRMQEVR---------QGGLVYDINMILEQLALSPNRQ----
          C  +K G LFFP+ IT LC+ A  P       L + G ID   +AR+         Q+    R          G ++  +  + ++L+    +Q    
Subjt:  SGCWKKKVGKLFFPNTITMLCKRAGVPEDEGGVILFDKGIIDTPNLARL---------QRMQEVR---------QGGLVYDINMILEQLALSPNRQ----

Query:  ---EFAERQALTFWNYVKNRDANLKKALQENFSKPYPALPGFPEDLL
           +   +Q   FW Y K RD  LKKALQ NF++P P  P FP+++L
Subjt:  ---EFAERQALTFWNYVKNRDANLKKALQENFSKPYPALPGFPEDLL

A0A2P5DXM3 Uncharacterized protein4.5e-2330.04Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------
        +VREFYAN+   +   + VRGV+V WS  AINA++ L + P   ++E +   +  +L   +  V   GA W                             
Subjt:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------

Query:  ------------ERVLLAFAILRSLSIDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKRAGVPEDEGGVILFDKGIIDTPNLARL--------------
                    +R+LL  ++L   SI+VG+++ +EI  C  +K G LFFP+ IT LC+ A    +E    L + G ID   +AR+              
Subjt:  ------------ERVLLAFAILRSLSIDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKRAGVPEDEGGVILFDKGIIDTPNLARL--------------

Query:  QRMQEVRQGGLVYDINMILEQLALSPNRQEFAERQALTFWNYVKNRDANLKKALQENFSKPYPALPGFPEDLL
         R           D+   L+ L    ++QE   +Q   FW Y K RD  LKKALQ NF++P P  P FP+++L
Subjt:  QRMQEVRQGGLVYDINMILEQLALSPNRQEFAERQALTFWNYVKNRDANLKKALQENFSKPYPALPGFPEDLL

A0A6A3BU96 Uncharacterized protein1.2e-2026.3Show/hide
Query:  RKAVCVQLPYDRFVNNSARAKYAEFLKRDFLFERGF------NGDLPHFLRTGIADHGWELFCAKPESVNAKVVREFYANIDKEDGFQVIVRGVEVDWSP
        R A    + + +F N+ A+A++  F  R+  FE GF      +G     +   +    W  F   P SVNA +V+EFYANI K +   + VRG ++ ++ 
Subjt:  RKAVCVQLPYDRFVNNSARAKYAEFLKRDFLFERGF------NGDLPHFLRTGIADHGWELFCAKPESVNAKVVREFYANIDKEDGFQVIVRGVEVDWSP

Query:  SAINALYNLQNF--PHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLS
         AIN  ++LQ     HA + E      + +    + ++  E   W                                          R+LL  +++ S  
Subjt:  SAINALYNLQNF--PHAAYNEMVVVPSNEQLSDAVREVGIEGARW-----------------------------------------ERVLLAFAILRSLS

Query:  IDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKRAGVPEDEGGVILFDKGIIDTPNLARLQRMQEVRQGGLVY-------DINMILEQLALSPN-RQEFA
        IDVG+I+V ++  C  KK   L FPN IT LC++  V E+    IL     I    L  L  ++  +    V+       + N  +  LAL     Q  A
Subjt:  IDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKRAGVPEDEGGVILFDKGIIDTPNLARLQRMQEVRQGGLVY-------DINMILEQLALSPN-RQEFA

Query:  ERQAL-----TFWNYVKNRDANLKKALQENFSKPYPALPGFPEDLL
        +  AL      F+ YVK+RD  ++   QE         PGFP+++L
Subjt:  ERQAL-----TFWNYVKNRDANLKKALQENFSKPYPALPGFPEDLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTTGTCACAGCGTCGAGACGCTGTGACTGCTGCCAACCAAATTAGAAAAGAAAAGGCAGCGTCGAGACGCTCCAAGAGTAGCGTCCCGACGCTGGTTCAGAAGGT
TGTTGCAGCAAACTCTATTTATTCCAAACACTTCGGTAAGTGCTTCTCACTCCATTTTCGTGCTTCAAATCTTTGCTGGTTACATTCTTCTTTCTTTATTGCTTTTCTCT
GTAAAACCCTTGAATCCTTCATGGCAAAAACAAGAGCGAGGAAAGAAAGAGAGAATGAGGAGGAAGAGATATCGATTACCTCTGAGGTGCAAAAGGTAAAGGCGAAGAAG
AAAAAGACATCGGAGGAGAAAGAAGCCAAGAGAAGAATACGACAACAGAGGGCTGAGGAACAAGAAAAGGCAACAGAGGATGATGCTGTCACAGTAGAAGAAGGAGACCC
AAAAGAACCTGAAAATCAGAACCCTGAACGGACTAACCCGGCAGTTGAGGATACGCAAGAAATTCAAGAAAAGCAGGCTGAGGAGGTGCGAGAACATGCAGAGGTTGCAC
TGGAAGAAGGAAACGAGCCAGTTCAAGAAGCTCGTGTTGAGGTCATCATGCCAGAAGTACCGAAGCGATGCCGCATAAAGCGAAAGGCTGTGTGCGTCCAGTTGCCGTAT
GATCGCTTCGTCAACAATTCTGCCAGAGCAAAATATGCTGAGTTTCTGAAAAGAGACTTCCTGTTTGAGAGAGGATTCAACGGTGATCTTCCGCATTTTCTGAGGACCGG
CATTGCAGACCATGGCTGGGAGTTGTTTTGTGCAAAGCCTGAGTCTGTGAACGCAAAGGTGGTGCGCGAATTTTATGCTAATATTGACAAAGAAGATGGTTTCCAAGTGA
TTGTTCGAGGAGTCGAGGTTGACTGGAGTCCTAGTGCTATTAACGCACTGTATAACCTTCAGAATTTCCCCCACGCAGCGTATAATGAGATGGTTGTGGTGCCATCTAAT
GAGCAGCTGAGTGACGCTGTGCGGGAAGTGGGAATTGAAGGGGCACGGTGGGAACGAGTTCTTCTGGCTTTCGCGATTTTGCGGTCTCTCAGCATCGATGTAGGGAAGAT
TGTTGTGAATGAAATTTCTGGTTGTTGGAAGAAGAAGGTGGGGAAGCTGTTTTTCCCAAATACTATTACCATGCTTTGCAAGAGAGCAGGGGTTCCAGAGGATGAAGGAG
GTGTTATTTTGTTTGACAAGGGGATCATCGACACGCCTAACTTGGCACGACTTCAGCGTATGCAAGAGGTGCGTCAGGGTGGGCTTGTCTACGACATCAACATGATTTTA
GAACAACTAGCACTTTCGCCCAATAGGCAAGAGTTTGCCGAGAGGCAAGCTTTGACCTTCTGGAACTATGTTAAAAATCGTGATGCCAATCTGAAGAAGGCGCTTCAAGA
AAATTTTTCCAAGCCATATCCAGCCCTTCCTGGATTCCCTGAGGATTTATTGAACCCCTGGATACCACCCCCACCGGTTGAAAGAGGAGAAGAGGATGATGAAAATGAGC
TGGGCCAAGAGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCTTGTCACAGCGTCGAGACGCTGTGACTGCTGCCAACCAAATTAGAAAAGAAAAGGCAGCGTCGAGACGCTCCAAGAGTAGCGTCCCGACGCTGGTTCAGAAGGT
TGTTGCAGCAAACTCTATTTATTCCAAACACTTCGGTAAGTGCTTCTCACTCCATTTTCGTGCTTCAAATCTTTGCTGGTTACATTCTTCTTTCTTTATTGCTTTTCTCT
GTAAAACCCTTGAATCCTTCATGGCAAAAACAAGAGCGAGGAAAGAAAGAGAGAATGAGGAGGAAGAGATATCGATTACCTCTGAGGTGCAAAAGGTAAAGGCGAAGAAG
AAAAAGACATCGGAGGAGAAAGAAGCCAAGAGAAGAATACGACAACAGAGGGCTGAGGAACAAGAAAAGGCAACAGAGGATGATGCTGTCACAGTAGAAGAAGGAGACCC
AAAAGAACCTGAAAATCAGAACCCTGAACGGACTAACCCGGCAGTTGAGGATACGCAAGAAATTCAAGAAAAGCAGGCTGAGGAGGTGCGAGAACATGCAGAGGTTGCAC
TGGAAGAAGGAAACGAGCCAGTTCAAGAAGCTCGTGTTGAGGTCATCATGCCAGAAGTACCGAAGCGATGCCGCATAAAGCGAAAGGCTGTGTGCGTCCAGTTGCCGTAT
GATCGCTTCGTCAACAATTCTGCCAGAGCAAAATATGCTGAGTTTCTGAAAAGAGACTTCCTGTTTGAGAGAGGATTCAACGGTGATCTTCCGCATTTTCTGAGGACCGG
CATTGCAGACCATGGCTGGGAGTTGTTTTGTGCAAAGCCTGAGTCTGTGAACGCAAAGGTGGTGCGCGAATTTTATGCTAATATTGACAAAGAAGATGGTTTCCAAGTGA
TTGTTCGAGGAGTCGAGGTTGACTGGAGTCCTAGTGCTATTAACGCACTGTATAACCTTCAGAATTTCCCCCACGCAGCGTATAATGAGATGGTTGTGGTGCCATCTAAT
GAGCAGCTGAGTGACGCTGTGCGGGAAGTGGGAATTGAAGGGGCACGGTGGGAACGAGTTCTTCTGGCTTTCGCGATTTTGCGGTCTCTCAGCATCGATGTAGGGAAGAT
TGTTGTGAATGAAATTTCTGGTTGTTGGAAGAAGAAGGTGGGGAAGCTGTTTTTCCCAAATACTATTACCATGCTTTGCAAGAGAGCAGGGGTTCCAGAGGATGAAGGAG
GTGTTATTTTGTTTGACAAGGGGATCATCGACACGCCTAACTTGGCACGACTTCAGCGTATGCAAGAGGTGCGTCAGGGTGGGCTTGTCTACGACATCAACATGATTTTA
GAACAACTAGCACTTTCGCCCAATAGGCAAGAGTTTGCCGAGAGGCAAGCTTTGACCTTCTGGAACTATGTTAAAAATCGTGATGCCAATCTGAAGAAGGCGCTTCAAGA
AAATTTTTCCAAGCCATATCCAGCCCTTCCTGGATTCCCTGAGGATTTATTGAACCCCTGGATACCACCCCCACCGGTTGAAAGAGGAGAAGAGGATGATGAAAATGAGC
TGGGCCAAGAGGACTGA
Protein sequenceShow/hide protein sequence
MLLSQRRDAVTAANQIRKEKAASRRSKSSVPTLVQKVVAANSIYSKHFGKCFSLHFRASNLCWLHSSFFIAFLCKTLESFMAKTRARKERENEEEEISITSEVQKVKAKK
KKTSEEKEAKRRIRQQRAEEQEKATEDDAVTVEEGDPKEPENQNPERTNPAVEDTQEIQEKQAEEVREHAEVALEEGNEPVQEARVEVIMPEVPKRCRIKRKAVCVQLPY
DRFVNNSARAKYAEFLKRDFLFERGFNGDLPHFLRTGIADHGWELFCAKPESVNAKVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVVPSN
EQLSDAVREVGIEGARWERVLLAFAILRSLSIDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKRAGVPEDEGGVILFDKGIIDTPNLARLQRMQEVRQGGLVYDINMIL
EQLALSPNRQEFAERQALTFWNYVKNRDANLKKALQENFSKPYPALPGFPEDLLNPWIPPPPVERGEEDDENELGQED