; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020938 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020938
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRegulator of rDNA transcription protein 15
Genome locationscaffold9:669954..672596
RNA-Seq ExpressionSpg020938
SyntenySpg020938
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU51071.1 hypothetical protein TSUD_84300 [Trifolium subterraneum]2.9e-4559.47Show/hide
Query:  WRSVLSGGPGTSPLEGGAGEGESPVALGPCRTTRRCQRVGLFGNAAPIGREADGGRRCAPVGCGTVRAGPPIDLGHGPMRIETAAKAQSFVTLVETSSSR
        W+S+LSGG G+SPLEGGAGEGESPV LGPCRTT  C+RVGLFGNAAPIGREADGGRRC PVGCGTV  GPPID GH P RI   A+      L ETSSSR
Subjt:  WRSVLSGGPGTSPLEGGAGEGESPVALGPCRTTRRCQRVGLFGNAAPIGREADGGRRCAPVGCGTVRAGPPIDLGHGPMRIETAAKAQSFVTLVETSSSR

Query:  SWLAARPRGCFGICALLASACGL--PIRPVLKHGPRSLTWFECEHACR----------DP-KDGELCLSGAKPEETMVEARSDTDVQIVR
          +AAR  G   +  L A   GL  P    L+   +       + A +          DP  DGELCLSGAK EET+VEARSDT+VQIVR
Subjt:  SWLAARPRGCFGICALLASACGL--PIRPVLKHGPRSLTWFECEHACR----------DP-KDGELCLSGAKPEETMVEARSDTDVQIVR

GEV87049.1 hypothetical protein [Tanacetum cinerariifolium]3.3e-4957.56Show/hide
Query:  FLSYYSQRPSATNISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVMHHCPPRNP----------------PSDSLR----------
        +L Y  +R SAT+ISA ASMKNVAKCDTWCELQ+P NHRVFERKLRP P GRGHVCLGV H   P                     D LR          
Subjt:  FLSYYSQRPSATNISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVMHHCPPRNP----------------PSDSLR----------

Query:  SLRTTSSRASEDFYVDPLNIVPKDDALDATPGQAGLPAEFKHINKRRKRSVRVVAWRSVLSGGPGTSPLEGGAGEGESPVALGPCRTTRRCQRVGLFGNA
        +L   S   +     +  N +P DDA  ATPGQAGLPAEFKHINKRRKR+++   +      GPG SPLEGGAGEGESPV  GPCRTTRRC  VGLFGNA
Subjt:  SLRTTSSRASEDFYVDPLNIVPKDDALDATPGQAGLPAEFKHINKRRKRSVRVVAWRSVLSGGPGTSPLEGGAGEGESPVALGPCRTTRRCQRVGLFGNA

Query:  APIGR
        APIGR
Subjt:  APIGR

KAF1855774.1 hypothetical protein Lal_00045041 [Lupinus albus]9.3e-6845.12Show/hide
Query:  SGGPGTSPLEGG-----AGEGESPVALGPCRTTRRCQRVGLFGNAAPIGREADGGRRCAPVGCGTVRAGPPIDLGHGPMRIETAAKAQSFVTLVETSSSR
        SG  G  P  GG        GE P+A           R G          + DGGRRCA VGCGT RAGPPID GHGP RI   A+A++   LVETSSSR
Subjt:  SGGPGTSPLEGG-----AGEGESPVALGPCRTTRRCQRVGLFGNAAPIGREADGGRRCAPVGCGTVRAGPPIDLGHGPMRIETAAKAQSFVTLVETSSSR

Query:  SWLAARPRG-CFGICALLASACGLPIRPVLKHGPRSLTWFEC--EHACRDPKDGELCLSGAKPEETMVEARSDTDVQIVREFY----RVKPMIRGIGGAT
         WL AR    CFG C L ASACGLPIRPVLKHGPRSLT      +H CRDPKDGELCLSGAKPEET+VEARSDTDVQIVR  +     V   I G  G  
Subjt:  SWLAARPRG-CFGICALLASACGLPIRPVLKHGPRSLTWFEC--EHACRDPKDGELCLSGAKPEETMVEARSDTDVQIVREFY----RVKPMIRGIGGAT

Query:  PS------------------------TYSQTLNRYDGVAALLSRATE----------------SRAP---------------------------------
         S                         +   +NR  G  A L    +                SR P                                 
Subjt:  PS------------------------TYSQTLNRYDGVAALLSRATE----------------SRAP---------------------------------

Query:  ------SGPFLP----AHPGNGSAGGR--VQRL-----------EEHRTSRATGRWNN-VGKGSRQNGSVTSGKGLALRAGHGVPSPEPVGYRWTARAAS
               G  LP    A P  GS+G +   +R+            E R     GR +N +   SRQNGSVTSGKGLALRAGHG PSPEPVG RWTARAA 
Subjt:  ------SGPFLP----AHPGNGSAGGR--VQRL-----------EEHRTSRATGRWNN-VGKGSRQNGSVTSGKGLALRAGHGVPSPEPVGYRWTARAAS

Query:  AARPGHRVPARGRIGNGSLGSLPRASNSQLRTGTDKGNPTV
        AAR G RVP    +G G  G LPRASNS+LRTGTDKGNPTV
Subjt:  AARPGHRVPARGRIGNGSLGSLPRASNSQLRTGTDKGNPTV

KAF7800676.1 atp synthase subunit beta [Senna tora]4.8e-4837.07Show/hide
Query:  MKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVMHHCPPR-NPPSDSLR--SLRTTSSRASEDFYVDPLNIVPKDDALDATP--GQAGLPAEF
        MKNVAKCDTWCELQ+P NHRVFERKLRP+P GRGHVCLGV H  P   +PP D  R  + R T       + V P      D+A    P  G+A   A+ 
Subjt:  MKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVMHHCPPR-NPPSDSLR--SLRTTSSRASEDFYVDPLNIVPKDDALDATP--GQAGLPAEF

Query:  KHINKRRKRSVRVVAWRSVLSGGPGTSPLEGGAGEGESPVALGPCRTTRRCQRVGLFGNAAPIGREADGGRRCAPVGCGTVRAGPPIDLGHGPMRIETAA
          +                        P +G      +P    P          G  G     G  ADGG RC PVGCGT  AG PI  G GPM+I  AA
Subjt:  KHINKRRKRSVRVVAWRSVLSGGPGTSPLEGGAGEGESPVALGPCRTTRRCQRVGLFGNAAPIGREADGGRRCAPVGCGTVRAGPPIDLGHGPMRIETAA

Query:  KAQSFVTLVETSSSRSWLAA--------------RPRGC-FGICA-----LLASACGLPIRPVLKHGPRSLTW-------FECEHACRDPKDGELCLSGA
        +AQ+    VETSSSR W+AA              RP G  FG  A     +L+   G  + P  +HG  S  W       + C    R P  G    +  
Subjt:  KAQSFVTLVETSSSRSWLAA--------------RPRGC-FGICA-----LLASACGLPIRPVLKHGPRSLTW-------FECEHACRDPKDGELCLSGA

Query:  KPEETMVEARSD-----TDVQIVREFYRVKPMIRGIGGATPSTYSQTLNRYDGVAALLSRATESRAPSGPFLPAHPGNGSAGGRVQRLEEHRTSRATGRW
         P + +   R       +   + R   R K      GGA         ++Y        R            PAHPGNGSAGGRVQRLEEH     +G  
Subjt:  KPEETMVEARSD-----TDVQIVREFYRVKPMIRGIGGATPSTYSQTLNRYDGVAALLSRATESRAPSGPFLPAHPGNGSAGGRVQRLEEHRTSRATGRW

Query:  NNVGKGSRQNGSVTSGK-GLALRAGHGVPSPEPVGYRWTARAASAARPGHRVPARGRIGNGSLG
            +        T G+    +R+    PSPEPVG RWTARAA AAR G RVPA GR GNG  G
Subjt:  NNVGKGSRQNGSVTSGK-GLALRAGHGVPSPEPVGYRWTARAASAARPGHRVPARGRIGNGSLG

KAG5568993.1 hypothetical protein H5410_064037 [Solanum commersonii]7.2e-4439.8Show/hide
Query:  ASSCTHAPSRCLNKTPAQVAPWCAEGRAFLSYYSQRPSATNISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVMHHCPPRNPPSDS
        A S T+ P R   +   + A     GRA L +  +R SAT+ISALASMKNVAKCDTWCELQ+P NHRVFERKLRP+P GRGHVCLGV H   PR P   +
Subjt:  ASSCTHAPSRCLNKTPAQVAPWCAEGRAFLSYYSQRPSATNISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVMHHCPPRNPPSDS

Query:  L-RSLRTTSSRA---------SEDFYVDPLNIVPKDDALDATP----------GQAGLPAEFKHIN-KRRKRSVRV------VAWRSVL--SGGPGTSPL
          R L + + RA         S D         P     D  P          G+A   A+   +  K     VR       VA R  +  SG  G  P 
Subjt:  L-RSLRTTSSRA---------SEDFYVDPLNIVPKDDALDATP----------GQAGLPAEFKHIN-KRRKRSVRV------VAWRSVL--SGGPGTSPL

Query:  EGG-----AGEGESPVALGPCRTTRRCQRVGLFGNAAPIGREADGGRRCAPVGCGTVRAGPPIDLGHGPMRIETAAKAQSFVTLVETSSSRSWLAARPRG
         GG        GE P+A           R G       + RE+              + GPPI  G GP+RI  AAKA++  T VET S R W    P G
Subjt:  EGG-----AGEGESPVALGPCRTTRRCQRVGLFGNAAPIGREADGGRRCAPVGCGTVRAGPPIDLGHGPMRIETAAKAQSFVTLVETSSSRSWLAARPRG

Query:  CFGICALLASACGLPI----------------------RPVLKHGPRSLTWFE--CEHACRDPKDGELCLSGAKPEETMVEARSDTDVQIVR
              L A   GL +                      +PV +   +   W +  CEH+CRDPKDGELCLSGAKPEET+VEARSDTDVQIVR
Subjt:  CFGICALLASACGLPI----------------------RPVLKHGPRSLTWFE--CEHACRDPKDGELCLSGAKPEETMVEARSDTDVQIVR

TrEMBL top hitse value%identityAlignment
A0A2N9IL11 Uncharacterized protein8.6e-4354.64Show/hide
Query:  DPANHRVFERKLRPEPSGRGHVCLGVMHHCPPRNPPSDSLRSLRTTS------------SRASEDFYVDPLNIVPKDDALDATPGQAGLPAEFKHINKRR
        +P NHRVFERKLRP P GRGHVCLGV H CP   PP    R  R  S            SR          ++ P     +ATPGQAGLPAEFKHINKRR
Subjt:  DPANHRVFERKLRPEPSGRGHVCLGVMHHCPPRNPPSDSLRSLRTTS------------SRASEDFYVDPLNIVPKDDALDATPGQAGLPAEFKHINKRR

Query:  K------------------------RSVRVVAWRSVLSGGPGTSPLEGGAGEGESPVALGPCRTTRRCQRVGLFGNAAPIGR-----EADGGRR
        K                        R VR+V WRSVLSGGPG SPLEGGAGEGESPV  GPCRTTRRC+RVGLFGNAAPIGR     +  GG+R
Subjt:  K------------------------RSVRVVAWRSVLSGGPGTSPLEGGAGEGESPVALGPCRTTRRCQRVGLFGNAAPIGR-----EADGGRR

A0A2Z6P3S7 Retrotrans_gag domain-containing protein1.4e-4559.47Show/hide
Query:  WRSVLSGGPGTSPLEGGAGEGESPVALGPCRTTRRCQRVGLFGNAAPIGREADGGRRCAPVGCGTVRAGPPIDLGHGPMRIETAAKAQSFVTLVETSSSR
        W+S+LSGG G+SPLEGGAGEGESPV LGPCRTT  C+RVGLFGNAAPIGREADGGRRC PVGCGTV  GPPID GH P RI   A+      L ETSSSR
Subjt:  WRSVLSGGPGTSPLEGGAGEGESPVALGPCRTTRRCQRVGLFGNAAPIGREADGGRRCAPVGCGTVRAGPPIDLGHGPMRIETAAKAQSFVTLVETSSSR

Query:  SWLAARPRGCFGICALLASACGL--PIRPVLKHGPRSLTWFECEHACR----------DP-KDGELCLSGAKPEETMVEARSDTDVQIVR
          +AAR  G   +  L A   GL  P    L+   +       + A +          DP  DGELCLSGAK EET+VEARSDT+VQIVR
Subjt:  SWLAARPRGCFGICALLASACGL--PIRPVLKHGPRSLTWFECEHACR----------DP-KDGELCLSGAKPEETMVEARSDTDVQIVR

A0A6A5KZG0 Uncharacterized protein4.5e-6845.12Show/hide
Query:  SGGPGTSPLEGG-----AGEGESPVALGPCRTTRRCQRVGLFGNAAPIGREADGGRRCAPVGCGTVRAGPPIDLGHGPMRIETAAKAQSFVTLVETSSSR
        SG  G  P  GG        GE P+A           R G          + DGGRRCA VGCGT RAGPPID GHGP RI   A+A++   LVETSSSR
Subjt:  SGGPGTSPLEGG-----AGEGESPVALGPCRTTRRCQRVGLFGNAAPIGREADGGRRCAPVGCGTVRAGPPIDLGHGPMRIETAAKAQSFVTLVETSSSR

Query:  SWLAARPRG-CFGICALLASACGLPIRPVLKHGPRSLTWFEC--EHACRDPKDGELCLSGAKPEETMVEARSDTDVQIVREFY----RVKPMIRGIGGAT
         WL AR    CFG C L ASACGLPIRPVLKHGPRSLT      +H CRDPKDGELCLSGAKPEET+VEARSDTDVQIVR  +     V   I G  G  
Subjt:  SWLAARPRG-CFGICALLASACGLPIRPVLKHGPRSLTWFEC--EHACRDPKDGELCLSGAKPEETMVEARSDTDVQIVREFY----RVKPMIRGIGGAT

Query:  PS------------------------TYSQTLNRYDGVAALLSRATE----------------SRAP---------------------------------
         S                         +   +NR  G  A L    +                SR P                                 
Subjt:  PS------------------------TYSQTLNRYDGVAALLSRATE----------------SRAP---------------------------------

Query:  ------SGPFLP----AHPGNGSAGGR--VQRL-----------EEHRTSRATGRWNN-VGKGSRQNGSVTSGKGLALRAGHGVPSPEPVGYRWTARAAS
               G  LP    A P  GS+G +   +R+            E R     GR +N +   SRQNGSVTSGKGLALRAGHG PSPEPVG RWTARAA 
Subjt:  ------SGPFLP----AHPGNGSAGGR--VQRL-----------EEHRTSRATGRWNN-VGKGSRQNGSVTSGKGLALRAGHGVPSPEPVGYRWTARAAS

Query:  AARPGHRVPARGRIGNGSLGSLPRASNSQLRTGTDKGNPTV
        AAR G RVP    +G G  G LPRASNS+LRTGTDKGNPTV
Subjt:  AARPGHRVPARGRIGNGSLGSLPRASNSQLRTGTDKGNPTV

A0A6A5L014 Uncharacterized protein7.7e-4443.62Show/hide
Query:  AKAQSFVTLVETSSSRSWLAARPRG-CFGICALLASACGLPIRPVLKHGPRSLTWFEC--EHACRDPKDGELCLSGAKPEETMVEARSDTDVQIVREFY-
        A+A++   LVETSSSR WL AR    CFG C L ASACGLPIRPVLKHGPRSLT      +H CRDPKDGELCLSGAKPEET+VEARSDTDVQIVR  + 
Subjt:  AKAQSFVTLVETSSSRSWLAARPRG-CFGICALLASACGLPIRPVLKHGPRSLTWFEC--EHACRDPKDGELCLSGAKPEETMVEARSDTDVQIVREFY-

Query:  ---RVKPMIRGIGGATPS------------------------TYSQTLNRYDGVAALLSRATE----------------SRAP-----------------
            V   I G  G   S                         +   +NR  G  A L    +                SR P                 
Subjt:  ---RVKPMIRGIGGATPS------------------------TYSQTLNRYDGVAALLSRATE----------------SRAP-----------------

Query:  ----------------------SGPFLP----AHPGNGSAGGR--VQRL-----------EEHRTSRATGRWNN-VGKGSRQNGSVTSGKGLALRAGHGV
                               G  LP    A P  GS+G +   +R+            E R     GR +N +   SRQNGSVTSGKGLALRAGHG 
Subjt:  ----------------------SGPFLP----AHPGNGSAGGR--VQRL-----------EEHRTSRATGRWNN-VGKGSRQNGSVTSGKGLALRAGHGV

Query:  PSPEPVGYRWTARAASAARPGHRVPARGRIGNGSLGS
        PSPEPVG RWTARAA AAR G RVP  GRIGNG  G+
Subjt:  PSPEPVGYRWTARAASAARPGHRVPARGRIGNGSLGS

A0A6A5L6L4 Uncharacterized protein3.8e-4344.07Show/hide
Query:  LVETSSSRSWLAARPRG-CFGICALLASACGLPIRPVLKHGPRSLTWFEC--EHACRDPKDGELCLSGAKPEETMVEARSDTDVQIVREFY----RVKPM
        LVETSSSR WL AR    CFG C L ASACGLPIRPVLKHGPRSLT      +H CRDPKDGELCLSGAKPEET+VEARSDTDVQIVR  +     V   
Subjt:  LVETSSSRSWLAARPRG-CFGICALLASACGLPIRPVLKHGPRSLTWFEC--EHACRDPKDGELCLSGAKPEETMVEARSDTDVQIVREFY----RVKPM

Query:  IRGIGGATPS------------------------TYSQTLNRYDGVAALLSRATE----------------SRAP-------------------------
        I G  G   S                         +   +NR  G  A L    +                SR P                         
Subjt:  IRGIGGATPS------------------------TYSQTLNRYDGVAALLSRATE----------------SRAP-------------------------

Query:  --------------SGPFLP----AHPGNGSAGGR--VQRL-----------EEHRTSRATGRWNN-VGKGSRQNGSVTSGKGLALRAGHGVPSPEPVGY
                       G  LP    A P  GS+G +   +R+            E R     GR +N +   SRQNGSVTSGKGLALRAGHG PSPEPVG 
Subjt:  --------------SGPFLP----AHPGNGSAGGR--VQRL-----------EEHRTSRATGRWNN-VGKGSRQNGSVTSGKGLALRAGHGVPSPEPVGY

Query:  RWTARAASAARPGHRVPARGRIGNGSLGS
        RWTARAA AAR G RVP  GRIGNG  G+
Subjt:  RWTARAASAARPGHRVPARGRIGNGSLGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTAAACATCAAACGACCCGCGAACGCGTTCACGAACTTTCTGTGCCTGGGGGGGGAGCATCGTCTTGCACGCATGCACCCTCCCGGTGCCTCAACAAAACCCCGGC
GCAGGTCGCGCCTTGGTGTGCGGAGGGCAGAGCATTCTTGTCGTATTATTCACAACGACCCTCGGCAACGAATATCTCGGCTCTCGCATCGATGAAGAACGTAGCAAAAT
GCGATACTTGGTGTGAATTGCAGGATCCTGCGAACCACCGAGTCTTTGAACGCAAGTTGCGCCCGGAGCCTTCTGGCCGAGGGCACGTCTGCCTGGGCGTCATGCATCAC
TGCCCCCCACGCAACCCCCCTTCGGATTCGTTGCGCAGTCTGCGCACCACCTCCTCGCGAGCGAGCGAGGACTTCTATGTCGACCCTCTGAACATCGTCCCCAAAGACGA
TGCTCTCGACGCGACCCCAGGTCAGGCGGGATTACCCGCTGAGTTTAAGCATATCAATAAGCGGAGGAAAAGAAGCGTCCGAGTTGTAGCCTGGAGAAGCGTCCTCAGCG
GCGGACCGGGCACAAGTCCCCTGGAAGGGGGCGCCGGAGAGGGTGAGAGCCCCGTTGCGCTCGGACCCTGTCGCACCACGAGGCGCTGTCAACGAGTCGGGTTGTTTGGG
AATGCAGCCCCAATCGGGCGGGAAGCGGATGGGGGCCGGCGATGTGCCCCAGTCGGATGTGGAACGGTGAGAGCCGGTCCGCCGATCGACTTGGGGCATGGACCGATGCG
GATTGAGACGGCGGCCAAAGCCCAGTCCTTTGTTACGCTTGTGGAGACGTCGTCGTCCCGATCGTGGCTGGCAGCGCGCCCTCGGGGGTGCTTCGGCATCTGCGCGCTCC
TGGCATCGGCCTGTGGGCTCCCCATTCGACCCGTCTTGAAACACGGACCAAGGAGTCTGACATGGTTCGAGTGTGAGCATGCCTGTCGGGACCCGAAAGATGGTGAACTA
TGCCTGAGCGGGGCGAAGCCAGAGGAAACTATGGTGGAGGCTCGCAGCGATACTGACGTGCAAATCGTTCGCGAGTTCTATCGGGTAAAGCCAATGATTAGAGGCATCGG
GGGCGCAACGCCCTCGACCTATTCTCAAACTTTAAATAGGTACGACGGCGTGGCTGCTTTGTTGAGCCGCGCCACGGAATCGAGAGCTCCAAGTGGGCCATTTTTGCCTG
CCCACCCTGGAAACGGCTCAGCCGGAGGTAGGGTCCAGCGGTTGGAAGAGCACCGCACGTCGCGTGCCACTGGTCGATGGAACAATGTAGGCAAGGGAAGTCGGCAAAAT
GGATCCGTAACCTCGGGAAAAGGATTGGCTCTGAGGGCTGGGCACGGGGTTCCCAGTCCCGAACCCGTCGGCTATCGATGGACTGCTCGAGCTGCTTCCGCGGCGAGACC
GGGTCACCGCGTGCCGGCCAGGGGACGGATTGGGAACGGCTCTCTCGGGAGCCTTCCCCGGGCGTCAAACAGTCAACTCAGAACTGGTACGGACAAGGGGAATCCGACTG
TTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTAAACATCAAACGACCCGCGAACGCGTTCACGAACTTTCTGTGCCTGGGGGGGGAGCATCGTCTTGCACGCATGCACCCTCCCGGTGCCTCAACAAAACCCCGGC
GCAGGTCGCGCCTTGGTGTGCGGAGGGCAGAGCATTCTTGTCGTATTATTCACAACGACCCTCGGCAACGAATATCTCGGCTCTCGCATCGATGAAGAACGTAGCAAAAT
GCGATACTTGGTGTGAATTGCAGGATCCTGCGAACCACCGAGTCTTTGAACGCAAGTTGCGCCCGGAGCCTTCTGGCCGAGGGCACGTCTGCCTGGGCGTCATGCATCAC
TGCCCCCCACGCAACCCCCCTTCGGATTCGTTGCGCAGTCTGCGCACCACCTCCTCGCGAGCGAGCGAGGACTTCTATGTCGACCCTCTGAACATCGTCCCCAAAGACGA
TGCTCTCGACGCGACCCCAGGTCAGGCGGGATTACCCGCTGAGTTTAAGCATATCAATAAGCGGAGGAAAAGAAGCGTCCGAGTTGTAGCCTGGAGAAGCGTCCTCAGCG
GCGGACCGGGCACAAGTCCCCTGGAAGGGGGCGCCGGAGAGGGTGAGAGCCCCGTTGCGCTCGGACCCTGTCGCACCACGAGGCGCTGTCAACGAGTCGGGTTGTTTGGG
AATGCAGCCCCAATCGGGCGGGAAGCGGATGGGGGCCGGCGATGTGCCCCAGTCGGATGTGGAACGGTGAGAGCCGGTCCGCCGATCGACTTGGGGCATGGACCGATGCG
GATTGAGACGGCGGCCAAAGCCCAGTCCTTTGTTACGCTTGTGGAGACGTCGTCGTCCCGATCGTGGCTGGCAGCGCGCCCTCGGGGGTGCTTCGGCATCTGCGCGCTCC
TGGCATCGGCCTGTGGGCTCCCCATTCGACCCGTCTTGAAACACGGACCAAGGAGTCTGACATGGTTCGAGTGTGAGCATGCCTGTCGGGACCCGAAAGATGGTGAACTA
TGCCTGAGCGGGGCGAAGCCAGAGGAAACTATGGTGGAGGCTCGCAGCGATACTGACGTGCAAATCGTTCGCGAGTTCTATCGGGTAAAGCCAATGATTAGAGGCATCGG
GGGCGCAACGCCCTCGACCTATTCTCAAACTTTAAATAGGTACGACGGCGTGGCTGCTTTGTTGAGCCGCGCCACGGAATCGAGAGCTCCAAGTGGGCCATTTTTGCCTG
CCCACCCTGGAAACGGCTCAGCCGGAGGTAGGGTCCAGCGGTTGGAAGAGCACCGCACGTCGCGTGCCACTGGTCGATGGAACAATGTAGGCAAGGGAAGTCGGCAAAAT
GGATCCGTAACCTCGGGAAAAGGATTGGCTCTGAGGGCTGGGCACGGGGTTCCCAGTCCCGAACCCGTCGGCTATCGATGGACTGCTCGAGCTGCTTCCGCGGCGAGACC
GGGTCACCGCGTGCCGGCCAGGGGACGGATTGGGAACGGCTCTCTCGGGAGCCTTCCCCGGGCGTCAAACAGTCAACTCAGAACTGGTACGGACAAGGGGAATCCGACTG
TTTAA
Protein sequenceShow/hide protein sequence
MPKHQTTRERVHELSVPGGGASSCTHAPSRCLNKTPAQVAPWCAEGRAFLSYYSQRPSATNISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVMHH
CPPRNPPSDSLRSLRTTSSRASEDFYVDPLNIVPKDDALDATPGQAGLPAEFKHINKRRKRSVRVVAWRSVLSGGPGTSPLEGGAGEGESPVALGPCRTTRRCQRVGLFG
NAAPIGREADGGRRCAPVGCGTVRAGPPIDLGHGPMRIETAAKAQSFVTLVETSSSRSWLAARPRGCFGICALLASACGLPIRPVLKHGPRSLTWFECEHACRDPKDGEL
CLSGAKPEETMVEARSDTDVQIVREFYRVKPMIRGIGGATPSTYSQTLNRYDGVAALLSRATESRAPSGPFLPAHPGNGSAGGRVQRLEEHRTSRATGRWNNVGKGSRQN
GSVTSGKGLALRAGHGVPSPEPVGYRWTARAASAARPGHRVPARGRIGNGSLGSLPRASNSQLRTGTDKGNPTV