; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007490 (gene) of Snake gourd v1 genome

Gene IDTan0007490
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionNuclear transcription factor Y subunit C-4, putative
Genome locationLG09:47417757..47418816
RNA-Seq ExpressionTan0007490
SyntenyTan0007490
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031439.1 uncharacterized protein E6C27_scaffold139G001960 [Cucumis melo var. makuwa]3.9e-7273.53Show/hide
Query:  LVILTLPCIMSILGRESG-FEFFAVSDMVDSEQLDLLFRDLGHEGLAINGHKALILSSAAAKGLTQVRVLDGDEHKLDIVLDSDFDRTGLFSDDSFDFVF
        +VILT PCI+SILG+ESG  EFF+VSDMVDS +LDL FRDLGHEG + NGHK LILSSA  KGL Q+RVLDGDEHKL+IV+DSDFDRTGLFSDDSFDFV 
Subjt:  LVILTLPCIMSILGRESG-FEFFAVSDMVDSEQLDLLFRDLGHEGLAINGHKALILSSAAAKGLTQVRVLDGDEHKLDIVLDSDFDRTGLFSDDSFDFVF

Query:  PSGLVDSDFVDRILRIGGIVAFPL-NKNDPSNHFQKKPNYRPVFLNRYSSIIVAMEKTAMSDQLVNASASRRRLCQFSLQTRKAALRGLEDALLEPPPIK
            +DSDF+DRIL+ GGIVAFPL N NDPSNHF+KKPNY+P+FLNRY+SIIVAMEKTA++D LV ASASRRRL + SL T  AALR LED      P K
Subjt:  PSGLVDSDFVDRILRIGGIVAFPL-NKNDPSNHFQKKPNYRPVFLNRYSSIIVAMEKTAMSDQLVNASASRRRLCQFSLQTRKAALRGLEDALLEPPPIK

Query:  TGRK
         GRK
Subjt:  TGRK

XP_008455527.1 PREDICTED: uncharacterized protein LOC103495679 [Cucumis melo]7.4e-8770.68Show/hide
Query:  MDFAPLNRAKSEGFDNGGFNSESHLVIKFPDPRILHVISRSLFFALVILTLPCIMSILGRESG-FEFFAVSDMVDSEQLDLLFRDLGHEGLAINGHKALI
        MD A  NR  +  FDN  +NS++HLVI FP+ RIL VIS S FFA+VILT PCI+SILG+ESG  EFF+VSDMVDS +LDL FRDLGHEG + NGHK LI
Subjt:  MDFAPLNRAKSEGFDNGGFNSESHLVIKFPDPRILHVISRSLFFALVILTLPCIMSILGRESG-FEFFAVSDMVDSEQLDLLFRDLGHEGLAINGHKALI

Query:  LSSAAAKGLTQVRVLDGDEHKLDIVLDSDFDRTGLFSDDSFDFVFPSGLVDSDFVDRILRIGGIVAFPL-NKNDPSNHFQKKPNYRPVFLNRYSSIIVAM
        LSSA  KGL Q+RVLDGDEHKL+IV+DSDFDRTGLFSDDSFDFV     +DSDF+DRIL+ GGIVAFPL N NDPSNHF+KKPNY+P+FLNRY+SIIVAM
Subjt:  LSSAAAKGLTQVRVLDGDEHKLDIVLDSDFDRTGLFSDDSFDFVFPSGLVDSDFVDRILRIGGIVAFPL-NKNDPSNHFQKKPNYRPVFLNRYSSIIVAM

Query:  EKTAMSDQLVNASASRRRLCQFSLQTRKAALRGLEDALLEPPPIKTGRK
        EKTA++D LV ASASRRRL + SL T  AALR LED      P K GRK
Subjt:  EKTAMSDQLVNASASRRRLCQFSLQTRKAALRGLEDALLEPPPIKTGRK

XP_011659719.1 uncharacterized protein LOC105436238 [Cucumis sativus]4.8e-8669.72Show/hide
Query:  MDFAPLNRAKSEGFDNGGFNSESHLVIKFPDPRILHVISRSLFFALVILTLPCIMSILGRESG-FEFFAVSDMVDSEQLDLLFRDLGHEGLAINGHKALI
        MD A  NR  +  FDN  +NS++HLVI FP  +IL VIS S FFA+VILT PCI+SILG+E+G  EFF+V DMVDSE+LDL FRDLGHEG + NGHK LI
Subjt:  MDFAPLNRAKSEGFDNGGFNSESHLVIKFPDPRILHVISRSLFFALVILTLPCIMSILGRESG-FEFFAVSDMVDSEQLDLLFRDLGHEGLAINGHKALI

Query:  LSSAAAKGLTQVRVLDGDEHKLDIVLDSDFDRTGLFSDDSFDFVFPSGLVDSDFVDRILRIGGIVAFPL-NKNDPSNHFQKKPNYRPVFLNRYSSIIVAM
        LSSA   GL Q+RVLDGDEHKL+IV+DSDFDRTGLFSDDSFDFV   G +DSDF+DRIL+IGGIVAFPL N NDPS+HF+KKPNY+PVFLNRY+SIIVAM
Subjt:  LSSAAAKGLTQVRVLDGDEHKLDIVLDSDFDRTGLFSDDSFDFVFPSGLVDSDFVDRILRIGGIVAFPL-NKNDPSNHFQKKPNYRPVFLNRYSSIIVAM

Query:  EKTAMSDQLVNASASRRRLCQFSLQTRKAALRGLEDALLEPPPIKTGRKNK
        EKT M+D+LV  SASRRRL + SL TR AALR LED      P + GRK K
Subjt:  EKTAMSDQLVNASASRRRLCQFSLQTRKAALRGLEDALLEPPPIKTGRKNK

XP_022141924.1 uncharacterized protein LOC111012177 [Momordica charantia]1.3e-9464.38Show/hide
Query:  MDFAPLNRAKSE----GFDN---GGFNSESHLVIKFPDPRILHVISRSLFFALVILTLPCIMSILGRESGFEFFAVSDMVDSEQLDLLFRDLGHEGLAIN
        MDFA  NRAK++    GF N   G +NS++HLVIKFPDPRILHVISRSLF ALVILTLPCI+SILGRES  EF +VSD+VDS QLDLLFRD G+EG+ IN
Subjt:  MDFAPLNRAKSE----GFDN---GGFNSESHLVIKFPDPRILHVISRSLFFALVILTLPCIMSILGRESGFEFFAVSDMVDSEQLDLLFRDLGHEGLAIN

Query:  GHKALILSSAAAKGLTQVRVLDGDEHKLDIVLDSDFDRTGLFSDDSFDFVFPSGLVDSDFVDRILRIGGIVAFPLNKNDPSNHFQKKPNYRPVFLNRYSS
        G KA+ILSS    GLTQVRV+D DE KLDIVLDSDFD++GLFSDDSFDFVF  G VDSDF+DRIL+ GGI+AFP   + PSNHFQKKPNYRPVFL+RYSS
Subjt:  GHKALILSSAAAKGLTQVRVLDGDEHKLDIVLDSDFDRTGLFSDDSFDFVFPSGLVDSDFVDRILRIGGIVAFPLNKNDPSNHFQKKPNYRPVFLNRYSS

Query:  IIVAMEKTAMSDQLVNASASRRRLCQFSLQTRKAALRGLEDALLEPPPIKTGRKNK---RRI--------------------FITVGLQKANKDMVQYFD
        IIVAMEKTAM D +V +SASRR L QFS  T KAA+RGLE+ +L   P K   K     R+I                    F+TVGL + N DM+QYFD
Subjt:  IIVAMEKTAMSDQLVNASASRRRLCQFSLQTRKAALRGLEDALLEPPPIKTGRKNK---RRI--------------------FITVGLQKANKDMVQYFD

Query:  ENYTQE
        +NY ++
Subjt:  ENYTQE

XP_038889013.1 uncharacterized protein LOC120078778 [Benincasa hispida]2.0e-10076.86Show/hide
Query:  MDFAPLNRAKSEGFDNGGFNSESHLVIKFPDPRILHVISRSLFFALVILTLPCIMSILGRESGFEFFAVSDMVDSEQLDLLFRDLGHEGLAINGHKALIL
        MDF   NR  S+ FD   +NS++HLVIKFP+ +IL VIS SLFFA+ ILT P I+SILG+ESG EFF+VSDM+DSEQLDL FRDLGHEGL INGHKALIL
Subjt:  MDFAPLNRAKSEGFDNGGFNSESHLVIKFPDPRILHVISRSLFFALVILTLPCIMSILGRESGFEFFAVSDMVDSEQLDLLFRDLGHEGLAINGHKALIL

Query:  SSAAAKGLTQVRVLDGDEHKLDIVLDSDFDRTGLFSDDSFDFVFPSGLVDSDFVDRILRIGGIVAFPLNKNDPSNHFQKKPNYRPVFLNRYSSIIVAMEK
        SSA  KGL Q+RVLDGDEHKL+IV+DSDFDR+GLFSDDSFDFV  SGLVDSDF+DRIL+IGGIVAFPLN NDPSNHFQKKPNYRPVFLNRYSSIIV MEK
Subjt:  SSAAAKGLTQVRVLDGDEHKLDIVLDSDFDRTGLFSDDSFDFVFPSGLVDSDFVDRILRIGGIVAFPLNKNDPSNHFQKKPNYRPVFLNRYSSIIVAMEK

Query:  TAMSDQLVNASASRRRLCQFSLQTRKAALRGLEDALLEPP------PIKTGRKNK
        TAM+DQLV AS+SRRRL QFSL TR AALR LED LLEPP      P K GRK K
Subjt:  TAMSDQLVNASASRRRLCQFSLQTRKAALRGLEDALLEPP------PIKTGRKNK

TrEMBL top hitse value%identityAlignment
A0A0A0K451 Uncharacterized protein2.3e-8669.72Show/hide
Query:  MDFAPLNRAKSEGFDNGGFNSESHLVIKFPDPRILHVISRSLFFALVILTLPCIMSILGRESG-FEFFAVSDMVDSEQLDLLFRDLGHEGLAINGHKALI
        MD A  NR  +  FDN  +NS++HLVI FP  +IL VIS S FFA+VILT PCI+SILG+E+G  EFF+V DMVDSE+LDL FRDLGHEG + NGHK LI
Subjt:  MDFAPLNRAKSEGFDNGGFNSESHLVIKFPDPRILHVISRSLFFALVILTLPCIMSILGRESG-FEFFAVSDMVDSEQLDLLFRDLGHEGLAINGHKALI

Query:  LSSAAAKGLTQVRVLDGDEHKLDIVLDSDFDRTGLFSDDSFDFVFPSGLVDSDFVDRILRIGGIVAFPL-NKNDPSNHFQKKPNYRPVFLNRYSSIIVAM
        LSSA   GL Q+RVLDGDEHKL+IV+DSDFDRTGLFSDDSFDFV   G +DSDF+DRIL+IGGIVAFPL N NDPS+HF+KKPNY+PVFLNRY+SIIVAM
Subjt:  LSSAAAKGLTQVRVLDGDEHKLDIVLDSDFDRTGLFSDDSFDFVFPSGLVDSDFVDRILRIGGIVAFPL-NKNDPSNHFQKKPNYRPVFLNRYSSIIVAM

Query:  EKTAMSDQLVNASASRRRLCQFSLQTRKAALRGLEDALLEPPPIKTGRKNK
        EKT M+D+LV  SASRRRL + SL TR AALR LED      P + GRK K
Subjt:  EKTAMSDQLVNASASRRRLCQFSLQTRKAALRGLEDALLEPPPIKTGRKNK

A0A1R3GZT9 Uncharacterized protein7.1e-5134Show/hide
Query:  LNRAKSEGFDNGGFNSESHLVIKFPDPRILHVISRSLFFALVILTLPCIMSILGRESGFEFFAV-----SDMVDSEQLDLLFRDLGHEGLAINGHKALIL
        +N+++S G     FN ++ LVIK PDPR    +SRSLF A+VI+TLP + S+L   SG  F  +     S  +D E L+LL++D  +EGL   GHKALIL
Subjt:  LNRAKSEGFDNGGFNSESHLVIKFPDPRILHVISRSLFFALVILTLPCIMSILGRESGFEFFAV-----SDMVDSEQLDLLFRDLGHEGLAINGHKALIL

Query:  SSAAAKGLTQVRVLDGDEHKLDIVLDSDFDRTGLFSDDSFDFVFPSGLVDSDFVDRILRIGGIVAFPLNKNDPSNHFQKKPNYRPVFLNRYSSIIVAMEK
        SS A +G+     +   + ++D+VL+ D ++     D+ FDFVF SG +DS FVDR+++IGGI+A  L  ++ S+ FQK+ +YR V+L +Y+S IVAM K
Subjt:  SSAAAKGLTQVRVLDGDEHKLDIVLDSDFDRTGLFSDDSFDFVFPSGLVDSDFVDRILRIGGIVAFPLNKNDPSNHFQKKPNYRPVFLNRYSSIIVAMEK

Query:  TAMSDQLVNASASRRRLCQFSLQTRKAALRGLEDALLEPPPIKTGRKN--------------------KRRIFITVGLQKANKDMVQYFDENY-------
           S+  +  S ++RRLCQ +++ +KAAL+GLED L EPP     + +                     RR+F+ VG  +    ++++FD+NY       
Subjt:  TAMSDQLVNASASRRRLCQFSLQTRKAALRGLEDALLEPPPIKTGRKN--------------------KRRIFITVGLQKANKDMVQYFDENY-------

Query:  -----------------------------TQERIKSLRSTKLTWNLIKSHRAASVEE-------------------DERGGYWECLSLYGSLKDEGIAEH
                                      +E +      +L  ++I+    + V+E                     +  YWECL+LYG L+DEG+A H
Subjt:  -----------------------------TQERIKSLRSTKLTWNLIKSHRAASVEE-------------------DERGGYWECLSLYGSLKDEGIAEH

A0A1S3C0P0 uncharacterized protein LOC1034956793.6e-8770.68Show/hide
Query:  MDFAPLNRAKSEGFDNGGFNSESHLVIKFPDPRILHVISRSLFFALVILTLPCIMSILGRESG-FEFFAVSDMVDSEQLDLLFRDLGHEGLAINGHKALI
        MD A  NR  +  FDN  +NS++HLVI FP+ RIL VIS S FFA+VILT PCI+SILG+ESG  EFF+VSDMVDS +LDL FRDLGHEG + NGHK LI
Subjt:  MDFAPLNRAKSEGFDNGGFNSESHLVIKFPDPRILHVISRSLFFALVILTLPCIMSILGRESG-FEFFAVSDMVDSEQLDLLFRDLGHEGLAINGHKALI

Query:  LSSAAAKGLTQVRVLDGDEHKLDIVLDSDFDRTGLFSDDSFDFVFPSGLVDSDFVDRILRIGGIVAFPL-NKNDPSNHFQKKPNYRPVFLNRYSSIIVAM
        LSSA  KGL Q+RVLDGDEHKL+IV+DSDFDRTGLFSDDSFDFV     +DSDF+DRIL+ GGIVAFPL N NDPSNHF+KKPNY+P+FLNRY+SIIVAM
Subjt:  LSSAAAKGLTQVRVLDGDEHKLDIVLDSDFDRTGLFSDDSFDFVFPSGLVDSDFVDRILRIGGIVAFPL-NKNDPSNHFQKKPNYRPVFLNRYSSIIVAM

Query:  EKTAMSDQLVNASASRRRLCQFSLQTRKAALRGLEDALLEPPPIKTGRK
        EKTA++D LV ASASRRRL + SL T  AALR LED      P K GRK
Subjt:  EKTAMSDQLVNASASRRRLCQFSLQTRKAALRGLEDALLEPPPIKTGRK

A0A5A7SQ50 Uncharacterized protein1.9e-7273.53Show/hide
Query:  LVILTLPCIMSILGRESG-FEFFAVSDMVDSEQLDLLFRDLGHEGLAINGHKALILSSAAAKGLTQVRVLDGDEHKLDIVLDSDFDRTGLFSDDSFDFVF
        +VILT PCI+SILG+ESG  EFF+VSDMVDS +LDL FRDLGHEG + NGHK LILSSA  KGL Q+RVLDGDEHKL+IV+DSDFDRTGLFSDDSFDFV 
Subjt:  LVILTLPCIMSILGRESG-FEFFAVSDMVDSEQLDLLFRDLGHEGLAINGHKALILSSAAAKGLTQVRVLDGDEHKLDIVLDSDFDRTGLFSDDSFDFVF

Query:  PSGLVDSDFVDRILRIGGIVAFPL-NKNDPSNHFQKKPNYRPVFLNRYSSIIVAMEKTAMSDQLVNASASRRRLCQFSLQTRKAALRGLEDALLEPPPIK
            +DSDF+DRIL+ GGIVAFPL N NDPSNHF+KKPNY+P+FLNRY+SIIVAMEKTA++D LV ASASRRRL + SL T  AALR LED      P K
Subjt:  PSGLVDSDFVDRILRIGGIVAFPL-NKNDPSNHFQKKPNYRPVFLNRYSSIIVAMEKTAMSDQLVNASASRRRLCQFSLQTRKAALRGLEDALLEPPPIK

Query:  TGRK
         GRK
Subjt:  TGRK

A0A6J1CK51 uncharacterized protein LOC1110121776.1e-9564.38Show/hide
Query:  MDFAPLNRAKSE----GFDN---GGFNSESHLVIKFPDPRILHVISRSLFFALVILTLPCIMSILGRESGFEFFAVSDMVDSEQLDLLFRDLGHEGLAIN
        MDFA  NRAK++    GF N   G +NS++HLVIKFPDPRILHVISRSLF ALVILTLPCI+SILGRES  EF +VSD+VDS QLDLLFRD G+EG+ IN
Subjt:  MDFAPLNRAKSE----GFDN---GGFNSESHLVIKFPDPRILHVISRSLFFALVILTLPCIMSILGRESGFEFFAVSDMVDSEQLDLLFRDLGHEGLAIN

Query:  GHKALILSSAAAKGLTQVRVLDGDEHKLDIVLDSDFDRTGLFSDDSFDFVFPSGLVDSDFVDRILRIGGIVAFPLNKNDPSNHFQKKPNYRPVFLNRYSS
        G KA+ILSS    GLTQVRV+D DE KLDIVLDSDFD++GLFSDDSFDFVF  G VDSDF+DRIL+ GGI+AFP   + PSNHFQKKPNYRPVFL+RYSS
Subjt:  GHKALILSSAAAKGLTQVRVLDGDEHKLDIVLDSDFDRTGLFSDDSFDFVFPSGLVDSDFVDRILRIGGIVAFPLNKNDPSNHFQKKPNYRPVFLNRYSS

Query:  IIVAMEKTAMSDQLVNASASRRRLCQFSLQTRKAALRGLEDALLEPPPIKTGRKNK---RRI--------------------FITVGLQKANKDMVQYFD
        IIVAMEKTAM D +V +SASRR L QFS  T KAA+RGLE+ +L   P K   K     R+I                    F+TVGL + N DM+QYFD
Subjt:  IIVAMEKTAMSDQLVNASASRRRLCQFSLQTRKAALRGLEDALLEPPPIKTGRKNK---RRI--------------------FITVGLQKANKDMVQYFD

Query:  ENYTQE
        +NY ++
Subjt:  ENYTQE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G58120.1 BEST Arabidopsis thaliana protein match is: methyltransferases (TAIR:AT5G01710.1)1.2e-2329.82Show/hide
Query:  SESHLVIKFPDPRILHVISRSLFFALVILTLPCIMSILGRESGFEFFAVSDMVDS---EQLDLLFRDLGHEGLAINGHKALILSSAAAKGLTQVRVLDGD
        S +   +K     +L +  RS   AL+ L+    +S+L  + G    A S  V+S   E L LL  DL  +GL   G KAL LS    +           
Subjt:  SESHLVIKFPDPRILHVISRSLFFALVILTLPCIMSILGRESGFEFFAVSDMVDS---EQLDLLFRDLGHEGLAINGHKALILSSAAAKGLTQVRVLDGD

Query:  EHKLDIVLDSDFDRTGLFSDDSFDFVFP-SGLVDS-DFVDRILRIGGIVAFPLNKNDPSNHFQKKPNYRPVFLNRYSSIIVAMEKTAMSDQLVNASASRR
        E  + +V  SD +   +  D++FDF F  S  +DS +F+DR L++GGI    LN  D   +F K PNY  V++      ++ M KT  ++Q  +  A+ R
Subjt:  EHKLDIVLDSDFDRTGLFSDDSFDFVFP-SGLVDS-DFVDRILRIGGIVAFPLNKNDPSNHFQKKPNYRPVFLNRYSSIIVAMEKTAMSDQLVNASASRR

Query:  RLCQFSLQ-TRKAALRGLEDALLEPPPIKTGRKN----------------------KRRIFITVGLQKANKDMVQYFDENYTQERIK-------------
        +L   + +  R+ ALR LED LLEPP   + +                         RR+FI VG  K +  M ++F ENY     K             
Subjt:  RLCQFSLQ-TRKAALRGLEDALLEPPPIKTGRKN----------------------KRRIFITVGLQKANKDMVQYFDENYTQERIK-------------

Query:  SLRSTKL---TW---------NLIKSHRAASVEE----------DE------------RG---------GYWECLSLYGSLKDEGIAEH
        SL S K+    W          ++    A  VEE          DE            RG          YWECL+LYG L+DEG+A H
Subjt:  SLRSTKL---TW---------NLIKSHRAASVEE----------DE------------RG---------GYWECLSLYGSLKDEGIAEH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTTGCTCCTCTTAATCGAGCCAAGAGCGAAGGATTCGACAATGGCGGCTTTAATTCTGAGTCCCATTTGGTTATTAAGTTTCCTGATCCACGAATTCTTCATGT
GATTTCTCGTTCGTTGTTTTTCGCTTTGGTTATTCTCACGTTGCCTTGTATTATGTCCATTCTTGGGCGAGAAAGTGGGTTTGAGTTTTTTGCTGTGTCTGATATGGTTG
ATTCTGAGCAATTGGATTTGCTTTTTCGTGATTTGGGTCACGAAGGCTTGGCCATTAACGGCCATAAGGCTCTCATTTTGAGCTCTGCTGCAGCTAAGGGCCTGACTCAG
GTTCGTGTGTTGGATGGTGATGAACACAAACTTGATATTGTTCTGGATTCTGATTTTGATCGAACTGGGTTGTTTTCTGATGATTCTTTTGATTTTGTGTTTCCTTCGGG
CCTTGTGGACTCTGATTTCGTGGATAGAATTCTGAGAATCGGTGGCATTGTGGCGTTTCCACTTAATAAAAATGACCCATCAAATCATTTTCAAAAGAAACCAAATTACA
GGCCTGTGTTTCTCAACAGATACAGCTCCATTATTGTGGCAATGGAGAAGACAGCCATGTCTGATCAGCTGGTTAATGCTTCAGCTTCAAGAAGACGCCTCTGTCAATTC
TCATTGCAAACTAGAAAAGCAGCTTTGAGAGGGCTTGAGGATGCTCTACTTGAGCCACCCCCAATTAAGACCGGGAGGAAAAACAAGCGAAGGATCTTCATCACAGTCGG
CCTGCAAAAAGCGAATAAAGACATGGTTCAATACTTTGATGAGAACTACACCCAAGAAAGGATCAAGAGTTTGAGGTCCACAAAATTGACTTGGAACCTGATCAAGAGTC
ATCGAGCAGCGTCGGTTGAGGAAGACGAACGCGGAGGTTATTGGGAATGCTTGTCTTTGTATGGAAGTTTGAAAGATGAGGGAATTGCAGAGCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTTGCTCCTCTTAATCGAGCCAAGAGCGAAGGATTCGACAATGGCGGCTTTAATTCTGAGTCCCATTTGGTTATTAAGTTTCCTGATCCACGAATTCTTCATGT
GATTTCTCGTTCGTTGTTTTTCGCTTTGGTTATTCTCACGTTGCCTTGTATTATGTCCATTCTTGGGCGAGAAAGTGGGTTTGAGTTTTTTGCTGTGTCTGATATGGTTG
ATTCTGAGCAATTGGATTTGCTTTTTCGTGATTTGGGTCACGAAGGCTTGGCCATTAACGGCCATAAGGCTCTCATTTTGAGCTCTGCTGCAGCTAAGGGCCTGACTCAG
GTTCGTGTGTTGGATGGTGATGAACACAAACTTGATATTGTTCTGGATTCTGATTTTGATCGAACTGGGTTGTTTTCTGATGATTCTTTTGATTTTGTGTTTCCTTCGGG
CCTTGTGGACTCTGATTTCGTGGATAGAATTCTGAGAATCGGTGGCATTGTGGCGTTTCCACTTAATAAAAATGACCCATCAAATCATTTTCAAAAGAAACCAAATTACA
GGCCTGTGTTTCTCAACAGATACAGCTCCATTATTGTGGCAATGGAGAAGACAGCCATGTCTGATCAGCTGGTTAATGCTTCAGCTTCAAGAAGACGCCTCTGTCAATTC
TCATTGCAAACTAGAAAAGCAGCTTTGAGAGGGCTTGAGGATGCTCTACTTGAGCCACCCCCAATTAAGACCGGGAGGAAAAACAAGCGAAGGATCTTCATCACAGTCGG
CCTGCAAAAAGCGAATAAAGACATGGTTCAATACTTTGATGAGAACTACACCCAAGAAAGGATCAAGAGTTTGAGGTCCACAAAATTGACTTGGAACCTGATCAAGAGTC
ATCGAGCAGCGTCGGTTGAGGAAGACGAACGCGGAGGTTATTGGGAATGCTTGTCTTTGTATGGAAGTTTGAAAGATGAGGGAATTGCAGAGCATTAA
Protein sequenceShow/hide protein sequence
MDFAPLNRAKSEGFDNGGFNSESHLVIKFPDPRILHVISRSLFFALVILTLPCIMSILGRESGFEFFAVSDMVDSEQLDLLFRDLGHEGLAINGHKALILSSAAAKGLTQ
VRVLDGDEHKLDIVLDSDFDRTGLFSDDSFDFVFPSGLVDSDFVDRILRIGGIVAFPLNKNDPSNHFQKKPNYRPVFLNRYSSIIVAMEKTAMSDQLVNASASRRRLCQF
SLQTRKAALRGLEDALLEPPPIKTGRKNKRRIFITVGLQKANKDMVQYFDENYTQERIKSLRSTKLTWNLIKSHRAASVEEDERGGYWECLSLYGSLKDEGIAEH