; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018982 (gene) of Snake gourd v1 genome

Gene IDTan0018982
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1685)
Genome locationLG02:1632355..1633351
RNA-Seq ExpressionTan0018982
SyntenyTan0018982
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572385.1 hypothetical protein SDJN03_29113, partial [Cucurbita argyrosperma subsp. sororia]6.7e-9079.4Show/hide
Query:  MAAEEILPLFDLLWFQHAVFAGKPLSQTRPEAPENRFHSPVMQVIKVRSQSEYLLSTNFPPPVTVHFS-NQKLQTILSGKVTEFSGDGEGKPAEIPAKKK
        MAAEEILPLFDL WFQ AVF GKPL  TR E+PENR  SPVMQV+KVRSQSEY LS+NF PP T ++S NQKLQ ILSGKVTEFSG GEGK    PAKKK
Subjt:  MAAEEILPLFDLLWFQHAVFAGKPLSQTRPEAPENRFHSPVMQVIKVRSQSEYLLSTNFPPPVTVHFS-NQKLQTILSGKVTEFSGDGEGKPAEIPAKKK

Query:  QEGSENKRRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEKR
         EGSE +RR+KRGRGLSKSLSDLEFEELKGFMDLGFVF EEDK NS+LASIIPGLQRLG+KT E+E       GIE+GV  SRPYLSEAW+AVEEE EKR
Subjt:  QEGSENKRRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEKR

Query:  VLMKWRVPGLGATEMDMKDHLKFWAHTVASTVR
         LMKWRVP LGATEMDMKDHLKFWAHTVASTVR
Subjt:  VLMKWRVPGLGATEMDMKDHLKFWAHTVASTVR

XP_022952129.1 uncharacterized protein LOC111454895 [Cucurbita moschata]5.1e-9079.4Show/hide
Query:  MAAEEILPLFDLLWFQHAVFAGKPLSQTRPEAPENRFHSPVMQVIKVRSQSEYLLSTNFPPPVTVHFS-NQKLQTILSGKVTEFSGDGEGKPAEIPAKKK
        MAAEEILPLFDL WFQ AVF GKPL  TR E+PENR  SPVMQV+KVRSQSEY LS+NF PP T ++S NQKLQ ILSGKVTEFSG GEGK    PAKKK
Subjt:  MAAEEILPLFDLLWFQHAVFAGKPLSQTRPEAPENRFHSPVMQVIKVRSQSEYLLSTNFPPPVTVHFS-NQKLQTILSGKVTEFSGDGEGKPAEIPAKKK

Query:  QEGSENKRRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEKR
         EGSE +RR+KRGRGLSKSLSDLEFEELKGFMDLGFVF EEDK NS+LASIIPGLQRLG+KT E+E       GIE+GV  SRPYLSEAW+AVEEE EKR
Subjt:  QEGSENKRRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEKR

Query:  VLMKWRVPGLGATEMDMKDHLKFWAHTVASTVR
         LMKWRVP LGATEMDMKDHLKFWAHTVASTVR
Subjt:  VLMKWRVPGLGATEMDMKDHLKFWAHTVASTVR

XP_022969253.1 uncharacterized protein LOC111468311 [Cucurbita maxima]3.7e-8878.54Show/hide
Query:  MAAEEILPLFDLLWFQHAVFAGKPLSQTRPEAPENRFHSPVMQVIKVRSQSEYLLSTNFPPPVTVHFS-NQKLQTILSGKVTEFSGDGEGKPAEIPAKKK
        MAAEEIL LFDL WFQ AVF G PL  TR E+PENR  SPVMQV+KVRSQSEY LS+NF PP T ++S NQKLQ ILSGKVTEFSG G GK    PAKKK
Subjt:  MAAEEILPLFDLLWFQHAVFAGKPLSQTRPEAPENRFHSPVMQVIKVRSQSEYLLSTNFPPPVTVHFS-NQKLQTILSGKVTEFSGDGEGKPAEIPAKKK

Query:  QEGSENKRRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEKR
         EGSE +RR+KRGRGLSKSLSDLEFEELKGFMDLGFVF EEDK NS+LASIIPGLQRLGE+T E EE    ++GIE+GV  SRPYLSEAW+AVEEE EKR
Subjt:  QEGSENKRRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEKR

Query:  VLMKWRVPGLGATEMDMKDHLKFWAHTVASTVR
         LMKWRVP LGATEMDMKDHLKFWAHTVASTVR
Subjt:  VLMKWRVPGLGATEMDMKDHLKFWAHTVASTVR

XP_023554329.1 uncharacterized protein LOC111811624 [Cucurbita pepo subsp. pepo]1.9e-8978.97Show/hide
Query:  MAAEEILPLFDLLWFQHAVFAGKPLSQTRPEAPENRFHSPVMQVIKVRSQSEYLLSTNFPPPVTVHFS-NQKLQTILSGKVTEFSGDGEGKPAEIPAKKK
        MAAEEILPLFDL WFQ AVF GKPL  TR E+PENR  SPVMQV+KVRSQSEY LS+NF PP TV++S NQKLQ ILSGKVTEF+G+ EGK    PAKKK
Subjt:  MAAEEILPLFDLLWFQHAVFAGKPLSQTRPEAPENRFHSPVMQVIKVRSQSEYLLSTNFPPPVTVHFS-NQKLQTILSGKVTEFSGDGEGKPAEIPAKKK

Query:  QEGSENKRRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEKR
         EGSE +RR+KRGRGLSKSLSDLEFEELKGFMDLGFVF EEDK NS+LASIIPGLQRLG+KT E+E       GIE+GV  SRPYLSEAW+AVEEE EKR
Subjt:  QEGSENKRRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEKR

Query:  VLMKWRVPGLGATEMDMKDHLKFWAHTVASTVR
         LMKWRVP LGATEMDMKDHLKFWAHTVASTVR
Subjt:  VLMKWRVPGLGATEMDMKDHLKFWAHTVASTVR

XP_038887878.1 uncharacterized protein LOC120077867 [Benincasa hispida]1.3e-9680.42Show/hide
Query:  MAAEEILPLFDLLWFQHAVFAGKPLSQTRPEAPENRFHSPVMQVIKVRSQSEYLLST-NFPPPVTVHFS-------NQKLQTILSGKVTEFSGDGEGKPA
        MAAEEILPLFDL WFQ A+F GKPL QT   APE RF SPV QV+K RSQSEYLLS+ +FPPP T  +S       +QKLQTILSGKV EF+G+GEGKPA
Subjt:  MAAEEILPLFDLLWFQHAVFAGKPLSQTRPEAPENRFHSPVMQVIKVRSQSEYLLST-NFPPPVTVHFS-------NQKLQTILSGKVTEFSGDGEGKPA

Query:  EIPAKKKQEGSENKRRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAV
        EIPAKKK EG+ENKRRKKRG+GLSKSLSDLEFEELKGFMDLGFVF EEDKN+SNLASIIPGLQRLG+KTGE EE+K     IENGV  SRPYLSEAWEAV
Subjt:  EIPAKKKQEGSENKRRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAV

Query:  EEENEKRVLMKWRVPGLGATEMDMKDHLKFWAHTVASTVR
        EEENEKR+LMKWRVPGLGATEMDMKDHLKFWAHTVASTVR
Subjt:  EEENEKRVLMKWRVPGLGATEMDMKDHLKFWAHTVASTVR

TrEMBL top hitse value%identityAlignment
A0A0A0K3T7 Uncharacterized protein5.5e-7470.09Show/hide
Query:  MAAEEILPLFDLLWFQHAVFAGKPLSQTRPEAPENRFHSPVMQVIKVRSQSEYLL-STNFPPPVTVHFSNQKLQTILSGKVTEFSGDGEGKPAEIPAKKK
        MAAEEILPLFDL WFQ A+F+ K   +T        F SPV QV+K+RSQSEYLL S +FPPP T   SNQKL+TILSGKVTEF G+ EG+ A    KKK
Subjt:  MAAEEILPLFDLLWFQHAVFAGKPLSQTRPEAPENRFHSPVMQVIKVRSQSEYLL-STNFPPPVTVHFSNQKLQTILSGKVTEFSGDGEGKPAEIPAKKK

Query:  QEGSENK-RRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEK
         EG+E+K RRKK+G+GLSKSLSDLEFEELKGFMDLGFVF EEDKN+SNL SIIPGL RLG K  + EEK+      ENGV + RPYLSEAW+A+EEENEK
Subjt:  QEGSENK-RRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEK

Query:  RVLMKWRVPGLGATEMDMKDHLKFWAHTVASTVR
         +LMKWRVP LGATEMD+K HLKFWAHTVASTVR
Subjt:  RVLMKWRVPGLGATEMDMKDHLKFWAHTVASTVR

A0A1S3C1U3 uncharacterized protein LOC1034954958.0e-7370.51Show/hide
Query:  MAAEEILPLFDLLWFQHAVFAGKPLSQTRPEAPENRFHSPVMQVIKVRSQSEYLL-STNFPPPVTVHFSNQKLQTILSGKVTEFSGDGEGKPAEIPAKKK
        MAAEEILPLFDL WFQ A+F  KPL +T  +       SPVM   K+RSQSEYLL S +FPPPVT   SNQKL+T+LSG+VTEF G GEGK A     KK
Subjt:  MAAEEILPLFDLLWFQHAVFAGKPLSQTRPEAPENRFHSPVMQVIKVRSQSEYLL-STNFPPPVTVHFSNQKLQTILSGKVTEFSGDGEGKPAEIPAKKK

Query:  QEGSENK-RRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEK
         EG+ENK RRKK+ +GLSKSLSDLEFEELKGFMDLGFVF EEDKN+SNL SIIPGL RLG +    EEK+      ENGV + RPYLSEAWEA+EEENEK
Subjt:  QEGSENK-RRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEK

Query:  RVLMKWRVPGLGATEMDMKDHLKFWAHTVASTVR
         VLMKWRVP LGATEMD+K HLKFWAHTVASTVR
Subjt:  RVLMKWRVPGLGATEMDMKDHLKFWAHTVASTVR

A0A6J1D2N3 uncharacterized protein LOC1110167781.6e-8171.02Show/hide
Query:  MAAEEILPLFDLLWFQHAVFAGKPLSQTR-----PEAPENRFHSPVMQVIKVRSQSEYLLSTNFPPPVTVHFS------NQKLQTILSGKVTEFSGDGEG
        MA+EEIL LFD  WFQH VFAGKPL +T+       APEN   SP+MQVI+ RSQSEYLL ++FP P T ++S      N+KLQTILSG+VTEFSG+  G
Subjt:  MAAEEILPLFDLLWFQHAVFAGKPLSQTR-----PEAPENRFHSPVMQVIKVRSQSEYLLSTNFPPPVTVHFS------NQKLQTILSGKVTEFSGDGEG

Query:  KPAEIPAKKKQEGSENKRRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTGEKEEKKEGKKGIE-NGVSISRPYLSEA
        K    PAKKK  G+E K RK+RGRGLSKSLSDLEFEELKGFMDLGFVF EEDK NS+LASIIPGLQRLG KTGE EE+KE K+  + N + +SRPYLSEA
Subjt:  KPAEIPAKKKQEGSENKRRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTGEKEEKKEGKKGIE-NGVSISRPYLSEA

Query:  WEAVEEENEKRVLMKWRVPGLG-ATEMDMKDHLKFWAHTVASTVR
        WEA +EENEKR+LMKWRVP LG ATEMDMKDHLKFWAHTVASTVR
Subjt:  WEAVEEENEKRVLMKWRVPGLG-ATEMDMKDHLKFWAHTVASTVR

A0A6J1GKQ8 uncharacterized protein LOC1114548952.5e-9079.4Show/hide
Query:  MAAEEILPLFDLLWFQHAVFAGKPLSQTRPEAPENRFHSPVMQVIKVRSQSEYLLSTNFPPPVTVHFS-NQKLQTILSGKVTEFSGDGEGKPAEIPAKKK
        MAAEEILPLFDL WFQ AVF GKPL  TR E+PENR  SPVMQV+KVRSQSEY LS+NF PP T ++S NQKLQ ILSGKVTEFSG GEGK    PAKKK
Subjt:  MAAEEILPLFDLLWFQHAVFAGKPLSQTRPEAPENRFHSPVMQVIKVRSQSEYLLSTNFPPPVTVHFS-NQKLQTILSGKVTEFSGDGEGKPAEIPAKKK

Query:  QEGSENKRRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEKR
         EGSE +RR+KRGRGLSKSLSDLEFEELKGFMDLGFVF EEDK NS+LASIIPGLQRLG+KT E+E       GIE+GV  SRPYLSEAW+AVEEE EKR
Subjt:  QEGSENKRRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEKR

Query:  VLMKWRVPGLGATEMDMKDHLKFWAHTVASTVR
         LMKWRVP LGATEMDMKDHLKFWAHTVASTVR
Subjt:  VLMKWRVPGLGATEMDMKDHLKFWAHTVASTVR

A0A6J1I0F9 uncharacterized protein LOC1114683111.8e-8878.54Show/hide
Query:  MAAEEILPLFDLLWFQHAVFAGKPLSQTRPEAPENRFHSPVMQVIKVRSQSEYLLSTNFPPPVTVHFS-NQKLQTILSGKVTEFSGDGEGKPAEIPAKKK
        MAAEEIL LFDL WFQ AVF G PL  TR E+PENR  SPVMQV+KVRSQSEY LS+NF PP T ++S NQKLQ ILSGKVTEFSG G GK    PAKKK
Subjt:  MAAEEILPLFDLLWFQHAVFAGKPLSQTRPEAPENRFHSPVMQVIKVRSQSEYLLSTNFPPPVTVHFS-NQKLQTILSGKVTEFSGDGEGKPAEIPAKKK

Query:  QEGSENKRRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEKR
         EGSE +RR+KRGRGLSKSLSDLEFEELKGFMDLGFVF EEDK NS+LASIIPGLQRLGE+T E EE    ++GIE+GV  SRPYLSEAW+AVEEE EKR
Subjt:  QEGSENKRRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEKR

Query:  VLMKWRVPGLGATEMDMKDHLKFWAHTVASTVR
         LMKWRVP LGATEMDMKDHLKFWAHTVASTVR
Subjt:  VLMKWRVPGLGATEMDMKDHLKFWAHTVASTVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G31560.1 Protein of unknown function (DUF1685)1.9e-0530.83Show/hide
Query:  SKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQ---RLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEKRVLMKWRVPGLGAT
        +KSL+D + EELKG +DLGF F  ++     L + +P L+    + +K  + +++   K   E+    S P  + A            +  W++   G  
Subjt:  SKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQ---RLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEKRVLMKWRVPGLGAT

Query:  EMDMKDHLKFWAHTVASTVR
          D+K  LK+WA TVA TVR
Subjt:  EMDMKDHLKFWAHTVASTVR

AT2G31560.2 Protein of unknown function (DUF1685)1.9e-0530.83Show/hide
Query:  SKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQ---RLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEKRVLMKWRVPGLGAT
        +KSL+D + EELKG +DLGF F  ++     L + +P L+    + +K  + +++   K   E+    S P  + A            +  W++   G  
Subjt:  SKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQ---RLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEKRVLMKWRVPGLGAT

Query:  EMDMKDHLKFWAHTVASTVR
          D+K  LK+WA TVA TVR
Subjt:  EMDMKDHLKFWAHTVASTVR

AT2G42760.1 unknown protein4.4e-3136.67Show/hide
Query:  MAAEEILPLFDLLWFQHAVFA-------GKP---------LSQTRPEAPENRFHSPVMQVIKVRSQSEYLLSTN-------------FPPP---VTVHFS
        MA EE+L LF+  W +  +F        GK          L + R E     F  PV  +++     E +++T+             F  P   + V  +
Subjt:  MAAEEILPLFDLLWFQHAVFA-------GKP---------LSQTRPEAPENRFHSPVMQVIKVRSQSEYLLSTN-------------FPPP---VTVHFS

Query:  NQKLQTILSGKVTEFSGDGEGKPAEIPAKKKQEGSENKRRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTG--EKEE
          KLQTILSGK  E +     +   + ++K+++  + K++        KS+SDLE+EELKGFMDLGFVF E+D  +S+L SI+PGLQRL +K     KEE
Subjt:  NQKLQTILSGKVTEFSGDGEGKPAEIPAKKKQEGSENKRRKKRGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTG--EKEE

Query:  KKEGKKGIENGVSISRPYLSEAWEAVEEENEKRVL---MKWRVPG-LGATEMDMKDHLKFWAHTVASTVR
        ++E ++    G   +RPYLSEAW+       K+ +   +KWRVP    A+E+D+KD+L+ WAH VAST+R
Subjt:  KKEGKKGIENGVSISRPYLSEAWEAVEEENEKRVL---MKWRVPG-LGATEMDMKDHLKFWAHTVASTVR

AT2G43340.1 Protein of unknown function (DUF1685)1.9e-0530Show/hide
Query:  SKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQ---RLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEKRVLMKWRVPGLGAT
        +KSL+D + EELKG +DLGF F  E+     L + +P L+    + +K  +++         E   S+    +S              +  W++   G  
Subjt:  SKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQ---RLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEKRVLMKWRVPGLGAT

Query:  EMDMKDHLKFWAHTVASTVR
          D+K  LKFWA  VA TVR
Subjt:  EMDMKDHLKFWAHTVASTVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCTGAAGAAATCCTTCCCCTTTTCGATCTACTCTGGTTTCAGCACGCAGTTTTTGCCGGAAAACCGCTTTCACAGACACGTCCCGAGGCGCCGGAAAACCGCTT
TCACAGCCCCGTAATGCAAGTGATTAAGGTGAGATCTCAAAGCGAGTATCTTCTCAGCACGAATTTCCCACCTCCCGTAACCGTTCATTTCTCCAATCAAAAGCTTCAAA
CCATTCTTTCCGGTAAGGTAACGGAGTTTTCCGGCGACGGAGAGGGGAAACCGGCGGAAATTCCGGCGAAGAAGAAACAGGAAGGGAGTGAAAATAAAAGAAGAAAAAAA
AGAGGCAGAGGGTTGAGTAAGAGCTTATCGGACCTTGAATTTGAAGAGTTGAAGGGATTTATGGATTTGGGATTTGTGTTTTGTGAGGAAGATAAAAATAATTCGAATTT
GGCTTCGATAATTCCAGGGTTGCAGAGATTGGGGGAAAAAACAGGGGAAAAAGAGGAAAAAAAAGAGGGAAAAAAAGGGATTGAAAATGGAGTTTCAATTTCAAGGCCAT
ATTTGTCTGAAGCTTGGGAAGCTGTTGAAGAAGAAAATGAAAAAAGGGTTTTGATGAAATGGAGAGTTCCAGGTTTGGGAGCAACTGAAATGGACATGAAAGATCATCTC
AAGTTCTGGGCTCATACAGTTGCTTCAACTGTGAGATAA
mRNA sequenceShow/hide mRNA sequence
AAACTGTTTAAACCCTCATTTTCCTCCTCAGAATTTTCCCAGCAGCAATGGCAGCTGAAGAAATCCTTCCCCTTTTCGATCTACTCTGGTTTCAGCACGCAGTTTTTGCC
GGAAAACCGCTTTCACAGACACGTCCCGAGGCGCCGGAAAACCGCTTTCACAGCCCCGTAATGCAAGTGATTAAGGTGAGATCTCAAAGCGAGTATCTTCTCAGCACGAA
TTTCCCACCTCCCGTAACCGTTCATTTCTCCAATCAAAAGCTTCAAACCATTCTTTCCGGTAAGGTAACGGAGTTTTCCGGCGACGGAGAGGGGAAACCGGCGGAAATTC
CGGCGAAGAAGAAACAGGAAGGGAGTGAAAATAAAAGAAGAAAAAAAAGAGGCAGAGGGTTGAGTAAGAGCTTATCGGACCTTGAATTTGAAGAGTTGAAGGGATTTATG
GATTTGGGATTTGTGTTTTGTGAGGAAGATAAAAATAATTCGAATTTGGCTTCGATAATTCCAGGGTTGCAGAGATTGGGGGAAAAAACAGGGGAAAAAGAGGAAAAAAA
AGAGGGAAAAAAAGGGATTGAAAATGGAGTTTCAATTTCAAGGCCATATTTGTCTGAAGCTTGGGAAGCTGTTGAAGAAGAAAATGAAAAAAGGGTTTTGATGAAATGGA
GAGTTCCAGGTTTGGGAGCAACTGAAATGGACATGAAAGATCATCTCAAGTTCTGGGCTCATACAGTTGCTTCAACTGTGAGATAACATTTTCTTTGTACATTTTATGGT
AACTTTTTTTTTTTAAAACTATTTTGGAGTGAAATTTTGTGGGTTTTGTTTGTTGTGTTTGGAAATTTAAATGTAGATAGATAGATGAAGAAATCTTTCTCTTTCTCTCT
TTTAATTATTGATGTTCAGTTGTTCTGTTGTGTTCTTTGCAGTAGAAACTAAATGAAAATTTGCTTAGTTTCAAACTGAAGGATTTTTTTTAACAAAAATAAATAAATAA
ACTGAAG
Protein sequenceShow/hide protein sequence
MAAEEILPLFDLLWFQHAVFAGKPLSQTRPEAPENRFHSPVMQVIKVRSQSEYLLSTNFPPPVTVHFSNQKLQTILSGKVTEFSGDGEGKPAEIPAKKKQEGSENKRRKK
RGRGLSKSLSDLEFEELKGFMDLGFVFCEEDKNNSNLASIIPGLQRLGEKTGEKEEKKEGKKGIENGVSISRPYLSEAWEAVEEENEKRVLMKWRVPGLGATEMDMKDHL
KFWAHTVASTVR