; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G01757 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G01757
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase
Genome locationClcChr04:5717241..5721675
RNA-Seq ExpressionClc04G01757
SyntenyClc04G01757
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_019150696.1 PREDICTED: uncharacterized protein LOC109147552 [Ipomoea nil]4.8e-5043.84Show/hide
Query:  RKEARERRTAEERFSAEGTIEELLEEEETPMEKPVERILWDLSAPNIDRQPLCITYP--------------------AMKGEDPHKHLKKFIIVCEGMRP
        RKE ++R ++        T + + EE  + M +P  R L +L+APN++ QPLCIT P                     + GEDPHKHLK+F +VC GM+P
Subjt:  RKEARERRTAEERFSAEGTIEELLEEEETPMEKPVERILWDLSAPNIDRQPLCITYP--------------------AMKGEDPHKHLKKFIIVCEGMRP

Query:  HGVTEEQINLRAFPFSLKDDAKDWLYFLPPGSVTTWKELQRQFLGKFFLASRATRIRKENYGIFQ-------------------------------TYFY
         GVT+E I+LRAFPFSLKD AKDWLY LPP SV TW ++++ FL KFF ASRAT IRKE YGI Q                                YFY
Subjt:  HGVTEEQINLRAFPFSLKDDAKDWLYFLPPGSVTTWKELQRQFLGKFFLASRATRIRKENYGIFQ-------------------------------TYFY

Query:  EGLIPSDRSTIDATSVVTLVNKTPTEARALISTMAENAQQFGTRALCNVFAAKQQPQEPVAEVSYAGNQHVDECLTT-----TVGAVKGTRA
        EGL+P+DRS +DA S  +L +KTP  AR LISTMAEN+QQ+GTRA  ++          V EVS + +++    LTT      VG +K T A
Subjt:  EGLIPSDRSTIDATSVVTLVNKTPTEARALISTMAENAQQFGTRALCNVFAAKQQPQEPVAEVSYAGNQHVDECLTT-----TVGAVKGTRA

XP_019180076.1 PREDICTED: uncharacterized protein LOC109175288 [Ipomoea nil]4.0e-4946.72Show/hide
Query:  RKEARERRTAEERFSAEGTIEELLEEEETPMEKPVERILWDLSAPNIDRQPLCITYP--------------------AMKGEDPHKHLKKFIIVCEGMRP
        RKE ++R ++        T + + EE  + M +P  R L +L+AP++++QPLCIT P                     + GEDPHKHLK+F +VC GM+P
Subjt:  RKEARERRTAEERFSAEGTIEELLEEEETPMEKPVERILWDLSAPNIDRQPLCITYP--------------------AMKGEDPHKHLKKFIIVCEGMRP

Query:  HGVTEEQINLRAFPFSLKDDAKDWLYFLPPGSVTTWKELQRQFLGKFFLASRATRIRKENYGIFQ-------------------------------TYFY
         GVT+E I+LRAFPFSLKD AKDWLY L P SV TW ++++ FL KFF ASRAT IRKE YGI Q                                YFY
Subjt:  HGVTEEQINLRAFPFSLKDDAKDWLYFLPPGSVTTWKELQRQFLGKFFLASRATRIRKENYGIFQ-------------------------------TYFY

Query:  EGLIPSDRSTIDATSVVTLVNKTPTEARALISTMAENAQQFGTR
        EGL+P+DRS +DA S  +L +KTPT+AR LISTMAEN+QQ+GTR
Subjt:  EGLIPSDRSTIDATSVVTLVNKTPTEARALISTMAENAQQFGTR

XP_019198426.1 PREDICTED: uncharacterized protein LOC109192308 [Ipomoea nil]3.1e-4951.87Show/hide
Query:  RKEARERRTAEERFSAEGTIEELLEEEETPMEKPVERILWDLSAPNIDRQPLCITYP--------------------AMKGEDPHKHLKKFIIVCEGMRP
        RKE ++R ++        T + + EE  + M +P  R L +L+APN+++QPLCIT P                     + GEDPHKHLK+F +VC GM+P
Subjt:  RKEARERRTAEERFSAEGTIEELLEEEETPMEKPVERILWDLSAPNIDRQPLCITYP--------------------AMKGEDPHKHLKKFIIVCEGMRP

Query:  HGVTEEQINLRAFPFSLKDDAKDWLYFLPPGSVTTWKELQRQFLGKFFLASRATRIRKENYGIFQTYFYEGLIPSDRSTIDATSVVTLVNKTPTEARALI
         GVT+E I+LRAFPFSLKD AKDWLY LPP SV TW ++++ FL KFF ASRAT IRKE YGI Q      L+P+DRS +DA S  +L +KT T+AR LI
Subjt:  HGVTEEQINLRAFPFSLKDDAKDWLYFLPPGSVTTWKELQRQFLGKFFLASRATRIRKENYGIFQTYFYEGLIPSDRSTIDATSVVTLVNKTPTEARALI

Query:  STMAENAQQFGTRA
        STMAEN+QQ+GTRA
Subjt:  STMAENAQQFGTRA

XP_024041424.1 uncharacterized protein LOC112098931 [Citrus clementina]7.6e-4842.07Show/hide
Query:  LRRKEAR-ERRTAEERFSAEGTIEELLE----EEETPMEK--PVERILWDLSAPNIDRQPLCITYPAMK--------------------GEDPHKHLKKF
        LR++  R ++R++    S   T+ +L+E     EE  M++  PVER L +L+ P++++QPLCI Y  ++                    GEDPHKHLK+F
Subjt:  LRRKEAR-ERRTAEERFSAEGTIEELLE----EEETPMEK--PVERILWDLSAPNIDRQPLCITYPAMK--------------------GEDPHKHLKKF

Query:  IIVCEGMRPHGVTEEQINLRAFPFSLKDDAKDWLYFLPPGSVTTWKELQRQFLGKFFLASRATRIRKENYGIFQ--------------------------
         +VC  MRP GVTEEQI LRAFPFS+   AKDWLY+LPPGS+TTW  L++QFL K+F ASRA  IRK+  GI Q                          
Subjt:  IIVCEGMRPHGVTEEQINLRAFPFSLKDDAKDWLYFLPPGSVTTWKELQRQFLGKFFLASRATRIRKENYGIFQ--------------------------

Query:  -----TYFYEGLIPSDRSTIDATSVVTLVNKTPTEARALISTMAENAQQFGTRALCNVFAAKQQPQEPVAEVSYAGNQHVDECLTTTVGAVKGTRARLAD
              YFYEGL   DRS IDA S   LVNKTPT+AR LIS MA NAQQFG+R                     A ++ V+E  T ++  ++   ++LA 
Subjt:  -----TYFYEGLIPSDRSTIDATSVVTLVNKTPTEARALISTMAENAQQFGTRALCNVFAAKQQPQEPVAEVSYAGNQHVDECLTTTVGAVKGTRARLAD

Query:  AVSRLDTQ-QKNLPSQPTPN-VQNVSAI
         VSRL++Q    LPSQ   N  QNVSA+
Subjt:  AVSRLDTQ-QKNLPSQPTPN-VQNVSAI

XP_031131881.1 uncharacterized protein LOC116033267 [Ipomoea triloba]1.5e-4846.12Show/hide
Query:  RKEARERRTAEERFSAEGTIEELLEEEETPMEKPVERILWDLSAPNIDRQPLCITYP--------------------AMKGEDPHKHLKKFIIVCEGMRP
        RKE ++R ++        T + + EE  + M +   R L +L+ P++++QPLCIT P                     + GEDPHKHLK+F +VC GM+P
Subjt:  RKEARERRTAEERFSAEGTIEELLEEEETPMEKPVERILWDLSAPNIDRQPLCITYP--------------------AMKGEDPHKHLKKFIIVCEGMRP

Query:  HGVTEEQINLRAFPFSLKDDAKDWLYFLPPGSVTTWKELQRQFLGKFFLASRATRIRKENYGIFQ-------------------------------TYFY
         GVT+E I LRAFPFSLKD AKDWLY +PP SVTTW ++++ FL KFF AS+AT IRKE YGI Q                                YFY
Subjt:  HGVTEEQINLRAFPFSLKDDAKDWLYFLPPGSVTTWKELQRQFLGKFFLASRATRIRKENYGIFQ-------------------------------TYFY

Query:  EGLIPSDRSTIDATSVVTLVNKTPTEARALISTMAENAQQFGTRA
        EGL+P DRS +DA S  +LV+KTPT+AR LISTMA+N+QQ+GTRA
Subjt:  EGLIPSDRSTIDATSVVTLVNKTPTEARALISTMAENAQQFGTRA

TrEMBL top hitse value%identityAlignment
A0A6P6SET8 uncharacterized protein LOC1136905522.6e-4643.11Show/hide
Query:  RKEARERRTAEERFSAEGTIEELL----EEEETPMEKPVERILWDLSAPNIDRQPLCITYPAMK--------------------GEDPHKHLKKFIIVCE
        RK  RER  A    +   ++ + +    E EE       ER L +L+AP++++QPLCITYP ++                    GEDPHKHLK+F +VC 
Subjt:  RKEARERRTAEERFSAEGTIEELL----EEEETPMEKPVERILWDLSAPNIDRQPLCITYPAMK--------------------GEDPHKHLKKFIIVCE

Query:  GMRPHGVTEEQINLRAFPFSLKDDAKDWLYFLPPGSVTTWKELQRQFLGKFFLASRATRIRKENYGIFQ-------------------------------
         M+P GVTEEQI LRAFPFSL D AKDWLY+LP GS++TW ++++ FL KFF ASRA  IRK+  GI Q                               
Subjt:  GMRPHGVTEEQINLRAFPFSLKDDAKDWLYFLPPGSVTTWKELQRQFLGKFFLASRATRIRKENYGIFQ-------------------------------

Query:  TYFYEGLIPSDRSTIDATSVVTLVNKTPTEARALISTMAENAQQFGTRALCNVFAAKQQPQEPVAEVSYAGNQHVDECLTTTV
         YFYEGL  +DR  IDA S  +LVNKTPTEAR LIS+MA NAQQFG R        +      V EVS +  +   +CLT+ V
Subjt:  TYFYEGLIPSDRSTIDATSVVTLVNKTPTEARALISTMAENAQQFGTRALCNVFAAKQQPQEPVAEVSYAGNQHVDECLTTTV

A0A6P6T081 uncharacterized protein LOC1136965151.4e-4442.54Show/hide
Query:  VLRRKEARERRTAEERFSAE--------------GTIEELLEEEETPMEKPVERILWDLSAPNIDRQPLCITYPAMK----------------------G
        + RR     R+  EE  SA               G      E+EE PM     R L +L+APN+++QPLCIT+P++                       G
Subjt:  VLRRKEARERRTAEERFSAE--------------GTIEELLEEEETPMEKPVERILWDLSAPNIDRQPLCITYPAMK----------------------G

Query:  EDPHKHLKKFIIVCEGMRPHGVTEEQINLRAFPFSLKDDAKDWLYFLPPGSVTTWKELQRQFLGKFFLASRATRIRKENYGIFQ----------------
        E+P+KHL++F +VC  M+P G+TEEQI ++AFPFSLKD AKDWLY+L PGS+TTW +L++ FL K+F ASRA+ +RKE  GI Q                
Subjt:  EDPHKHLKKFIIVCEGMRPHGVTEEQINLRAFPFSLKDDAKDWLYFLPPGSVTTWKELQRQFLGKFFLASRATRIRKENYGIFQ----------------

Query:  ---------------TYFYEGLIPSDRSTIDATSVVTLVNKTPTEARALISTMAENAQQFGTRALCNV
                        YFYEGL+  DRS IDA S   LVNKTP EAR LI  MAEN+QQFGTR  C +
Subjt:  ---------------TYFYEGLIPSDRSTIDATSVVTLVNKTPTEARALISTMAENAQQFGTRALCNV

A0A6P6UJL6 Reverse transcriptase2.0e-4640.6Show/hide
Query:  RILWDLSAPNIDRQPLCITYP----------------------AMKGEDPHKHLKKFIIVCEGMRPHGVTEEQINLRAFPFSLKDDAKDWLYFLPPGSVT
        R L +L+AP++++QPLCIT+P                       + GE+P+KHL++F +VC  M+P G+TEEQI +RAFPFSLKD AKDWLY+LPPGS+T
Subjt:  RILWDLSAPNIDRQPLCITYP----------------------AMKGEDPHKHLKKFIIVCEGMRPHGVTEEQINLRAFPFSLKDDAKDWLYFLPPGSVT

Query:  TWKELQRQFLGKFFLASRATRIRKENYGIFQ-------------------------------TYFYEGLIPSDRSTIDATSVVTLVNKTPTEARALISTM
        TW +L+++FL K+F ASRA  +RKE  GI Q                                YFYE L+  DRS IDA     LVNKTP  A  LI  M
Subjt:  TWKELQRQFLGKFFLASRATRIRKENYGIFQ-------------------------------TYFYEGLIPSDRSTIDATSVVTLVNKTPTEARALISTM

Query:  AENAQQFGTRALCNVFAAKQQPQEPVAEVSYAGNQHVDECLTTTVGAVKGTRARLADAVSRLDTQ-QKNLPSQPTPNVQNVSAISMSCAMNPLPEKPV
        AEN+QQFG+R        +  P   V EV  +  Q     LT+ V +++    ++A  ++RL++Q Q  LPSQP  N +NVSA+++         +PV
Subjt:  AENAQQFGTRALCNVFAAKQQPQEPVAEVSYAGNQHVDECLTTTVGAVKGTRARLADAVSRLDTQ-QKNLPSQPTPNVQNVSAISMSCAMNPLPEKPV

A0A6P6WXZ3 uncharacterized protein LOC1137355111.3e-4542.76Show/hide
Query:  RKEARERRTAEERFSAEGTIEELL----EEEETPMEKPVERILWDLSAPNIDRQPLCITYPAMK--------------------GEDPHKHLKKFIIVCE
        RK  RER  A    +   ++ + +    E EE       ER L +L+AP++++QPLCITYP ++                    GEDPHKHLK+F +VC 
Subjt:  RKEARERRTAEERFSAEGTIEELL----EEEETPMEKPVERILWDLSAPNIDRQPLCITYPAMK--------------------GEDPHKHLKKFIIVCE

Query:  GMRPHGVTEEQINLRAFPFSLKDDAKDWLYFLPPGSVTTWKELQRQFLGKFFLASRATRIRKENYGIFQ-------------------------------
         M+P GVTEEQI LRAFPFSL D AKDWLY+L  GS++TW ++++ FL KFF ASRA  IRK+  GI Q                               
Subjt:  GMRPHGVTEEQINLRAFPFSLKDDAKDWLYFLPPGSVTTWKELQRQFLGKFFLASRATRIRKENYGIFQ-------------------------------

Query:  TYFYEGLIPSDRSTIDATSVVTLVNKTPTEARALISTMAENAQQFGTRALCNVFAAKQQPQEPVAEVSYAGNQHVDECLTTTV
         YFYEGL  +DR  IDA S  +LVNKTPTEAR+LIS+MA NAQQFG R        +      V EVS +  +   +CLT+ V
Subjt:  TYFYEGLIPSDRSTIDATSVVTLVNKTPTEARALISTMAENAQQFGTRALCNVFAAKQQPQEPVAEVSYAGNQHVDECLTTTV

A0A6P6X8T1 Reverse transcriptase1.0e-4542.4Show/hide
Query:  RKEARERRTAEERFSAEGTIEELL----EEEETPMEKPVERILWDLSAPNIDRQPLCITYPAMK--------------------GEDPHKHLKKFIIVCE
        RK  RER  A    +   ++ + +    E EE       ER L +L+AP++++QPLCITYP ++                    GEDPHKHLK+F ++C 
Subjt:  RKEARERRTAEERFSAEGTIEELL----EEEETPMEKPVERILWDLSAPNIDRQPLCITYPAMK--------------------GEDPHKHLKKFIIVCE

Query:  GMRPHGVTEEQINLRAFPFSLKDDAKDWLYFLPPGSVTTWKELQRQFLGKFFLASRATRIRKENYGIFQ-------------------------------
         M+P GVTEEQI LRAFPF L D AKDWLY+LP GS++TW ++++ FL KFF ASRA  IRK+  GI Q                               
Subjt:  GMRPHGVTEEQINLRAFPFSLKDDAKDWLYFLPPGSVTTWKELQRQFLGKFFLASRATRIRKENYGIFQ-------------------------------

Query:  TYFYEGLIPSDRSTIDATSVVTLVNKTPTEARALISTMAENAQQFGTRALCNVFAAKQQPQEPVAEVSYAGNQHVDECLTTTV
         YFYEGL  +DR  IDA S  +LVNKTPTEAR+LIS+MA NAQQFG R               V EVS +  +   +CLT+ V
Subjt:  TYFYEGLIPSDRSTIDATSVVTLVNKTPTEARALISTMAENAQQFGTRALCNVFAAKQQPQEPVAEVSYAGNQHVDECLTTTV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATCAAGGCTATGGTAACTATTTTATAGCTTGGGTATTAGGGATTCTTTTTGTAATCGATGTGCTTAGAAGGAAGGAAGCTAGAGAGCGAAGAACTGCAGAGGAACG
TTTCAGTGCAGAAGGCACAATTGAAGAGCTTTTAGAAGAAGAAGAAACACCAATGGAAAAACCTGTAGAGCGCATATTGTGGGACCTCTCAGCCCCAAACATTGATCGAC
AACCCCTTTGCATTACCTATCCCGCCATGAAAGGTGAGGACCCCCATAAACATCTTAAGAAGTTCATCATTGTGTGCGAGGGGATGAGGCCCCATGGTGTGACTGAGGAG
CAAATCAACCTTAGAGCTTTCCCCTTCTCCTTGAAGGACGATGCGAAGGATTGGCTTTACTTTCTTCCTCCTGGTTCTGTGACCACTTGGAAGGAATTGCAGAGGCAGTT
TTTGGGGAAGTTCTTCCTTGCATCACGTGCTACACGTATAAGGAAAGAGAATTATGGGATCTTTCAGACGTACTTCTATGAAGGGCTCATCCCTAGTGATAGAAGTACCA
TCGATGCAACTAGTGTTGTCACCTTGGTCAACAAGACTCCTACAGAGGCTAGGGCACTCATCTCCACCATGGCAGAGAATGCACAGCAGTTCGGGACGCGAGCTCTCTGC
AATGTCTTTGCTGCCAAGCAGCAACCCCAAGAGCCTGTAGCTGAGGTTAGTTATGCTGGCAATCAACATGTGGATGAATGTTTAACTACAACTGTTGGGGCGGTCAAGGG
AACTAGGGCTAGGCTAGCTGATGCTGTGAGTAGGTTGGACACTCAACAGAAGAATCTTCCTTCCCAGCCCACACCAAACGTGCAGAATGTGAGTGCCATCTCTATGAGCT
GCGCGATGAACCCCTTGCCTGAAAAGCCCGTAGGTGAACCTGTGGACATAGTGAGTGATATTAGCTCAGTGAAGAAACGTGGGGTAAGCTTTGATCCACCTCTTAATTTA
AATGTTTCTACTAATTTGCCTCGTGCTCCCTTCCTCAGCAGGTTGGCTGTTGCTCAGGGAGGATCACTTCAGGAGAAGGTGAGGTTAGTTCTATCCCCTGAAGTTCAGAG
GCCTACTATTGAGGGAAAGAGGGTGTGGACCAAAGTTCGTCAGAAGGAAAAGAAGGGAGAGAATAAACAAGAAGGGTGTGGACCAAAGTACGTTTTCTGGAGACCCTTTC
TAAAGACCGCGAAAGTGGTGATCGATGTGGATGAAGGATCTCTATCTCTAAGGCATGGAGAAAAAATTGAAAAATTCTTTATTTCTAATGATTCCTCCACCACTAATCTC
GAGTGTTTTAGCTCTATGGGGACTCGCAATTCCAGATGGATATGGATGACGAAGCCAAGTGTAGTTGAGCCTAACGACCTTAAAAATTTGCCCTTCCTGGAGGGCACAAC
GTCGCGATGCTACCATCACGACAGAACTACTAATAGCAGTTCTGTCCGCGTGAGAGGGGATTTCTTCACTCTTCTTCCTTCTTTTAAACATTTTCTCCGTGGGTTTTTGG
CCATTTGTGCGTATTGGATAGCATCATTGGCATTTGGAGTCTTCACCAATCTTAAGTCTTCATTGCTGATTTTGCTTGAGATTTTGTGTGGTTTCCCTAAATGTTGTGTT
GTGCTAAAGTTGGAGGAAGGAATTGAAGTGTTGAAGCTTGGAAAGGAATTTTTTAGACTTGAACCTAGCATTGTGTTCTTAGAGTCTTATCCTATGGCACCTAAGAAAAG
CAAGGGTAAGGGTGTAGCATCTAGTTCCACTAGAGACCAGAATGAGCCATCATGCGATGGTACCATGGTCGCTAATGTTATAATTCCTGAGCGAGGGCTTAGGCTCGAGG
CTGATTTCTTCACCCAAATCTCCACTAAAACTCAACGTCGTGGGTGGGAATTATTGGCTCGGCAACCTGATGTGGCCATAGTTCTCGTAGTGAGGAAATTTTACTACAGT
ATAGTTGAGGATTTTGATGAGTCTTATGTTAGAGGTCGGTCGGTATCTTTCTCCCCTTCTGTCATAAATAGTTTATTCCACCTTCCTAACATCCATAGAGACGAGTACTC
TGAGTTTTCTTATGGCAAGATCGACTATGACACCATGGCCAAGACCCTAGGTGGGCCTGGGACGCGATGGGTGGTCAAACGAGAGGCACCAGTTAGGATAAGAGGGACCG
ACCTCTTTGTTTCTGGCCAGATTTGGCACGCTTTTATTTGCACTAGGTTCATGCTTGTGACACACTTAGCGGACATCATACGTGCATCAATCCAAGGAGCTCGAGAGGAG
AGTCAACTGGAGCGTATTAAGAAGAGGATCTCTACTTTGCTCACATTTATTTTGACCCTCACCTTCCTAGGGCGCATTCCCCTGGCTCTGATTCTTAAACCCCATTTCGC
TTCTAGTGCGCTTCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTATCAAGGCTATGGTAACTATTTTATAGCTTGGGTATTAGGGATTCTTTTTGTAATCGATGTGCTTAGAAGGAAGGAAGCTAGAGAGCGAAGAACTGCAGAGGAACG
TTTCAGTGCAGAAGGCACAATTGAAGAGCTTTTAGAAGAAGAAGAAACACCAATGGAAAAACCTGTAGAGCGCATATTGTGGGACCTCTCAGCCCCAAACATTGATCGAC
AACCCCTTTGCATTACCTATCCCGCCATGAAAGGTGAGGACCCCCATAAACATCTTAAGAAGTTCATCATTGTGTGCGAGGGGATGAGGCCCCATGGTGTGACTGAGGAG
CAAATCAACCTTAGAGCTTTCCCCTTCTCCTTGAAGGACGATGCGAAGGATTGGCTTTACTTTCTTCCTCCTGGTTCTGTGACCACTTGGAAGGAATTGCAGAGGCAGTT
TTTGGGGAAGTTCTTCCTTGCATCACGTGCTACACGTATAAGGAAAGAGAATTATGGGATCTTTCAGACGTACTTCTATGAAGGGCTCATCCCTAGTGATAGAAGTACCA
TCGATGCAACTAGTGTTGTCACCTTGGTCAACAAGACTCCTACAGAGGCTAGGGCACTCATCTCCACCATGGCAGAGAATGCACAGCAGTTCGGGACGCGAGCTCTCTGC
AATGTCTTTGCTGCCAAGCAGCAACCCCAAGAGCCTGTAGCTGAGGTTAGTTATGCTGGCAATCAACATGTGGATGAATGTTTAACTACAACTGTTGGGGCGGTCAAGGG
AACTAGGGCTAGGCTAGCTGATGCTGTGAGTAGGTTGGACACTCAACAGAAGAATCTTCCTTCCCAGCCCACACCAAACGTGCAGAATGTGAGTGCCATCTCTATGAGCT
GCGCGATGAACCCCTTGCCTGAAAAGCCCGTAGGTGAACCTGTGGACATAGTGAGTGATATTAGCTCAGTGAAGAAACGTGGGGTAAGCTTTGATCCACCTCTTAATTTA
AATGTTTCTACTAATTTGCCTCGTGCTCCCTTCCTCAGCAGGTTGGCTGTTGCTCAGGGAGGATCACTTCAGGAGAAGGTGAGGTTAGTTCTATCCCCTGAAGTTCAGAG
GCCTACTATTGAGGGAAAGAGGGTGTGGACCAAAGTTCGTCAGAAGGAAAAGAAGGGAGAGAATAAACAAGAAGGGTGTGGACCAAAGTACGTTTTCTGGAGACCCTTTC
TAAAGACCGCGAAAGTGGTGATCGATGTGGATGAAGGATCTCTATCTCTAAGGCATGGAGAAAAAATTGAAAAATTCTTTATTTCTAATGATTCCTCCACCACTAATCTC
GAGTGTTTTAGCTCTATGGGGACTCGCAATTCCAGATGGATATGGATGACGAAGCCAAGTGTAGTTGAGCCTAACGACCTTAAAAATTTGCCCTTCCTGGAGGGCACAAC
GTCGCGATGCTACCATCACGACAGAACTACTAATAGCAGTTCTGTCCGCGTGAGAGGGGATTTCTTCACTCTTCTTCCTTCTTTTAAACATTTTCTCCGTGGGTTTTTGG
CCATTTGTGCGTATTGGATAGCATCATTGGCATTTGGAGTCTTCACCAATCTTAAGTCTTCATTGCTGATTTTGCTTGAGATTTTGTGTGGTTTCCCTAAATGTTGTGTT
GTGCTAAAGTTGGAGGAAGGAATTGAAGTGTTGAAGCTTGGAAAGGAATTTTTTAGACTTGAACCTAGCATTGTGTTCTTAGAGTCTTATCCTATGGCACCTAAGAAAAG
CAAGGGTAAGGGTGTAGCATCTAGTTCCACTAGAGACCAGAATGAGCCATCATGCGATGGTACCATGGTCGCTAATGTTATAATTCCTGAGCGAGGGCTTAGGCTCGAGG
CTGATTTCTTCACCCAAATCTCCACTAAAACTCAACGTCGTGGGTGGGAATTATTGGCTCGGCAACCTGATGTGGCCATAGTTCTCGTAGTGAGGAAATTTTACTACAGT
ATAGTTGAGGATTTTGATGAGTCTTATGTTAGAGGTCGGTCGGTATCTTTCTCCCCTTCTGTCATAAATAGTTTATTCCACCTTCCTAACATCCATAGAGACGAGTACTC
TGAGTTTTCTTATGGCAAGATCGACTATGACACCATGGCCAAGACCCTAGGTGGGCCTGGGACGCGATGGGTGGTCAAACGAGAGGCACCAGTTAGGATAAGAGGGACCG
ACCTCTTTGTTTCTGGCCAGATTTGGCACGCTTTTATTTGCACTAGGTTCATGCTTGTGACACACTTAGCGGACATCATACGTGCATCAATCCAAGGAGCTCGAGAGGAG
AGTCAACTGGAGCGTATTAAGAAGAGGATCTCTACTTTGCTCACATTTATTTTGACCCTCACCTTCCTAGGGCGCATTCCCCTGGCTCTGATTCTTAAACCCCATTTCGC
TTCTAGTGCGCTTCTTTAG
Protein sequenceShow/hide protein sequence
MYQGYGNYFIAWVLGILFVIDVLRRKEARERRTAEERFSAEGTIEELLEEEETPMEKPVERILWDLSAPNIDRQPLCITYPAMKGEDPHKHLKKFIIVCEGMRPHGVTEE
QINLRAFPFSLKDDAKDWLYFLPPGSVTTWKELQRQFLGKFFLASRATRIRKENYGIFQTYFYEGLIPSDRSTIDATSVVTLVNKTPTEARALISTMAENAQQFGTRALC
NVFAAKQQPQEPVAEVSYAGNQHVDECLTTTVGAVKGTRARLADAVSRLDTQQKNLPSQPTPNVQNVSAISMSCAMNPLPEKPVGEPVDIVSDISSVKKRGVSFDPPLNL
NVSTNLPRAPFLSRLAVAQGGSLQEKVRLVLSPEVQRPTIEGKRVWTKVRQKEKKGENKQEGCGPKYVFWRPFLKTAKVVIDVDEGSLSLRHGEKIEKFFISNDSSTTNL
ECFSSMGTRNSRWIWMTKPSVVEPNDLKNLPFLEGTTSRCYHHDRTTNSSSVRVRGDFFTLLPSFKHFLRGFLAICAYWIASLAFGVFTNLKSSLLILLEILCGFPKCCV
VLKLEEGIEVLKLGKEFFRLEPSIVFLESYPMAPKKSKGKGVASSSTRDQNEPSCDGTMVANVIIPERGLRLEADFFTQISTKTQRRGWELLARQPDVAIVLVVRKFYYS
IVEDFDESYVRGRSVSFSPSVINSLFHLPNIHRDEYSEFSYGKIDYDTMAKTLGGPGTRWVVKREAPVRIRGTDLFVSGQIWHAFICTRFMLVTHLADIIRASIQGAREE
SQLERIKKRISTLLTFILTLTFLGRIPLALILKPHFASSALL