; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg028568 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg028568
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCACTA en-spm transposon protein
Genome locationscaffold7:17602643..17604134
RNA-Seq ExpressionSpg028568
SyntenySpg028568
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056379.1 hypothetical protein E6C27_scaffold186G001230 [Cucumis melo var. makuwa]2.3e-5944.98Show/hide
Query:  GRSRNQRGRGRRTRGHCRNIELDRYVALHWRIRIEITEQIGKPVCGWAMRFSGSIGTITRSTVPLSCATWKAVPKQVRDAVKARLLTYFDLDMSKRHVVK
        G     + RGR  RG+ RNIELD++V  H +I+IEI E+ GKPV  +  + +  IGT  R+T+PLSC  WKAVP  VR+ V   L T+F+ D +   V K
Subjt:  GRSRNQRGRGRRTRGHCRNIELDRYVALHWRIRIEITEQIGKPVCGWAMRFSGSIGTITRSTVPLSCATWKAVPKQVRDAVKARLLTYFDLDMSKRHVVK

Query:  YIHKLMSSTFKEFRVDLYKYYWEFEDPAKARQNPPERIINLEDWNLLCDRWETLEWKKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQ
        Y+ + M +TF+EFR +L+KYY +F+D  +AR NPP RI + EDWN++CDRWET  WKK K+  DVD++++FHE+HF +++GW+ND AKDAYLEMQ+II +
Subjt:  YIHKLMSSTFKEFRVDLYKYYWEFEDPAKARQNPPERIINLEDWNLLCDRWETLEWKKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQ

Query:  STQEGATPIPPDKVCKQVLGHRLGHIKGLGWDPKPSSSSSVTSSQYEKELEKKVEKMEGEMEQMKASYADMQESNVALKSQLSMWESRW
        ST+ G   I   K CK VLG R   ++ +      S  S+V+SS+         EK + EM  +K       E+N  L  +L+ WE  +
Subjt:  STQEGATPIPPDKVCKQVLGHRLGHIKGLGWDPKPSSSSSVTSSQYEKELEKKVEKMEGEMEQMKASYADMQESNVALKSQLSMWESRW

XP_038887408.1 poly [ADP-ribose] polymerase 1-like isoform X1 [Benincasa hispida]7.9e-10054.57Show/hide
Query:  IGSST---VQAASGSRGRSRN-QRGRGRRTRGHCRNIELDRYVALHWRIRIEITEQIGKPVCGWAMRFSGSIGTITRSTVPLSCATWKAVPKQVRDAVKA
        IGSST      ASGSR  SR   RG  RRTRGH RN+ELDR+V +H RIRIEI E++GKPVC  A +FS +IGTI R+T+PL C  W  V K+VRD V  
Subjt:  IGSST---VQAASGSRGRSRN-QRGRGRRTRGHCRNIELDRYVALHWRIRIEITEQIGKPVCGWAMRFSGSIGTITRSTVPLSCATWKAVPKQVRDAVKA

Query:  RLL---------------------------TYFDLDMSKRHVVKYIHKLMSSTFKEFRVDLYKYYWEFEDPAKARQNPPERIINLEDWNLLCDRWETLEW
        +LL                           +YFD D+ K+HV KY+ + + +TFKE+R DLYK+Y  F+DP +AR  PP+RI +  DWNLLC+RWET EW
Subjt:  RLL---------------------------TYFDLDMSKRHVVKYIHKLMSSTFKEFRVDLYKYYWEFEDPAKARQNPPERIINLEDWNLLCDRWETLEW

Query:  K-------------------------------KIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATPIPPDKVCKQVLGHRLGH
        K                               KIKE RDVDQVDLF +SHFCE+DGWVN+ AKDAYLEMQ++++ S QE  TP+   +VCKQVLGHR G+
Subjt:  K-------------------------------KIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATPIPPDKVCKQVLGHRLGH

Query:  IKGLGWDPKPSSSSSVTS-SQYEKELEKKVEKMEGEMEQMKASYADMQESNVALKSQLSMWESRWSDIQNFLGRDQREDGPSN
        IKGLG +PKPSSSSSVTS  Q +KELEKK+EKME EM QMKA+Y  M+E+NVAL SQLSMWE RW++IQN LGR Q +DG SN
Subjt:  IKGLGWDPKPSSSSSVTS-SQYEKELEKKVEKMEGEMEQMKASYADMQESNVALKSQLSMWESRWSDIQNFLGRDQREDGPSN

XP_038887409.1 poly [ADP-ribose] polymerase 1-like isoform X2 [Benincasa hispida]3.1e-10458.71Show/hide
Query:  IGSST---VQAASGSRGRSRN-QRGRGRRTRGHCRNIELDRYVALHWRIRIEITEQIGKPVCGWAMRFSGSIGTITRSTVPLSCATWKAVPKQVRDAVKA
        IGSST      ASGSR  SR   RG  RRTRGH RN+ELDR+V +H RIRIEI E++GKPVC  A +FS +IGTI R+T+PL C  W  V K+VRD V  
Subjt:  IGSST---VQAASGSRGRSRN-QRGRGRRTRGHCRNIELDRYVALHWRIRIEITEQIGKPVCGWAMRFSGSIGTITRSTVPLSCATWKAVPKQVRDAVKA

Query:  RLLTYFDLDMSKRHVVKYIHKLMSSTFKEFRVDLYKYYWEFEDPAKARQNPPERIINLEDWNLLCDRWETLEWK--------------------------
        +LL+YFD D+ K+HV KY+ + + +TFKE+R DLYK+Y  F+DP +AR  PP+RI +  DWNLLC+RWET EWK                          
Subjt:  RLLTYFDLDMSKRHVVKYIHKLMSSTFKEFRVDLYKYYWEFEDPAKARQNPPERIINLEDWNLLCDRWETLEWK--------------------------

Query:  -----KIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATPIPPDKVCKQVLGHRLGHIKGLGWDPKPSSSSSVTS-SQYEKELE
             KIKE RDVDQVDLF +SHFCE+DGWVN+ AKDAYLEMQ++++ S QE  TP+   +VCKQVLGHR G+IKGLG +PKPSSSSSVTS  Q +KELE
Subjt:  -----KIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATPIPPDKVCKQVLGHRLGHIKGLGWDPKPSSSSSVTS-SQYEKELE

Query:  KKVEKMEGEMEQMKASYADMQESNVALKSQLSMWESRWSDIQNFLGRDQREDGPSN
        KK+EKME EM QMKA+Y  M+E+NVAL SQLSMWE RW++IQN LGR Q +DG SN
Subjt:  KKVEKMEGEMEQMKASYADMQESNVALKSQLSMWESRWSDIQNFLGRDQREDGPSN

XP_038887410.1 poly [ADP-ribose] polymerase 1-like isoform X3 [Benincasa hispida]2.1e-8147.91Show/hide
Query:  IGSST---VQAASGSRGRSRN-QRGRGRRTRGHCRNIELDRYVALHWRIRIEITEQIGKPVCGWAMRFSGSIGTITRSTVPLSCATWKAVPKQVRDAVKA
        IGSST      ASGSR  SR   RG  RRTRGH RN+ELDR+V +H RIRIEI E++GKPVC  A +FS +IGTI R+T+PL C  W  V K+VRD V  
Subjt:  IGSST---VQAASGSRGRSRN-QRGRGRRTRGHCRNIELDRYVALHWRIRIEITEQIGKPVCGWAMRFSGSIGTITRSTVPLSCATWKAVPKQVRDAVKA

Query:  RLL---------------------------TYFDLDMSKRHVVKYIHKLMSSTFKEFRVDLYKYYWEFEDPAKARQNPPERIINLEDWNLLCDRWETLEW
        +LL                           +YFD D+ K+HV KY+ + + +TFKE+R DLYK+Y  F+DP +AR  PP+RI +  DWNLLC+RWET EW
Subjt:  RLL---------------------------TYFDLDMSKRHVVKYIHKLMSSTFKEFRVDLYKYYWEFEDPAKARQNPPERIINLEDWNLLCDRWETLEW

Query:  K-------------------------------KIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATPIPPDKVCKQVLGHRLGH
        K                               KIKE RDVDQVDLF +SHFCE+DGWVN+ AKDAYLEMQ++++ S QE  TP                 
Subjt:  K-------------------------------KIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATPIPPDKVCKQVLGHRLGH

Query:  IKGLGWDPKPSSSSSVTSSQYEKELEKKVEKMEGEMEQMKASYADMQESNVALKSQLSMWESRWSDIQNFLGRDQREDGPSN
                       ++S + +KELEKK+EKME EM QMKA+Y  M+E+NVAL SQLSMWE RW++IQN LGR Q +DG SN
Subjt:  IKGLGWDPKPSSSSSVTSSQYEKELEKKVEKMEGEMEQMKASYADMQESNVALKSQLSMWESRWSDIQNFLGRDQREDGPSN

XP_038887413.1 uncharacterized protein LOC120077557 isoform X5 [Benincasa hispida]7.9e-10054.57Show/hide
Query:  IGSST---VQAASGSRGRSRN-QRGRGRRTRGHCRNIELDRYVALHWRIRIEITEQIGKPVCGWAMRFSGSIGTITRSTVPLSCATWKAVPKQVRDAVKA
        IGSST      ASGSR  SR   RG  RRTRGH RN+ELDR+V +H RIRIEI E++GKPVC  A +FS +IGTI R+T+PL C  W  V K+VRD V  
Subjt:  IGSST---VQAASGSRGRSRN-QRGRGRRTRGHCRNIELDRYVALHWRIRIEITEQIGKPVCGWAMRFSGSIGTITRSTVPLSCATWKAVPKQVRDAVKA

Query:  RLL---------------------------TYFDLDMSKRHVVKYIHKLMSSTFKEFRVDLYKYYWEFEDPAKARQNPPERIINLEDWNLLCDRWETLEW
        +LL                           +YFD D+ K+HV KY+ + + +TFKE+R DLYK+Y  F+DP +AR  PP+RI +  DWNLLC+RWET EW
Subjt:  RLL---------------------------TYFDLDMSKRHVVKYIHKLMSSTFKEFRVDLYKYYWEFEDPAKARQNPPERIINLEDWNLLCDRWETLEW

Query:  K-------------------------------KIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATPIPPDKVCKQVLGHRLGH
        K                               KIKE RDVDQVDLF +SHFCE+DGWVN+ AKDAYLEMQ++++ S QE  TP+   +VCKQVLGHR G+
Subjt:  K-------------------------------KIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATPIPPDKVCKQVLGHRLGH

Query:  IKGLGWDPKPSSSSSVTS-SQYEKELEKKVEKMEGEMEQMKASYADMQESNVALKSQLSMWESRWSDIQNFLGRDQREDGPSN
        IKGLG +PKPSSSSSVTS  Q +KELEKK+EKME EM QMKA+Y  M+E+NVAL SQLSMWE RW++IQN LGR Q +DG SN
Subjt:  IKGLGWDPKPSSSSSVTS-SQYEKELEKKVEKMEGEMEQMKASYADMQESNVALKSQLSMWESRWSDIQNFLGRDQREDGPSN

TrEMBL top hitse value%identityAlignment
A0A5A7SPZ3 Transposase1.2e-5038.6Show/hide
Query:  RGRSRNQRGRGRRTRGHCRNIELDRYVALHWRIRIEITEQIGKPVCGWAMRFSGSIGTITRSTVPLSCATWKAVPKQVRDAVKARLLTYFDLDMSKRHVV
        R +SR ++ R R  RG+ RNIELD++V  H +++IEI E+ GKPV  +A + +  IGT  R+T+ LSC  WKA+P  V++ +  R  T+F+ D +   V 
Subjt:  RGRSRNQRGRGRRTRGHCRNIELDRYVALHWRIRIEITEQIGKPVCGWAMRFSGSIGTITRSTVPLSCATWKAVPKQVRDAVKARLLTYFDLDMSKRHVV

Query:  KYIHKLMSSTFKEFRVDLYKYYWEFEDPAKARQNPPERIINLEDWNLLCDRWETLEWKKIKENR-------------------------------DVDQV
        KY+ + M + F+EFR  L+KYY +F+D  +AR NPP++I + EDWN++CDRWET  WKK +E                                 DVD+V
Subjt:  KYIHKLMSSTFKEFRVDLYKYYWEFEDPAKARQNPPERIINLEDWNLLCDRWETLEWKKIKENR-------------------------------DVDQV

Query:  DLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATPIPPDKVCKQVLGHRLGHIKGLGWDPKPSSSSSVTSSQYEKELEKKVEKMEGEMEQMKASY
        ++F E+HF E++GW+ND AKDAY    +II +ST+ G   I   K CK VLG     I  L       S  S  SS  EKE               K   
Subjt:  DLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATPIPPDKVCKQVLGHRLGHIKGLGWDPKPSSSSSVTSSQYEKELEKKVEKMEGEMEQMKASY

Query:  ADMQESNVALKSQLSMWESRWSDIQNFLG
        A ++E N  L  +L+ WE RW+DI+  +G
Subjt:  ADMQESNVALKSQLSMWESRWSDIQNFLG

A0A5A7TFG0 Transposon protein, putative, CACTA, En/Spm sub-class1.4e-5746.91Show/hide
Query:  GRSRNQRGRGRRTRGHCRNIELDRYVALHWRIRIEITEQIGKPVCGWAMRFSGSIGTITRSTVPLSCATWKAVPKQVRDAVKARLLTYFDLDMSKRHVVK
        G     + RGR  RG+ RNIELD++V  H +I+IEI E+ GKPV  +A + +  IGT  R+T+PLSC  WKAVP  VR+ V  RL T+F+ D +   V K
Subjt:  GRSRNQRGRGRRTRGHCRNIELDRYVALHWRIRIEITEQIGKPVCGWAMRFSGSIGTITRSTVPLSCATWKAVPKQVRDAVKARLLTYFDLDMSKRHVVK

Query:  YIHKLMSSTFKEFRVDLYKYYWEFEDPAKARQNPPERIINLEDWNLLCDRWETLEWKKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQ
        Y+ + M + F+EFR DL+KYY +F+D  +AR NPP R I  EDWN++CDRWET  WK  K+  DVD++++FHE+HF E++GW+ND AKDAYLEMQ+II +
Subjt:  YIHKLMSSTFKEFRVDLYKYYWEFEDPAKARQNPPERIINLEDWNLLCDRWETLEWKKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQ

Query:  STQEGATPIPPDKVCKQVLGHRLGHIKGLGWDPKPSSS-SSVTSSQYEKE------LEKKVEKMEGEMEQMKASY
        ST+ G   I   K C+ VLG R         +P+   S  S  SS  EKE      L++  EK+  E+ + + SY
Subjt:  STQEGATPIPPDKVCKQVLGHRLGHIKGLGWDPKPSSS-SSVTSSQYEKE------LEKKVEKMEGEMEQMKASY

A0A5A7TRX4 DUF4216 domain-containing protein1.6e-5341.16Show/hide
Query:  GRRTRGHCRNIELDRYVALHWRIRIEITEQIGKPVCGWAMRFSGSIGTITRSTVPLSCATWKAVPKQVRDAVKARLLTYFDLDMSKRHVVKYIHKLMSST
        GR  RG+ RNIELD++V  H +I+IEI E+ GKPV  +A + +  IGT  R+T+PLSC  WKAVP  VR+ V   L T+F+ D +   V KY+ + M +T
Subjt:  GRRTRGHCRNIELDRYVALHWRIRIEITEQIGKPVCGWAMRFSGSIGTITRSTVPLSCATWKAVPKQVRDAVKARLLTYFDLDMSKRHVVKYIHKLMSST

Query:  FKEFRVDLYKYYWEFEDPAKARQNPPERIINLEDWNLLCDRWETLEWKKIKENR-------------------------------DVDQVDLFHESHFCE
        F+EFR DL+KYY +F+D  +AR NP  RI + EDWN++CDRWET  WKK +E                                 DVD++++FHE+HF E
Subjt:  FKEFRVDLYKYYWEFEDPAKARQNPPERIINLEDWNLLCDRWETLEWKKIKENR-------------------------------DVDQVDLFHESHFCE

Query:  RDGWVNDVAKDAYLEMQKIIDQSTQEGATPIPPDKVCKQVLGHRLGHIKGLGWDPKPSSSSSVTSSQYEKELEKKVEKMEGEMEQMKASYADMQESNVAL
        ++GW ND AKDAYLEMQ+II +ST+ G   I   K C+ VLG R            P S  S+ S+     +    EK + EM  +K       E+N  L
Subjt:  RDGWVNDVAKDAYLEMQKIIDQSTQEGATPIPPDKVCKQVLGHRLGHIKGLGWDPKPSSSSSVTSSQYEKELEKKVEKMEGEMEQMKASYADMQESNVAL

Query:  KSQLSMWESRW
          +L+ WE  +
Subjt:  KSQLSMWESRW

A0A5A7US78 Uncharacterized protein1.1e-5944.98Show/hide
Query:  GRSRNQRGRGRRTRGHCRNIELDRYVALHWRIRIEITEQIGKPVCGWAMRFSGSIGTITRSTVPLSCATWKAVPKQVRDAVKARLLTYFDLDMSKRHVVK
        G     + RGR  RG+ RNIELD++V  H +I+IEI E+ GKPV  +  + +  IGT  R+T+PLSC  WKAVP  VR+ V   L T+F+ D +   V K
Subjt:  GRSRNQRGRGRRTRGHCRNIELDRYVALHWRIRIEITEQIGKPVCGWAMRFSGSIGTITRSTVPLSCATWKAVPKQVRDAVKARLLTYFDLDMSKRHVVK

Query:  YIHKLMSSTFKEFRVDLYKYYWEFEDPAKARQNPPERIINLEDWNLLCDRWETLEWKKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQ
        Y+ + M +TF+EFR +L+KYY +F+D  +AR NPP RI + EDWN++CDRWET  WKK K+  DVD++++FHE+HF +++GW+ND AKDAYLEMQ+II +
Subjt:  YIHKLMSSTFKEFRVDLYKYYWEFEDPAKARQNPPERIINLEDWNLLCDRWETLEWKKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQ

Query:  STQEGATPIPPDKVCKQVLGHRLGHIKGLGWDPKPSSSSSVTSSQYEKELEKKVEKMEGEMEQMKASYADMQESNVALKSQLSMWESRW
        ST+ G   I   K CK VLG R   ++ +      S  S+V+SS+         EK + EM  +K       E+N  L  +L+ WE  +
Subjt:  STQEGATPIPPDKVCKQVLGHRLGHIKGLGWDPKPSSSSSVTSSQYEKELEKKVEKMEGEMEQMKASYADMQESNVALKSQLSMWESRW

A0A6J1DUH3 uncharacterized protein LOC1110232129.2e-5448.06Show/hide
Query:  YFDLDMSKRHVVKYIHKLMSSTFKEFRVDLYKYYWEFEDPAKARQNPPERIINLEDWNLLCDRWETLEWK------------------------------
        +F +D+SKR V K+I + M  +FK++R DL++YY EFEDP +AR NPPER+ N EDWN LCDRWET EWK                              
Subjt:  YFDLDMSKRHVVKYIHKLMSSTFKEFRVDLYKYYWEFEDPAKARQNPPERIINLEDWNLLCDRWETLEWK------------------------------

Query:  -KIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATPIPPDKVCKQVLGHRLGHIKGLGWDPKPS-----SSSSVTSSQ-YEKEL
         KIKE  D+  VDLF ESH+ E+DG VND A+DAY  MQ +I   TQEG  P+   + C++VLG R  H+KGLG+ P+P+     SSS+VTSS  YEKEL
Subjt:  -KIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLEMQKIIDQSTQEGATPIPPDKVCKQVLGHRLGHIKGLGWDPKPS-----SSSSVTSSQ-YEKEL

Query:  EKKVEKMEGEMEQMKASYADMQESNVALKSQLSMWESRWSDIQNFLGRDQREDGPSNN
        EKKVE ME EM +MK         N  LK  +S WE RW++I  F+   ++ DGPSNN
Subjt:  EKKVEKMEGEMEQMKASYADMQESNVALKSQLSMWESRWSDIQNFLGRDQREDGPSNN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCGGAGGCACGTGATGCACTTGCAAATGCTAACCGACTAGTAATTGGTTCATCCACTGTTCAAGCTGCGTCAGGATCTAGAGGTCGTTCAAGAAATCAACGAGGACGAGG
AAGGCGGACTAGAGGACATTGTCGGAATATTGAACTAGACCGATATGTGGCTCTTCATTGGAGGATCAGAATTGAGATCACCGAGCAGATTGGAAAACCAGTATGCGGTT
GGGCTATGAGGTTTAGTGGCTCTATTGGTACCATAACAAGGAGCACGGTTCCTTTGAGTTGTGCGACATGGAAGGCTGTACCAAAACAAGTACGGGACGCTGTGAAGGCT
CGTTTGTTGACATATTTTGACCTGGATATGTCAAAGAGACATGTGGTGAAGTACATACACAAACTCATGTCATCAACTTTCAAGGAATTTCGAGTGGATTTATATAAATA
TTATTGGGAGTTTGAGGACCCTGCAAAAGCCCGTCAAAATCCACCCGAGAGGATTATAAACCTTGAAGATTGGAATCTTCTATGTGATCGATGGGAGACACTTGAGTGGA
AGAAAATCAAAGAGAATCGGGATGTTGACCAGGTAGATTTGTTCCATGAAAGTCATTTTTGTGAAAGAGATGGATGGGTCAACGATGTTGCCAAAGATGCATATCTAGAG
ATGCAAAAAATCATAGATCAATCAACACAAGAAGGCGCAACACCGATTCCCCCAGACAAAGTATGTAAGCAGGTGTTGGGTCATCGATTAGGCCACATCAAAGGCCTAGG
TTGGGACCCAAAACCCAGCTCATCATCCAGTGTCACATCATCACAATATGAAAAAGAACTAGAAAAGAAGGTTGAGAAGATGGAAGGTGAAATGGAACAGATGAAGGCTT
CTTACGCAGATATGCAGGAATCAAATGTTGCCCTGAAGTCACAATTGTCGATGTGGGAAAGTAGATGGTCTGACATTCAAAACTTTTTGGGGCGAGATCAAAGAGAGGAT
GGACCTTCAAACAATTAG
mRNA sequenceShow/hide mRNA sequence
TCGGAGGCACGTGATGCACTTGCAAATGCTAACCGACTAGTAATTGGTTCATCCACTGTTCAAGCTGCGTCAGGATCTAGAGGTCGTTCAAGAAATCAACGAGGACGAGG
AAGGCGGACTAGAGGACATTGTCGGAATATTGAACTAGACCGATATGTGGCTCTTCATTGGAGGATCAGAATTGAGATCACCGAGCAGATTGGAAAACCAGTATGCGGTT
GGGCTATGAGGTTTAGTGGCTCTATTGGTACCATAACAAGGAGCACGGTTCCTTTGAGTTGTGCGACATGGAAGGCTGTACCAAAACAAGTACGGGACGCTGTGAAGGCT
CGTTTGTTGACATATTTTGACCTGGATATGTCAAAGAGACATGTGGTGAAGTACATACACAAACTCATGTCATCAACTTTCAAGGAATTTCGAGTGGATTTATATAAATA
TTATTGGGAGTTTGAGGACCCTGCAAAAGCCCGTCAAAATCCACCCGAGAGGATTATAAACCTTGAAGATTGGAATCTTCTATGTGATCGATGGGAGACACTTGAGTGGA
AGAAAATCAAAGAGAATCGGGATGTTGACCAGGTAGATTTGTTCCATGAAAGTCATTTTTGTGAAAGAGATGGATGGGTCAACGATGTTGCCAAAGATGCATATCTAGAG
ATGCAAAAAATCATAGATCAATCAACACAAGAAGGCGCAACACCGATTCCCCCAGACAAAGTATGTAAGCAGGTGTTGGGTCATCGATTAGGCCACATCAAAGGCCTAGG
TTGGGACCCAAAACCCAGCTCATCATCCAGTGTCACATCATCACAATATGAAAAAGAACTAGAAAAGAAGGTTGAGAAGATGGAAGGTGAAATGGAACAGATGAAGGCTT
CTTACGCAGATATGCAGGAATCAAATGTTGCCCTGAAGTCACAATTGTCGATGTGGGAAAGTAGATGGTCTGACATTCAAAACTTTTTGGGGCGAGATCAAAGAGAGGAT
GGACCTTCAAACAATTAG
Protein sequenceShow/hide protein sequence
SEARDALANANRLVIGSSTVQAASGSRGRSRNQRGRGRRTRGHCRNIELDRYVALHWRIRIEITEQIGKPVCGWAMRFSGSIGTITRSTVPLSCATWKAVPKQVRDAVKA
RLLTYFDLDMSKRHVVKYIHKLMSSTFKEFRVDLYKYYWEFEDPAKARQNPPERIINLEDWNLLCDRWETLEWKKIKENRDVDQVDLFHESHFCERDGWVNDVAKDAYLE
MQKIIDQSTQEGATPIPPDKVCKQVLGHRLGHIKGLGWDPKPSSSSSVTSSQYEKELEKKVEKMEGEMEQMKASYADMQESNVALKSQLSMWESRWSDIQNFLGRDQRED
GPSNN