; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005902 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005902
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold11:663583..666808
RNA-Seq ExpressionSpg005902
SyntenySpg005902
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN70669.1 hypothetical protein VITISV_037506 [Vitis vinifera]4.5e-3434.16Show/hide
Query:  GAAEGILILWSDPDFTIKETIQGLFSLSIHIVMADGFDFWLSAIYGPTRSDMRDEFWQELHDLAGLGRDRWILGGDFNVTRWSWEKSSDQPVTTSMNSFN
        GA+ GILI+W   +   +E +   F +S+   + D    W+SAIYGP    +R +FW EL D+ GL    W +G DFNV R S EK     +T+SM  F+
Subjt:  GAAEGILILWSDPDFTIKETIQGLFSLSIHIVMADGFDFWLSAIYGPTRSDMRDEFWQELHDLAGLGRDRWILGGDFNVTRWSWEKSSDQPVTTSMNSFN

Query:  QWIATYQLIDIPLQN-----------DCDSLSSDQMSQRTHLREQI------EDITAREHTYWRQRCKLNWLKEGDENTKFFHRFMAARKRKNSISELLS
         +I   +L D PL+N               L+ + +SQR   + ++      E++  RE  YWRQ+ ++ W+KEGD N+KF+H+    R+ +  I EL +
Subjt:  QWIATYQLIDIPLQN-----------DCDSLSSDQMSQRTHLREQI------EDITAREHTYWRQRCKLNWLKEGDENTKFFHRFMAARKRKNSISELLS

Query:  RNGVGLLSAKDIEEEFIDFYKKLYTKDSSSRFLPTNLEWGRIN
          G+ L +AK I EE + +++KLY   +   +    L+W  I+
Subjt:  RNGVGLLSAKDIEEEFIDFYKKLYTKDSSSRFLPTNLEWGRIN

KAA0044449.1 hypothetical protein E6C27_scaffold46G001820 [Cucumis melo var. makuwa]3.3e-4024.65Show/hide
Query:  NHSTRSITIDRKTFSIAFDELSRGSCAKITERSRNSTHSLSLSWKSLNWLASSFHSLTKEPCSYKFFSEFRGDGYVLCLEKLRNKHGFF-----------
        N   R  ++++K F ++ D+ SR S   ITE     + S++++  SL WL  +F +L   P + +FF E R   + L ++ + N+ G+            
Subjt:  NHSTRSITIDRKTFSIAFDELSRGSCAKITERSRNSTHSLSLSWKSLNWLASSFHSLTKEPCSYKFFSEFRGDGYVLCLEKLRNKHGFF-----------

Query:  -------------------------ETSNQKVDHRSHTY-----KEILEQRPQHPTSIHMQPPQK--APMIETSAPPLKLDTN-DEWKDVIVVERFSPKD
                                 +TS++K     H Y     KE ++Q   + +S   + P+K  A  + +S+     +++  + K    ++R    D
Subjt:  -------------------------ETSNQKVDHRSHTY-----KEILEQRPQHPTSIHMQPPQK--APMIETSAPPLKLDTN-DEWKDVIVVERFSPKD

Query:  NWPSIRETIANLTSR----CSINPFQDNKALIHCYGHDHTLNLCNNSKWTLLGNFRLKFYPLTTISYQQNQKISFYGGWVNIHNLPLNLWTDTVFQYIGE
        +W  I + + + T +        PF  +KAL+     +    LC N  WT +G F +KF   +  ++   + I  YGGW     +PL++W    F  IGE
Subjt:  NWPSIRETIANLTSR----CSINPFQDNKALIHCYGHDHTLNLCNNSKWTLLGNFRLKFYPLTTISYQQNQKISFYGGWVNIHNLPLNLWTDTVFQYIGE

Query:  QCGGFESLSDHTMRRLVITEASIKIKENPTGFIPAAIRLPAALTGDAIV--TAHITGE-IEELNKKENRSNSEFNSENTEALNKEIIEPKISHQITPLRS
          GGF   +  ++ +L +TEA IK+KEN TGF+PA I++      D I+    H  G+ + E N   + S ++  +EN    N    +      +  +  
Subjt:  QCGGFESLSDHTMRRLVITEASIKIKENPTGFIPAAIRLPAALTGDAIV--TAHITGE-IEELNKKENRSNSEFNSENTEALNKEIIEPKISHQITPLRS

Query:  RD-----EKGKKILHD----SPTQPLNPILPKPSTLRIGTKSSTHNL----KIVGSDTEDYLT-----------------SPL-----------------
         D     +KG K +++     PT+ +       S L +  +  + N     KI   D    +T                 SP+                 
Subjt:  RD-----EKGKKILHD----SPTQPLNPILPKPSTLRIGTKSSTHNL----KIVGSDTEDYLT-----------------SPL-----------------

Query:  -SNHSGP--------HLIMPNIMAQTEKPS----------TSKNLEPSMFAESTMDQSSQLEPTDHE----INPLPLLTMGPTFNPTNQPSPVPNQCQLS
         S HS P         +    + A++++P+           S+++E      S      +++P + E    +  + LL+    F+    PS +P+    S
Subjt:  -SNHSGP--------HLIMPNIMAQTEKPS----------TSKNLEPSMFAESTMDQSSQLEPTDHE----INPLPLLTMGPTFNPTNQPSPVPNQCQLS

Query:  PT---VAVSPVNDLLCKAHPHGESSSNFLNLPLSI--------RQIAPILIEHGLCIMA-IPPPKKKVGAAEGILILWSDPDFTI----KETIQGLFSLS
        PT   +    +  ++  AH   E      N+            R++   L E+ L + A        V     I IL   P+  +    ++ I G FS+S
Subjt:  PT---VAVSPVNDLLCKAHPHGESSSNFLNLPLSI--------RQIAPILIEHGLCIMA-IPPPKKKVGAAEGILILWSDPDFTI----KETIQGLFSLS

Query:  IHIVMADGFDFWLSAIYGPTRSDMRDEFWQELHDLAGLGRDRWILGGDFNVTRWSWEKSSDQPVTTSMNSFNQWIATYQLIDIPLQN
        I +   +G  +WLSAIYGP +   R  FW+EL +L  +    WILGGDFNV RW  E S+  P + SM  FN +I+   LID PL N
Subjt:  IHIVMADGFDFWLSAIYGPTRSDMRDEFWQELHDLAGLGRDRWILGGDFNVTRWSWEKSSDQPVTTSMNSFNQWIATYQLIDIPLQN

RVW60988.1 putative ribonuclease H protein [Vitis vinifera]1.1e-3233.61Show/hide
Query:  GAAEGILILWSDPDFTIKETIQGLFSLSIHIVMADGFDFWLSAIYGPTRSDMRDEFWQELHDLAGLGRDRWILGGDFNVTRWSWEKSSDQPVTTS-----
        GA+ GI+ILW    F   E + G FS+++ +   +   FWL+++YGP +   R +FW EL DL GL   RW +GGDFNV R   EK  D           
Subjt:  GAAEGILILWSDPDFTIKETIQGLFSLSIHIVMADGFDFWLSAIYGPTRSDMRDEFWQELHDLAGLGRDRWILGGDFNVTRWSWEKSSDQPVTTS-----

Query:  --MNSFNQWIAT-----------YQLIDIPLQNDCDSLSSDQMSQRTHLREQIEDITAREHTYWRQRCKLNWLKEGDENTKFFHRFMAARKRKNSISELL
             F  W               + + I L     +L+ D +S+RT  R+++ED+  +E   WRQ+ ++ W+KEGD N+KFFHR    R+ +  I  L+
Subjt:  --MNSFNQWIAT-----------YQLIDIPLQNDCDSLSSDQMSQRTHLREQIEDITAREHTYWRQRCKLNWLKEGDENTKFFHRFMAARKRKNSISELL

Query:  SRNGVGLLSAKDIEEEFIDFYKKLYTKDSSSRFLPTNLEWGRIN
        S+ G  L + + I EE ++F+ KLY+K     +    ++W  I+
Subjt:  SRNGVGLLSAKDIEEEFIDFYKKLYTKDSSSRFLPTNLEWGRIN

RVW61143.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]2.9e-3330.94Show/hide
Query:  GAAEGILILWSDPDFTIKETIQGLFSLSIHIVMADGFDFWLSAIYGPTRSDMRDEFWQELHDLAGLGRDRWILGGDFNVTRWSWEKSSDQPVTTSMNSFN
        GA+ GILI+W       +E + G FS+SI   M      WLSA+YGP  S +R +FW EL D+AGL   RW +GGDFNV R S EK     +T  M  F+
Subjt:  GAAEGILILWSDPDFTIKETIQGLFSLSIHIVMADGFDFWLSAIYGPTRSDMRDEFWQELHDLAGLGRDRWILGGDFNVTRWSWEKSSDQPVTTSMNSFN

Query:  QWIATYQLIDIPLQ--------------------------------------------------------NDCDSLSSDQMSQRTHLREQIEDITAREHT
        ++I   +LID PL+                                                         + +  S  ++ QR   + ++E++  RE  
Subjt:  QWIATYQLIDIPLQ--------------------------------------------------------NDCDSLSSDQMSQRTHLREQIEDITAREHT

Query:  YWRQRCKLNWLKEGDENTKFFHRFMAARKRKNSISELLSRNGVGLLSAKDIEEEFIDFYKKLYTKDSSSRFLPTNLEW
        +WRQ+ ++ W+KEGD N+KFFH+    R+ +  I EL + +G+ L + + I+EE + +++KLY   S   +    L+W
Subjt:  YWRQRCKLNWLKEGDENTKFFHRFMAARKRKNSISELLSRNGVGLLSAKDIEEEFIDFYKKLYTKDSSSRFLPTNLEW

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]4.1e-4333.83Show/hide
Query:  GAAEGILILWSDPDFTIKETIQGLFSLSIHIVMADGFDFWLSAIYGPTRSDMRDEFWQELHDLAGLGRDRWILGGDFNVTRWSWEKSSDQPVTTSMNSFN
        G A GILILW+DPD    E I+G+FSL+I+  ++DGF FW+S IYGP+ ++    FWQEL DL+ L  + WIL GDFNVTRWSWEKS+ +P+T SM  FN
Subjt:  GAAEGILILWSDPDFTIKETIQGLFSLSIHIVMADGFDFWLSAIYGPTRSDMRDEFWQELHDLAGLGRDRWILGGDFNVTRWSWEKSSDQPVTTSMNSFN

Query:  QWIATYQLIDIPLQN--------------DCDSLSS----------------------------------------------------------------
         +I    LID+PL N              DC  L++                                                                
Subjt:  QWIATYQLIDIPLQN--------------DCDSLSS----------------------------------------------------------------

Query:  ------------------------------------------------------DQMSQRTHLREQIEDITAREHTYWRQRCKLNWLKEGDENTKFFHRF
                                                              DQ   R   +E +  + A+E  +WRQRCK  WL EGDENTKFFHRF
Subjt:  ------------------------------------------------------DQMSQRTHLREQIEDITAREHTYWRQRCKLNWLKEGDENTKFFHRF

Query:  MAARKRKNSISELLSRNGVGLLSAKDIEEEFIDF
        +A ++R++ I+E+LS+ G+GL   KDIEEEFIDF
Subjt:  MAARKRKNSISELLSRNGVGLLSAKDIEEEFIDF

TrEMBL top hitse value%identityAlignment
A0A438CPP7 Transposon TX1 uncharacterized 149 kDa protein3.5e-3237.68Show/hide
Query:  GAAEGILILWSDPDFTIKETIQGLFSLSIHIVMADGFDFWLSAIYGPTRSDMRDEFWQELHDLAGLGRDRWILGGDFNVTRWSWEKSSDQPVTTSMNSFN
        GA+ GILI+W   +   +E + G FS+S+   +      W+S +YGP    +R +FW EL D+ GL    W +GGDFNV R S EK     VT+SM  F+
Subjt:  GAAEGILILWSDPDFTIKETIQGLFSLSIHIVMADGFDFWLSAIYGPTRSDMRDEFWQELHDLAGLGRDRWILGGDFNVTRWSWEKSSDQPVTTSMNSFN

Query:  QWIATYQLIDIPLQNDCDSLSSDQMSQRTHLREQIEDITAREHTYWRQRCKLNWLKEGDENTKFFHRFMAARKRKNSISELLSRNGVGLLSAKDIEEEFI
         +I   +L+D PL+N   + S+ Q S       ++E++  RE  +WRQ+ ++ W+KEGD N+KF+H+    R+ +  I EL +  G+ L +A+ I EE +
Subjt:  QWIATYQLIDIPLQNDCDSLSSDQMSQRTHLREQIEDITAREHTYWRQRCKLNWLKEGDENTKFFHRFMAARKRKNSISELLSRNGVGLLSAKDIEEEFI

Query:  DFYKKLY
         +++KLY
Subjt:  DFYKKLY

A0A438FLV0 Putative ribonuclease H protein5.4e-3333.61Show/hide
Query:  GAAEGILILWSDPDFTIKETIQGLFSLSIHIVMADGFDFWLSAIYGPTRSDMRDEFWQELHDLAGLGRDRWILGGDFNVTRWSWEKSSDQPVTTS-----
        GA+ GI+ILW    F   E + G FS+++ +   +   FWL+++YGP +   R +FW EL DL GL   RW +GGDFNV R   EK  D           
Subjt:  GAAEGILILWSDPDFTIKETIQGLFSLSIHIVMADGFDFWLSAIYGPTRSDMRDEFWQELHDLAGLGRDRWILGGDFNVTRWSWEKSSDQPVTTS-----

Query:  --MNSFNQWIAT-----------YQLIDIPLQNDCDSLSSDQMSQRTHLREQIEDITAREHTYWRQRCKLNWLKEGDENTKFFHRFMAARKRKNSISELL
             F  W               + + I L     +L+ D +S+RT  R+++ED+  +E   WRQ+ ++ W+KEGD N+KFFHR    R+ +  I  L+
Subjt:  --MNSFNQWIAT-----------YQLIDIPLQNDCDSLSSDQMSQRTHLREQIEDITAREHTYWRQRCKLNWLKEGDENTKFFHRFMAARKRKNSISELL

Query:  SRNGVGLLSAKDIEEEFIDFYKKLYTKDSSSRFLPTNLEWGRIN
        S+ G  L + + I EE ++F+ KLY+K     +    ++W  I+
Subjt:  SRNGVGLLSAKDIEEEFIDFYKKLYTKDSSSRFLPTNLEWGRIN

A0A5A7TTA1 DUF4283 domain-containing protein1.6e-4024.65Show/hide
Query:  NHSTRSITIDRKTFSIAFDELSRGSCAKITERSRNSTHSLSLSWKSLNWLASSFHSLTKEPCSYKFFSEFRGDGYVLCLEKLRNKHGFF-----------
        N   R  ++++K F ++ D+ SR S   ITE     + S++++  SL WL  +F +L   P + +FF E R   + L ++ + N+ G+            
Subjt:  NHSTRSITIDRKTFSIAFDELSRGSCAKITERSRNSTHSLSLSWKSLNWLASSFHSLTKEPCSYKFFSEFRGDGYVLCLEKLRNKHGFF-----------

Query:  -------------------------ETSNQKVDHRSHTY-----KEILEQRPQHPTSIHMQPPQK--APMIETSAPPLKLDTN-DEWKDVIVVERFSPKD
                                 +TS++K     H Y     KE ++Q   + +S   + P+K  A  + +S+     +++  + K    ++R    D
Subjt:  -------------------------ETSNQKVDHRSHTY-----KEILEQRPQHPTSIHMQPPQK--APMIETSAPPLKLDTN-DEWKDVIVVERFSPKD

Query:  NWPSIRETIANLTSR----CSINPFQDNKALIHCYGHDHTLNLCNNSKWTLLGNFRLKFYPLTTISYQQNQKISFYGGWVNIHNLPLNLWTDTVFQYIGE
        +W  I + + + T +        PF  +KAL+     +    LC N  WT +G F +KF   +  ++   + I  YGGW     +PL++W    F  IGE
Subjt:  NWPSIRETIANLTSR----CSINPFQDNKALIHCYGHDHTLNLCNNSKWTLLGNFRLKFYPLTTISYQQNQKISFYGGWVNIHNLPLNLWTDTVFQYIGE

Query:  QCGGFESLSDHTMRRLVITEASIKIKENPTGFIPAAIRLPAALTGDAIV--TAHITGE-IEELNKKENRSNSEFNSENTEALNKEIIEPKISHQITPLRS
          GGF   +  ++ +L +TEA IK+KEN TGF+PA I++      D I+    H  G+ + E N   + S ++  +EN    N    +      +  +  
Subjt:  QCGGFESLSDHTMRRLVITEASIKIKENPTGFIPAAIRLPAALTGDAIV--TAHITGE-IEELNKKENRSNSEFNSENTEALNKEIIEPKISHQITPLRS

Query:  RD-----EKGKKILHD----SPTQPLNPILPKPSTLRIGTKSSTHNL----KIVGSDTEDYLT-----------------SPL-----------------
         D     +KG K +++     PT+ +       S L +  +  + N     KI   D    +T                 SP+                 
Subjt:  RD-----EKGKKILHD----SPTQPLNPILPKPSTLRIGTKSSTHNL----KIVGSDTEDYLT-----------------SPL-----------------

Query:  -SNHSGP--------HLIMPNIMAQTEKPS----------TSKNLEPSMFAESTMDQSSQLEPTDHE----INPLPLLTMGPTFNPTNQPSPVPNQCQLS
         S HS P         +    + A++++P+           S+++E      S      +++P + E    +  + LL+    F+    PS +P+    S
Subjt:  -SNHSGP--------HLIMPNIMAQTEKPS----------TSKNLEPSMFAESTMDQSSQLEPTDHE----INPLPLLTMGPTFNPTNQPSPVPNQCQLS

Query:  PT---VAVSPVNDLLCKAHPHGESSSNFLNLPLSI--------RQIAPILIEHGLCIMA-IPPPKKKVGAAEGILILWSDPDFTI----KETIQGLFSLS
        PT   +    +  ++  AH   E      N+            R++   L E+ L + A        V     I IL   P+  +    ++ I G FS+S
Subjt:  PT---VAVSPVNDLLCKAHPHGESSSNFLNLPLSI--------RQIAPILIEHGLCIMA-IPPPKKKVGAAEGILILWSDPDFTI----KETIQGLFSLS

Query:  IHIVMADGFDFWLSAIYGPTRSDMRDEFWQELHDLAGLGRDRWILGGDFNVTRWSWEKSSDQPVTTSMNSFNQWIATYQLIDIPLQN
        I +   +G  +WLSAIYGP +   R  FW+EL +L  +    WILGGDFNV RW  E S+  P + SM  FN +I+   LID PL N
Subjt:  IHIVMADGFDFWLSAIYGPTRSDMRDEFWQELHDLAGLGRDRWILGGDFNVTRWSWEKSSDQPVTTSMNSFNQWIATYQLIDIPLQN

A0A6J1E2G6 uncharacterized protein LOC1110254052.0e-4333.83Show/hide
Query:  GAAEGILILWSDPDFTIKETIQGLFSLSIHIVMADGFDFWLSAIYGPTRSDMRDEFWQELHDLAGLGRDRWILGGDFNVTRWSWEKSSDQPVTTSMNSFN
        G A GILILW+DPD    E I+G+FSL+I+  ++DGF FW+S IYGP+ ++    FWQEL DL+ L  + WIL GDFNVTRWSWEKS+ +P+T SM  FN
Subjt:  GAAEGILILWSDPDFTIKETIQGLFSLSIHIVMADGFDFWLSAIYGPTRSDMRDEFWQELHDLAGLGRDRWILGGDFNVTRWSWEKSSDQPVTTSMNSFN

Query:  QWIATYQLIDIPLQN--------------DCDSLSS----------------------------------------------------------------
         +I    LID+PL N              DC  L++                                                                
Subjt:  QWIATYQLIDIPLQN--------------DCDSLSS----------------------------------------------------------------

Query:  ------------------------------------------------------DQMSQRTHLREQIEDITAREHTYWRQRCKLNWLKEGDENTKFFHRF
                                                              DQ   R   +E +  + A+E  +WRQRCK  WL EGDENTKFFHRF
Subjt:  ------------------------------------------------------DQMSQRTHLREQIEDITAREHTYWRQRCKLNWLKEGDENTKFFHRF

Query:  MAARKRKNSISELLSRNGVGLLSAKDIEEEFIDF
        +A ++R++ I+E+LS+ G+GL   KDIEEEFIDF
Subjt:  MAARKRKNSISELLSRNGVGLLSAKDIEEEFIDF

A5BXE3 Uncharacterized protein2.2e-3434.16Show/hide
Query:  GAAEGILILWSDPDFTIKETIQGLFSLSIHIVMADGFDFWLSAIYGPTRSDMRDEFWQELHDLAGLGRDRWILGGDFNVTRWSWEKSSDQPVTTSMNSFN
        GA+ GILI+W   +   +E +   F +S+   + D    W+SAIYGP    +R +FW EL D+ GL    W +G DFNV R S EK     +T+SM  F+
Subjt:  GAAEGILILWSDPDFTIKETIQGLFSLSIHIVMADGFDFWLSAIYGPTRSDMRDEFWQELHDLAGLGRDRWILGGDFNVTRWSWEKSSDQPVTTSMNSFN

Query:  QWIATYQLIDIPLQN-----------DCDSLSSDQMSQRTHLREQI------EDITAREHTYWRQRCKLNWLKEGDENTKFFHRFMAARKRKNSISELLS
         +I   +L D PL+N               L+ + +SQR   + ++      E++  RE  YWRQ+ ++ W+KEGD N+KF+H+    R+ +  I EL +
Subjt:  QWIATYQLIDIPLQN-----------DCDSLSSDQMSQRTHLREQI------EDITAREHTYWRQRCKLNWLKEGDENTKFFHRFMAARKRKNSISELLS

Query:  RNGVGLLSAKDIEEEFIDFYKKLYTKDSSSRFLPTNLEWGRIN
          G+ L +AK I EE + +++KLY   +   +    L+W  I+
Subjt:  RNGVGLLSAKDIEEEFIDFYKKLYTKDSSSRFLPTNLEWGRIN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.5e-0629.67Show/hide
Query:  SDQMSQRTHL-REQIEDITAREHTYWRQRCKLNWLKEGDENTKFFHRFMAARKRKNSISELLSRNGVGLLSAKDIEEEFIDFYKKLYTKDS
        SD + +  H+ R++     A   +++RQ+ ++ WL++GD NT+FFH+ + A + KN I  L   + V + +   ++E  + +Y  L   DS
Subjt:  SDQMSQRTHL-REQIEDITAREHTYWRQRCKLNWLKEGDENTKFFHRFMAARKRKNSISELLSRNGVGLLSAKDIEEEFIDFYKKLYTKDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTCTCCGGCGGCCTCATTACCCTGGAACCACTCCACCCGCTCCATCACCATTGATCGAAAGACTTTTTCCATAGCCTTTGATGAACTTTCTCGAGGAAGCTGTGC
TAAAATCACAGAACGAAGTAGGAACTCTACTCACTCATTGTCTCTTTCTTGGAAATCCCTCAATTGGCTTGCCTCCTCCTTCCATTCACTGACCAAGGAACCTTGCTCGT
ATAAGTTTTTCTCAGAATTTCGAGGAGATGGATATGTTCTTTGTCTTGAAAAGCTCAGAAACAAACATGGTTTCTTTGAAACCAGCAACCAAAAGGTGGACCACCGCTCT
CATACTTACAAAGAAATTCTCGAACAGCGACCTCAGCATCCTACCTCAATTCATATGCAGCCACCGCAAAAAGCCCCCATGATTGAGACAAGCGCGCCGCCACTTAAGCT
TGACACGAATGATGAATGGAAGGATGTGATCGTTGTCGAAAGGTTCTCGCCAAAGGACAATTGGCCTAGTATCCGAGAGACTATTGCCAATCTCACTTCCCGCTGCTCTA
TTAACCCTTTCCAAGACAACAAGGCTCTGATACATTGTTACGGTCACGATCATACCCTCAACCTCTGCAATAATTCGAAATGGACTTTGCTTGGTAATTTCCGCCTGAAA
TTTTACCCATTGACCACAATCTCATATCAACAAAACCAGAAGATAAGTTTCTATGGAGGATGGGTTAATATTCACAATCTTCCCCTCAACCTATGGACTGATACTGTCTT
TCAGTATATCGGGGAACAATGTGGAGGATTTGAATCTTTGTCTGACCACACCATGAGGAGATTAGTTATCACGGAGGCCAGCATTAAAATCAAGGAAAATCCCACTGGTT
TTATCCCTGCCGCCATTCGACTTCCAGCAGCTCTCACCGGAGACGCAATTGTTACCGCACATATCACGGGGGAAATTGAGGAATTGAATAAGAAAGAGAATCGTTCAAAT
TCTGAATTCAATTCGGAAAATACTGAAGCGCTGAATAAGGAAATTATTGAGCCGAAGATCTCGCACCAAATAACACCCTTGCGGTCAAGAGATGAAAAAGGAAAGAAGAT
TCTTCACGATTCTCCCACTCAGCCGCTTAATCCAATTCTTCCTAAACCCTCCACGCTTAGAATTGGGACAAAAAGCTCCACTCATAATTTGAAGATTGTGGGCTCAGATA
CTGAAGATTATTTAACCAGCCCTCTAAGCAACCATTCTGGACCTCACCTAATAATGCCCAATATCATGGCCCAAACCGAGAAACCATCTACCTCTAAAAATCTTGAACCT
AGCATGTTTGCAGAATCCACAATGGATCAATCTTCACAGCTGGAACCCACAGACCATGAAATAAATCCTCTACCTCTTCTCACCATGGGCCCCACTTTCAACCCCACAAA
TCAACCCTCCCCTGTCCCAAACCAATGCCAATTATCCCCAACAGTGGCAGTCTCTCCTGTAAATGATTTATTGTGTAAGGCTCATCCACATGGCGAAAGTAGCTCCAATT
TCCTCAATTTGCCGCTCTCGATAAGGCAAATAGCTCCGATTCTCATTGAACATGGTTTATGCATTATGGCTATTCCTCCTCCAAAAAAGAAAGTGGGAGCAGCTGAAGGT
ATCCTAATTCTTTGGAGTGACCCGGATTTCACTATCAAAGAAACAATTCAAGGTTTGTTTTCTCTATCAATACATATTGTTATGGCTGATGGCTTTGATTTTTGGTTGTC
GGCTATCTATGGTCCTACCAGGAGTGATATGCGGGATGAATTCTGGCAAGAATTACATGACTTAGCAGGCCTGGGAAGAGATAGATGGATTCTTGGAGGCGACTTCAATG
TCACTCGTTGGTCTTGGGAGAAATCTAGCGATCAACCAGTCACTACAAGCATGAATTCTTTTAATCAATGGATTGCAACCTACCAGTTGATTGATATTCCTCTTCAGAAT
GATTGTGATTCTTTATCTTCTGACCAAATGTCTCAAAGGACTCATCTTAGAGAACAAATTGAGGATATTACTGCTCGAGAACATACATACTGGAGACAAAGATGTAAGCT
TAATTGGTTGAAAGAAGGTGATGAGAATACAAAATTTTTTCATCGTTTTATGGCTGCTCGTAAAAGGAAAAATTCAATTTCTGAGTTGTTGTCTCGTAATGGGGTGGGTC
TTTTGTCAGCTAAAGATATTGAGGAGGAGTTCATTGATTTCTATAAGAAGCTATATACGAAAGATTCATCTTCACGATTTTTACCTACCAATCTTGAATGGGGTAGGATT
AATGCCTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGTCTCCGGCGGCCTCATTACCCTGGAACCACTCCACCCGCTCCATCACCATTGATCGAAAGACTTTTTCCATAGCCTTTGATGAACTTTCTCGAGGAAGCTGTGC
TAAAATCACAGAACGAAGTAGGAACTCTACTCACTCATTGTCTCTTTCTTGGAAATCCCTCAATTGGCTTGCCTCCTCCTTCCATTCACTGACCAAGGAACCTTGCTCGT
ATAAGTTTTTCTCAGAATTTCGAGGAGATGGATATGTTCTTTGTCTTGAAAAGCTCAGAAACAAACATGGTTTCTTTGAAACCAGCAACCAAAAGGTGGACCACCGCTCT
CATACTTACAAAGAAATTCTCGAACAGCGACCTCAGCATCCTACCTCAATTCATATGCAGCCACCGCAAAAAGCCCCCATGATTGAGACAAGCGCGCCGCCACTTAAGCT
TGACACGAATGATGAATGGAAGGATGTGATCGTTGTCGAAAGGTTCTCGCCAAAGGACAATTGGCCTAGTATCCGAGAGACTATTGCCAATCTCACTTCCCGCTGCTCTA
TTAACCCTTTCCAAGACAACAAGGCTCTGATACATTGTTACGGTCACGATCATACCCTCAACCTCTGCAATAATTCGAAATGGACTTTGCTTGGTAATTTCCGCCTGAAA
TTTTACCCATTGACCACAATCTCATATCAACAAAACCAGAAGATAAGTTTCTATGGAGGATGGGTTAATATTCACAATCTTCCCCTCAACCTATGGACTGATACTGTCTT
TCAGTATATCGGGGAACAATGTGGAGGATTTGAATCTTTGTCTGACCACACCATGAGGAGATTAGTTATCACGGAGGCCAGCATTAAAATCAAGGAAAATCCCACTGGTT
TTATCCCTGCCGCCATTCGACTTCCAGCAGCTCTCACCGGAGACGCAATTGTTACCGCACATATCACGGGGGAAATTGAGGAATTGAATAAGAAAGAGAATCGTTCAAAT
TCTGAATTCAATTCGGAAAATACTGAAGCGCTGAATAAGGAAATTATTGAGCCGAAGATCTCGCACCAAATAACACCCTTGCGGTCAAGAGATGAAAAAGGAAAGAAGAT
TCTTCACGATTCTCCCACTCAGCCGCTTAATCCAATTCTTCCTAAACCCTCCACGCTTAGAATTGGGACAAAAAGCTCCACTCATAATTTGAAGATTGTGGGCTCAGATA
CTGAAGATTATTTAACCAGCCCTCTAAGCAACCATTCTGGACCTCACCTAATAATGCCCAATATCATGGCCCAAACCGAGAAACCATCTACCTCTAAAAATCTTGAACCT
AGCATGTTTGCAGAATCCACAATGGATCAATCTTCACAGCTGGAACCCACAGACCATGAAATAAATCCTCTACCTCTTCTCACCATGGGCCCCACTTTCAACCCCACAAA
TCAACCCTCCCCTGTCCCAAACCAATGCCAATTATCCCCAACAGTGGCAGTCTCTCCTGTAAATGATTTATTGTGTAAGGCTCATCCACATGGCGAAAGTAGCTCCAATT
TCCTCAATTTGCCGCTCTCGATAAGGCAAATAGCTCCGATTCTCATTGAACATGGTTTATGCATTATGGCTATTCCTCCTCCAAAAAAGAAAGTGGGAGCAGCTGAAGGT
ATCCTAATTCTTTGGAGTGACCCGGATTTCACTATCAAAGAAACAATTCAAGGTTTGTTTTCTCTATCAATACATATTGTTATGGCTGATGGCTTTGATTTTTGGTTGTC
GGCTATCTATGGTCCTACCAGGAGTGATATGCGGGATGAATTCTGGCAAGAATTACATGACTTAGCAGGCCTGGGAAGAGATAGATGGATTCTTGGAGGCGACTTCAATG
TCACTCGTTGGTCTTGGGAGAAATCTAGCGATCAACCAGTCACTACAAGCATGAATTCTTTTAATCAATGGATTGCAACCTACCAGTTGATTGATATTCCTCTTCAGAAT
GATTGTGATTCTTTATCTTCTGACCAAATGTCTCAAAGGACTCATCTTAGAGAACAAATTGAGGATATTACTGCTCGAGAACATACATACTGGAGACAAAGATGTAAGCT
TAATTGGTTGAAAGAAGGTGATGAGAATACAAAATTTTTTCATCGTTTTATGGCTGCTCGTAAAAGGAAAAATTCAATTTCTGAGTTGTTGTCTCGTAATGGGGTGGGTC
TTTTGTCAGCTAAAGATATTGAGGAGGAGTTCATTGATTTCTATAAGAAGCTATATACGAAAGATTCATCTTCACGATTTTTACCTACCAATCTTGAATGGGGTAGGATT
AATGCCTCCTAA
Protein sequenceShow/hide protein sequence
MVSPAASLPWNHSTRSITIDRKTFSIAFDELSRGSCAKITERSRNSTHSLSLSWKSLNWLASSFHSLTKEPCSYKFFSEFRGDGYVLCLEKLRNKHGFFETSNQKVDHRS
HTYKEILEQRPQHPTSIHMQPPQKAPMIETSAPPLKLDTNDEWKDVIVVERFSPKDNWPSIRETIANLTSRCSINPFQDNKALIHCYGHDHTLNLCNNSKWTLLGNFRLK
FYPLTTISYQQNQKISFYGGWVNIHNLPLNLWTDTVFQYIGEQCGGFESLSDHTMRRLVITEASIKIKENPTGFIPAAIRLPAALTGDAIVTAHITGEIEELNKKENRSN
SEFNSENTEALNKEIIEPKISHQITPLRSRDEKGKKILHDSPTQPLNPILPKPSTLRIGTKSSTHNLKIVGSDTEDYLTSPLSNHSGPHLIMPNIMAQTEKPSTSKNLEP
SMFAESTMDQSSQLEPTDHEINPLPLLTMGPTFNPTNQPSPVPNQCQLSPTVAVSPVNDLLCKAHPHGESSSNFLNLPLSIRQIAPILIEHGLCIMAIPPPKKKVGAAEG
ILILWSDPDFTIKETIQGLFSLSIHIVMADGFDFWLSAIYGPTRSDMRDEFWQELHDLAGLGRDRWILGGDFNVTRWSWEKSSDQPVTTSMNSFNQWIATYQLIDIPLQN
DCDSLSSDQMSQRTHLREQIEDITAREHTYWRQRCKLNWLKEGDENTKFFHRFMAARKRKNSISELLSRNGVGLLSAKDIEEEFIDFYKKLYTKDSSSRFLPTNLEWGRI
NAS