; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0010177 (gene) of Chayote v1 genome

Gene IDSed0010177
OrganismSechium edule (Chayote v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationLG07:13755290..13757111
RNA-Seq ExpressionSed0010177
SyntenySed0010177
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588985.1 Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyrosperma subsp. sororia]3.9e-5147.53Show/hide
Query:  SSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTDSTEYDLWFEKDQTLITLLNATLSQTTLSFAIGCKTSKD
        S I LLSNI NL+SI+LDSTNY+LW++Q+++LLKAHKL+G+IDG  P P    S           YD WF KDQ L+T++NATLS   L++ +G  TSK 
Subjt:  SSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTDSTEYDLWFEKDQTLITLLNATLSQTTLSFAIGCKTSKD

Query:  LWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELHMLLKTEE
        +W  + K YSS++R N++NLK +LQ+ISKK  ESID YI+ I  I  +LA V   ++ ED++IY +NGLP+ YN F+TS+RTR+T V+F+ELH+LLK EE
Subjt:  LWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELHMLLKTEE

Query:  TAIDLHTKKED-----------SASLQVLAMNNSRGGWRGSNRGRGRNSG-------NRGGRS
        +A+   +K++D           S SL   A   +    RG  RGRG   G        RGG S
Subjt:  TAIDLHTKKED-----------SASLQVLAMNNSRGGWRGSNRGRGRNSG-------NRGGRS

KAG7015254.1 hypothetical protein SDJN02_22888, partial [Cucurbita argyrosperma subsp. argyrosperma]3.9e-5147.53Show/hide
Query:  SSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTDSTEYDLWFEKDQTLITLLNATLSQTTLSFAIGCKTSKD
        S I LLSNI NL+SI+LDSTNY+LW++Q+++LLKAHKL+G+IDG  P P    S           YD WF KDQ L+T++NATLS   L++ +G  TSK 
Subjt:  SSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTDSTEYDLWFEKDQTLITLLNATLSQTTLSFAIGCKTSKD

Query:  LWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELHMLLKTEE
        +W  + K YSS++R N++NLK +LQ+ISKK  ESID YI+ I  I  +LA V   ++ ED++IY +NGLP+ YN F+TS+RTR+T V+F+ELH+LLK EE
Subjt:  LWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELHMLLKTEE

Query:  TAIDLHTKKED-----------SASLQVLAMNNSRGGWRGSNRGRGRNSG-------NRGGRS
        +A+   +K++D           S SL   A   +    RG  RGRG   G        RGG S
Subjt:  TAIDLHTKKED-----------SASLQVLAMNNSRGGWRGSNRGRGRNSG-------NRGGRS

TYK17989.1 uncharacterized protein E5676_scaffold306G002980 [Cucumis melo var. makuwa]2.3e-5148.09Show/hide
Query:  NSSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTDST--------------EYDLWFEKDQTLITLLNATLS
        +S ++LL+NI NL+SIRLDSTNY LW++Q   +LKAHKLYG+ID  IP P +  S    +S+ +T               Y+ W  KDQ  + L+NATLS
Subjt:  NSSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTDST--------------EYDLWFEKDQTLITLLNATLS

Query:  QTTLSFAIGCKTSKDLWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRAT
           L++ +GCK+S  +W+T+ ++YSS TR NI+NLK +LQ ISKKP E I +YI+ I  +  +LA     ++ ED+VIY +NGLP  YN F+TS++TR+ 
Subjt:  QTTLSFAIGCKTSKDLWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRAT

Query:  VVSFDELHMLLKTEETAIDLHTKKEDSASLQVLAM
         VSF ELH+LLK+EE+A++  TK+ED  S+Q  AM
Subjt:  VVSFDELHMLLKTEETAIDLHTKKEDSASLQVLAM

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]2.6e-5044.57Show/hide
Query:  SNSSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESST---DSTEYDLWFEKDQTLITLLNATLSQTTLSFAIGC
        S S I+LLSNI NL+S+RLDSTN++LW++Q++++LKAHKLYG+IDG  P P +  ++   S+     +  Y+ W  KDQ L+T++NATLS   L++ +G 
Subjt:  SNSSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESST---DSTEYDLWFEKDQTLITLLNATLSQTTLSFAIGC

Query:  KTSKDLWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELHML
         +SK +W+ + K YSS +R N++NLK +LQ+I KKP ESID YI+ I  I  +LA V   I++ED++IY +NGLP+ YN F+TS+RTR+  V+F+ELH+L
Subjt:  KTSKDLWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELHML

Query:  LKTEETAIDLHTKKEDSASLQVLAMNNS---------------RGGWRGSNRGRGRNS
        L+ EE+A+   +K +DS +   + +++S               RG   G + G GR S
Subjt:  LKTEETAIDLHTKKEDSASLQVLAMNNS---------------RGGWRGSNRGRGRNS

XP_022158689.1 uncharacterized protein LOC111025150 [Momordica charantia]6.1e-5250.95Show/hide
Query:  NSSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTD-----STEYDLWFEKDQTLITLLNATLSQTTLSFAIG
        +S I+LLSNI NLVS+RLDS+N++LW++Q++++LKAHKLYG+IDG  P+P +   +  + S+      +  +  W  KD  L+TLLNA LS + L++ +G
Subjt:  NSSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTD-----STEYDLWFEKDQTLITLLNATLSQTTLSFAIG

Query:  CKTSKDLWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELHM
        C +S+ +W+T+ K+YSS++R N++NLK +LQSISKKPG SID Y+Q I  +  +LA V V +D ED++IYT+N LP  +N F+TS+RTR+  VSF+ELH+
Subjt:  CKTSKDLWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELHM

Query:  LLKTEETAID
        LL +EE AID
Subjt:  LLKTEETAID

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X21.2e-5044.57Show/hide
Query:  SNSSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESST---DSTEYDLWFEKDQTLITLLNATLSQTTLSFAIGC
        S S I+LLSNI NL+S+RLDSTN++LW++Q++++LKAHKLYG+IDG  P P +  ++   S+     +  Y+ W  KDQ L+T++NATLS   L++ +G 
Subjt:  SNSSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESST---DSTEYDLWFEKDQTLITLLNATLSQTTLSFAIGC

Query:  KTSKDLWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELHML
         +SK +W+ + K YSS +R N++NLK +LQ+I KKP ESID YI+ I  I  +LA V   I++ED++IY +NGLP+ YN F+TS+RTR+  V+F+ELH+L
Subjt:  KTSKDLWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELHML

Query:  LKTEETAIDLHTKKEDSASLQVLAMNNS---------------RGGWRGSNRGRGRNS
        L+ EE+A+   +K +DS +   + +++S               RG   G + G GR S
Subjt:  LKTEETAIDLHTKKEDSASLQVLAMNNS---------------RGGWRGSNRGRGRNS

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X11.2e-5044.57Show/hide
Query:  SNSSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESST---DSTEYDLWFEKDQTLITLLNATLSQTTLSFAIGC
        S S I+LLSNI NL+S+RLDSTN++LW++Q++++LKAHKLYG+IDG  P P +  ++   S+     +  Y+ W  KDQ L+T++NATLS   L++ +G 
Subjt:  SNSSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESST---DSTEYDLWFEKDQTLITLLNATLSQTTLSFAIGC

Query:  KTSKDLWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELHML
         +SK +W+ + K YSS +R N++NLK +LQ+I KKP ESID YI+ I  I  +LA V   I++ED++IY +NGLP+ YN F+TS+RTR+  V+F+ELH+L
Subjt:  KTSKDLWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELHML

Query:  LKTEETAIDLHTKKEDSASLQVLAMNNS---------------RGGWRGSNRGRGRNS
        L+ EE+A+   +K +DS +   + +++S               RG   G + G GR S
Subjt:  LKTEETAIDLHTKKEDSASLQVLAMNNS---------------RGGWRGSNRGRGRNS

A0A5D3CLI6 T4.51.2e-5044.57Show/hide
Query:  SNSSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESST---DSTEYDLWFEKDQTLITLLNATLSQTTLSFAIGC
        S S I+LLSNI NL+S+RLDSTN++LW++Q++++LKAHKLYG+IDG  P P +  ++   S+     +  Y+ W  KDQ L+T++NATLS   L++ +G 
Subjt:  SNSSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESST---DSTEYDLWFEKDQTLITLLNATLSQTTLSFAIGC

Query:  KTSKDLWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELHML
         +SK +W+ + K YSS +R N++NLK +LQ+I KKP ESID YI+ I  I  +LA V   I++ED++IY +NGLP+ YN F+TS+RTR+  V+F+ELH+L
Subjt:  KTSKDLWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELHML

Query:  LKTEETAIDLHTKKEDSASLQVLAMNNS---------------RGGWRGSNRGRGRNS
        L+ EE+A+   +K +DS +   + +++S               RG   G + G GR S
Subjt:  LKTEETAIDLHTKKEDSASLQVLAMNNS---------------RGGWRGSNRGRGRNS

A0A5D3D3T6 Retrotran_gag_3 domain-containing protein1.1e-5148.09Show/hide
Query:  NSSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTDST--------------EYDLWFEKDQTLITLLNATLS
        +S ++LL+NI NL+SIRLDSTNY LW++Q   +LKAHKLYG+ID  IP P +  S    +S+ +T               Y+ W  KDQ  + L+NATLS
Subjt:  NSSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTDST--------------EYDLWFEKDQTLITLLNATLS

Query:  QTTLSFAIGCKTSKDLWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRAT
           L++ +GCK+S  +W+T+ ++YSS TR NI+NLK +LQ ISKKP E I +YI+ I  +  +LA     ++ ED+VIY +NGLP  YN F+TS++TR+ 
Subjt:  QTTLSFAIGCKTSKDLWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRAT

Query:  VVSFDELHMLLKTEETAIDLHTKKEDSASLQVLAM
         VSF ELH+LLK+EE+A++  TK+ED  S+Q  AM
Subjt:  VVSFDELHMLLKTEETAIDLHTKKEDSASLQVLAM

A0A6J1E049 uncharacterized protein LOC1110251502.9e-5250.95Show/hide
Query:  NSSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTD-----STEYDLWFEKDQTLITLLNATLSQTTLSFAIG
        +S I+LLSNI NLVS+RLDS+N++LW++Q++++LKAHKLYG+IDG  P+P +   +  + S+      +  +  W  KD  L+TLLNA LS + L++ +G
Subjt:  NSSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTD-----STEYDLWFEKDQTLITLLNATLSQTTLSFAIG

Query:  CKTSKDLWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELHM
        C +S+ +W+T+ K+YSS++R N++NLK +LQSISKKPG SID Y+Q I  +  +LA V V +D ED++IYT+N LP  +N F+TS+RTR+  VSF+ELH+
Subjt:  CKTSKDLWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELHM

Query:  LLKTEETAID
        LL +EE AID
Subjt:  LLKTEETAID

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.9e-0624.55Show/hide
Query:  WRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTDSTEYDLWFEKDQTLITLLNATLSQTTLSFAIGCKTSKDLWETMRKYYSSTTRMNIINLKLEL
        W+ ++  LL    L+  +D    +P            D+ + + W + D+   + +   LS   ++  I   T++ +W  +   Y S T  N + LK +L
Subjt:  WRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTDSTEYDLWFEKDQTLITLLNATLSQTTLSFAIGCKTSKDLWETMRKYYSSTTRMNIINLKLEL

Query:  QSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVY-NIFKTSLRTRATVVSFDELHMLLKTEETAIDLHTKKEDSASLQVLAMNNS
         ++    G +   ++ + + +I +LA + VKI++ED  I  +N LPS Y N+  T L  + T+   D    LL  E+       KK ++    ++     
Subjt:  QSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVY-NIFKTSLRTRATVVSFDELHMLLKTEETAIDLHTKKEDSASLQVLAMNNS

Query:  RGGWRGSNR-GRGRNSGNRGGRSQ
        R   R SN  GR    G    RS+
Subjt:  RGGWRGSNR-GRGRNSGNRGGRSQ

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.7e-1024.08Show/hide
Query:  NSSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTDSTEYDLWFEKDQTLITLLNATLSQTTLSFAIGCKTSK
        N++  L  N+ N+   +L STNY++W  Q+ +L   ++L G++DG    P     TD      + +Y  W  +D+ + + +   +S +         T+ 
Subjt:  NSSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTDSTEYDLWFEKDQTLITLLNATLSQTTLSFAIGCKTSK

Query:  DLWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELHMLLKTE
         +WET+RK Y++ +  ++  L+ +L+  +K   ++ID Y+Q + T   +LA +   +D ++ V   +  LP  Y      +  + T  +  E+H  L   
Subjt:  DLWETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELHMLLKTE

Query:  ETAIDLHTKKEDSASLQVLAMNNSRGGWRGSNRGRGRNSGNRGGR
        E+ I         +S  V+ +  +    R +      N+GNR  R
Subjt:  ETAIDLHTKKEDSASLQVLAMNNSRGGWRGSNRGRGRNSGNRGGR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.9e-0922.94Show/hide
Query:  RLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTDSTEYDLWFEKDQTLITLLNATLSQTTLSFAIGCKTSKDLWETMRKYYSSTTRM
        +L STNY++W  Q+ +L   ++L G++DG  P P     TD      + +Y  W  +D+ + + +   +S +         T+  +WET+RK Y++ +  
Subjt:  RLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTDSTEYDLWFEKDQTLITLLNATLSQTTLSFAIGCKTSKDLWETMRKYYSSTTRM

Query:  NIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELH-MLLKTEETAIDLHTKKEDSAS
        ++  L+                      T   +LA +   +D ++ V   +  LP  Y      +  + T  S  E+H  L+  E   + L++ +    +
Subjt:  NIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELH-MLLKTEETAIDLHTKKEDSAS

Query:  LQVLAMNNSRGGWRGSNRGRGRNSGNRGGRS
          V+   N+      +NRG  RN  N   RS
Subjt:  LQVLAMNNSRGGWRGSNRGRGRNSGNRGGRS

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).3.2e-0621.32Show/hide
Query:  YNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTDSTEYDLWFEKDQTLITLLNATLSQTTLSFAIGCKTSKDLWETMRKYY
        +++  +  D  NY+ W+ +  S L+  K +G+IDG +P+P             S  Y  W + +  ++  L  +++   L   +  +T+  +WE +R+ +
Subjt:  YNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTDSTEYDLWFEKDQTLITLLNATLSQTTLSFAIGCKTSKDLWETMRKYY

Query:  SSTTRMNIINLKLELQSISKKPGESIDVYIQIISTI
             + I  L+  L ++ ++ G+S++ Y   +S +
Subjt:  SSTTRMNIINLKLELQSISKKPGESIDVYIQIISTI

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)7.8e-0525.56Show/hide
Query:  IYLLSNIYNLVSIRLD--STNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTDSTEYDLWFEKDQTLITLLNATLS--QTTLSFAIGCKTS
        IY +SNI + + + LD   +NY  WR    +   +  + G+IDG +              T++ + + W ++D  +   L  TL+  Q   SF +   TS
Subjt:  IYLLSNIYNLVSIRLD--STNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTDSTEYDLWFEKDQTLITLLNATLS--QTTLSFAIGCKTS

Query:  KDLWETMRKYYSSTTRMNIINLKLELQSISKKPGE-SIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELHMLLK
        +D+W  ++  + +      + L  EL+  +K  G+  +  Y + +  +   L  VDV +   ++V+Y +NGL   ++     ++ R    SFD+   +L+
Subjt:  KDLWETMRKYYSSTTRMNIINLKLELQSISKKPGE-SIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELHMLLK

Query:  TEETAIDLHTKK-----EDSASLQVLAMNNS--------RGGWRGSNRGRGRNSG---NRGGRSQY
         EE  +    K      + S+S  VLA + +         GG +   RGRGR +     RGGR  Y
Subjt:  TEETAIDLHTKK-----EDSASLQVLAMNNS--------RGGWRGSNRGRGRNSG---NRGGRSQY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAATATCTCAAATTTCTCTAATTCTTCGATCTACTTGCTATCAAACATTTACAATCTTGTGTCTATAAGGTTAGATTCGACAAACTATATACTCTGGCGGTATCA
AATTTCTTCTTTACTCAAAGCCCATAAGCTATATGGGTATATTGACGGGAAAATCCCAGAACCGAAGCAAGAAACTTCAACGGATGGTGAATCATCTACAGATTCAACCG
AATATGATCTTTGGTTTGAGAAAGATCAAACTCTTATCACTCTTCTGAATGCTACGCTATCTCAAACAACCCTATCGTTTGCAATCGGCTGCAAGACTTCGAAAGATCTA
TGGGAAACTATGAGAAAATACTATTCTTCTACAACAAGAATGAACATTATCAATTTGAAGTTAGAACTCCAGAGTATATCCAAGAAACCAGGTGAGTCGATTGATGTCTA
TATTCAAATAATCTCAACTATCATTCATCGATTGGCTGCAGTTGATGTTAAGATTGATCAAGAAGATGTTGTTATTTACACCGTAAATGGTCTCCCTTCTGTTTACAATA
TCTTCAAAACATCACTGAGAACACGAGCGACTGTGGTATCTTTTGACGAACTGCACATGCTATTAAAAACAGAGGAAACTGCAATTGATTTACATACCAAAAAAGAAGAT
TCTGCATCTCTTCAGGTTCTTGCAATGAATAACAGCCGAGGAGGTTGGCGTGGTTCGAACCGCGGACGAGGACGCAATAGTGGTAATCGTGGAGGTCGCAGTCAATATTA
A
mRNA sequenceShow/hide mRNA sequence
GACTGACCTAAAATCACGCTTTCTATTTTCCTGCTTTACCATTTCTCACGCCAAATCCCTAATGTTCTCATTCTCTCTCCTCTCTCTTCCAAATCCCTAATTTCTGCGAA
CCAAGCTCTCTGCAAATCCATGGCGAATATCTCAAATTTCTCTAATTCTTCGATCTACTTGCTATCAAACATTTACAATCTTGTGTCTATAAGGTTAGATTCGACAAACT
ATATACTCTGGCGGTATCAAATTTCTTCTTTACTCAAAGCCCATAAGCTATATGGGTATATTGACGGGAAAATCCCAGAACCGAAGCAAGAAACTTCAACGGATGGTGAA
TCATCTACAGATTCAACCGAATATGATCTTTGGTTTGAGAAAGATCAAACTCTTATCACTCTTCTGAATGCTACGCTATCTCAAACAACCCTATCGTTTGCAATCGGCTG
CAAGACTTCGAAAGATCTATGGGAAACTATGAGAAAATACTATTCTTCTACAACAAGAATGAACATTATCAATTTGAAGTTAGAACTCCAGAGTATATCCAAGAAACCAG
GTGAGTCGATTGATGTCTATATTCAAATAATCTCAACTATCATTCATCGATTGGCTGCAGTTGATGTTAAGATTGATCAAGAAGATGTTGTTATTTACACCGTAAATGGT
CTCCCTTCTGTTTACAATATCTTCAAAACATCACTGAGAACACGAGCGACTGTGGTATCTTTTGACGAACTGCACATGCTATTAAAAACAGAGGAAACTGCAATTGATTT
ACATACCAAAAAAGAAGATTCTGCATCTCTTCAGGTTCTTGCAATGAATAACAGCCGAGGAGGTTGGCGTGGTTCGAACCGCGGACGAGGACGCAATAGTGGTAATCGTG
GAGGTCGCAGTCAATATTAAGTCAATCAAGATCCAAATATGAATCAGAATCATACCTATTATCAGAACCAAAATCAACCTCAGTATCGAAATCAAAATCAGAACTATTAT
CAACCTGGTAATACTTCACTCATGAATTCCAATTTTCCAATCTCTGATTCAGATTCATCTCCACAAATATTACCTTGTCAAATATGTGGGAAACTTGGACACGGTGCGTT
AGATTGCTACAATAGAATGAACTTTACTTATCAAGGCAAAATTCCTCCATCCAAATTGGCTGCAATGGCTGCTTTGTCCTCCAATCAAGGTACAACTCCATCTTCCTCTC
AAGATAGTGGAATATGGCTTTCAGACTTTGGCTGTAATGCACACCTCTCAAGCGATCTCAACAATTTTCGACACTCTACTCCGTATATGGGTGAGGACAACATTCAAGTA
GGAAATGGTGAATCATTAAACATTACACATCAAGGAAATGGACAAAGAACACTTAGGACATTATACTATGGACCTAGCAAAAATGGTCTCTATCCAATGTATTCCAATGA
TATTAAATCCTTTGAAGCACACGCCGGAATTAAAGAGACATCTACTACTTGGCATGATAGGCTTGGACATCCGAATGATGTTATTCTTAGGAAGGTTTTTAGTACCTTCG
AGTCTCCTGTTAAGAATAATTCTACATTTTGCAAGGATTATATAGCTGGGAAAATGATAAAACAACCTTTTTCTAATTCAGTTTCCTTTACTACTATGCCTTTAGAGTTA
CTACATAGTGATGTATGGGGTCCGGC
Protein sequenceShow/hide protein sequence
MANISNFSNSSIYLLSNIYNLVSIRLDSTNYILWRYQISSLLKAHKLYGYIDGKIPEPKQETSTDGESSTDSTEYDLWFEKDQTLITLLNATLSQTTLSFAIGCKTSKDL
WETMRKYYSSTTRMNIINLKLELQSISKKPGESIDVYIQIISTIIHRLAAVDVKIDQEDVVIYTVNGLPSVYNIFKTSLRTRATVVSFDELHMLLKTEETAIDLHTKKED
SASLQVLAMNNSRGGWRGSNRGRGRNSGNRGGRSQY