; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037389 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037389
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr2:5784319..5785972
RNA-Seq ExpressionLag0037389
SyntenyLag0037389
Gene Ontology termsGO:0009987 - cellular process (biological process)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN66330.1 hypothetical protein VITISV_000598 [Vitis vinifera]9.0e-4838.58Show/hide
Query:  TRGMKKFNKFIESAALQDIPLSNGKYTWSSFRPSPTMTLIDRFLISDSIPNKFVSAKARRLDRVTSDHYPICLTLGKESWGPSPFRFINAWLSHHSFLHT
        T  M+ F+ FI  + L D PL N  +TWS+ + SP    +DRFL S+     F  +    L R TSDH+PI L      WGP+PFRF N WL HH+F  +
Subjt:  TRGMKKFNKFIESAALQDIPLSNGKYTWSSFRPSPTMTLIDRFLISDSIPNKFVSAKARRLDRVTSDHYPICLTLGKESWGPSPFRFINAWLSHHSFLHT

Query:  VDSWWKANPYYGWPGHGFIQKLKGLKTELRSWNIQIFGQQKEKKTNLGRELAIIDKKEECAPLSEQDFRRRAEIKAELIFLSANEEIMWRQRCKSKWFVE
          SWW+     GW GH F++KL+ +K +L+ WN   FG  KE+K ++  E+A ID  E+   LS      RA  K EL  L   EEI W+Q+ K KW  E
Subjt:  VDSWWKANPYYGWPGHGFIQKLKGLKTELRSWNIQIFGQQKEKKTNLGRELAIIDKKEECAPLSEQDFRRRAEIKAELIFLSANEEIMWRQRCKSKWFVE

Query:  GDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSFYQRLYSKKANRMPIPDIDDWNP
        GD NS  FH++    R K+ I  + +  G+ +     I  E L ++++LYS         +  DW+P
Subjt:  GDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSFYQRLYSKKANRMPIPDIDDWNP

RVW28221.1 hypothetical protein CK203_083720 [Vitis vinifera]3.1e-4836.28Show/hide
Query:  IDAVGASGGITILWSDPTFKVLEVV------------EAPTR------------GMKKFNKFIESAALQDIPLSNGKYTWSSFRPSPTMTLIDRFLISDS
        + A GASGGI ILW    FK  E V            +   R             M+ F++FI  + L D PL N  +TWS+ +  P    +DRFL S  
Subjt:  IDAVGASGGITILWSDPTFKVLEVV------------EAPTR------------GMKKFNKFIESAALQDIPLSNGKYTWSSFRPSPTMTLIDRFLISDS

Query:  IPNKFVSAKARRLDRVTSDHYPICLTLGKESWGPSPFRFINAWLSHHSFLHTVDSWWKANPYYGWPGHGFIQKLKGLKTELRSWNIQIFGQQKEKKTNLG
          + F  +    L R TSDH PICL      WGP+PFRF N WL H  F      WW+     GW GH F++KLK +K++L+ WNI  FG  +E+K ++ 
Subjt:  IPNKFVSAKARRLDRVTSDHYPICLTLGKESWGPSPFRFINAWLSHHSFLHTVDSWWKANPYYGWPGHGFIQKLKGLKTELRSWNIQIFGQQKEKKTNLG

Query:  RELAIIDKKEECAPLSEQDFRRRAEIKAELIFLSANEEIMWRQRCKSKWFVEGDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSFYQR
         +L  ID  E+   L+      R   + EL  L   EE+ WRQ+ + KW  EGD NS FFHR+    R +  I  ++S  G +L +   I  E ++F+  
Subjt:  RELAIIDKKEECAPLSEQDFRRRAEIKAELIFLSANEEIMWRQRCKSKWFVEGDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSFYQR

Query:  LYSKKANRMPIPDIDDW
        LYSK       P+ D W
Subjt:  LYSKKANRMPIPDIDDW

RVW96808.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]9.0e-4837.63Show/hide
Query:  IDAVGASGGITILWSDPTFKVLEVVEAPTRGMKKFNKFIESAALQDIPLSNGKYTWSSFRPSPTMTLIDRFLISDSIPNKFVSAKARRLDRVTSDHYPIC
        + A GASGGI ILW    F+  E V        KFN      A    PL N  +TWS+ +  P    +DRFL S      F  +    L R TSDH PIC
Subjt:  IDAVGASGGITILWSDPTFKVLEVVEAPTRGMKKFNKFIESAALQDIPLSNGKYTWSSFRPSPTMTLIDRFLISDSIPNKFVSAKARRLDRVTSDHYPIC

Query:  LTLGKESWGPSPFRFINAWLSHHSFLHTVDSWWKANPYYGWPGHGFIQKLKGLKTELRSWNIQIFGQQKEKKTNLGRELAIIDKKEECAPLSEQDFRRRA
        L      WGP+PFRF N WL H  F      WW+     GW GH F++KLK +K +L+ WNI  FG  KE+K  +  +L+ ID  E+   L+      R 
Subjt:  LTLGKESWGPSPFRFINAWLSHHSFLHTVDSWWKANPYYGWPGHGFIQKLKGLKTELRSWNIQIFGQQKEKKTNLGRELAIIDKKEECAPLSEQDFRRRA

Query:  EIKAELIFLSANEEIMWRQRCKSKWFVEGDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSFYQRLYSKKANRMPIPDIDDWNP
          + EL  +   EE+ WRQ+ + KW  EGD NS FFHR+    R +  I  ++S  G +L +  +I  E ++F+  LYSK        +  DW P
Subjt:  EIKAELIFLSANEEIMWRQRCKSKWFVEGDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSFYQRLYSKKANRMPIPDIDDWNP

RVX20162.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]9.0e-4838.58Show/hide
Query:  TRGMKKFNKFIESAALQDIPLSNGKYTWSSFRPSPTMTLIDRFLISDSIPNKFVSAKARRLDRVTSDHYPICLTLGKESWGPSPFRFINAWLSHHSFLHT
        T  M+ F+ FI  + L D PL N  +TWS+ + S     +DRFL S+     F  +    L R TSDH+PI L      WGP+PFRF N WL HHSF  +
Subjt:  TRGMKKFNKFIESAALQDIPLSNGKYTWSSFRPSPTMTLIDRFLISDSIPNKFVSAKARRLDRVTSDHYPICLTLGKESWGPSPFRFINAWLSHHSFLHT

Query:  VDSWWKANPYYGWPGHGFIQKLKGLKTELRSWNIQIFGQQKEKKTNLGRELAIIDKKEECAPLSEQDFRRRAEIKAELIFLSANEEIMWRQRCKSKWFVE
          SWW+     GW GH F++KL+ +K +L+ WN   FG  KE+K  +  E+AIID  E+   LS     +RA  K EL  +   EEI WRQ+ K KW  +
Subjt:  VDSWWKANPYYGWPGHGFIQKLKGLKTELRSWNIQIFGQQKEKKTNLGRELAIIDKKEECAPLSEQDFRRRAEIKAELIFLSANEEIMWRQRCKSKWFVE

Query:  GDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSFYQRLYSKKANRMPIPDIDDWNP
        GD NS  FH++    R ++ I  + +  G+ L +   I  E L ++++LYS         +  DW+P
Subjt:  GDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSFYQRLYSKKANRMPIPDIDDWNP

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]2.4e-5344.35Show/hide
Query:  TRGMKKFNKFIESAALQDIPLSNGKYTWSSFRPSPTMTLIDRFLISDSIPNKFVSAKARRLDRVTSDHYPICLTLGKESWGPSPFRFINAWLSHHSFLHT
        T+ M  FN FIE ++L D+PL+NG++TWS    + + +LID FL+++   +K     A+R+ R TSDH+PI L  G+ +WG +PFRF N WLSH +F   
Subjt:  TRGMKKFNKFIESAALQDIPLSNGKYTWSSFRPSPTMTLIDRFLISDSIPNKFVSAKARRLDRVTSDHYPICLTLGKESWGPSPFRFINAWLSHHSFLHT

Query:  VDSWWKANPYYGWPGHGFIQKLKGLKTELRSW---NIQIFGQQKEKKTNLGRELAIIDKKEECAPLSEQDFRRRAEIKAELIFLSANEEIMWRQRCKSKW
        +++WW   P +GWPGHG + KLK LK  ++ W   + +    QKE  TNL   L   D  E   P++    R R + K +L+ + A EE  WRQRCK KW
Subjt:  VDSWWKANPYYGWPGHGFIQKLKGLKTELRSW---NIQIFGQQKEKKTNLGRELAIIDKKEECAPLSEQDFRRRAEIKAELIFLSANEEIMWRQRCKSKW

Query:  FVEGDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSF
          EGD N+ FFHR +A  RR+S I+EILS  GI L    +IE EF+ F
Subjt:  FVEGDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSF

TrEMBL top hitse value%identityAlignment
A0A438CYG1 Uncharacterized protein1.5e-4836.28Show/hide
Query:  IDAVGASGGITILWSDPTFKVLEVV------------EAPTR------------GMKKFNKFIESAALQDIPLSNGKYTWSSFRPSPTMTLIDRFLISDS
        + A GASGGI ILW    FK  E V            +   R             M+ F++FI  + L D PL N  +TWS+ +  P    +DRFL S  
Subjt:  IDAVGASGGITILWSDPTFKVLEVV------------EAPTR------------GMKKFNKFIESAALQDIPLSNGKYTWSSFRPSPTMTLIDRFLISDS

Query:  IPNKFVSAKARRLDRVTSDHYPICLTLGKESWGPSPFRFINAWLSHHSFLHTVDSWWKANPYYGWPGHGFIQKLKGLKTELRSWNIQIFGQQKEKKTNLG
          + F  +    L R TSDH PICL      WGP+PFRF N WL H  F      WW+     GW GH F++KLK +K++L+ WNI  FG  +E+K ++ 
Subjt:  IPNKFVSAKARRLDRVTSDHYPICLTLGKESWGPSPFRFINAWLSHHSFLHTVDSWWKANPYYGWPGHGFIQKLKGLKTELRSWNIQIFGQQKEKKTNLG

Query:  RELAIIDKKEECAPLSEQDFRRRAEIKAELIFLSANEEIMWRQRCKSKWFVEGDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSFYQR
         +L  ID  E+   L+      R   + EL  L   EE+ WRQ+ + KW  EGD NS FFHR+    R +  I  ++S  G +L +   I  E ++F+  
Subjt:  RELAIIDKKEECAPLSEQDFRRRAEIKAELIFLSANEEIMWRQRCKSKWFVEGDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSFYQR

Query:  LYSKKANRMPIPDIDDW
        LYSK       P+ D W
Subjt:  LYSKKANRMPIPDIDDW

A0A438IJB1 Transposon TX1 uncharacterized 149 kDa protein4.3e-4837.63Show/hide
Query:  IDAVGASGGITILWSDPTFKVLEVVEAPTRGMKKFNKFIESAALQDIPLSNGKYTWSSFRPSPTMTLIDRFLISDSIPNKFVSAKARRLDRVTSDHYPIC
        + A GASGGI ILW    F+  E V        KFN      A    PL N  +TWS+ +  P    +DRFL S      F  +    L R TSDH PIC
Subjt:  IDAVGASGGITILWSDPTFKVLEVVEAPTRGMKKFNKFIESAALQDIPLSNGKYTWSSFRPSPTMTLIDRFLISDSIPNKFVSAKARRLDRVTSDHYPIC

Query:  LTLGKESWGPSPFRFINAWLSHHSFLHTVDSWWKANPYYGWPGHGFIQKLKGLKTELRSWNIQIFGQQKEKKTNLGRELAIIDKKEECAPLSEQDFRRRA
        L      WGP+PFRF N WL H  F      WW+     GW GH F++KLK +K +L+ WNI  FG  KE+K  +  +L+ ID  E+   L+      R 
Subjt:  LTLGKESWGPSPFRFINAWLSHHSFLHTVDSWWKANPYYGWPGHGFIQKLKGLKTELRSWNIQIFGQQKEKKTNLGRELAIIDKKEECAPLSEQDFRRRA

Query:  EIKAELIFLSANEEIMWRQRCKSKWFVEGDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSFYQRLYSKKANRMPIPDIDDWNP
          + EL  +   EE+ WRQ+ + KW  EGD NS FFHR+    R +  I  ++S  G +L +  +I  E ++F+  LYSK        +  DW P
Subjt:  EIKAELIFLSANEEIMWRQRCKSKWFVEGDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSFYQRLYSKKANRMPIPDIDDWNP

A0A438KG26 Transposon TX1 uncharacterized 149 kDa protein4.3e-4838.58Show/hide
Query:  TRGMKKFNKFIESAALQDIPLSNGKYTWSSFRPSPTMTLIDRFLISDSIPNKFVSAKARRLDRVTSDHYPICLTLGKESWGPSPFRFINAWLSHHSFLHT
        T  M+ F+ FI  + L D PL N  +TWS+ + S     +DRFL S+     F  +    L R TSDH+PI L      WGP+PFRF N WL HHSF  +
Subjt:  TRGMKKFNKFIESAALQDIPLSNGKYTWSSFRPSPTMTLIDRFLISDSIPNKFVSAKARRLDRVTSDHYPICLTLGKESWGPSPFRFINAWLSHHSFLHT

Query:  VDSWWKANPYYGWPGHGFIQKLKGLKTELRSWNIQIFGQQKEKKTNLGRELAIIDKKEECAPLSEQDFRRRAEIKAELIFLSANEEIMWRQRCKSKWFVE
          SWW+     GW GH F++KL+ +K +L+ WN   FG  KE+K  +  E+AIID  E+   LS     +RA  K EL  +   EEI WRQ+ K KW  +
Subjt:  VDSWWKANPYYGWPGHGFIQKLKGLKTELRSWNIQIFGQQKEKKTNLGRELAIIDKKEECAPLSEQDFRRRAEIKAELIFLSANEEIMWRQRCKSKWFVE

Query:  GDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSFYQRLYSKKANRMPIPDIDDWNP
        GD NS  FH++    R ++ I  + +  G+ L +   I  E L ++++LYS         +  DW+P
Subjt:  GDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSFYQRLYSKKANRMPIPDIDDWNP

A0A6J1E2G6 uncharacterized protein LOC1110254051.2e-5344.35Show/hide
Query:  TRGMKKFNKFIESAALQDIPLSNGKYTWSSFRPSPTMTLIDRFLISDSIPNKFVSAKARRLDRVTSDHYPICLTLGKESWGPSPFRFINAWLSHHSFLHT
        T+ M  FN FIE ++L D+PL+NG++TWS    + + +LID FL+++   +K     A+R+ R TSDH+PI L  G+ +WG +PFRF N WLSH +F   
Subjt:  TRGMKKFNKFIESAALQDIPLSNGKYTWSSFRPSPTMTLIDRFLISDSIPNKFVSAKARRLDRVTSDHYPICLTLGKESWGPSPFRFINAWLSHHSFLHT

Query:  VDSWWKANPYYGWPGHGFIQKLKGLKTELRSW---NIQIFGQQKEKKTNLGRELAIIDKKEECAPLSEQDFRRRAEIKAELIFLSANEEIMWRQRCKSKW
        +++WW   P +GWPGHG + KLK LK  ++ W   + +    QKE  TNL   L   D  E   P++    R R + K +L+ + A EE  WRQRCK KW
Subjt:  VDSWWKANPYYGWPGHGFIQKLKGLKTELRSW---NIQIFGQQKEKKTNLGRELAIIDKKEECAPLSEQDFRRRAEIKAELIFLSANEEIMWRQRCKSKW

Query:  FVEGDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSF
          EGD N+ FFHR +A  RR+S I+EILS  GI L    +IE EF+ F
Subjt:  FVEGDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSF

A5C3T9 Uncharacterized protein4.3e-4838.58Show/hide
Query:  TRGMKKFNKFIESAALQDIPLSNGKYTWSSFRPSPTMTLIDRFLISDSIPNKFVSAKARRLDRVTSDHYPICLTLGKESWGPSPFRFINAWLSHHSFLHT
        T  M+ F+ FI  + L D PL N  +TWS+ + SP    +DRFL S+     F  +    L R TSDH+PI L      WGP+PFRF N WL HH+F  +
Subjt:  TRGMKKFNKFIESAALQDIPLSNGKYTWSSFRPSPTMTLIDRFLISDSIPNKFVSAKARRLDRVTSDHYPICLTLGKESWGPSPFRFINAWLSHHSFLHT

Query:  VDSWWKANPYYGWPGHGFIQKLKGLKTELRSWNIQIFGQQKEKKTNLGRELAIIDKKEECAPLSEQDFRRRAEIKAELIFLSANEEIMWRQRCKSKWFVE
          SWW+     GW GH F++KL+ +K +L+ WN   FG  KE+K ++  E+A ID  E+   LS      RA  K EL  L   EEI W+Q+ K KW  E
Subjt:  VDSWWKANPYYGWPGHGFIQKLKGLKTELRSWNIQIFGQQKEKKTNLGRELAIIDKKEECAPLSEQDFRRRAEIKAELIFLSANEEIMWRQRCKSKWFVE

Query:  GDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSFYQRLYSKKANRMPIPDIDDWNP
        GD NS  FH++    R K+ I  + +  G+ +     I  E L ++++LYS         +  DW+P
Subjt:  GDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSFYQRLYSKKANRMPIPDIDDWNP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein8.7e-1725.4Show/hide
Query:  PTRGMKKFNKFIESAALQDIPLSNGKYTWSSFR-PSPTMTLIDRFLISDSIPNKFVSAKARRLDRVTSDHYPICLTL-GKESWGPSPFRFINAWLSHHSF
        P RG+++F   +  + L DIP     YTWS+ +  +P +  +DR + +    + F SA A       SDH P  + L          FR+ +   +H +F
Subjt:  PTRGMKKFNKFIESAALQDIPLSNGKYTWSSFR-PSPTMTLIDRFLISDSIPNKFVSAKARRLDRVTSDHYPICLTL-GKESWGPSPFRFINAWLSHHSF

Query:  LHTVDSWWKANPYYGWPGHGFIQKLKGLKTELRSWNIQIFGQQKEKKTNLGRELAIIDKKEECAPLSEQDFRRRAEIKAELIFLSANEEIMWRQRCKSKW
        L ++   W+     G       + LK  K   +  N Q FG  + K       L  I  +    P S+  FR     + +  F +A  E  +RQ+ + KW
Subjt:  LHTVDSWWKANPYYGWPGHGFIQKLKGLKTELRSWNIQIFGQQKEKKTNLGRELAIIDKKEECAPLSEQDFRRRAEIKAELIFLSANEEIMWRQRCKSKW

Query:  FVEGDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSFYQRL
          +GD N+ FFH+++ AN+ K+ I  +     + + +  +++   +++Y  L
Subjt:  FVEGDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSFYQRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGATGATCAGATCCCCCTTGTCGAAAATCCCACTCCGTTAAGGATTGAAGACCCGAATAGTAAAAGCTTGCACTTGAGCCACCAAGAGGAAGAGATAGCTTTTGC
TGAAAATTACACGGAAGACATAGAAGAAGATGAATCAGACACAGAGAATGAAGTGTCTGACCCAACAGCTTTTTTGCCCTATCTTTTCCCATGGTTGGCTGAACATGGGA
TGTGTATTATGCCTATCCCAAACAGACAAAAACTGTCATCTGCCGCAAAGAAGAAGAAAAATTGGATCAAGGAATTGGAGAACCTTCAGACTTCTGTTAATTACAACAGA
CCTTCAGCTATTTCCCATATGGGAGGGTCGAGAATTCTCAATGATTATTATATCCTGGAACCAATTGATGCAGTGGGAGCTTCGGGAGGGATCACCATTTTATGGTCGGA
TCCCACATTCAAAGTGCTGGAAGTTGTAGAAGCTCCCACTCGGGGTATGAAAAAATTCAACAAATTCATAGAATCAGCTGCCCTCCAAGATATCCCCCTCTCAAATGGCA
AGTACACATGGTCCAGCTTTCGCCCAAGTCCCACCATGACCCTTATTGATCGATTCTTGATCTCAGATAGCATCCCCAACAAGTTTGTTTCTGCAAAGGCCCGAAGACTT
GATAGAGTCACCTCGGACCACTATCCTATTTGTCTCACTTTGGGTAAAGAATCTTGGGGACCATCTCCTTTTCGCTTTATCAATGCTTGGCTCTCCCATCATTCATTCTT
GCATACAGTAGATTCTTGGTGGAAGGCAAACCCTTATTATGGTTGGCCGGGTCATGGTTTCATTCAGAAATTAAAGGGCTTGAAAACAGAATTAAGAAGCTGGAATATAC
AGATTTTCGGACAACAAAAAGAGAAAAAAACCAATTTGGGTCGGGAACTTGCTATTATTGATAAAAAAGAGGAATGTGCACCTTTATCTGAACAAGACTTCAGAAGAAGA
GCCGAGATTAAAGCAGAATTAATTTTTTTATCAGCTAACGAAGAGATTATGTGGCGCCAAAGATGCAAATCAAAATGGTTTGTCGAGGGAGATGTAAATTCTGCTTTCTT
CCACCGCATTGTTGCTGCTAATAGAAGGAAAAGCTCTATTTCGGAGATCCTATCATCCTCGGGTATTAGCCTTGTTGATGATATAGAAATTGAAAATGAATTTCTCTCCT
TTTATCAAAGGCTTTACTCCAAAAAAGCCAACAGAATGCCTATCCCTGATATTGATGACTGGAACCCATCTCCATGGATCAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGATGATCAGATCCCCCTTGTCGAAAATCCCACTCCGTTAAGGATTGAAGACCCGAATAGTAAAAGCTTGCACTTGAGCCACCAAGAGGAAGAGATAGCTTTTGC
TGAAAATTACACGGAAGACATAGAAGAAGATGAATCAGACACAGAGAATGAAGTGTCTGACCCAACAGCTTTTTTGCCCTATCTTTTCCCATGGTTGGCTGAACATGGGA
TGTGTATTATGCCTATCCCAAACAGACAAAAACTGTCATCTGCCGCAAAGAAGAAGAAAAATTGGATCAAGGAATTGGAGAACCTTCAGACTTCTGTTAATTACAACAGA
CCTTCAGCTATTTCCCATATGGGAGGGTCGAGAATTCTCAATGATTATTATATCCTGGAACCAATTGATGCAGTGGGAGCTTCGGGAGGGATCACCATTTTATGGTCGGA
TCCCACATTCAAAGTGCTGGAAGTTGTAGAAGCTCCCACTCGGGGTATGAAAAAATTCAACAAATTCATAGAATCAGCTGCCCTCCAAGATATCCCCCTCTCAAATGGCA
AGTACACATGGTCCAGCTTTCGCCCAAGTCCCACCATGACCCTTATTGATCGATTCTTGATCTCAGATAGCATCCCCAACAAGTTTGTTTCTGCAAAGGCCCGAAGACTT
GATAGAGTCACCTCGGACCACTATCCTATTTGTCTCACTTTGGGTAAAGAATCTTGGGGACCATCTCCTTTTCGCTTTATCAATGCTTGGCTCTCCCATCATTCATTCTT
GCATACAGTAGATTCTTGGTGGAAGGCAAACCCTTATTATGGTTGGCCGGGTCATGGTTTCATTCAGAAATTAAAGGGCTTGAAAACAGAATTAAGAAGCTGGAATATAC
AGATTTTCGGACAACAAAAAGAGAAAAAAACCAATTTGGGTCGGGAACTTGCTATTATTGATAAAAAAGAGGAATGTGCACCTTTATCTGAACAAGACTTCAGAAGAAGA
GCCGAGATTAAAGCAGAATTAATTTTTTTATCAGCTAACGAAGAGATTATGTGGCGCCAAAGATGCAAATCAAAATGGTTTGTCGAGGGAGATGTAAATTCTGCTTTCTT
CCACCGCATTGTTGCTGCTAATAGAAGGAAAAGCTCTATTTCGGAGATCCTATCATCCTCGGGTATTAGCCTTGTTGATGATATAGAAATTGAAAATGAATTTCTCTCCT
TTTATCAAAGGCTTTACTCCAAAAAAGCCAACAGAATGCCTATCCCTGATATTGATGACTGGAACCCATCTCCATGGATCAAATGA
Protein sequenceShow/hide protein sequence
MQDDQIPLVENPTPLRIEDPNSKSLHLSHQEEEIAFAENYTEDIEEDESDTENEVSDPTAFLPYLFPWLAEHGMCIMPIPNRQKLSSAAKKKKNWIKELENLQTSVNYNR
PSAISHMGGSRILNDYYILEPIDAVGASGGITILWSDPTFKVLEVVEAPTRGMKKFNKFIESAALQDIPLSNGKYTWSSFRPSPTMTLIDRFLISDSIPNKFVSAKARRL
DRVTSDHYPICLTLGKESWGPSPFRFINAWLSHHSFLHTVDSWWKANPYYGWPGHGFIQKLKGLKTELRSWNIQIFGQQKEKKTNLGRELAIIDKKEECAPLSEQDFRRR
AEIKAELIFLSANEEIMWRQRCKSKWFVEGDVNSAFFHRIVAANRRKSSISEILSSSGISLVDDIEIENEFLSFYQRLYSKKANRMPIPDIDDWNPSPWIK