Protein

MCA_03559_1

Length
2,571 amino acids


Browser: contigC:329195-336911+

RNA-seq: read pairs 6258, FPKM 30.1, percentile rank 52.8% (100% = highest expression)

Protein function

EGGNOG:0Q0U7Inherit from euNOG: The circumsporozoite protein is the immunodominant surface antigen on the sporozoite (the infective stage of the malaria parasite that is transmitted from the mosquito to the vertebrate host)

Protein alignments

%idAln lengthE-value
UniRef50_A8DVU152.63%1521e-22Predicted protein (Fragment) n=1 Tax=Nematostella vectensis TaxID=45351 RepID=A8DVU1_NEMVE
A0A060TDS9_BLAAD28.63%2621e-13ARAD1D46508p OS=Blastobotrys adeninivorans GN=GNLVRS02_ARAD1D46508g PE=4 SV=1
A0A0J9X2E1_GEOCN52.17%692e-13Similar to Yarrowia lipolytica YALI0E18722p [Yarrowia lipolytica CLIB122], partial (Partial) (Fragment) OS=Geotrichum candidum GN=BN980_GECA01s00169g PE=4 SV=1
Q6CE66_YARLI51.90%1586e-12YALI0B18194p OS=Yarrowia lipolytica (strain CLIB 122 / E 150) GN=YALI0_B18194g PE=4 SV=2

Mitochondrial localization by mitoprotII

Probability of mitochondrial location: 0.0011

Protein family membership

None predicted.

Domains and repeats

None predicted.

Detailed signature matches

Unintegrated signatures no IPR
Unintegrated signatures
  1. TRANSMEMBRANE (Tran...)
  2. mobidb-lite (disord...)

Protein sequence

>MCA_03559_1
MKLLSLISAQVLLGAAYGALIPTEKQLSAKSPAEIKLQQIENCEKPDLETCIDACYNFNSRMIMAEESSDVCVVLDDNLD
SLYSCIVCGSHYSELNPVSLQILDYIDKCQPNIPVKNCGKLHEKRSMELIARQQTCTGSTGNCLGDCVTLVPRLLLGLLS
FNFCGIVAGALDSITDCLLCGGLSGVSDALVNGLIGCGVNMDPCPESSSTGLPAECETNAPSFGTCFAQCSALIPRILGA
VGSLNVCNIVAESTDSLTQCLLCGALFPLVNSLITGVVLETLVGCDFDTAITCDENAAPTSIEPAFRTVVPESCNTGGLD
SGSCLATCGALIPRILLAVGSVQICEIVSESTDALSQCLLCGALFPAVNELITGVVLERLVGCNFDTEIKCEAISTSATD
SASVTASDSLSVSSASDSLSVSSASDSAPNAPSSIPPECQTNAPSFGTCFAQCSALIPRILGAVGSLNVCNIVAESTDSL
TQCLLCGALFPLVNNLITGVVLETLVGCDFDTAITCNDAPIPSSEPAPAFPTGNIPESCNTDGLDSGSCLASCGALIPRI
LTAVGSVQICEIVSESTDALSQCLLCGALFPAVNQLITGVVLERLVGCNFDTEIKCETISTSATDSASVTASDSLSVSSA
SDSTPNVPSSIPPECQTNAPSFGTCFAQCSALIPRILGAVGSLNVCNIVAESTDSLTQCLLCGALFPLVNNLITGVVLET
LVGCDFDTAITCNDAPIPSSEPAPAFPTGNVPESCNTDGLDSGSCLASCGALIPRILTAVGSVQICEIVSESTDALSQCL
LCGALFPAVNQLITGVVLERLVGCNFDTEIKCETVSTTTTDSASITGSDSESASSAPASASNTDSDSNSASTSNSDDQLS
SNSDASATSTEAGTPSSSSDDGNIPPTAVIPNPCNTASLQEYWCIEKCYSVMTDVDNSATCAELTPIAEDISTCLICASV
FLTVEQQLQDSFRPKLQECGMGIMTSLCPSTTSTPTTVGPDPCQTETLEEYWCIQKCYDIMIFVEDNTVCTDMSTRAEDI
STCLICANTFPAVEQQLQDNFRPLIGQCGLGIIENQCSSSSITSDGSTSGSEEASTNTDASSTDGAGASSTDDSGASSTD
GAGASSTDDAGASSTDDAGASSTDGAGASSTDDSGASSTDGAGASSTDDSGASSTDDSGASSTDGAGASSTDDAGASSTD
DSGASSTDDSGASSTDDAGASSTDDSGASSTDGAGASSTDDSGASSTDDSGASTTDGAGASSTDGAGVSSTDGAGASSTD
DSGASSTDGAGASSSDDSGASSTDGAGASSTDGGGASSTDDSGASSTDGAGASSTDDSGASSTDSAGASSTDDAGASSTD
DSGASSTDDAGASSTDDSGASSTDGAGASSTDNSGASSTDGAGASSTHDSGASSTDGAGASSTDDSGASSTDDSGASSTD
GAGVSLTTDTGILPTSVPSECPLPNPGLVTCITDCTLLIPRILAAVGSLNICNIVAESTDSLTKCLLCGVLFEPVNALIS
GLLLETLVGCDLPIDPRCAPPLPSNTLPASTTGSLTSATEESSTDDAGASSTDDSGASSTDGAGASSTDDSGASTTDGAG
ASSTDGAGASSTDDSGASTTDGAGASSTDGAGVSSTDGAGASSTDDSGASSTDDAGASSTDDSGASSTDDSGASSTDDAG
ASSTDDSGASSTDGAGASSTDDSGASSTDGAGASSTDDAGASSTDDSGASSTDGAGASSTDDSGASSTDDAGASSTDDSG
ASSTDDSGASSTDDSGASTTDGAGASSTDDSGASTTDGAGASSSDGGQVGPTGRSTSDGASSTDGAGASSTDDSGASSTD
GAGVSSTDGAGASSSDGGQVGPTGRSTSDGASSTDGAGASSTDGAGASSTDDSGASSTDGAGVSSTDGAGASSTDGAGAS
STDDSGASSTDGAGVSSTDSAGASSSDGGQVGPTGRSTSDGASSTDGAGASSTDDSGASSTDGAGASSTDGAGVSSSDGG
QVGPTGRSTSDGAGASSTDDSGASSTDDSGASSTDDSGASSTDDSGASSTDGAGASSTDGAGPSSSDGGQVGPTGRSTSD
GASSTDGAGASSTDGGQVGPIGPSTTDGFGLSSTGGVDNSVSHSTTATLTTDTFVPPTFSYIFTNTSASISHDSTYSSIP
QFTSLPSSDGGSLGPAPASSTDGGSLGPAPASSTDGGAVGPAPASSTDGGAVGPAPASSTDGGAVGPAPASSTDGGAVGP
APAPSTDGGAVGPAPAPSTDGGSLGPAPASSTDGGAVGPAPASSTDGGAVGPAPASSTDGGAVGPAPASSTDGGSLGPAP
ASSTDGGAVGPAPTSGPIDNDNNNGGSSPGQGSGDNGDGSDSNGGVAPGQGSGNNGSGSGSDNNGGVAPGQGSGNNGSGS
GSDNNGGAAPGQGSGNNGDGSGSGSDNNGGLAPGQGSGNNGSGSGSDNNGGAAPGQGSGNNGDGSGSGSDNNGGLAPGQG
SGNNGSGSGSDNNGGAAPGQGSGNNGDGSGSGDSNSPGTSSGSDNDNSLGNPGQVGETPPIAQTNAAPSKSSNVALALGF
ALSTCFFLVIA

GO term prediction

Biological Process

None predicted.

Molecular Function

None predicted.

Cellular Component

None predicted.